dc.contributor.author | Sas, Jerzy |
dc.date.accessioned | 2018-04-17T09:41:46Z |
dc.date.available | 2018-04-17T09:41:46Z |
dc.date.issued | 2018-04-03 |
dc.identifier.uri | http://hdl.handle.net/11321/466 |
dc.description | This folder contains data and software tools (in python) that can be used in experiments with phoneme recognition in speech samples recorder in Polish. Acoustic data used here were extracted from CLARIN-PL speech corpus after rejecting speech samples, where recorded sequence of words does not correspond strictly to the word sequence declared as the sample orthographic transcription. In order to use the python programs and data published here, the appropriate folder structure should be created. Follow the steps below: 1) create the root folder and set the environment variable ASR_DATASET_ROOT that points to this folder (let's call it ROOT), 2) create the subfolders in the ROOT folder: train, devel, test, doc and src in the root folder, 3) download ar files: test.tar.gz, devel.tar.gz, train.tar.gz, doc.tar.gz, src.tar.gz and unpack then in the corresponding folders, 4)download aux.tar.gz and unpack it directly to ROOT folder. More information can be found in doc/README.pdf. If you find this dataset useful, please make reference in your related papers to the paper: " Acoustic Data Building Toolset for Easy Experimentation with Neural Network-based Speech Recognition in Polish and English" (https://ieeexplore.ieee.org/document/8431366/) Bibtex: @INPROCEEDINGS{8431366, author={J. Sas}, booktitle={2018 11th International Conference on Human System Interaction (HSI)}, title={Acoustic Data Building Toolset for Easy Experimentation with Neural Network-based Speech Recognition in Polish and English}, year={2018}, volume={}, number={}, pages={93-99}, doi={10.1109/HSI.2018.8431366}, ISSN={}, month={July},} |
dc.language.iso | pol |
dc.publisher | Wrocław University of Science and Technology |
dc.rights | Creative Commons - Attribution 3.0 Unported (CC BY 3.0) |
dc.rights.uri | http://creativecommons.org/licenses/by/3.0/ |
dc.rights.label | CC |
dc.source.uri | https://github.com/ASR-K2-WrUT/nn asr |
dc.subject | speech recognition |
dc.subject | deep neural networks |
dc.subject | machine learning |
dc.title | Acoustic Data Building Toolset |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | audio |
has.files | yes |
branding | CLARIN-PL |
contact.person | Jerzy Sas jerzy.sas@pwr.edu.pl Wrocław University of Science and Technology |
size.info | 29 hours |
files.size | 18661660774 |
files.count | 13 |
Files in this item
This item is
Creative Commons - Attribution 3.0 Unported (CC BY 3.0)
Distributed under Creative Commons
and licensed under:Creative Commons - Attribution 3.0 Unported (CC BY 3.0)