Acoustic Data Building Toolset

Sas, Jerzy

dc.contributor.author	Sas, Jerzy
dc.date.accessioned	2018-04-17T09:41:46Z
dc.date.available	2018-04-17T09:41:46Z
dc.date.issued	2018-04-03
dc.identifier.uri	http://hdl.handle.net/11321/466
dc.description	This folder contains data and software tools (in python) that can be used in experiments with phoneme recognition in speech samples recorder in Polish. Acoustic data used here were extracted from CLARIN-PL speech corpus after rejecting speech samples, where recorded sequence of words does not correspond strictly to the word sequence declared as the sample orthographic transcription. In order to use the python programs and data published here, the appropriate folder structure should be created. Follow the steps below: 1) create the root folder and set the environment variable ASR_DATASET_ROOT that points to this folder (let's call it ROOT), 2) create the subfolders in the ROOT folder: train, devel, test, doc and src in the root folder, 3) download ar files: test.tar.gz, devel.tar.gz, train.tar.gz, doc.tar.gz, src.tar.gz and unpack then in the corresponding folders, 4)download aux.tar.gz and unpack it directly to ROOT folder. More information can be found in doc/README.pdf. If you find this dataset useful, please make reference in your related papers to the paper: " Acoustic Data Building Toolset for Easy Experimentation with Neural Network-based Speech Recognition in Polish and English" (https://ieeexplore.ieee.org/document/8431366/) Bibtex: @INPROCEEDINGS{8431366, author={J. Sas}, booktitle={2018 11th International Conference on Human System Interaction (HSI)}, title={Acoustic Data Building Toolset for Easy Experimentation with Neural Network-based Speech Recognition in Polish and English}, year={2018}, volume={}, number={}, pages={93-99}, doi={10.1109/HSI.2018.8431366}, ISSN={}, month={July},}
dc.language.iso	pol
dc.publisher	Wrocław University of Science and Technology
dc.rights	Creative Commons - Attribution 3.0 Unported (CC BY 3.0)
dc.rights.uri	http://creativecommons.org/licenses/by/3.0/
dc.rights.label	CC
dc.source.uri	https://github.com/ASR-K2-WrUT/nn asr
dc.subject	speech recognition
dc.subject	deep neural networks
dc.subject	machine learning
dc.title	Acoustic Data Building Toolset
dc.type	corpus
metashare.ResourceInfo#ContentInfo.mediaType	audio
has.files	yes
branding	CLARIN-PL
contact.person	Jerzy Sas jerzy.sas@pwr.edu.pl Wrocław University of Science and Technology
size.info	29 hours
files.size	18661660774
files.count	13