Show simple item record

 
dc.contributor.author Sas, Jerzy
dc.date.accessioned 2018-04-17T09:41:46Z
dc.date.available 2018-04-17T09:41:46Z
dc.date.issued 2018-04-03
dc.identifier.uri http://hdl.handle.net/11321/466
dc.description This folder contains data and software tools (in python) that can be used in experiments with phoneme recognition in speech samples recorder in Polish. Acoustic data used here were extracted from CLARIN-PL speech corpus after rejecting speech samples, where recorded sequence of words does not correspond strictly to the word sequence declared as the sample orthographic transcription. In order to use the python programs and data published here, the appropriate folder structure should be created. Follow the steps below: 1) create the root folder and set the environment variable ASR_DATASET_ROOT that points to this folder (let's call it ROOT), 2) create the subfolders in the ROOT folder: train, devel, test, doc and src in the root folder, 3) download ar files: test.tar.gz, devel.tar.gz, train.tar.gz, doc.tar.gz, src.tar.gz and unpack then in the corresponding folders, 4)download aux.tar.gz and unpack it directly to ROOT folder. More information can be found in doc/README.pdf. If you find this dataset useful, please make reference in your related papers to the paper: " Acoustic Data Building Toolset for Easy Experimentation with Neural Network-based Speech Recognition in Polish and English" (https://ieeexplore.ieee.org/document/8431366/) Bibtex: @INPROCEEDINGS{8431366, author={J. Sas}, booktitle={2018 11th International Conference on Human System Interaction (HSI)}, title={Acoustic Data Building Toolset for Easy Experimentation with Neural Network-based Speech Recognition in Polish and English}, year={2018}, volume={}, number={}, pages={93-99}, doi={10.1109/HSI.2018.8431366}, ISSN={}, month={July},}
dc.language.iso pol
dc.publisher Wrocław University of Science and Technology
dc.rights Creative Commons - Attribution 3.0 Unported (CC BY 3.0)
dc.rights.uri http://creativecommons.org/licenses/by/3.0/
dc.rights.label CC
dc.source.uri https://github.com/ASR-K2-WrUT/nn asr
dc.subject speech recognition
dc.subject deep neural networks
dc.subject machine learning
dc.title Acoustic Data Building Toolset
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType audio
has.files yes
branding CLARIN-PL
contact.person Jerzy Sas jerzy.sas@pwr.edu.pl Wrocław University of Science and Technology
size.info 29 hours
files.size 18661660774
files.count 13


 Files in this item

This item is
Distributed under Creative Commons
and licensed under:
Creative Commons - Attribution 3.0 Unported (CC BY 3.0)
Attribution Required
Icon
Name
doc.tar.gz
Size
37.89 KB
Format
application/gzip
Description
documentation
 Download file
Icon
Name
src.tar.gz
Size
31.64 KB
Format
application/gzip
Description
toolset source code
 Download file
Icon
Name
aux.tar.gz
Size
514 bytes
Format
application/gzip
Description
aux
 Download file
Icon
Name
devel.tar.gz
Size
1.35 GB
Format
application/gzip
Description
devel
 Download file
Icon
Name
test.tar.gz
Size
1.6 GB
Format
application/gzip
 Download file
Icon
Name
train.zip
Size
439.52 MB
Format
application/zip
Description
train
 Download file
Icon
Name
train.z01
Size
2 GB
Format
Unknown
Description
train
 Download file
Icon
Name
train.z02
Size
2 GB
Format
Unknown
Description
train
 Download file
Icon
Name
train.z03
Size
2 GB
Format
Unknown
Description
train
 Download file
Icon
Name
train.z04
Size
2 GB
Format
Unknown
Description
train
 Download file
Icon
Name
train.z05
Size
2 GB
Format
Unknown
Description
train
 Download file
Icon
Name
train.z06
Size
2 GB
Format
Unknown
Description
train
 Download file
Icon
Name
train.z07
Size
2 GB
Format
Unknown
Description
train
 Download file

Show simple item record