This folder contains data and software tools (in python) that can be used in experiments with phoneme recognition in speech samples recorder in Polish. Acoustic data used here were extracted from CLARIN-PL speech corpus after rejecting speech samples, where recorded sequence of words does not correspond strictly to the word sequence declared as the sample orthographic transcription. In order to use the python programs and data published here, the appropriate folder structure should be created. Follow the steps below:
1) create the root folder and set the environment variable ASR_DATASET_ROOT that points to this folder (let's call it ROOT),
2) create the subfolders in the ROOT folder: train, devel, test, doc and src in the root folder,
3) download ar files: test.tar.gz, devel.tar.gz, train.tar.gz, doc.tar.gz, src.tar.gz and unpack then in the corresponding folders,
4)download aux.tar.gz and unpack it directly to ROOT folder.
More information can be found in doc/README.pdf.
If you find this dataset useful, please make reference in your related papers to the paper: "
Acoustic Data Building Toolset for Easy Experimentation with Neural Network-based Speech Recognition in Polish and English" (https://ieeexplore.ieee.org/document/8431366/)
Bibtex:
@INPROCEEDINGS{8431366,
author={J. Sas},
booktitle={2018 11th International Conference on Human System Interaction (HSI)},
title={Acoustic Data Building Toolset for Easy Experimentation with Neural Network-based Speech Recognition in Polish and English},
year={2018},
volume={},
number={},
pages={93-99},
doi={10.1109/HSI.2018.8431366},
ISSN={},
month={July},}