Acoustic Data Building Toolset

Acoustic Data Building Toolset

Authors: Sas, Jerzy

Project URL: https://github.com/ASR-K2-WrUT/nn asr

Date issued: 2018-04-03

Type: corpus

Size: 29 hours

Language(s): Polish

Description: This folder contains data and software tools (in python) that can be used in experiments with phoneme recognition in speech samples recorder in Polish. Acoustic data used here were extracted from CLARIN-PL speech corpus after rejecting speech samples, where recorded sequence of words does not correspond strictly to the word sequence declared as the sample orthographic transcription. In order to use the python programs and data published here, the appropriate folder structure should be created. Follow the steps below: 1) create the root folder and set the environment variable ASR_DATASET_ROOT that points to this folder (let's call it ROOT), 2) create the subfolders in the ROOT folder: train, devel, test, doc and src in the root folder, 3) download ar files: test.tar.gz, devel.tar.gz, train.tar.gz, doc.tar.gz, src.tar.gz and unpack then in the corresponding folders, 4)download aux.tar.gz and unpack it directly to ROOT folder. More information can be found in doc/README.pdf. If you find this dataset useful, please make reference in your related papers to the paper: " Acoustic Data Building Toolset for Easy Experimentation with Neural Network-based Speech Recognition in Polish and English" (https://ieeexplore.ieee.org/document/8431366/) Bibtex: @INPROCEEDINGS{8431366, author={J. Sas}, booktitle={2018 11th International Conference on Human System Interaction (HSI)}, title={Acoustic Data Building Toolset for Easy Experimentation with Neural Network-based Speech Recognition in Polish and English}, year={2018}, volume={}, number={}, pages={93-99}, doi={10.1109/HSI.2018.8431366}, ISSN={}, month={July},}

Publisher: Wrocław University of Science and Technology

Subject(s): speech recognition deep neural networks machine learning

Collection(s): CLARIN-PL

Show full item record

Files in this item

This item is

Distributed under Creative Commons

and licensed under:
Creative Commons - Attribution 3.0 Unported (CC BY 3.0)

Name: doc.tar.gz
Size: 37.89 KB
Format: application/gzip
Description: documentation

Name: src.tar.gz
Size: 31.64 KB
Format: application/gzip
Description: toolset source code

Name: aux.tar.gz
Size: 514 bytes
Format: application/gzip
Description: aux

Name: devel.tar.gz
Size: 1.35 GB
Format: application/gzip
Description: devel

Name: test.tar.gz
Size: 1.6 GB
Format: application/gzip

Name: train.zip
Size: 439.52 MB
Format: application/zip
Description: train

Name: train.z01
Size: 2 GB
Format: Unknown
Description: train

Name: train.z02
Size: 2 GB
Format: Unknown
Description: train

Name: train.z03
Size: 2 GB
Format: Unknown
Description: train

Name: train.z04
Size: 2 GB
Format: Unknown
Description: train

Name: train.z05
Size: 2 GB
Format: Unknown
Description: train

Name: train.z06
Size: 2 GB
Format: Unknown
Description: train

Name: train.z07
Size: 2 GB
Format: Unknown
Description: train