OLAC Record oai:lindat.mff.cuni.cz:11234/1-1510 |
Metadata | ||
Title: | STAZKA – Speech recordings from vehicles | |
Bibliographic Citation: | http://hdl.handle.net/11234/1-1510 | |
Creator: | Šmídl, Luboš | |
Stanislav, Petr | ||
Radová, Vlasta | ||
Date (W3CDTF): | 2015-09-03T07:44:19Z | |
Date Available: | 2015-09-03T07:44:19Z | |
Description: | The database actually contains two sets of recordings, both recorded in the moving or stationary vehicles (passenger cars or trucks). All data were recorded within the project “Intelligent Electronic Record of the Operation and Vehicle Performance” whose aim is to develop a voice-operated software for registering the vehicle operation data. The first part (full_noises.zip) consists of relatively long recordings from the vehicle cabin, containing spontaneous speech from the vehicle crew. The recordings are accompanied with detailed transcripts in the Transcriber XML-based format (.trs). Due to the recording settings, the audio contains many different noises, only sparsely interspersed with speech. As such, the set is suitable for robust estimation of the voice activity detector parameters. The second set (prompts.zip) consists of short prompts that were recorded in the controlled setting – the speakers either answered simple questions or they repeated commands and short phrases. The prompts were recorded by 26 different speakers. Each speaker recorded at least two sessions (with identical set of prompts) – first in stationary vehicle, with low level of noise (those recordings are marked by –A_ in the file name) and second while actually driving the car (marked by –B_ or, since several speakers recorded 3 sessions, by –C_). The recordings from this set are suitable mostly for training of the robust domain-specific speech recognizer and also ASR test purposes. | |
Identifier (URI): | http://hdl.handle.net/11234/1-1510 | |
Language: | Czech | |
Language (ISO639): | ces | |
Publisher: | University of West Bohemia, Department of Cybernetics | |
Rights: | Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) | |
http://creativecommons.org/licenses/by-nc-sa/4.0/ | ||
Subject: | speech corpus | |
noisy speech | ||
voice activity detector | ||
speech recognition | ||
Type: | corpus | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University | |
Description: | http://www.language-archives.org/archive/lindat.mff.cuni.cz | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:lindat.mff.cuni.cz:11234/1-1510 | |
DateStamp: | 2021-06-29 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Šmídl, Luboš; Stanislav, Petr; Radová, Vlasta. 2015. University of West Bohemia, Department of Cybernetics. | |
Terms: | area_Europe country_CZ dcmi_Text iso639_ces olac_primary_text |