OLAC Record: RATS Low Speech Density

OLAC Record
oai:www.ldc.upenn.edu:LDC2024S03

Metadata

Title: RATS Low Speech Density

Access Rights: Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining

Bibliographic Citation: Walker, Kevin, et al. RATS Low Speech Density LDC2024S03. Web Download. Philadelphia: Linguistic Data Consortium, 2024

Contributor: Walker, Kevin

Graff, David

Ma, Xiaoyi

Strassel, Stephanie

Jones, Karen

Date (W3CDTF): 2024

Date Issued (W3CDTF): 2024-03-15

Description: *Introduction* RATS Low Speech Density was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 87 hours of English, Levantine Arabic, Farsi, Pashto and Urdu speech and non-speech samples. The recordings were assembled by concatenating a randomized selection of speech, communications systems sounds, and silence. This corpus was created to measure false alarm performance in RATS speech activity detection systems. The goal of the RATS (Robust Automatic Transcription of Speech) program was to develop human language technology systems capable of performing speech detection, language identification, speaker identification and keyword spotting on the severely degraded audio signals that are typical of various radio communication channels, especially those employing various types of handheld portable transceiver systems. To support that goal, LDC assembled a system for the transmission, reception and digital capture of audio data that allowed a single source audio signal to be distributed and recorded over eight distinct transceiver configurations simultaneously. Those configurations included three frequencies -- high, very high and ultra high -- variously combined with amplitude modulation, frequency hopping spread spectrum, narrow-band frequency modulation, single-side-band or wide-band frequency modulation. Annotations on the clear source audio signal, e.g., time boundaries for the duration of speech activity, were projected onto the corresponding eight channels recorded from the radio receivers. *Data* The source audio was extracted from RATS development and progress speech activity detection sets and from RATS keyword spotting development data. It consists of conversational telephone speech recordings collected by LDC: (1) data collected for the RATS program from Levantine Arabic, Farsi, Pashto and Urdu speakers; and (2) material from the Fisher English (LDC2004S13, LDC2005S13) and Fisher Levantine Arabic telephone studies (LDC2007S02), Levantine Arabic QT Training Data Set 5, Speech (LDC2006S29), and CALLFRIEND Farsi Second Edition Speech (LDC2014S01). Non-speech samples were selected from communications systems sounds, including telephone network special information tones, radio selective calling signals, HF/VHF/UHF digital mode radio traffic, radio network control channel signals, two-way radio traffic containing roger beeps, and short duration shift-key modulated handset data transmissions. The data is divided into development, progress, and train sets, each containing their own subdirectories. All audio files are presented as single-channel, 16-bit PCM, 16000 samples per second; lossless FLAC compression is used on all files. When uncompressed, the files have "MS-WAV" (RIFF) file headers. A collection of tables describing the design and assembly of the source audio files is included in the documentation accompanying this release. *Sponsorship* This material is based upon work supported by the Defense Advanced Research Projects Agency (DARPA) under Contract No. D10PC20016. The content does not necessarily reflect the position or the policy of the Government, and no official endorsement should be inferred. *Samples* * Audio Sample (FLAC) *Updates* None at this time.

Extent: Corpus size: 142610317 KB

Format: Sampling Rate: 16000

Sampling Format: pcm

Identifier: LDC2024S03

https://catalog.ldc.upenn.edu/LDC2024S03

ISLRN: 670-178-409-396-6

DOI: 10.35111/4ena-fg30

Language: English

Persian

Pushto

Urdu

Levantine Arabic

Language (ISO639): eng

fas

pus

urd

License: LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf

Medium: Distribution: Web Download

Publisher: Linguistic Data Consortium

Publisher (URI): https://www.ldc.upenn.edu

Relation (URI): https://catalog.ldc.upenn.edu/docs/LDC2024S03

Rights Holder: Portions © 1995-1996, 2003-2006, 2014, 2017, 2024 Trustees of the University of Pennsylvania

Type (DCMI): Sound

Text

Type (OLAC): primary_text

OLAC Info

Archive: The LDC Corpus Catalog

Description: http://www.language-archives.org/archive/www.ldc.upenn.edu

GetRecord: OAI-PMH request for OLAC format

GetRecord: Pre-generated XML file

OAI Info

OaiIdentifier: oai:www.ldc.upenn.edu:LDC2024S03

DateStamp: 2025-01-02

GetRecord: OAI-PMH request for simple DC format

Search Info
Citation: Walker, Kevin; Graff, David; Ma, Xiaoyi; Strassel, Stephanie; Jones, Karen. 2024. Linguistic Data Consortium.
Terms: area_Asia area_Europe country_GB country_PK dcmi_Sound dcmi_Text iso639_eng iso639_fas iso639_pus iso639_urd olac_primary_text

http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2024S03
Up-to-date as of: Wed Oct 29 7:02:11 EDT 2025

Metadata
Title:		RATS Low Speech Density
Access Rights:		Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:		Walker, Kevin, et al. RATS Low Speech Density LDC2024S03. Web Download. Philadelphia: Linguistic Data Consortium, 2024
Contributor:		Walker, Kevin
		Graff, David
		Ma, Xiaoyi
		Strassel, Stephanie
		Jones, Karen
Date (W3CDTF):		2024
Date Issued (W3CDTF):		2024-03-15
Description:		Introduction RATS Low Speech Density was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 87 hours of English, Levantine Arabic, Farsi, Pashto and Urdu speech and non-speech samples. The recordings were assembled by concatenating a randomized selection of speech, communications systems sounds, and silence. This corpus was created to measure false alarm performance in RATS speech activity detection systems. The goal of the RATS (Robust Automatic Transcription of Speech) program was to develop human language technology systems capable of performing speech detection, language identification, speaker identification and keyword spotting on the severely degraded audio signals that are typical of various radio communication channels, especially those employing various types of handheld portable transceiver systems. To support that goal, LDC assembled a system for the transmission, reception and digital capture of audio data that allowed a single source audio signal to be distributed and recorded over eight distinct transceiver configurations simultaneously. Those configurations included three frequencies -- high, very high and ultra high -- variously combined with amplitude modulation, frequency hopping spread spectrum, narrow-band frequency modulation, single-side-band or wide-band frequency modulation. Annotations on the clear source audio signal, e.g., time boundaries for the duration of speech activity, were projected onto the corresponding eight channels recorded from the radio receivers. Data The source audio was extracted from RATS development and progress speech activity detection sets and from RATS keyword spotting development data. It consists of conversational telephone speech recordings collected by LDC: (1) data collected for the RATS program from Levantine Arabic, Farsi, Pashto and Urdu speakers; and (2) material from the Fisher English (LDC2004S13, LDC2005S13) and Fisher Levantine Arabic telephone studies (LDC2007S02), Levantine Arabic QT Training Data Set 5, Speech (LDC2006S29), and CALLFRIEND Farsi Second Edition Speech (LDC2014S01). Non-speech samples were selected from communications systems sounds, including telephone network special information tones, radio selective calling signals, HF/VHF/UHF digital mode radio traffic, radio network control channel signals, two-way radio traffic containing roger beeps, and short duration shift-key modulated handset data transmissions. The data is divided into development, progress, and train sets, each containing their own subdirectories. All audio files are presented as single-channel, 16-bit PCM, 16000 samples per second; lossless FLAC compression is used on all files. When uncompressed, the files have "MS-WAV" (RIFF) file headers. A collection of tables describing the design and assembly of the source audio files is included in the documentation accompanying this release. Sponsorship This material is based upon work supported by the Defense Advanced Research Projects Agency (DARPA) under Contract No. D10PC20016. The content does not necessarily reflect the position or the policy of the Government, and no official endorsement should be inferred. Samples * Audio Sample (FLAC) Updates None at this time.
Extent:		Corpus size: 142610317 KB
Format:		Sampling Rate: 16000
Format:		Sampling Format: pcm
Identifier:		LDC2024S03
		https://catalog.ldc.upenn.edu/LDC2024S03
		ISLRN: 670-178-409-396-6
		DOI: 10.35111/4ena-fg30
Language:		English
		Persian
		Pushto
		Urdu
		Levantine Arabic
Language (ISO639):		eng
		fas
		pus
		urd
License:		LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:		Distribution: Web Download
Publisher:		Linguistic Data Consortium
Publisher (URI):		https://www.ldc.upenn.edu
Relation (URI):		https://catalog.ldc.upenn.edu/docs/LDC2024S03
Rights Holder:		Portions © 1995-1996, 2003-2006, 2014, 2017, 2024 Trustees of the University of Pennsylvania
Type (DCMI):		Sound
Type (DCMI):		Text
Type (OLAC):		primary_text
OLAC Info
Archive:		The LDC Corpus Catalog
Description:		http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:		OAI-PMH request for OLAC format
GetRecord:		Pre-generated XML file
OAI Info
OaiIdentifier:		oai:www.ldc.upenn.edu:LDC2024S03
DateStamp:		2025-01-02
GetRecord:		OAI-PMH request for simple DC format
Search Info
Citation:		Walker, Kevin; Graff, David; Ma, Xiaoyi; Strassel, Stephanie; Jones, Karen. 2024. Linguistic Data Consortium.
Terms:		area_Asia area_Europe country_GB country_PK dcmi_Sound dcmi_Text iso639_eng iso639_fas iso639_pus iso639_urd olac_primary_text