OLAC Record
oai:www.ldc.upenn.edu:LDC2023S08

Metadata
Title:CALLFRIEND Russian Speech
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Miller, David, et al. CALLFRIEND Russian Speech LDC2023S08. Web Download. Philadelphia: Linguistic Data Consortium, 2023
Contributor:Miller, David
Walker, Kevin
Graff, David
Canavan, Alexandra
Date (W3CDTF):2023
Date Issued (W3CDTF):2023-09-15
Description:*Introduction* CALLFRIEND Russian Speech (LDC2023S08) was developed by the Linguistic Data Consortium (LDC) and consists of approximately 48 hours of telephone conversations (100 recordings) between native speakers of Russian. The calls were recorded in 1999 as part of the CALLFRIEND collection. One hundred native Russian speakers living in the continental United States each made a single phone call, lasting up to 30 minutes, to a family member or friend living in the United States. Corresponding transcripts and a lexicon are available in CALLFRIEND Russian Text (LDC2023T09). The CALLFRIEND series is a collection of telephone conversations in several languages conducted by LDC in support of language identification technology development. Languages covered in the collection include American English, Canadian French, Egyptian Arabic, Farsi, German, Hindi, Japanese, Korean, Mandarin Chinese, Russian, Spanish, Tamil and Vietnamese. *Data* All recordings involved domestic calls routed through the automated telephone collection platform at LDC and were stored as 2-channel (4-wire) 8-KHz mu-law samples taken directly from a public telephone network via a T-1 circuit. Each audio file is a FLAC-compressed MS-WAV (RIFF) format audio file containing 2-channel, 8-KHz, 16-bit PCM sample data. This release includes call metadata, including speaker gender, the number of speakers on each channel and call duration. *Samples* Please listen to this audio sample. *Updates* None at this time.
Extent:Corpus size: 2345105 KB
Format:Sampling Rate: 8000
Sampling Format: pcm
Identifier:LDC2023S08
https://catalog.ldc.upenn.edu/LDC2023S08
ISLRN: 607-488-391-299-4
DOI: 10.35111/x1s2-xv64
Language:Russian
Language (ISO639):rus
License:LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2023S08
Rights Holder:Portions © 1999, 2023 Trustees of the University of Pennsylvania
Type (DCMI):Sound
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2023S08
DateStamp:  2024-01-01
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Miller, David; Walker, Kevin; Graff, David; Canavan, Alexandra. 2023. Linguistic Data Consortium.
Terms: area_Europe country_RU dcmi_Sound iso639_rus olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2023S08
Up-to-date as of: Fri Dec 6 7:49:12 EST 2024