OLAC Record
oai:www.ldc.upenn.edu:LDC2019S14

Metadata
Title:The DKU-JNU-EMA Electromagnetic Articulography Database
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Qin, Xiaoyi, et al. The DKU-JNU-EMA Electromagnetic Articulography Database LDC2019S14. Web Download. Philadelphia: Linguistic Data Consortium, 2019
Contributor:Qin, Xiaoyi
Liu, Xinzhong
Cai, Zexin
Li, Ming
Date (W3CDTF):2019
Date Issued (W3CDTF):2019-07-15
Description:*Introduction* The DKU-JNU-EMA Electromagnetic Articulography Database was developed by Duke Kunshan University and Jinan University and contains approximately 10 hours of articulography and speech data in Mandarin, Cantonese, Hakka, and Teochew Chinese from two to seven native speakers for each dialect. Electromagnetic articulography (EMA) is a method of measuring the position of parts of the mouth and their movement over time during speech and swallowing. Measurements are made from sensors placed in the mouth to capture real-time vocal tract variable trajectories. EMA is used in linguistics and language-related research to study phonetics, in particular, articulation (how sounds are made). *Data* Articulatory measurements were made using the NDI electromagnetic articulography wave research system. Subjects had six sensors placed in various locations in their mouth and one reference sensor was placed on the bridge of their nose. For simultaneous recording of speech signals, subjects also wore a head-mounted close-talk microphone. Speakers engaged in four different types of recording sessions: one in which they read complete sentences or short texts, and three sessions in which they read related words of a specific common consonant, vowel or tone. Audio data is presented as single channel, 16kHz, 16-bit flac compressed wav files. Articulography data is stored as UTF-8 plain text files. *Samples* Please view the following samples: * Cantonese Speech * Cantonese Sensor Data * Cantonese Parameter Data * Mandarin Speech * Mandarin Sensor Data * Mandarin Parameter Data *Updates* Chinese and English README files were updated on April 22, 2024.
Extent:Corpus size: 3122232 KB
Format:Sampling Rate: 16000
Sampling Format: pcm
Identifier:LDC2019S14
https://catalog.ldc.upenn.edu/LDC2019S14
ISBN: 1-58563-894-3
ISLRN: 147-070-436-975-2
DOI: 10.35111/61f9-zg33
Language:Yue Chinese
Hakka Chinese
Min Nan Chinese
Mandarin Chinese
Language (ISO639):yue
hak
nan
cmn
License:LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2019S14
Rights Holder:Portions © 2019 Duke Kunshan University, © 2019 Jinan University, © 2019 Trustees of the University of Pennsylvania
Type (DCMI):Sound
Text
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2019S14
DateStamp:  2024-04-22
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Qin, Xiaoyi; Liu, Xinzhong; Cai, Zexin; Li, Ming. 2019. Linguistic Data Consortium.
Terms: area_Asia country_CN dcmi_Sound dcmi_Text iso639_cmn iso639_hak iso639_nan iso639_yue olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2019S14
Up-to-date as of: Fri Dec 6 7:48:53 EST 2024