OLAC Record
oai:www.ldc.upenn.edu:LDC95T13

Metadata
Title:Mandarin Chinese News Text
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Wu, Zhibiao. Mandarin Chinese News Text LDC95T13. Web Download. Philadelphia: Linguistic Data Consortium, 1995
Contributor:Wu, Zhibiao
Date (W3CDTF):1995
Description:The Linguistic Data Consortium (LDC) announces the availability of a Mandarin Chinese text corpus. This corpus includes about 250 million GB-encoded text characters. The Mandarin News Corpus includes text from various journalistic sources: * newspaper text from Renmin Ribao (People's Daily) * radio scripts from China Radio International * newswire text from Xinhua newswire service The format of this corpus uses a labeled bracketing, expressed in the style of SGML (Standard Generalized Markup Language). The header fields provided by the sources, which give information such as topic, date and article ID, have been retained. The articles cover a variety of topics, including international and domestic news, sports and culture.
Identifier:LDC95T13
https://catalog.ldc.upenn.edu/LDC95T13
ISBN: 1-58563-052-7
ISLRN: 133-578-348-091-2
DOI: 10.35111/ajd2-0b82
Language:Mandarin Chinese
Language (ISO639):cmn
License:Mandarin Chinese News Text Agreement: https://catalog.ldc.upenn.edu/license/mandarin-chinese-news-text-corpus-user-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC95T13
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC95T13
DateStamp:  2020-11-30
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Wu, Zhibiao. 1995. Linguistic Data Consortium.
Terms: area_Asia country_CN dcmi_Text iso639_cmn olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC95T13
Up-to-date as of: Fri Dec 6 7:47:10 EST 2024