OLAC Record oai:www.ldc.upenn.edu:LDC2013T13 |
Metadata | ||
Title: | Chinese Proposition Bank 3.0 | |
Access Rights: | Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining | |
Bibliographic Citation: | Xue, Nianwen, et al. Chinese Proposition Bank 3.0 LDC2013T13. Web Download. Philadelphia: Linguistic Data Consortium, 2013 | |
Contributor: | Xue, Nianwen | |
Bai, Xiaopeng | ||
Lu, Jill | ||
Zhang, Jennifer | ||
Palmer, Martha | ||
Chang, Meiyu | ||
Zhong, Hua | ||
Date (W3CDTF): | 2013 | |
Date Issued (W3CDTF): | 2013-07-15 | |
Description: | Chinese Proposition Bank 3.0 is a continuation of the Chinese Proposition Bank project which aims to create a corpus of text annotated with information about basic semantic propositions. Chinese Proposition Bank 3.0 adds predicate-argument annotation on 187,731 words from Chinese Treebank 7.0 (LDC2010T07). The data sources are comprised of newswire, magazine articles, various broadcast news and broadcast conversation programming, web newsgroups and weblogs. LDC has also released Chinese Proposition Bank 1.0 (LDC2005T23) and Chinese Proposition Bank 2.0 (LDC2008T07). *Data* This release contains the predicate-argument annotation of 173,206 verb instances and 14,525 noun instances. The annotation of nouns is limited to nominalizations that have a corresponding verb. The general annotation guidelines and the lexical guidelines (called frame files) for each verbal and nominal predicate are also included in this release. Below are some statistics about the corpus. * Total propositions for verbs - 173,206 * Total propositions for nouns - 14,525 * Total verbs framed - 24,642 * Total framesets - 26,467 * Verbs with multiple framesets - 1337 * Average framesets per verb - 1.07 * Total nouns framed - 1,421 * Total noun framesets - 1,528 * Nouns with multiple framesets - 48 * Average framesets per nouns - 1.08 *Samples* Please view the following samples. * Noun Sample * Verb Sample * XML Sample *Updates* None at this time. | |
Extent: | Corpus size: 217088 KB | |
Identifier: | LDC2013T13 | |
https://catalog.ldc.upenn.edu/LDC2013T13 | ||
ISBN: 1-58563-648-7 | ||
ISLRN: 460-638-744-650-2 | ||
DOI: 10.35111/828e-w727 | ||
Language: | Mandarin Chinese | |
Language (ISO639): | cmn | |
License: | LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf | |
Medium: | Distribution: Web Download | |
Publisher: | Linguistic Data Consortium | |
Publisher (URI): | https://www.ldc.upenn.edu | |
Relation (URI): | https://catalog.ldc.upenn.edu/docs/LDC2013T13 | |
Rights Holder: | Portions © 2006 Agence France Presse, © 2006 Anhui TV, © 2005 Cable News Network, LP, LLLP, © 2000-2001 China Broadcasting System, © 2000-2001, 2005-2006 China Central TV, © 2000-2001 China National Radio, © 2006 Chinanews.com, © 2000-2001 China Television System, © 2006 Guangming Daily, © 2006 National Broadcasting Company, Inc., © 2006 New Tang Dynasty TV, © 2006 Peoples Daily Online, © 2005-2006 Phoenix TV, © 1999-2001 Sinorama Magazine, © 1996-1998, 2006 Xinhua News Agency, © 2001, 2004, 2005, 2007, 2008, 2009, 2010, 2013 Trustees of the University of Pennsylvania | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | The LDC Corpus Catalog | |
Description: | http://www.language-archives.org/archive/www.ldc.upenn.edu | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:www.ldc.upenn.edu:LDC2013T13 | |
DateStamp: | 2020-11-30 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Xue, Nianwen; Bai, Xiaopeng; Lu, Jill; Zhang, Jennifer; Palmer, Martha; Chang, Meiyu; Zhong, Hua. 2013. Linguistic Data Consortium. | |
Terms: | area_Asia country_CN dcmi_Text iso639_cmn olac_primary_text |