OLAC Record
oai:www.ldc.upenn.edu:LDC99T34

Metadata
Title:Japanese Business News Text Supplement
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Kobayashi, Masato, and Kevin Walker. Japanese Business News Text Supplement LDC99T34. Web Download. Philadelphia: Linguistic Data Consortium, 1999
Contributor:Kobayashi, Masato
Walker, Kevin
Date (W3CDTF):1999
Description:This corpus consists of newswire text from Nihon Keizai Shimbun, Inc. (NIKKEI), the largest Japanese daily financial newspaper, and Telerate, Inc. (formerly known as Dow Jones/Kyodo News Service), published primarily for managers of Japanese-owned corporations or Japanese employees working in North American financial institutions. The Telerate portion constitutes all newswire text collected by the LDC between December 1994 and September 1998. The Telerate data collected from June 1995 to September 1998 serves as a supplement to the original publication. All NIKKEI data was collected from December 1993 to November 1994 and is also available on the 1995 release of the Japanese Business News Text. The data, including SGML tags, breaks down as follows. # of Files Daily Average Size Total Size -------------------------------------------------- NIKKEI 364 514K 188MB Telerate 1060 336K 357MB The NIKKEI text was received on nine-track magnetic tape. The original character encoding was EBCDIC, but was converted to EUC encoding, which the LDC uses for its Japanese publications. The Telerate text was received via a digital transmission service installed at the LDC by Telerate. Custom software was written by the LDC to poll a central database and download articles individually. The character encoding is EUC. LDC added SGML tags automatically in order to identify individual stories within the daily collections. *Additional Licensing Instructions* This 'members-only' corpora is available to current members who can request the data at the listed reduced-license fee. Contact ldc@ldc.upenn.edu for information about becoming a member.
Identifier:LDC99T34
https://catalog.ldc.upenn.edu/LDC99T34
ISBN: 1-58563-143-4
ISLRN: 768-601-383-003-7
DOI: 10.35111/me5s-en17
Language:Japanese
Language (ISO639):jpn
License:Japanese Business News Text Supplement Individual: https://catalog.ldc.upenn.edu/license/japanese-business-news-text-supplement-individual.pdf
Japanese Business News Text Supplement Organization: https://catalog.ldc.upenn.edu/license/japanese-business-news-text-supplement.pdf
Nihon Keizai Shimbun Agreement: https://catalog.ldc.upenn.edu/license/nihon-keizai-shimbun-license.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC99T34
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC99T34
DateStamp:  2020-11-30
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Kobayashi, Masato; Walker, Kevin. 1999. Linguistic Data Consortium.
Terms: area_Asia country_JP dcmi_Text iso639_jpn olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC99T34
Up-to-date as of: Sun Jun 16 7:33:54 EDT 2024