![]() |
OLAC Record oai:catalogue.elra.info:ELRA-W0329 |
| Metadata | ||
| Title: | Bulgarian Event Corpus | |
| Access Rights: | Rights available for: attribution | |
| Date Available (W3CDTF): | 2022-10-03 | |
| Date Issued (W3CDTF): | 2022-10-03 | |
| Description: | The Bulgarian Event Corpus is composed 324,905 tokens appropriate for training Named Entity Recognition (NER), Named Entity Linking (NEL) and Event Recognition models for Bulgarian in a multidomain context within Humanities. The texts are domain related. They include documents from the area of Social Sciences and Humanities – scientific papers, archive documents, popular documents, and Wikipedia articles in the relevant areas. The annotation scheme reflects the rationale behind the CIDOC-CRM ontology since this ontology has been widely used in the areas of GLAM and Humanities. The annotation scheme envisages two main layers: the first one is the Named Entity (NE) layer - 16 types, and the second one is the event layer where each event is connected to its participants – 39 event labels. | |
| Identifier: | ELRA-W0329 | |
| ISLRN: 832-960-876-604-2 | ||
| Identifier (URI): | https://catalog.elra.info/en-us/repository/browse/ELRA-W0329/ | |
| Language: | Bulgarian | |
| Language (ISO639): | bul | |
| Medium: | Not specified | |
| Publisher: | ELRA (European Language Resources Association) | |
| Type (DCMI): | Text | |
| Type (OLAC): | primary_text | |
OLAC Info |
||
| Archive: | ELRA Catalogue of Language Resources | |
| Description: | http://www.language-archives.org/archive/catalogue.elra.info | |
| GetRecord: | OAI-PMH request for OLAC format | |
| GetRecord: | Pre-generated XML file | |
OAI Info |
||
| OaiIdentifier: | oai:catalogue.elra.info:ELRA-W0329 | |
| DateStamp: | 2022-10-03 | |
| GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
| Citation: | n.a. 2022. ELRA (European Language Resources Association). | |
| Terms: | area_Europe country_BG dcmi_Text iso639_bul olac_primary_text | |