OLAC Record oai:clarin.eurac.edu:20.500.12124/60 |
Metadata | ||
Title: | MT@BZ translation corpus v1.0 | |
Bibliographic Citation: | http://hdl.handle.net/20.500.12124/60 | |
Creator: | De Camillis, Flavia | |
Chiocchetti, Elena | ||
Stemle, Egon W. | ||
Date (W3CDTF): | 2023-06-18T18:33:02Z | |
Date Available: | 2023-06-18T18:33:02Z | |
Description: | The MT@BZ is a translation corpus that consists of 52 decrees published by the Autonomous Province of Bolzano (South Tyrol) aligned with their machine translated versions. More precisely, it consists of 26 decrees in German and the same 26 in Italian in their official versions, respectively machine translated by the project team into Italian and into German. 10 of them are COVID-19 related decress, while 16 are miscellaneous. Overall, they consist of around 130,000 words. Their machine translation was carried out with a customized version of ModernMT. Later, the corpus was uploaded first into the annotation platform Webanno, then transferred to Inception. Four annotators annotated the translation errors made by the machine according to an ad hoc error taxonomy for quality assessment. Finally, the annotations were curated to create a gold standard corpus. | |
Identifier (URI): | http://hdl.handle.net/20.500.12124/60 | |
Language: | Italian | |
German | ||
Language (ISO639): | ita | |
deu | ||
Publisher: | Institute for Applied Linguistics, Eurac Research | |
Rights: | Creative Commons - Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) | |
https://creativecommons.org/licenses/by-nc/4.0/ | ||
Subject: | machine translation | |
annotation | ||
translation errors | ||
accuracy | ||
fluency | ||
Italian | ||
German | ||
South Tyrolean German | ||
legal language | ||
Type: | corpus | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | Eurac Research CLARIN Centre | |
Description: | http://www.language-archives.org/archive/clarin.eurac.edu | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:clarin.eurac.edu:20.500.12124/60 | |
DateStamp: | 2023-06-18 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | De Camillis, Flavia; Chiocchetti, Elena; Stemle, Egon W. 2023. Institute for Applied Linguistics, Eurac Research. | |
Terms: | area_Europe country_DE country_IT dcmi_Text iso639_deu iso639_ita olac_primary_text |