OLAC Record
oai:scholarspace.manoa.hawaii.edu:10125/74810

Metadata
Title:Lang*Reg corpus: Documenting intraspeaker variation across languages and registers
Bibliographic Citation:2025-03; Article; Kaipuleohone University of Hawai'i Digital Language Archive;https://hdl.handle.net/10125/74810.
Date (W3CDTF):2025-03
Description:We present a new corpus design for multi-lingual corpora that involve intra-speaker variation in different situational-functional contexts, including primarily spoken but also the written mode, with the aim towards enhancing language documentation efforts and resources. We illustrate how this comparative design and the resulting cross-culturally applicable data collection procedure has been successfully realized in order to build the Lang*Reg corpus (Adli et. al. 2024), which currently includes five languages from three different language families: German, Persian, Southern Kurdish, Yucatec Maya and Javanese. For each of these languages, the same native speakers were asked to produce language in two types of activities that naturally occur in all the respective cultural contexts: telling a story to a friend, and talking freely with various interlocutors (friend, stranger, taxi driver, university professor). Moreover, our design included the storytelling in two modes, which allows for the comparison between spoken and written modes of the same language user. We show how Lang*Reg provides a versatile resource for many purposes – in particular research into register due to the variety of situational contexts involved, we show how German and Persian exploit the right periphery for different register distinctions, and we invite others to use this resource. At the same time, we show how the methodology developed can be used as a template to complement language resources by creating comparable intra-individual, multi-purpose data sets.
National Foreign Language Resource Center
Format:Article
27
Identifier:Lehmann, Nico, Vahid Mortezapour, Jozina Vander Klok, Zahra Farokhnejad, David Müller, Elisabeth Verhoeven, Aria Adli. 2025. Lang*Reg corpus: Documenting intra-speaker variation across languages and registers. Language Documentation & Conservation 19: 40-66.
1934-5275
Identifier (URI):https://hdl.handle.net/10125/74810
Language:English
Language (ISO639):eng
Publisher:University of Hawaii Press
Table Of Contents:Lehmann_etal_2025.pdf

OLAC Info

Archive:  Language Documentation and Conservation
Description:  http://www.language-archives.org/archive/ldc.scholarspace.manoa.hawaii.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:scholarspace.manoa.hawaii.edu:10125/74810
DateStamp:  2025-03-04
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2025. University of Hawaii Press.
Terms: area_Europe country_GB iso639_eng


http://www.language-archives.org/item.php/oai:scholarspace.manoa.hawaii.edu:10125/74810
Up-to-date as of: Thu Mar 27 1:10:09 EDT 2025