Spoken America Danish Corpus (SpAmDa Corpus)

Karoline Kühl (Ophavsmand)

Department of Nordic Studies and Linguistics

Data set

Beskrivelse

SpAmDa Corpus has been developed within the research project 'Varieties of Danish outside Denmark' (affiliated to the LANCHART-Center and the Department of Nordic Research, University of Copenhagen). SpAmDa Corpus is a linguistic corpus that consists of the transcriptions of 64 Danish conversations/interviews conducted between 1966–1976 in the USA. The informants are 58 mostly elderly people of Danish ancestry (emigrants and their descendants), the majority born before 1900 and migrated into the US before 1925. The transcriptions have been provided by the University of Copenhagen.

SpAmDa Corpus in xml-based, is largely TEI-conformant and follows the principles defined in PaCMan 2.0 plus a corpus-specific tagset. As in in the WriFD Corpus described above, the annotation categories are operational, explicitely defined, largely theory-neutral and language specific.

Some of the features that have been annotated and analysed in SpAmDa are word order in declarative clauses, placement of central adverbial in subclauses, subject marker in interrogative subclauses and plural formation.

Dato for tilgængelighed	2015
Forlag	ingen
Geografisk dækning	USA

Spoken America Danish Corpus (SpAmDa Corpus)

Beskrivelse

Citationsformater