The Seemingly (Un)systematic Linking Element in Danish

Sidsel Boldsen; Manex Aguirrezabal Zabaleta

The Seemingly (Un)systematic Linking Element in Danish

Sidsel Boldsen, Manex Aguirrezabal Zabaleta

Department of Nordic Studies and Linguistics

Abstract

The use of a linking element between compound members is a common phenomenon in Germanic languages. Still, the exact use and conditioning of such elements is a disputed topic in linguistics. In this paper we address the issue of predicting the use of linking elements in Danish. Following previous research that shows how the choice of linking element might be conditioned by phonology, we frame the problem as a language modeling task: Considering the linking elements -s/-∅ the problem becomes predicting what is most probable to encounter next, a syllable boundary or the joining element, s. We show that training a language model on this task reaches an accuracy of 94 %, and in the case of an unsupervised model, the accuracy reaches 80 %.

Original language	English
Title of host publication	The Seemingly (Un)systematic Linking Element in Danish
Place of Publication	The 22nd Nordic Conference on Computational Linguistics (NoDaLiDa’19)
Publication date	Oct 2019
Publication status	Published - Oct 2019

Cite this

@inproceedings{078b6a18f1de4903b63dc95f60f0c80e,

title = "The Seemingly (Un)systematic Linking Element in Danish",

abstract = "The use of a linking element between compound members is a common phenomenon in Germanic languages. Still, the exact use and conditioning of such elements is a disputed topic in linguistics. In this paper we address the issue of predicting the use of linking elements in Danish. Following previous research that shows how the choice of linking element might be conditioned by phonology, we frame the problem as a language modeling task: Considering the linking elements -s/-∅ the problem becomes predicting what is most probable to encounter next, a syllable boundary or the joining element, s. We show that training a language model on this task reaches an accuracy of 94 %, and in the case of an unsupervised model, the accuracy reaches 80 %.",

author = "Sidsel Boldsen and {Aguirrezabal Zabaleta}, Manex",

year = "2019",

month = oct,

language = "English",

booktitle = "The Seemingly (Un)systematic Linking Element in Danish",

}

TY - GEN

T1 - The Seemingly (Un)systematic Linking Element in Danish

AU - Boldsen, Sidsel

AU - Aguirrezabal Zabaleta, Manex

PY - 2019/10

Y1 - 2019/10

N2 - The use of a linking element between compound members is a common phenomenon in Germanic languages. Still, the exact use and conditioning of such elements is a disputed topic in linguistics. In this paper we address the issue of predicting the use of linking elements in Danish. Following previous research that shows how the choice of linking element might be conditioned by phonology, we frame the problem as a language modeling task: Considering the linking elements -s/-∅ the problem becomes predicting what is most probable to encounter next, a syllable boundary or the joining element, s. We show that training a language model on this task reaches an accuracy of 94 %, and in the case of an unsupervised model, the accuracy reaches 80 %.

AB - The use of a linking element between compound members is a common phenomenon in Germanic languages. Still, the exact use and conditioning of such elements is a disputed topic in linguistics. In this paper we address the issue of predicting the use of linking elements in Danish. Following previous research that shows how the choice of linking element might be conditioned by phonology, we frame the problem as a language modeling task: Considering the linking elements -s/-∅ the problem becomes predicting what is most probable to encounter next, a syllable boundary or the joining element, s. We show that training a language model on this task reaches an accuracy of 94 %, and in the case of an unsupervised model, the accuracy reaches 80 %.

M3 - Article in proceedings

BT - The Seemingly (Un)systematic Linking Element in Danish

CY - The 22nd Nordic Conference on Computational Linguistics (NoDaLiDa’19)

ER -

The Seemingly (Un)systematic Linking Element in Danish

Abstract

Fingerprint

Cite this