Unsupervised morphological segmentation in a language with reduplication

Type of content
Conference Contributions - Published
Publisher's DOI/URI
Thesis discipline
Degree name
Journal Title
Journal ISSN
Volume Title
Todd S
Huang A
Needle J
King J
Hay, Jennifer

We present an extension of the Morfessor Base line model of unsupervised morphological seg mentation (Creutz and Lagus, 2007) that in corporates abstract templates for reduplication, a typologically common but computationally underaddressed process. Through a detailed in vestigation that applies the model to Maori, the ¯ Indigenous language of Aotearoa New Zealand, we show that incorporating templates improves Morfessor’s ability to identify instances of redu plication, and does so most when there are multiple minimally-overlapping templates. We present an error analysis that reveals important factors to consider when applying the extended model and suggests useful future directions.

Todd S, Huang A, Needle J, Hay J, King J (2022). Unsupervised morphological segmentation in a language with reduplication. Seattle: Sigmorphon: Special Interest Group on Computational Morphology and Phonology. 14/07/2022-14/07/2022. To appear.
Ngā upoko tukutuku/Māori subject headings
Nga Upoko Tukutuku / Maori Subject Headings::Reo Māori | Reo rangatira; Te reo Māori; Te reo rangatira; Māori language
ANZSRC fields of research
Fields of Research::47 - Language, communication and culture::4704 - Linguistics::470409 - Linguistic structures (incl. phonology, morphology and syntax)
Fields of Research::45 - Indigenous studies::4507 - Te ahurea, reo me te hītori o te Māori (Māori culture, language and history)
All rights reserved unless otherwise stated