Unsupervised morphological segmentation in a language with reduplication

Type of content
Conference Contributions - Published
Publisher's DOI/URI
Thesis discipline
Degree name
Publisher
Journal Title
Journal ISSN
Volume Title
Language
Date
2022
Authors
Todd S
Huang A
Needle J
King J
Hay, Jennifer
Abstract

We present an extension of the Morfessor Base line model of unsupervised morphological seg mentation (Creutz and Lagus, 2007) that in corporates abstract templates for reduplication, a typologically common but computationally underaddressed process. Through a detailed in vestigation that applies the model to Maori, the ¯ Indigenous language of Aotearoa New Zealand, we show that incorporating templates improves Morfessor’s ability to identify instances of redu plication, and does so most when there are multiple minimally-overlapping templates. We present an error analysis that reveals important factors to consider when applying the extended model and suggests useful future directions.

Description
Citation
Todd S, Huang A, Needle J, Hay J, King J (2022). Unsupervised morphological segmentation in a language with reduplication. Seattle: Sigmorphon: Special Interest Group on Computational Morphology and Phonology. 14/07/2022-14/07/2022. To appear.
Keywords
Ngā upoko tukutuku/Māori subject headings
Nga Upoko Tukutuku / Maori Subject Headings::Reo Māori | Reo rangatira; Te reo Māori; Te reo rangatira; Māori language
ANZSRC fields of research
Fields of Research::47 - Language, communication and culture::4704 - Linguistics::470409 - Linguistic structures (incl. phonology, morphology and syntax)
Fields of Research::45 - Indigenous studies::4507 - Te ahurea, reo me te hītori o te Māori (Māori culture, language and history)
Rights
All rights reserved unless otherwise stated