Stability of Syntactic Dialect Classification Over Space and Time

Type of content
Conference Contributions - Published
Publisher's DOI/URI
Thesis discipline
Degree name
Publisher
Journal Title
Journal ISSN
Volume Title
Language
Date
2022
Authors
Wong S
Dunn, Jonathan
Abstract

This paper analyses the degree to which dialect classifiers based on syntactic representations remain stable over space and time. While previous work has shown that the combination of grammar induction and geospatial text classification produces robust dialect models, we do not know what influence both changing grammars and changing populations have on dialect models. This paper constructs a test set for 12 dialects of English that spans three years at monthly intervals with a fixed spatial distribution across 1,120 cities. Syntactic representations are formulated within the usage-based Construction Grammar paradigm (CxG). The decay rate of classification performance for each dialect over time allows us to identify regions undergoing syntactic change. And the distribution of classification accuracy within dialect regions allows us to identify the degree to which the grammar of a dialect is internally heterogeneous. The main contribution of this paper is to show that a rigorous evaluation of dialect classification models can be used to find both variation over space and change over time.

Description
Citation
Dunn J, Wong S (2022). Stability of Syntactic Dialect Classification Over Space and Time. International Conference on Computational Linguistics. Proceedings of the International Conference on Computational Linguistics (COLING).
Keywords
Ngā upoko tukutuku/Māori subject headings
ANZSRC fields of research
Fields of Research::47 - Language, communication and culture::4704 - Linguistics::470404 - Corpus linguistics
Fields of Research::47 - Language, communication and culture::4704 - Linguistics::470407 - Language documentation and description
Fields of Research::47 - Language, communication and culture::4704 - Linguistics::470406 - Historical, comparative and typological linguistics
Rights
All rights reserved unless otherwise stated