Modeling Global Syntactic Variation in English Using Dialect Classification

Type of content
Conference Contributions - Published
Publisher's DOI/URI
Thesis discipline
Degree name
Publisher
Association for Computational Linguistics
Journal Title
Journal ISSN
Volume Title
Language
Date
2019
Authors
Dunn J
Abstract

This paper evaluates global-scale dialect identiļ¬cation for 14 national varieties of English as a means for studying syntactic variation. The paper makes three main contributions: (i) introducing data-driven language mapping as a method for selecting the inventory of national varieties to include in the task; (ii) producing a large and dynamic set of syntactic features using grammar induction rather than focusing on a few hand-selected features such as function words; and( iii) comparing models across both web corpora and social media corpora in order to measure the robustness of syntactic variation across registers.

Description
Citation
Dunn J (2019). Modeling Global Syntactic Variation in English Using Dialect Classification. North American Chapter of the Association for Computational Linguistics: Sixth Workshop on NLP for Similar Languages, Varieties and Dialects. Proceedings of the Sixth Workshop on NLP for Similar Languages, Varieties and Dialects.
Keywords
Ngā upoko tukutuku/Māori subject headings
ANZSRC fields of research
Field of Research::20 - Language, Communication and Culture::2004 - Linguistics
Rights