University of Canterbury Home
    • Admin
    UC Research Repository
    UC Library
    JavaScript is disabled for your browser. Some features of this site may not work without it.
    View Item 
    1. UC Home
    2. Library
    3. UC Research Repository
    4. Faculty of Arts | Te Kaupeka Toi Tangata
    5. Arts: Conference Contributions
    6. View Item
    1. UC Home
    2.  > 
    3. Library
    4.  > 
    5. UC Research Repository
    6.  > 
    7. Faculty of Arts | Te Kaupeka Toi Tangata
    8.  > 
    9. Arts: Conference Contributions
    10.  > 
    11. View Item

    Stability of Syntactic Dialect Classification Over Space and Time (2022)

    Thumbnail
    View/Open
    Accepted version (551.8Kb)
    Type of Content
    Conference Contributions - Published
    UC Permalink
    https://hdl.handle.net/10092/104653
    
    Collections
    • Arts: Conference Contributions [216]
    Authors
    Wong S
    Dunn, Jonathan cc
    show all
    Abstract

    This paper analyses the degree to which dialect classifiers based on syntactic representations remain stable over space and time. While previous work has shown that the combination of grammar induction and geospatial text classification produces robust dialect models, we do not know what influence both changing grammars and changing populations have on dialect models. This paper constructs a test set for 12 dialects of English that spans three years at monthly intervals with a fixed spatial distribution across 1,120 cities. Syntactic representations are formulated within the usage-based Construction Grammar paradigm (CxG). The decay rate of classification performance for each dialect over time allows us to identify regions undergoing syntactic change. And the distribution of classification accuracy within dialect regions allows us to identify the degree to which the grammar of a dialect is internally heterogeneous. The main contribution of this paper is to show that a rigorous evaluation of dialect classification models can be used to find both variation over space and change over time.

    Citation
    Dunn J, Wong S (2022). Stability of Syntactic Dialect Classification Over Space and Time. International Conference on Computational Linguistics. Proceedings of the International Conference on Computational Linguistics (COLING).
    This citation is automatically generated and may be unreliable. Use as a guide only.
    ANZSRC Fields of Research
    47 - Language, communication and culture::4704 - Linguistics::470404 - Corpus linguistics
    47 - Language, communication and culture::4704 - Linguistics::470407 - Language documentation and description
    47 - Language, communication and culture::4704 - Linguistics::470406 - Historical, comparative and typological linguistics
    Rights
    All rights reserved unless otherwise stated
    http://hdl.handle.net/10092/17651

    Related items

    Showing items related by title, author, creator and subject.

    • Register variation remains stable across 60 languages 

      Li H; Nini A; Dunn, Jonathan (Walter de Gruyter GmbH, 2022)
      This paper measures the stability of cross-linguistic register variation. A register is a variety of a language that is associated with extra-linguistic context. The relationship between a register and its context is ...
    • Learned Construction Grammars Converge Across Registers Given Increased Exposure 

      Tayyar Madabushi H; Dunn, Jonathan (Association for Computational Linguistics, 2021)
      This paper measures the impact of increased exposure on whether learned construction grammars converge onto shared representations when trained on data from different registers. Register influences the frequency of ...
    • Mapping Languages and Demographics with Georeferenced Corpora 

      Dunn, Jonathan; Adams, Ben (2019)
      This paper evaluates large georeferenced corpora, taken from both web-crawled and social media sources, against ground-truth population and language-census datasets. The goal is to determine (i) which dataset best ...
    Advanced Search

    Browse

    All of the RepositoryCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThesis DisciplineThis CollectionBy Issue DateAuthorsTitlesSubjectsThesis Discipline

    Statistics

    View Usage Statistics
    • SUBMISSIONS
    • Research Outputs
    • UC Theses
    • CONTACTS
    • Send Feedback
    • +64 3 369 3853
    • ucresearchrepository@canterbury.ac.nz
    • ABOUT
    • UC Research Repository Guide
    • Copyright and Disclaimer
    • SUBMISSIONS
    • Research Outputs
    • UC Theses
    • CONTACTS
    • Send Feedback
    • +64 3 369 3853
    • ucresearchrepository@canterbury.ac.nz
    • ABOUT
    • UC Research Repository Guide
    • Copyright and Disclaimer