University of Canterbury Home
    • Admin
    UC Research Repository
    UC Library
    JavaScript is disabled for your browser. Some features of this site may not work without it.
    View Item 
    1. UC Home
    2. Library
    3. UC Research Repository
    4. Faculty of Arts | Te Kaupeka Toi Tangata
    5. Arts: Conference Contributions
    6. View Item
    1. UC Home
    2.  > 
    3. Library
    4.  > 
    5. UC Research Repository
    6.  > 
    7. Faculty of Arts | Te Kaupeka Toi Tangata
    8.  > 
    9. Arts: Conference Contributions
    10.  > 
    11. View Item

    Modeling Global Syntactic Variation in English Using Dialect Classification (2019)

    Thumbnail
    View/Open
    VarDial_19.pdf (198.2Kb)
    Type of Content
    Conference Contributions - Published
    UC Permalink
    http://hdl.handle.net/10092/17631
    
    Publisher
    Association for Computational Linguistics
    Collections
    • Arts: Conference Contributions [217]
    Authors
    Dunn J
    show all
    Abstract

    This paper evaluates global-scale dialect identification for 14 national varieties of English as a means for studying syntactic variation. The paper makes three main contributions: (i) introducing data-driven language mapping as a method for selecting the inventory of national varieties to include in the task; (ii) producing a large and dynamic set of syntactic features using grammar induction rather than focusing on a few hand-selected features such as function words; and( iii) comparing models across both web corpora and social media corpora in order to measure the robustness of syntactic variation across registers.

    Citation
    Dunn J (2019). Modeling Global Syntactic Variation in English Using Dialect Classification. North American Chapter of the Association for Computational Linguistics: Sixth Workshop on NLP for Similar Languages, Varieties and Dialects. Proceedings of the Sixth Workshop on NLP for Similar Languages, Varieties and Dialects.
    This citation is automatically generated and may be unreliable. Use as a guide only.
    ANZSRC Fields of Research
    20 - Language, Communication and Culture::2004 - Linguistics

    Related items

    Showing items related by title, author, creator and subject.

    • Stability of Syntactic Dialect Classification Over Space and Time 

      Wong S; Dunn, Jonathan (2022)
      This paper analyses the degree to which dialect classifiers based on syntactic representations remain stable over space and time. While previous work has shown that the combination of grammar induction and geospatial ...
    • Global Syntactic Variation in Seven Languages: Towards a Computational Dialectology 

      Dunn J (2019)
      The goal of this paper is to provide a complete representation of regional linguistic variation on a global scale. To this end, the paper focuses on removing three constraints that have previously limited work within ...
    • Register variation remains stable across 60 languages 

      Li H; Nini A; Dunn, Jonathan (Walter de Gruyter GmbH, 2022)
      This paper measures the stability of cross-linguistic register variation. A register is a variety of a language that is associated with extra-linguistic context. The relationship between a register and its context is ...
    Advanced Search

    Browse

    All of the RepositoryCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThesis DisciplineThis CollectionBy Issue DateAuthorsTitlesSubjectsThesis Discipline

    Statistics

    View Usage Statistics
    • SUBMISSIONS
    • Research Outputs
    • UC Theses
    • CONTACTS
    • Send Feedback
    • +64 3 369 3853
    • ucresearchrepository@canterbury.ac.nz
    • ABOUT
    • UC Research Repository Guide
    • Copyright and Disclaimer
    • SUBMISSIONS
    • Research Outputs
    • UC Theses
    • CONTACTS
    • Send Feedback
    • +64 3 369 3853
    • ucresearchrepository@canterbury.ac.nz
    • ABOUT
    • UC Research Repository Guide
    • Copyright and Disclaimer