University of Canterbury Home
    • Admin
    UC Research Repository
    UC Library
    JavaScript is disabled for your browser. Some features of this site may not work without it.
    View Item 
    1. UC Home
    2. Library
    3. UC Research Repository
    4. Faculty of Arts | Te Kaupeka Toi Tangata
    5. Arts: Conference Contributions
    6. View Item
    1. UC Home
    2.  > 
    3. Library
    4.  > 
    5. UC Research Repository
    6.  > 
    7. Faculty of Arts | Te Kaupeka Toi Tangata
    8.  > 
    9. Arts: Conference Contributions
    10.  > 
    11. View Item

    Learned Construction Grammars Converge Across Registers Given Increased Exposure (2021)

    Thumbnail
    View/Open
    Accepted version (520.3Kb)
    Type of Content
    Conference Contributions - Published
    UC Permalink
    https://hdl.handle.net/10092/102887
    
    Publisher
    Association for Computational Linguistics
    Collections
    • Arts: Conference Contributions [217]
    Authors
    Tayyar Madabushi H
    Dunn, Jonathan cc
    show all
    Abstract

    This paper measures the impact of increased exposure on whether learned construction grammars converge onto shared representations when trained on data from different registers. Register influences the frequency of constructions, with some structures common in formal but not informal usage. We expect that a grammar induction algorithm exposed to different registers will acquire different constructions. To what degree does increased exposure lead to the convergence of register-specific grammars? The experiments in this paper simulate language learning in 12 languages (half Germanic and half Romance) with corpora representing three registers (Twitter, Wikipedia, Web). These simulations are repeated with increasing amounts of exposure, from 100k to 2 million words, to measure the impact of exposure on the convergence of grammars. The results show that increased exposure does lead to converging grammars across all languages. In addition, a shared core of register-universal constructions remains constant across increasing amounts of exposure.

    Citation
    Dunn J, Tayyar Madabushi H (2021). Learned Construction Grammars Converge Across Registers Given Increased Exposure. Conference on Natural Language Learning (CoNLL). Proceedings of the Conference on Natural Language Learning.
    This citation is automatically generated and may be unreliable. Use as a guide only.
    ANZSRC Fields of Research
    47 - Language, communication and culture::4704 - Linguistics::470403 - Computational linguistics
    47 - Language, communication and culture::4704 - Linguistics::470409 - Linguistic structures (incl. phonology, morphology and syntax)
    47 - Language, communication and culture::4704 - Linguistics::470404 - Corpus linguistics
    47 - Language, communication and culture::4704 - Linguistics::470401 - Applied linguistics and educational linguistics
    47 - Language, communication and culture::4703 - Language studies::470304 - Comparative language studies
    Rights
    All rights reserved unless otherwise stated
    http://hdl.handle.net/10092/17651

    Related items

    Showing items related by title, author, creator and subject.

    • Production vs Perception: The Role of Individuality in Usage-Based Grammar Induction 

      Nini A; Dunn, Jonathan (Association for Computational Linguistics, 2021)
      This paper asks whether a distinction between production-based and perception-based grammar induction influences either (i) the growth curve of grammars and lexicons or (ii) the similarity between representations learned ...
    • How linguistic structure influences and helps to predict metaphoric meaning 

      Dunn J (Walter de Gruyter GmbH, 2013)
      This paper argues that two properties of the linguistic structure of an utterance influence and partially determine whether the utterance has a metaphoric meaning that results in a stable interpretation: (i) degree of ...
    • Multi-unit association measures: Moving beyond pairs of words 

      Dunn J (2018)
      This paper formulates and evaluates a series of multi-unit measures of directional association, building on the pairwise ΔP measure, that are able to quantify association in sequences of varying length and type ...
    Advanced Search

    Browse

    All of the RepositoryCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThesis DisciplineThis CollectionBy Issue DateAuthorsTitlesSubjectsThesis Discipline

    Statistics

    View Usage Statistics
    • SUBMISSIONS
    • Research Outputs
    • UC Theses
    • CONTACTS
    • Send Feedback
    • +64 3 369 3853
    • ucresearchrepository@canterbury.ac.nz
    • ABOUT
    • UC Research Repository Guide
    • Copyright and Disclaimer
    • SUBMISSIONS
    • Research Outputs
    • UC Theses
    • CONTACTS
    • Send Feedback
    • +64 3 369 3853
    • ucresearchrepository@canterbury.ac.nz
    • ABOUT
    • UC Research Repository Guide
    • Copyright and Disclaimer