Learned Construction Grammars Converge Across Registers Given Increased Exposure

dc.contributor.authorTayyar Madabushi H
dc.contributor.authorDunn, Jonathan
dc.date.accessioned2021-11-07T23:42:03Z
dc.date.available2021-11-07T23:42:03Z
dc.date.issued2021en
dc.date.updated2021-10-13T07:14:27Z
dc.description.abstractThis paper measures the impact of increased exposure on whether learned construction grammars converge onto shared representations when trained on data from different registers. Register influences the frequency of constructions, with some structures common in formal but not informal usage. We expect that a grammar induction algorithm exposed to different registers will acquire different constructions. To what degree does increased exposure lead to the convergence of register-specific grammars? The experiments in this paper simulate language learning in 12 languages (half Germanic and half Romance) with corpora representing three registers (Twitter, Wikipedia, Web). These simulations are repeated with increasing amounts of exposure, from 100k to 2 million words, to measure the impact of exposure on the convergence of grammars. The results show that increased exposure does lead to converging grammars across all languages. In addition, a shared core of register-universal constructions remains constant across increasing amounts of exposure.en
dc.identifier.citationDunn J, Tayyar Madabushi H (2021). Learned Construction Grammars Converge Across Registers Given Increased Exposure. Conference on Natural Language Learning (CoNLL). Proceedings of the Conference on Natural Language Learning.en
dc.identifier.urihttps://hdl.handle.net/10092/102887
dc.language.isoen
dc.publisherAssociation for Computational Linguisticsen
dc.rightsAll rights reserved unless otherwise stateden
dc.rights.urihttp://hdl.handle.net/10092/17651en
dc.subject.anzsrcFields of Research::47 - Language, communication and culture::4704 - Linguistics::470403 - Computational linguisticsen
dc.subject.anzsrcFields of Research::47 - Language, communication and culture::4704 - Linguistics::470409 - Linguistic structures (incl. phonology, morphology and syntax)en
dc.subject.anzsrcFields of Research::47 - Language, communication and culture::4704 - Linguistics::470404 - Corpus linguisticsen
dc.subject.anzsrcFields of Research::47 - Language, communication and culture::4704 - Linguistics::470401 - Applied linguistics and educational linguisticsen
dc.subject.anzsrcFields of Research::47 - Language, communication and culture::4703 - Language studies::470304 - Comparative language studiesen
dc.titleLearned Construction Grammars Converge Across Registers Given Increased Exposureen
dc.typeConference Contributions - Publisheden
uc.collegeFaculty of Arts
uc.departmentLanguage, Social and Political Sciences
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
CoNLL_21.pdf
Size:
520.33 KB
Format:
Adobe Portable Document Format
Description:
Accepted version