Measuring Linguistic Diversity During COVID-19

dc.contributor.authorDunn J
dc.contributor.authorCoupe T
dc.contributor.authorAdams B
dc.contributor.editorJurgens D
dc.contributor.editorVolkova S
dc.contributor.editorBamman D
dc.contributor.editorHovy D
dc.contributor.editorO'Connor B
dc.date.accessioned2020-11-05T22:09:47Z
dc.date.available2020-11-05T22:09:47Z
dc.date.issued2020en
dc.date.updated2020-10-11T22:38:16Z
dc.description.abstractComputational measures of linguistic diversity help us understand the linguistic landscape using digital language data. The contribution of this paper is to calibrate measures of linguistic diversity using restrictions on international travel resulting from the COVID-19 pandemic. Previous work has mapped the distribution of languages using geo-referenced social media and web data. The goal, however, has been to describe these corpora themselves rather than to make inferences about underlying populations. This paper shows that a difference-indifferences method based on the Herfindahl Hirschman Index can identify the bias in digital corpora that is introduced by non-local populations. These methods tell us where significant changes have taken place and whether this leads to increased or decreased diversity. This is an important step in aligning digital corpora like social media with the real-world populations that have produced them.en
dc.identifier.citationDunn J, Coupe T, Adams B (2020). Measuring Linguistic Diversity During COVID-19. The Fourth Workshop on Natural Language Processing and Computational Social Science. 19/11/2020-20/11/2020. Proceedings of The Fourth Workshop on the Fourth Workshop on Natural Language Processing and Computational Social Science.en
dc.identifier.urihttps://hdl.handle.net/10092/101220
dc.language.isoen
dc.rightsAll rights reserved unless otherwise stateden
dc.rights.urihttp://hdl.handle.net/10092/17651en
dc.subject.anzsrcFields of Research::47 - Language, communication and culture::4704 - Linguistics::470404 - Corpus linguisticsen
dc.subject.anzsrcFields of Research::47 - Language, communication and culture::4704 - Linguistics::470403 - Computational linguisticsen
dc.subject.anzsrcFields of Research::47 - Language, communication and culture::4704 - Linguistics::470411 - Sociolinguisticsen
dc.titleMeasuring Linguistic Diversity During COVID-19en
dc.typeConference Contributions - Publisheden
uc.collegeFaculty of Engineering
uc.departmentComputer Science and Software Engineering
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
DunnCoupeAdams2020.pdf
Size:
980.98 KB
Format:
Adobe Portable Document Format
Description:
Accepted version