Finding variants for construction-based dialectometry: A corpus-based approach to regional CxGs

Type of content
Journal Article
Thesis discipline
Degree name
Publisher
Walter de Gruyter GmbH
Journal Title
Journal ISSN
Volume Title
Language
Date
2018
Authors
Dunn J
Abstract

This paper develops a construction-based dialectometry capable of identifying previously unknown constructions and measuring the degree to which a given construction is subject to regional variation. The central idea is to learn a grammar of constructions (a CxG) using construction grammar induction and then to use these constructions as features for dialectometry. This offers a method for measuring the aggregate similarity between regional CxGs without limiting in advance the set of constructions subject to variation. The learned CxG is evaluated on how well it describes held-out test corpora while dialectometry is evaluated on how well it can model regional varieties of English. The method is tested using two distinct datasets: First, the International Corpus of English representing eight outer circle varieties; Second, a web-crawled corpus representing five inner circle varieties. Results show that the method (1) produces a grammar with stable quality across sub-sets of a single corpus that is (2) capable of distinguishing between regional varieties of English with a high degree of accuracy, thus (3) supporting dialectometric methods for measuring the similarity between varieties of English and (4) measuring the degree to which each construction is subject to regional variation. This is important for cognitive sociolinguistics because it operationalizes the idea that competition between constructions is organized at the functional level so that dialectometry needs to represent as much of the available functional space as possible.

Description
Citation
Dunn J (2018). Finding variants for construction-based dialectometry: A corpus-based approach to regional CxGs. Cognitive Linguistics. 29(2). 275-311.
Keywords
construction grammar, CxG, dialectometry, dialectology, spatial variation
Ngā upoko tukutuku/Māori subject headings
ANZSRC fields of research
Field of Research::20 - Language, Communication and Culture::2004 - Linguistics
Rights