Multi-unit association measures: Moving beyond pairs of words (2018)
This paper formulates and evaluates a series of multi-unit measures of directional association, building on the pairwise ΔP measure, that are able to quantify association in sequences of varying length and type of representation. Multi-unit measures face an additional segmentation problem: once the implicit length constraint of pairwise measures is abandoned, association measures must also identify the borders of meaningful sequences. This paper takes a vector-based approach to the segmentation problem by using 18 unique measures to describe different aspects of multi-unit association. An examination of these measures across eight languages shows that they are stable across languages and that each provides a unique rank of associated sequences. Taken together, these measures expand corpus-based approaches to association by generalizing across varying lengths and types of representation.
CitationDunn J (2018). Multi-unit association measures: Moving beyond pairs of words. International Journal of Corpus Linguistics. 23(2). 183-215.
This citation is automatically generated and may be unreliable. Use as a guide only.
Keywordsassociation strength; multi-unit association; sequences; ΔP; collocations
ANZSRC Fields of Research47 - Language, communication and culture::4704 - Linguistics::470409 - Linguistic structures (incl. phonology, morphology and syntax)
20 - Language, Communication and Culture::2004 - Linguistics::200402 - Computational Linguistics
47 - Language, communication and culture::4703 - Language studies::470304 - Comparative language studies
Showing items related by title, author, creator and subject.
Dunn J (Walter de Gruyter GmbH, 2013)This paper argues that two properties of the linguistic structure of an utterance influence and partially determine whether the utterance has a metaphoric meaning that results in a stable interpretation: (i) degree of ...
Tayyar Madabushi H; Dunn, Jonathan (Association for Computational Linguistics, 2021)This paper measures the impact of increased exposure on whether learned construction grammars converge onto shared representations when trained on data from different registers. Register influences the frequency of ...
Dunn J (2014)This paper presents the first computationally-derived scalar measurement of metaphoricity. Each input sentence is given a value between 0 and 1 which represents how metaphoric that sentence is. This measure achieves a ...