Semantic change is attested commonly in the historical development of lexicons
across the world’s languages. Extensive research has sought to characterize
regularity in semantic change, but existing studies have typically relied on manual
approaches or the analysis of a restricted set of languages. We present a
large-scale computational analysis to explore regular patterns in word meaning
change shared across many languages. We focus on two levels of analysis: (1)
regularity in directionality, ...
Semantic change is attested commonly in the historical development of lexicons
across the world’s languages. Extensive research has sought to characterize
regularity in semantic change, but existing studies have typically relied on manual
approaches or the analysis of a restricted set of languages. We present a
large-scale computational analysis to explore regular patterns in word meaning
change shared across many languages. We focus on two levels of analysis: (1)
regularity in directionality, which we explore by inferring the historical direction of
semantic change between a sourcemeaning and a targetmeaning; (2) regularity in
source-target mapping, which we explore by inferring the target meaning given a
sourcemeaning.We work with DatSemShift, the world’s largest public database of
semantic change that records thousands ofmeaning changes fromover hundreds
of languages. For directionality inference, we find that concreteness explains
directionality in more than 70% of the attested cases of semantic change and is
the strongest predictor among the alternatives including frequency and valence.
For target inference, we find that a parallelogram-style analogy model based on
contextual embeddings predicts the attested source-targetmappings substantially
better than chance and similarity-based models. Clustering the meaning pairs of
semantic change reveals regular meaning shiftings between domains, such as
body parts to geological formations. Our study provides an automated approach
and large-scale evidence for multifaceted regularity in semantic change across
languages.
+