View a PDF of the paper titled Investigating the Contextualised Word Embedding Dimensions Specified for Contextual and Temporal Semantic Changes, by Taichi Aida and 1 other authors
Abstract:The sense-aware contextualised word embeddings (SCWEs) encode semantic changes of words within the contextualised word embedding (CWE) spaces. Despite the superior performance of SCWEs in contextual/temporal semantic change detection (SCD) benchmarks, it remains unclear as to how the meaning changes are encoded in the embedding space. To study this, we compare pre-trained CWEs and their fine-tuned versions on contextual and temporal semantic change benchmarks under Principal Component Analysis (PCA) and Independent Component Analysis (ICA) transformations. Our experimental results reveal (a) although there exist a smaller number of axes that are specific to semantic changes of words in the pre-trained CWE space, this information gets distributed across all dimensions when fine-tuned, and (b) in contrast to prior work studying the geometry of CWEs, we find that PCA to better represent semantic changes than ICA within the top 10% of axes. These findings encourage the development of more efficient SCD methods with a small number of SCD-aware dimensions. Source code is available at this https URL .
Submission history
From: Taichi Aida [view email]
[v1]
Wed, 3 Jul 2024 05:42:20 UTC (1,242 KB)
[v2]
Tue, 3 Dec 2024 20:56:16 UTC (28,052 KB)
Source link
#Investigating #Contextualised #Word #Embedding #Dimensions #Contextual #Temporal #Semantic