Autori: Varvara, Rossella, Salvadori, Justine, Huyghe, Richard
Titolo: Lexical ambiguity in contextualized word embeddings: A case study of nominalizations
Periodico: Lingue e linguaggio
Anno: 2024 - Volume: 45 - Fascicolo: 1 - Pagina iniziale: 141 - Pagina finale: 182

In this paper we investigate the extent to which contextualized word embeddings can encode lexical ambiguity. Specifically, we focus on nominalizations in French, which constitute an interesting case for the study of ambiguity because of their frequent polysemy and their relationship with polyfunctional morphological processes. Given a random sample of occurrences of 90 nouns, we compute for each word the pairwise cosine similarity (SelfSim) among their token embeddings extracted from the pre-trained model FlauBERT and we test it as a predictor of the degree of ambiguity of nominalizations. For the evaluation we make use of a manual annotation of lexical ambiguity, testing different annotation strategies: defining word senses with different semantic classifications and granularities; annotating lexemes in isolation or based on a sample of tokens. Our findings contribute to the understanding of (i) the lexical semantic component of contextual embeddings, enhancing their interpretability, (ii) aspects of lexical ambiguity related to derivational semantics and to the contextual variation of meaning.




SICI: 1720-9331(2024)45:1<141:LAICWE>2.0.ZU;2-X
Testo completo: https://www.rivisteweb.it/download/article/10.1418/112743
Testo completo alternativo: https://www.rivisteweb.it/doi/10.1418/112743

Esportazione dati in Refworks (solo per utenti abilitati)

Record salvabile in Zotero

Biblioteche ACNP che possiedono il periodico