Analyzing how context size and symmetry influence word embedding information

Gabanes Anuncibay, Inés; Gabanes Anuncibay, Inés

Analyzing how context size and symmetry influence word embedding information

Author

Gabanes Anuncibay, Inés

Publication date

2022-09-14T17:43:30Z

2022-09-14

Abstract

Treball de fi de màster en Lingüística Teòrica i Aplicada. Director: Dr. Thomas Brochhagen

Word embeddings represent word meaning in the form of a vector; however, the encoded information varies depending on the parameters the vector has been trained with. This paper analyzes how two parameters, context size and symmetry, influence word embedding information and aims to find if there exists a single distributional parametrization for capturing semantic similarity as well as relatedness. The models were trained with GloVe with different parametrizations; then, they were quantitatively evaluated through a similarity task, using WordSim-353 (for relatedness) and SimLex-999 (for semantic similarity) as benchmarks. The results show a minimal variation when manipulating some of the analyzed parameters, in particular between symmetric and asymmetric contexts, which leads us to conclude that it is not necessary to train models with large contexts for achieving good performance.

Document Type

Master's final project

Language

English

Subjects and keywords

Semantics; Embeddings; Context; Distributional; Similarity; Relatedness; GloVe; WordSim-353; SimLex-999

Recommended citation

This citation was generated automatically.

Export

DIDL MARC MARC_CCUC METS OAI_DC ORE QDC RDF

Rights

Llicència CC Reconeixement-NoComercial-SenseObraDerivada 4.0 Internacional (CC BY-NC-ND 4.0)

https://creativecommons.org/licenses/by-nc-nd/4.0/

This item appears in the following Collection(s)

Treballs d'estudiants [4946]

Analyzing how context size and symmetry influence word embedding information

Author

Publication date

Share

Abstract

Document Type

Language

Subjects and keywords

Recommended citation

Export

Rights

This item appears in the following Collection(s)