LX Semantic Similarity

LX Semantic Similarity

Developed at the University of Lisbon, Department of Informatics, by the NLX-Natural Language and Speech Group.


home    |    versão portuguesa

 

Features


Table of Contents

Functions

LX Semantic Similarity is an online service for measuring the semantic similarity between words in Portuguese. This service uses the LX-DSemVectors, a distributional semantics model (a.k.a. word embeddings) of the Portuguese language.

The model represents each word in its vocabulary by a vector of real numbers. This vector representation allows the user to obtain a measure of similarity between two or more words, calculated by means of the cosine distance between the vectors of those words.

The online service provides two types of search:

This service was developed at the University of Lisbon University of Lisbon, Department of Informatics, by the NLX-Natural Language and Speech Group.

Autorship

LX Semantic Similarity was developed by Marcos Garcia, João Silva and João Rodrigues under the coordination of António Branco, at the NLX-Natural Language and Speech Group

Reference

The LX-DSemVectors are described in the following publication:

and are distributed via GitHub.

Acknowledgments

This web demo makes use of wordcloud and tSNEJS.