MULTILEXICON - Multilingual Lexicon: English, French, Portuguese

MULTILEXICON

The MULTILEXICON is a multilingual word-based lexicon from English, French, and Brazilian Portuguese languages.

In this first version, all composed translations from Google Translate were excluded.

The focus was on the orthographic information, especially regarding the neighborhood. Four neighborhood measures were used: Coltheart's N, Levenshtein Distance, OLD20, and Uniqueness Point.

First, this categories were derived for mono-lexicons. Second, this categories were derived for bi-lexicons and all-lexicons.

Third, word-language-pairs were derived for Levenshtein Distance, Relative Levenshtein Distance, and Uniqueness Point.

Finally, complementary categories were derived, such as: frequency, word length, cvcv structure, reverse word, among others.

We hope the MULTILEXICON can be a useful tool for stimuli selection and control in psycholinguistic experiments, translation resource, and language modeling database! Enjoy!