Table of Contents

Dictionary statistics

This file is automatically updated once a day, provided the dictionary has been modified since the last update. DON’T try to modify it manually, since all your changes will soon be overwritten! Last update: 2023-03-16.

Influence distribution

Influence distribution

986 of 1605 entries directly derived from source languages.

Related vocabulary percentages

For an explanation of how influences and related vocabulary are measured and why the latter can never be lower than the former, see the section “Influence distribution and similarity ratios” in this article.

Most common source language combinations

Combinations involving three languages:

Combinations involving two languages:

Percentages specify how many of the entries directly derived from source languages have been derived from this combination (and possibly other source languages).

Word class distribution

1623 lemmas in total, 18 of which are synonyms of another lemma. 223 entries belong to more than one class.

Sound frequency distribution

Multi-word expressions are ignored.

Trivia

Average length of root words: 4.81 letters.

Longest roots (10 letters): demokrasia, horisontal, koresponde, papyamentu, Sakartvelo, Slovenesko.

Roots with maximum number of relations in the source languages (10 or more): -istan, afegan, amen, Amerika, Andora, arabi, Argentina, Asturie, Balgaria, Belarus, Belgie, Bolivia, Brasil, Buda, Cile, Danmarke, Ekvador, Galisia, han, hindi, Indonesia, islam, Israel, Italya, jaket, Kanada, katalan, Katalunya, latin, Mexiko, Mormon, muslim, papyamentu, Paragvai, Serbia, Turkie, urdu.

(A word can have more relations than we have source languages if it’s derived from a language that is not normally among our sources, such as Italya ‘Italy’, derived from Italian.)