Lugamun

An easy and fair language for global communication

User Tools

Site Tools


en:statistics

Dictionary statistics

This file is automatically updated once a day, provided the dictionary has been modified since the last update. DON’T try to modify it manually, since all your changes will soon be overwritten! Last update: 2023-01-28.

Influence distribution

Influence distribution

  • Spanish: 14.3%
  • English: 13.5%
  • French: 13.3%
  • Hindi/Urdu: 10.6%
  • Arabic: 9.7%
  • Indonesian/Malay: 7.6%
  • Russian: 7.6%
  • Mandarin Chinese: 7.2%
  • Swahili: 7.2%
  • Japanese: 7.0%
  • Others: 2.0%

947 of 1543 entries directly derived from source languages.

Related vocabulary percentages

  • Spanish: 52.8%
  • French: 50.3%
  • English: 49.3%
  • Russian: 32.0%
  • Hindi/Urdu: 31.9%
  • Indonesian/Malay: 30.1%
  • Arabic: 29.7%
  • Swahili: 26.6%
  • Japanese: 25.2%
  • Mandarin Chinese: 17.5%

For an explanation of how influences and related vocabulary are measured and why the latter can never be lower than the former, see the section “Influence distribution and similarity ratios” in this article.

Most common source language combinations

Combinations involving three languages:

  • English/Spanish/French: 35.7%
  • Spanish/French/Russian: 22.1%
  • English/Spanish/Russian: 21.1%
  • English/French/Russian: 20.8%
  • Spanish/French/Indonesian: 15.9%
  • English/French/Indonesian: 15.5%
  • English/Spanish/Indonesian: 15.3%
  • Spanish/Indonesian/Russian: 14.0%
  • English/Indonesian/Russian: 13.9%
  • French/Indonesian/Russian: 13.7%
  • Spanish/French/Hindi: 13.4%
  • English/French/Hindi: 12.6%
  • English/Spanish/Hindi: 12.1%
  • English/French/Japanese: 12.1%
  • English/Spanish/Japanese: 12.0%
  • Spanish/French/Swahili: 11.6%
  • Arabic/Spanish/French: 11.1%
  • Mandarin/Spanish/French: 5.7%

Combinations involving two languages:

  • Spanish/French: 43.7%
  • English/French: 38.5%
  • English/Spanish: 38.3%
  • Spanish/Russian: 25.0%
  • English/Russian: 23.9%
  • French/Russian: 23.7%
  • Spanish/Indonesian: 18.3%
  • French/Indonesian: 18.0%
  • English/Indonesian: 17.6%
  • Arabic/Swahili: 17.1%
  • English/Japanese: 15.8%
  • Spanish/Hindi: 15.7%
  • Indonesian/Russian: 15.4%
  • French/Hindi: 15.3%
  • English/Hindi: 15.2%
  • Mandarin/Japanese: 8.1%

Percentages specify how many of the entries directly derived from source languages have been derived from this combination (and possibly other source languages).

Word class distribution

  • 759 Nouns
  • 340 Adjectives
  • 292 Verbs
  • 78 Proper nouns
  • 46 Adverbs
  • 39 Numbers
  • 35 Prepositions
  • 25 Pronouns
  • 23 Suffixes
  • 19 Quantifiers
  • 17 Conjunctions
  • 17 Interjections
  • 16 Prefixes
  • 12 Particles
  • 9 Auxiliary verbs
  • 9 Selectors
  • 8 Phrases
  • 7 Prepositional phrases

1561 lemmas in total, 18 of which are synonyms of another lemma. 205 entries belong to more than one class.

Sound frequency distribution

  • a: 14.01%
  • i: 10.93%
  • e: 9.19%
  • n: 7.93%
  • s: 7.24%
  • r: 7.11%
  • t: 5.18%
  • o: 4.99%
  • u: 4.49%
  • m: 4.12%
  • k: 4.11%
  • l: 3.89%
  • d: 2.96%
  • b: 2.31%
  • p: 1.81%
  • v: 1.55%
  • g: 1.50%
  • j: 1.29%
  • f: 1.10%
  • h: 1.10%
  • y: 0.95%
  • x: 0.73%
  • c: 0.65%
  • ai: 0.42%
  • au: 0.37%
  • oi: 0.08%

Multi-word expressions are ignored.

Trivia

Average length of root words: 4.78 letters.

Longest roots (10 letters): demokrasia, horisontal, papyamentu, Sakartvelo, Slovenesko.

Roots with maximum number of relations in the source languages (10 or more): -istan, afegan, amen, Amerika, Andora, arabi, Argentina, Asturie, Balgaria, Belarus, Belgie, Bolivia, Brasil, Buda, Cile, Danmarke, Ekvador, Galisia, han, hindi, Indonesia, islam, Israel, Italya, jaket, Kanada, katalan, Katalunya, latin, Mexiko, Mormon, muslim, papyamentu, Paragvai, Serbia, Turkie, urdu.

(A word can have more relations than we have source languages if it’s derived from a language that is not normally among our sources, such as Italya ‘Italy’, derived from Italian.)

en/statistics.txt · Last modified: 2023-01-28 03:17 by lugamun

Except where otherwise noted, content on this wiki is licensed under the following license: CC0 1.0 Universal
CC0 1.0 Universal Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki