Lugamun

An easy and fair language for global communication

User Tools

Site Tools


en:statistics

Dictionary statistics

This file is automatically updated once a day, provided the dictionary has been modified since the last update. DON’T try to modify it manually, since all your changes will soon be overwritten! Last update: 2022-05-22.

Influence distribution

  • Spanish: 12.5%
  • English: 12.2%
  • French: 11.9%
  • Hindi/Urdu: 11.2%
  • Arabic: 10.6%
  • Mandarin Chinese: 9.4%
  • Indonesian/Malay: 7.9%
  • Russian: 7.7%
  • Swahili: 7.5%
  • Japanese: 7.2%
  • Others: 1.8%

578 of 913 entries directly derived from source languages.

  • Spanish: 45.5%
  • French: 44.8%
  • English: 43.6%
  • Hindi/Urdu: 33.4%
  • Russian: 31.3%
  • Arabic: 30.3%
  • Indonesian/Malay: 29.9%
  • Swahili: 26.3%
  • Japanese: 24.4%
  • Mandarin Chinese: 20.9%

For an explanation of how influences and related vocabulary are measured and why the latter can never be lower than the former, see the section “Influence distribution and similarity ratios” in this article.

Word class distribution

  • 410 Nouns
  • 226 Adjectives
  • 137 Verbs
  • 47 Proper nouns
  • 31 Numbers
  • 28 Adverbs
  • 27 Prepositions
  • 17 Quantifiers
  • 15 Suffixes
  • 14 Pronouns
  • 12 Particles
  • 12 Prefixes
  • 11 Interjections
  • 10 Conjunctions
  • 9 Phrases
  • 8 Selectors
  • 5 Auxiliary verbs
  • 3 Prepositional phrases

929 words in total, 16 of which are synonyms of another word. 91 words belong to more than one class.

Sound frequency distribution

  • a: 14.91%
  • i: 11.31%
  • e: 7.77%
  • n: 7.65%
  • s: 7.23%
  • r: 6.92%
  • o: 5.07%
  • u: 4.73%
  • t: 4.68%
  • k: 4.08%
  • m: 4.08%
  • l: 3.83%
  • d: 2.95%
  • b: 2.86%
  • p: 1.67%
  • g: 1.56%
  • j: 1.30%
  • h: 1.28%
  • f: 1.02%
  • w: 1.02%
  • c: 0.96%
  • y: 0.96%
  • ai: 0.82%
  • x: 0.68%
  • au: 0.51%
  • oi: 0.14%

Multi-word expressions are ignored.

Trivia

Average length of root words: 4.47 letters.

Longest roots (10 letters): horisontal.

Roots with maximum number of relations in the source languages (10 or more): -istan, Amerika, Andora, arabi, Argentina, Bolibia, Brasil, Buda, Cile, Ekwador, Galisia, han, hindi, islam, Italya, jaket, katalan, Katalunya, Mormon, muslim, Paragwai, Turkie, urdu.

(A word can have more relations than we have source languages if it’s derived from a language that is not normally among our sources, such as Italya ‘Italy’, derived from Italian.)

en/statistics.txt · Last modified: 2022-05-22 03:17 by lugamun