Lugamun

An easy and fair language for global communication

User Tools

Site Tools


en:statistics

Dictionary statistics

This file is automatically updated once a day, provided the dictionary has been modified since the last update. DON’T try to modify it manually, since all your changes will soon be overwritten! Last update: 2022-10-04.

Influence distribution

  • Spanish: 13.9%
  • English: 13.2%
  • French: 12.8%
  • Hindi/Urdu: 11.1%
  • Arabic: 9.7%
  • Mandarin Chinese: 7.7%
  • Indonesian/Malay: 7.6%
  • Russian: 7.5%
  • Swahili: 7.4%
  • Japanese: 7.0%
  • Others: 2.3%

793 of 1294 entries directly derived from source languages.

  • Spanish: 50.9%
  • French: 48.5%
  • English: 48.0%
  • Hindi/Urdu: 33.8%
  • Russian: 31.7%
  • Arabic: 30.3%
  • Indonesian/Malay: 29.9%
  • Swahili: 27.4%
  • Japanese: 25.2%
  • Mandarin Chinese: 18.4%

For an explanation of how influences and related vocabulary are measured and why the latter can never be lower than the former, see the section “Influence distribution and similarity ratios” in this article.

Word class distribution

  • 599 Nouns
  • 305 Adjectives
  • 223 Verbs
  • 68 Proper nouns
  • 42 Adverbs
  • 34 Numbers
  • 33 Prepositions
  • 23 Pronouns
  • 18 Quantifiers
  • 18 Suffixes
  • 16 Conjunctions
  • 14 Interjections
  • 13 Prefixes
  • 12 Particles
  • 9 Selectors
  • 8 Auxiliary verbs
  • 8 Phrases
  • 6 Prepositional phrases

1315 lemmas in total, 21 of which are synonyms of another lemma. 152 entries belong to more than one class.

Sound frequency distribution

  • a: 14.55%
  • i: 10.80%
  • e: 9.02%
  • n: 7.91%
  • s: 7.22%
  • r: 6.97%
  • t: 5.28%
  • o: 4.91%
  • u: 4.32%
  • m: 4.11%
  • k: 3.97%
  • l: 3.80%
  • b: 2.82%
  • d: 2.80%
  • p: 1.78%
  • g: 1.57%
  • j: 1.30%
  • w: 1.19%
  • f: 1.17%
  • h: 1.07%
  • y: 0.98%
  • c: 0.71%
  • x: 0.71%
  • ai: 0.50%
  • au: 0.44%
  • oi: 0.10%

Multi-word expressions are ignored.

Trivia

Average length of root words: 4.68 letters.

Longest roots (10 letters): horisontal, papyamentu, Sakartwelo.

Roots with maximum number of relations in the source languages (10 or more): -istan, afegan, amen, Amerika, Andora, arabi, Argentina, Asturie, Balgaria, Belarus, Belgie, Bolibia, Brasil, Buda, Cile, Danmarke, Ekwador, Galisia, han, hindi, islam, Italya, jaket, Kanada, katalan, Katalunya, latin, Mormon, muslim, papyamentu, Paragwai, Serbia, Turkie, urdu.

(A word can have more relations than we have source languages if it’s derived from a language that is not normally among our sources, such as Italya ‘Italy’, derived from Italian.)

en/statistics.txt · Last modified: 2022-10-04 03:17 by lugamun

Except where otherwise noted, content on this wiki is licensed under the following license: CC0 1.0 Universal
CC0 1.0 Universal Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki