Changes between Version 8 and Version 9 of AmharicCorpus
- Timestamp:
- Jan 17, 2017, 12:39:47 PM (7 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
AmharicCorpus
v8 v9 22 22 ||=Token count =|| 20,287,250|| 23 23 ||=Ge'ez lexicon size=|| 955,628|| 24 ||=Sera lexicon size =|| 948,553||24 ||=Sera transliteration lexicon size =|| 948,553|| 25 25 26 26 Document count – the most frequent web domains and domain size distribution: … … 36 36 37 37 The most frequent words: 38 ||=Word (Ge'ez ) =||= Word (Sera) =||= Count =||38 ||=Word (Ge'ez script) =||= Word (Sera transliteration) =||= Count =|| 39 39 ||ነው ||new || 155,520|| 40 40 ||ላይ ||lay || 91,592||