Changes between Version 7 and Version 8 of OromoCorpus
- Timestamp:
- Jan 17, 2017, 12:42:19 PM (7 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
OromoCorpus
v7 v8 38 38 The content of news/politics and religious sites has a significant presence in the corpus sources. 39 39 40 The most frequent words: 41 ||=Word (Latin) =||= Count =|| 42 ||akka || 82,032|| 43 ||kan || 71,775|| 44 ||hin || 65,390|| 45 ||fi || 64,710|| 46 ||Oromoo || 32,189|| 47 ||kana || 26,818|| 48 ||tokko || 26,699|| 49 ||itti || 25,926|| 50 ||waan || 24,733|| 51 ||yeroo || 22,580|| 52 ||keessatti || 22,189|| 53 ||isa || 21,732|| 54 ||isaa || 21,636|| 55 ||irratti || 20,666|| 56 ||jiru || 20,655|| 57 40 58 == Corpus query interface == 41 The corpus has been indexed by corpus manager and query system Sketch Engine [5]. The corpus can be searched at http://corpora.fi.muni.cz/habit/ .59 The corpus has been indexed by corpus manager and query system Sketch Engine [5]. The corpus can be searched at http://corpora.fi.muni.cz/habit/run.cgi/first_form?corpname=orwac16. 42 60 43 61 == References ==