Changes between Version 4 and Version 5 of AmharicCorpus
- Timestamp:
- Jan 16, 2017, 7:16:42 PM (7 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
AmharicCorpus
v4 v5 33 33 ||info || 85|| ethsat.com || 894|| At least 1 document || 573|| 34 34 35 We observe the content of news/politic and religious portals has a significant presence in the corpus sources. Since there are only 149 domains with more than 10 documents represented in the corpus, the result collection would benefit from a greater variety of sources.35 We observe the content of news/politic and religious sites has a significant presence in the corpus sources. Since there are only 149 domains with more than 10 documents represented in the corpus, the result collection would benefit from a greater variety of sources. 36 36 37 37 The most frequent parts of speech in both corpora are nouns and verbs. The most frequent part of speech tags: