Changes between Version 4 and Version 5 of AmharicCorpus


Ignore:
Timestamp:
Jan 16, 2017, 7:16:42 PM (7 years ago)
Author:
xsuchom2
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • AmharicCorpus

    v4 v5  
    3333||info ||     85|| ethsat.com     ||   894|| At least 1 document      || 573||
    3434
    35 We observe the content of news/politic and religious portals has a significant presence in the corpus sources. Since there are only 149 domains with more than 10 documents represented in the corpus, the result collection would benefit from a greater variety of sources.
     35We observe the content of news/politic and religious sites has a significant presence in the corpus sources. Since there are only 149 domains with more than 10 documents represented in the corpus, the result collection would benefit from a greater variety of sources.
    3636
    3737The most frequent parts of speech in both corpora are nouns and verbs. The most frequent part of speech tags: