Changes between Version 27 and Version 28 of InterimResults


Ignore:
Timestamp:
Jan 17, 2017, 11:20:37 AM (7 years ago)
Author:
hales
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • InterimResults

    v27 v28  
    1010  Amharic WIC corpus (News from Walta Information Center), manually tagged.
    1111
    12  * [http://corpora.fi.muni.cz/habit/run.cgi/first?corpname=amwac15&reload=1 Amharic WaC corpus], 17 million tokens
     12 * [http://corpora.fi.muni.cz/habit/run.cgi/first?corpname=amwac16&reload=1 Amharic WaC corpus], 17 million tokens
    1313
    1414  Amharic web corpus. Crawled by !SpiderLing  in August 2013 and October 2015. Encoded in UTF-8, cleaned, deduplicated. Automatically tagged by !TreeTagger  trained on Amharic WiC
    1515
    16  * [http://corpora.fi.muni.cz/habit/run.cgi/first?corpname=oromo Oromo spoken corpus], 7,500 tokens.
     16 * [http://corpora.fi.muni.cz/habit/run.cgi/first?corpname=or_spoken Oromo spoken corpus], 7,500 tokens.
    1717
    1818  Oromo spoken corpus containing 1205 utterances. Built by Text Laboratory, University of Oslo.