Changes between Version 27 and Version 28 of InterimResults
- Timestamp:
- Jan 17, 2017, 11:20:37 AM (7 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
InterimResults
v27 v28 10 10 Amharic WIC corpus (News from Walta Information Center), manually tagged. 11 11 12 * [http://corpora.fi.muni.cz/habit/run.cgi/first?corpname=amwac1 5&reload=1 Amharic WaC corpus], 17 million tokens12 * [http://corpora.fi.muni.cz/habit/run.cgi/first?corpname=amwac16&reload=1 Amharic WaC corpus], 17 million tokens 13 13 14 14 Amharic web corpus. Crawled by !SpiderLing in August 2013 and October 2015. Encoded in UTF-8, cleaned, deduplicated. Automatically tagged by !TreeTagger trained on Amharic WiC 15 15 16 * [http://corpora.fi.muni.cz/habit/run.cgi/first?corpname=or omoOromo spoken corpus], 7,500 tokens.16 * [http://corpora.fi.muni.cz/habit/run.cgi/first?corpname=or_spoken Oromo spoken corpus], 7,500 tokens. 17 17 18 18 Oromo spoken corpus containing 1205 utterances. Built by Text Laboratory, University of Oslo.