= HaBiT - Harvesting big text data for under-resourced languages =

Start date:  1.10.2014 [[BR]]
End date:  30.4.2017

== Partners ==

 * MU: Masarykova univerzita, Brno
 * NTNU: Norges teknisk-naturvitenskapelige universitet, Trondheim

== Project Goals ==

 1. build a multi-billion word Norwegian corpus
  * using the tools co-developed by MU and utilized in a joint EU-funded project with NTNU
 1. support linguistic resource building in Ethiopia funded  by Norad in project NORHED
 1. build shallow processing applications for Czech and Norwegian, and at least one Ethiopian language

[[int/InternalWikiStart Internal Wiki]]