Changes between Version 26 and Version 27 of WikiStart


Ignore:
Timestamp:
Jan 4, 2016, 4:12:45 PM (8 years ago)
Author:
xkocinc
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • WikiStart

    v26 v27  
    33Start date:  1.10.2014 [[BR]]
    44End date:  30.4.2017
     5
     6== About the project ==
     7
     8The main objectives of the HaBiT project are to gather large-scale text data (corpora) from the Web for under-resourced languages, involving Norwegian, partly Czech and the major languages of Ethiopia — Amharic, Afaan Oromo, Tigrinya, Somali — and to build shallow processing applications. The gathered data will be processed to make it usable in many language applications, such as information extraction or machine translation. Furthermore, in the process of collecting corpora data, existing tools for building web text resources will be further developed and improved since the Ethiopian languages are quite different from most European languages. Applications for the given languages will be built to allow for the separation and disambiguation of multiple senses of words.
     9
     10
     11== Project Goals ==
     12
     13 1. Creating a repository for the investigated languages and making them freely accessible for further research (especially in Ethiopia and Norway).
     14 1. Presenting results obtained in the Project to the research community and disseminate the result via the HaBiT project web pages.
     15 1. In general, the accessibility of the results will push forward the research in the area of the under-resourced language and in this way contribute to promoting our knowledge of these languages in a longer perspective.
     16 1. The project results will make it possible to acquire information technologies in a less-developed country and contribute to its cultural development.
     17
    518
    619== Partners ==
     
    1124 AAU: Addis Ababa University and [[BR]]
    1225 HU: Hawassa University
     26
    1327
    1428== Project team ==
     
    2236The language processing team at NTNU belongs to the Artificial Intelligence division of the Department of Computer and Information Science. The Norwegian team in HaBiT will consist of Björn Gambäck (Professor of Language Technology, NTNU), Janne Bondi Johannessen (Professor at the Text Laboratory, University of Oslo), PhD student (to be appointed) and researchers: L. Bungum, H. Moen, together providing a strong background in language technology and knowledge representation, and in language resource building, both for Norwegian and for Ethiopian languages. Within the HaBiT project, the team will participate in and lead the research activities related to corpora building, annotation and processing for Norwegian and for the Ethiopian languages. Furthermore, NTNU is collaborating with University of Oslo and the universities in Addis Ababa and Hawassa in Ethiopia in a project to support linguistic capacity building in Ethiopia funded by Norad through the NORHED programme.
    2337
    24 
    25 == Project Goals ==
    26 
    27  1. Creating a repository for the investigated languages and making them freely accessible for further research (especially in Ethiopia and Norway),
    28  1. Presenting results obtained in the Project to the research community and disseminate the result via the HaBiT project web pages,
    29  1. In general, the accessibility of the results will push forward the research in the area of the under-resourced language and in this way contribute to promoting our knowledge of these languages in a longer perspective.
    30  1. The project results will make it possible to acquire information technologies in a less-developed country and contribute to its cultural development.
    3138
    3239== Public outcomes (in progress) ==
     
    6168Contract no. MSMT-28477/2014.
    6269
     70
    6371== Contact ==
    6472
     
    6674* Project coordinator: pala@fi.muni.cz
    6775
     76
    6877----
    6978