Changes between Version 10 and Version 11 of HabitCorpusAnnotation


Ignore:
Timestamp:
Feb 21, 2017, 3:32:55 PM (7 years ago)
Author:
pary
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • HabitCorpusAnnotation

    v10 v11  
    11= HaBiT Corpus Annotation =
     2 * login to [http://corpora.fi.muni.cz:8787/files/ Corpus Annotation tool]
     3 * documents with notes for annotation of:
     4   * [https://docs.google.com/document/d/1GzyvhJTNTG4kQTEMmlQ_9W7rZMKnkORXwA5H66nfioo/edit?usp=sharing Czech]
     5   * [https://docs.google.com/document/d/1Pui6dPEPD9A0wucWg12KIUWE-1Sksi0IkC85LgEo1rU/edit?usp=sharing ​Amharic]
     6   * ​[https://docs.google.com/document/d/1L-x-aBXnce-iMyAtI_HkKQ4OnKcQMGnkfskz9Al8PFU/edit?usp=sharing Afaan Oromo]
     7   * ​[https://docs.google.com/document/d/1VE9F-sC7QnsBvoTMdZ0z8TWcufFA7HSprvjMd0DVQcU/edit?usp=sharing Tigrinya]
     8   * ​[https://docs.google.com/document/d/1hZNahiUZRUZFxLJOrWBHdty1IXDGF-7QG3hW573E6-A/edit?usp=sharing Somali]
     9   * [https://docs.google.com/document/d/1gJSmCzSkXm4D-_R4ypMTc1vNVBBmflZgBiyDtb6qklM/edit?usp=sharing Norwegian]
    210
    3 * login to [http://corpora.fi.muni.cz:8787/files/ Corpus Annotation tool]
    4 * documents with notes for annotation of:
    5  * [https://docs.google.com/document/d/1GzyvhJTNTG4kQTEMmlQ_9W7rZMKnkORXwA5H66nfioo/edit?usp=sharing Czech]
    6  * [https://docs.google.com/document/d/1Pui6dPEPD9A0wucWg12KIUWE-1Sksi0IkC85LgEo1rU/edit?usp=sharing ​Amharic]
    7  * ​[https://docs.google.com/document/d/1L-x-aBXnce-iMyAtI_HkKQ4OnKcQMGnkfskz9Al8PFU/edit?usp=sharing Afaan Oromo]
    8  * ​[https://docs.google.com/document/d/1VE9F-sC7QnsBvoTMdZ0z8TWcufFA7HSprvjMd0DVQcU/edit?usp=sharing Tigrinya]
    9  * ​[https://docs.google.com/document/d/1hZNahiUZRUZFxLJOrWBHdty1IXDGF-7QG3hW573E6-A/edit?usp=sharing Somali]
    10  * [https://docs.google.com/document/d/1gJSmCzSkXm4D-_R4ypMTc1vNVBBmflZgBiyDtb6qklM/edit?usp=sharing Norwegian]
    11 
    12 * [https://nlp.fi.muni.cz/projects/habit/stats/ annotation statistics]
     11 * [https://nlp.fi.muni.cz/projects/habit/stats/ annotation statistics]
     12 * AnnotationResults
    1313
    1414We use ''' Universal POS tags ''' according to '''[http://universaldependencies.org/u/pos/index.html Universal Dependencies v2]''' speciffication
    1515
     16The work aims to annotate ''' Open class words ''' and ''' Closed class words ''' that are divided into these eight categories:
    1617
    17 The work aims to annotate ''' Open class words ''' and ''' Closed class words ''' that are divided into these eight categories:
     18''' Open class words '''
    1819
    19 ''' Open class words '''
    2020 * [http://universaldependencies.org/u/pos/ADJ.html adjective (ADJ)]
    2121 * [http://universaldependencies.org/u/pos/ADV.html adverb (ADV)]
     
    2626
    2727''' Closed class words '''
    28  * [http://universaldependencies.org/u/pos/ADP.html adposition (ADP)]
     28
     29 * [http://universaldependencies.org/u/pos/ADP.html adposition (ADP)]
    2930 * [http://universaldependencies.org/u/pos/AUX_.html auxiliary (AUX)]
    3031 * [http://universaldependencies.org/u/pos/CCONJ.html coordinating conjunction (CCONJ)]
     
    3334 * [http://universaldependencies.org/u/pos/PART.html particle (PART)]
    3435 * [http://universaldependencies.org/u/pos/PRON.html pronoun (PRON)]
    35  * [http://universaldependencies.org/u/pos/SCONJ.html subordinating conjunction (SCONJ)] 
     36 * [http://universaldependencies.org/u/pos/SCONJ.html subordinating conjunction (SCONJ)]
    3637
    3738Starting minimalistic tagger is based on seed word examples for each POS category:
     39
    3840 * [https://docs.google.com/document/d/1exUa_2ndLIZvw1gCF7GqmcmiEpZrlLOcuj3fdC25mGQ/edit?usp=sharing English] ('''example document''')
    39  *  Ethiopian languages:
    40   * [https://docs.google.com/document/d/1Adg63YA1JsQ5wxlAOXRpcp2ivrZcPtGCB0bsRhiampk/edit?usp=sharing Amharic]
    41   * [https://docs.google.com/document/d/1CG6iROYvCiIbS1PmpzsumyNpuhar1WcnIUTcOzG9Iaw/edit?usp=sharing Afaan Oromo]
    42   * [https://docs.google.com/document/d/1Us1QbvA4p1xUhfWB0Xdh8sWMd0exZqyGIPTlkc7l4k4/edit?usp=sharing Tigrinya]
    43   * [https://docs.google.com/document/d/1TUGD1CSaFlqu8BmIXBIImjxGozdwXceAgy4zaAYal0A/edit?usp=sharing Somali]
     41 * Ethiopian languages:
     42   * [https://docs.google.com/document/d/1Adg63YA1JsQ5wxlAOXRpcp2ivrZcPtGCB0bsRhiampk/edit?usp=sharing Amharic]
     43   * [https://docs.google.com/document/d/1CG6iROYvCiIbS1PmpzsumyNpuhar1WcnIUTcOzG9Iaw/edit?usp=sharing Afaan Oromo]
     44   * [https://docs.google.com/document/d/1Us1QbvA4p1xUhfWB0Xdh8sWMd0exZqyGIPTlkc7l4k4/edit?usp=sharing Tigrinya]
     45   * [https://docs.google.com/document/d/1TUGD1CSaFlqu8BmIXBIImjxGozdwXceAgy4zaAYal0A/edit?usp=sharing Somali]
    4446 * [https://docs.google.com/document/d/1lSvT1JOYnWUYc2EY-B1z3kszdrqhGpzPN9j37QxM_e8/edit?usp=sharing Czech]
    4547 * [https://docs.google.com/document/d/1F4RTDWoLf3cgjvQAJju83SalKPvbOn-0eTdmQ4lntuI/edit?usp=sharing Norwegian]
    4648
    4749== Word Sketch questionnaires ==
    48 
    4950 * [https://docs.google.com/document/d/1UQv_tYOqSLjkXv0Xgu_IdhAhteARPxRmS0B4nutlelA/edit# Amharic]
    5051 * [https://docs.google.com/document/d/1cTLXL6RGvqGjZaHVcBOONhoG5DF_LpqxFT0urke6wQY/edit# Bengali]