Changes between Version 2 and Version 3 of HabitSystemFinal


Ignore:
Timestamp:
Jun 2, 2017, 11:30:43 AM (7 years ago)
Author:
xsuchom2
Comment:

Examples of Ethiopian Web Corpora use

Legend:

Unmodified
Added
Removed
Modified
  • HabitSystemFinal

    v2 v3  
    4040 * [AmharicCorpus Corpus deliverable/technical report]
    4141
    42 TODO Examples of concordance, sketches, wordlist
     42==== Examples of Amharic Web Corpus use ====
     43 * [http://corpora.fi.muni.cz/habit/run.cgi/corp_info?corpname=amwac16 Corpus information]
     44 * [http://corpora.fi.muni.cz/habit/run.cgi/first?corpname=amwac16&reload=&iquery=%E1%88%98%E1%8A%95%E1%8C%8D%E1%88%A5%E1%89%B5&queryselector=iqueryrow&phrase=&word=&char=&cql=&default_attr=word&fc_lemword_window_type=both&fc_lemword_wsize=5&fc_lemword=&fc_lemword_type=all&fsca_doc.t2ld=&fsca_doc.urldomain= Examples of the use of "መንግሥት" ("government")] – Words or phrases in a natural Amharic context. The base function for language study and the source of good dictionary examples.
     45 * [http://corpora.fi.muni.cz/habit/run.cgi/wsketch?corpname=amwac16&reload=&lemma=%E1%88%98%E1%8A%95%E1%8C%8D%E1%88%A5%E1%89%B5&minfreq=6&minscore=0.0&maxitems=20&sort_ws_columns=s&show_lemma_coverage=0&clustercolls=0&minsim=0.15&structured=0&structured=1&min_unary_score=5.0&min_mwlink_freq=100&nr_ws_cols=5&bim_corpname=&bim_lemma= Grammatical and collocational behaviour of "መንግሥት" ("government")] – An essential feature for creating dictionaries in Amharic.
     46 * [http://corpora.fi.muni.cz/habit/run.cgi/thes?corpname=amwac16&reload=&lemma=%E1%88%98%E1%8A%95%E1%8C%8D%E1%88%A5%E1%89%B5&maxthesitems=60&minthesscore=0.0&includeheadword=0&clusteritems=0&minsim=0.15 Words used in the same context as "መንግሥት" ("government")] – A useful list for creating an Amharic thesaurus.
     47 * [http://corpora.fi.muni.cz/habit/run.cgi/freqml?q=aword%2C%5Bword%3D%22.%7B3%2C%7D%22+%26+tag%3D%22N.*%22%5D&corpname=amwac16&viewmode=sen&refs=%3Ddoc.t2ld&ml=1&flimit=0&ml1attr=word&ml1ctx=0~0%3E0&freqlevel=2&ml2attr=sera&ml2ctx=0~0%3E0&ml3attr=word&ml3ctx=0~0%3E0&ml4attr=word&ml4ctx=0~0%3E0 List of Amharic nouns by frequency] – Useful for dictionary based applications, e.g. predictive text writing.
    4348
    4449=== Oromo Web Corpus ===
     
    4954 * [OromoCorpus Corpus deliverable/technical report]
    5055
    51 TODO Examples of concordance, sketches, wordlist
     56==== Examples of Oromo Web Corpus use ====
     57 * [http://corpora.fi.muni.cz/habit/run.cgi/corp_info?corpname=orwac16 Corpus information]
     58 * [http://corpora.fi.muni.cz/habit/run.cgi/first?corpname=orwac16&reload=&iquery=mootummaa&queryselector=iqueryrow&phrase=&word=&char=&cql=&default_attr=word&fc_lemword_window_type=both&fc_lemword_wsize=5&fc_lemword=&fc_lemword_type=all&fsca_doc.t2ld=&fsca_doc.urldomain= Examples of the use of "mootummaa" ("government") in context] – Words or phrases in a natural Oromo context. The base function for language study and the source of good dictionary examples.
     59 * [http://corpora.fi.muni.cz/habit/run.cgi/wsketch?corpname=orwac16&reload=&lemma=mootummaa&minfreq=auto&minscore=0.0&maxitems=20&sort_ws_columns=s&show_lemma_coverage=0&clustercolls=0&minsim=0.15&structured=0&structured=1&min_unary_score=5.0&min_mwlink_freq=100&nr_ws_cols=5&bim_corpname=&bim_lemma= Grammatical and collocational behaviour of "mootummaa" ("government")] – An essential feature for creating dictionaries in Oromo.
     60 * [http://corpora.fi.muni.cz/habit/run.cgi/thes?corpname=orwac16&reload=&lemma=mootummaa&maxthesitems=60&minthesscore=0.0&includeheadword=0&clusteritems=0&minsim=0.15 Words used in the same context as "mootummaa" ("government")] – A useful list for creating an Oromo thesaurus.
     61 * [http://corpora.fi.muni.cz/habit/run.cgi/struct_wordlist?corpname=orwac16&refs=%3Ddoc.t2ld&wlmaxitems=100&wlsort=f&subcnorm=freq&corpname=orwac16&reload=&wlattr=tag&usengrams=0&ngrams_n=2&wlpat=NOUN&wlminfreq=5&wlmaxfreq=0&wlfile=&wlblacklist=&wlnums=frq&wltype=multilevel&wlstruct_attr1=word&wlstruct_attr2=&wlstruct_attr3= List of Oromo nouns by frequency] – Useful for dictionary based applications, e.g. predictive text writing.
    5262
    5363=== Somali Web Corpus ===
     
    5868 * [SomaliCorpus Corpus deliverable/technical report]
    5969
    60 TODO Examples of concordance, sketches, wordlist
     70==== Examples of Somali Web Corpus use ====
     71 * [http://corpora.fi.muni.cz/habit/run.cgi/corp_info?corpname=sowac16 Corpus information]
     72 * [http://corpora.fi.muni.cz/habit/run.cgi/first?corpname=sowac16&reload=&iquery=dowladda&queryselector=iqueryrow&phrase=&word=&char=&cql=&default_attr=word&fc_lemword_window_type=both&fc_lemword_wsize=5&fc_lemword=&fc_lemword_type=all&fsca_doc.tld=&fsca_doc.t2ld=&fsca_doc.urldomain= Examples of the use of "dowladda" ("government") in context] – Words or phrases in a natural Somali context. The base function for language study and the source of good dictionary examples.
     73 * [http://corpora.fi.muni.cz/habit/run.cgi/wsketch?corpname=sowac16&reload=&lemma=dowladda&minfreq=auto&minscore=0.0&maxitems=20&sort_ws_columns=s&show_lemma_coverage=0&clustercolls=0&minsim=0.15&structured=0&structured=1&min_unary_score=5.0&min_mwlink_freq=100&nr_ws_cols=5&bim_corpname=&bim_lemma= Grammatical and collocational behaviour of "dowladda" ("government")] – An essential feature for creating dictionaries in Somali.
     74 * [http://corpora.fi.muni.cz/habit/run.cgi/thes?corpname=sowac16&reload=&lemma=dowladda&maxthesitems=60&minthesscore=0.0&includeheadword=0&clusteritems=0&minsim=0.15 Words used in the same context as "dowladda" ("government")] – A useful list for creating a Somali thesaurus.
     75 * [http://corpora.fi.muni.cz/habit/run.cgi/struct_wordlist?corpname=sowac16&refs=%3Ddoc.t2ld&wlmaxitems=100&wlsort=f&subcnorm=freq&corpname=sowac16&reload=&wlattr=tag&usengrams=0&ngrams_n=2&wlpat=NOUN&wlminfreq=5&wlmaxfreq=0&wlfile=&wlblacklist=&wlnums=frq&wltype=multilevel&wlstruct_attr1=word&wlstruct_attr2=&wlstruct_attr3= List of Somali nouns by frequency] – Useful for dictionary based applications, e.g. predictive text writing.
    6176
    6277=== Tigrinya Web Corpus ===
     
    6782 * [TigrinyaCorpus Corpus deliverable/technical report]
    6883
    69 TODO Examples of concordance, sketches, wordlist
     84==== Examples of Tigrinya Web Corpus use ====
     85 * [http://corpora.fi.muni.cz/habit/run.cgi/corp_info?corpname=tiwac16 Corpus information]
     86 * [http://corpora.fi.muni.cz/habit/run.cgi/first?corpname=tiwac16&reload=&iquery=%E1%88%98%E1%8A%95%E1%8C%8D%E1%88%B5%E1%89%B2&queryselector=iqueryrow&phrase=&word=&wpos=&char=&cql=&default_attr=word&fc_lemword_window_type=both&fc_lemword_wsize=5&fc_lemword=&fc_lemword_type=all&fc_pos_window_type=both&fc_pos_wsize=5&fc_pos_type=all&fsca_doc.t2ld=&fsca_doc.urldomain= Examples of the use of "መንግስቲ " ("government") in context] – Words or phrases in a natural Tigrinya context. The base function for language study and the source of good dictionary examples.
     87 * [http://corpora.fi.muni.cz/habit/run.cgi/wsketch?corpname=tiwac16&reload=&lemma=%E1%88%98%E1%8A%95%E1%8C%8D%E1%88%B5%E1%89%B2&minfreq=auto&minscore=0.0&maxitems=20&sort_ws_columns=s&show_lemma_coverage=0&clustercolls=0&minsim=0.15&structured=0&structured=1&min_unary_score=5.0&min_mwlink_freq=100&nr_ws_cols=5&bim_corpname=&bim_lemma= Grammatical and collocational behaviour of "መንግስቲ " ("government")] – An essential feature for creating dictionaries in Tigrinya.
     88 * [http://corpora.fi.muni.cz/habit/run.cgi/thes?corpname=tiwac16&reload=&lemma=%E1%88%98%E1%8A%95%E1%8C%8D%E1%88%B5%E1%89%B2&maxthesitems=60&minthesscore=0.0&includeheadword=0&clusteritems=0&minsim=0.15 Words used in the same context as "መንግስቲ " ("government")] – A useful list for creating a Tigrinya thesaurus.
     89 * [http://corpora.fi.muni.cz/habit/run.cgi/struct_wordlist?corpname=tiwac16&refs=%3Ddoc.t2ld&wlmaxitems=100&wlsort=f&subcnorm=freq&corpname=tiwac16&reload=&wlattr=tag&usengrams=0&ngrams_n=2&wlpat=NOUN&wlminfreq=5&wlmaxfreq=0&wlfile=&wlblacklist=&wlnums=frq&wltype=multilevel&wlstruct_attr1=word&wlstruct_attr2=&wlstruct_attr3= List of Tigrinya nouns by frequency] – Useful for dictionary based applications, e.g. predictive text writing.
    7090
    7191=== Norwegian Bokmål Web Corpus ===