Changes between Version 3 and Version 4 of HabitSystemFinal
- Timestamp:
- Jun 2, 2017, 12:19:16 PM (7 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
HabitSystemFinal
v3 v4 40 40 * [AmharicCorpus Corpus deliverable/technical report] 41 41 42 ==== Examples of Amharic Web Corpus use====42 ==== Examples of HaBiT System features for the Amharic Web Corpus ==== 43 43 * [http://corpora.fi.muni.cz/habit/run.cgi/corp_info?corpname=amwac16 Corpus information] 44 44 * [http://corpora.fi.muni.cz/habit/run.cgi/first?corpname=amwac16&reload=&iquery=%E1%88%98%E1%8A%95%E1%8C%8D%E1%88%A5%E1%89%B5&queryselector=iqueryrow&phrase=&word=&char=&cql=&default_attr=word&fc_lemword_window_type=both&fc_lemword_wsize=5&fc_lemword=&fc_lemword_type=all&fsca_doc.t2ld=&fsca_doc.urldomain= Examples of the use of "መንግሥት" ("government")] – Words or phrases in a natural Amharic context. The base function for language study and the source of good dictionary examples. … … 54 54 * [OromoCorpus Corpus deliverable/technical report] 55 55 56 ==== Examples of Oromo Web Corpus use====56 ==== Examples of HaBiT System features for the Oromo Web Corpus ==== 57 57 * [http://corpora.fi.muni.cz/habit/run.cgi/corp_info?corpname=orwac16 Corpus information] 58 58 * [http://corpora.fi.muni.cz/habit/run.cgi/first?corpname=orwac16&reload=&iquery=mootummaa&queryselector=iqueryrow&phrase=&word=&char=&cql=&default_attr=word&fc_lemword_window_type=both&fc_lemword_wsize=5&fc_lemword=&fc_lemword_type=all&fsca_doc.t2ld=&fsca_doc.urldomain= Examples of the use of "mootummaa" ("government") in context] – Words or phrases in a natural Oromo context. The base function for language study and the source of good dictionary examples. … … 68 68 * [SomaliCorpus Corpus deliverable/technical report] 69 69 70 ==== Examples of Somali Web Corpus use====70 ==== Examples of HaBiT System features for the Somali Web Corpus ==== 71 71 * [http://corpora.fi.muni.cz/habit/run.cgi/corp_info?corpname=sowac16 Corpus information] 72 72 * [http://corpora.fi.muni.cz/habit/run.cgi/first?corpname=sowac16&reload=&iquery=dowladda&queryselector=iqueryrow&phrase=&word=&char=&cql=&default_attr=word&fc_lemword_window_type=both&fc_lemword_wsize=5&fc_lemword=&fc_lemword_type=all&fsca_doc.tld=&fsca_doc.t2ld=&fsca_doc.urldomain= Examples of the use of "dowladda" ("government") in context] – Words or phrases in a natural Somali context. The base function for language study and the source of good dictionary examples. … … 82 82 * [TigrinyaCorpus Corpus deliverable/technical report] 83 83 84 ==== Examples of Tigrinya Web Corpus use====84 ==== Examples of HaBiT System features for the Tigrinya Web Corpus ==== 85 85 * [http://corpora.fi.muni.cz/habit/run.cgi/corp_info?corpname=tiwac16 Corpus information] 86 86 * [http://corpora.fi.muni.cz/habit/run.cgi/first?corpname=tiwac16&reload=&iquery=%E1%88%98%E1%8A%95%E1%8C%8D%E1%88%B5%E1%89%B2&queryselector=iqueryrow&phrase=&word=&wpos=&char=&cql=&default_attr=word&fc_lemword_window_type=both&fc_lemword_wsize=5&fc_lemword=&fc_lemword_type=all&fc_pos_window_type=both&fc_pos_wsize=5&fc_pos_type=all&fsca_doc.t2ld=&fsca_doc.urldomain= Examples of the use of "መንግስቲ " ("government") in context] – Words or phrases in a natural Tigrinya context. The base function for language study and the source of good dictionary examples. … … 95 95 * Tagged by [https://www.sketchengine.co.uk/norwegian-oslo-bergen-part-of-speech-tagset/ Oslo-Bergen Tagger]. 96 96 97 TODO Examples of concordance, sketches, wordlist 97 ==== Examples of HaBiT System features for the Norwegian Bokmål Web Corpus ==== 98 * [http://corpora.fi.muni.cz/habit/run.cgi/corp_info?corpname=notenten15_4_bokmal Corpus information] 99 * [http://corpora.fi.muni.cz/habit/run.cgi/first?corpname=notenten15_4_bokmal&reload=&iquery=regjering&queryselector=iqueryrow&lemma=&lpos=&phrase=&word=&wpos=&char=&cql=&default_attr=word&fc_lemword_window_type=both&fc_lemword_wsize=5&fc_lemword=&fc_lemword_type=all&fc_pos_window_type=both&fc_pos_wsize=5&fc_pos_type=all&fsca_doc.t2ld=&fsca_doc.urldomain= Examples of the use of "regjering" ("government") in context] – Words or phrases in a natural Norwegian Bokmål context. The base function for language study and the source of good dictionary examples. 100 * [http://corpora.fi.muni.cz/habit/run.cgi/wsketch?corpname=notenten15_4_bokmal&reload=&lemma=regjering&lpos=&minfreq=auto&minscore=0.0&maxitems=25&sort_ws_columns=s&show_lemma_coverage=0&clustercolls=0&minsim=0.15&structured=0&structured=1&min_unary_score=5.0&min_mwlink_freq=100&nr_ws_cols=4&bim_corpname=&bim_lemma= Grammatical and collocational behaviour of "regjering" ("government")] – An essential feature for creating dictionaries in Norwegian Bokmål. 101 * [http://corpora.fi.muni.cz/habit/run.cgi/thes?corpname=notenten15_4_bokmal&reload=&lemma=regjering&lpos=-n&maxthesitems=60&minthesscore=0.0&includeheadword=0&clusteritems=0&minsim=0.15 Words used in the same context as "regjering" ("government")] – A useful list for creating a Norwegian Bokmål thesaurus. 102 * [http://corpora.fi.muni.cz/habit/run.cgi/freqml?q=aword%2C%5Btag%3D%22subst%22%5D&corpname=notenten15_4_bokmal&viewmode=sen&attrs=word&ctxattrs=word&structs=p%2Cg&refs=%3Ddoc.urldomain&pagesize=40&gdexconf=&attr_tooltip=nott&ml=1&flimit=0&freqlevel=1&ml1attr=lemma&ml1ctx=0~0%3E0&ml2attr=word&ml2ctx=0~0%3E0&ml3attr=word&ml3ctx=0~0%3E0&ml4attr=word&ml4ctx=0~0%3E0 List of Norwegian Bokmål nouns by frequency] – Useful for dictionary based applications, e.g. predictive text writing. 98 103 99 104 === Norwegian Nynorsk Web Corpus === … … 103 108 * Tagged by [https://www.sketchengine.co.uk/norwegian-oslo-bergen-part-of-speech-tagset/ Oslo-Bergen Tagger]. 104 109 105 TODO Examples of concordance, sketches, wordlist 110 ==== Examples of HaBiT System features for the Norwegian Nynorsk Web Corpus ==== 111 * [http://corpora.fi.muni.cz/habit/run.cgi/corp_info?corpname=notenten15_4_nynorsk Corpus information] 112 * [http://corpora.fi.muni.cz/habit/run.cgi/first?corpname=notenten15_4_nynorsk&reload=&iquery=regjering&queryselector=iqueryrow&phrase=&word=&char=&cql=&default_attr=word&fc_lemword_window_type=both&fc_lemword_wsize=5&fc_lemword=&fc_lemword_type=all&fsca_doc.t2ld=&fsca_doc.urldomain= Examples of the use of "regjering" ("government") in context] – Words or phrases in a natural Norwegian Nynorsk context. The base function for language study and the source of good dictionary examples. 106 113 107 114 === Czech Web Corpus === … … 111 118 * Tagged by [https://www.sketchengine.co.uk/tagset-reference-for-czech Czech POS tagger Majka]. 112 119 113 TODO Examples of concordance, sketches, wordlist 120 ==== Examples of HaBiT System features for the Czech Web Corpus ==== 121 * [http://corpora.fi.muni.cz/habit/run.cgi/corp_info?corpname=cztenten16_0 Corpus information] 122 * [http://corpora.fi.muni.cz/habit/run.cgi/reduce?q=aword%2C%5Blc%3D%22vl%C3%A1da%22%7Clemma_lc%3D%22vl%C3%A1da%22%5D&q=Fdoc&corpname=cztenten16_0&viewmode=sen&attrs=word&ctxattrs=word&structs=p%2Cg&refs=doc&pagesize=40&gdexconf=&iquery=vl%C3%A1da&attr_tooltip=nott&rlines=250 Examples of the use of "vláda" ("government") in context] – Words or phrases in a natural Czech context. The base function for language study and the source of good dictionary examples. 123 * [http://corpora.fi.muni.cz/habit/run.cgi/wsketch?corpname=cztenten16_0&reload=&lemma=vl%C3%A1da&minfreq=auto&minscore=0.0&maxitems=25&sort_ws_columns=s&show_lemma_coverage=0&clustercolls=0&minsim=0.15&structured=0&structured=1&min_unary_score=5.0&min_mwlink_freq=100&nr_ws_cols=5&bim_corpname=&bim_lemma= Grammatical and collocational behaviour of "vláda" ("government")] – An essential feature for creating dictionaries in Czech. 124 * [http://corpora.fi.muni.cz/habit/run.cgi/thes?corpname=cztenten16_0&reload=&lemma=vl%C3%A1da&maxthesitems=60&minthesscore=0.0&includeheadword=0&clusteritems=0&minsim=0.15 Words used in the same context as "vláda" ("government")] – A useful list for creating a Czech thesaurus. 125 * [http://corpora.fi.muni.cz/habit/run.cgi/freqml?q=aword%2C%5Btag%3D%22k1.*%22%5D&corpname=cztenten16_0&viewmode=sen&attrs=word&ctxattrs=word&structs=p%2Cg&refs=doc&pagesize=40&gdexconf=&attr_tooltip=nott&ml=1&flimit=0&freqlevel=1&ml1attr=lemma&ml1ctx=0~0%3E0&ml2attr=word&ml2ctx=0~0%3E0&ml3attr=word&ml3ctx=0~0%3E0&ml4attr=word&ml4ctx=0~0%3E0 List of Czech nouns by frequency] – Useful for dictionary based applications, e.g. predictive text writing. 114 126 115 127 === Czech-!Norwegian/Norwegian-Czech Parallel Corpus === … … 119 131 * [ParallelCzechNorwegian Corpus deliverable/technical report] 120 132 121 TODO Examples of concordance, sketches, wordlist 133 ==== Examples of HaBiT System features for the Czech-!Norwegian/Norwegian-Czech Parallel Corpus ==== 134 * [http://corpora.fi.muni.cz/habit/run.cgi/corp_info?corpname=czech_norwegian_opus__czech Czech-Norwegian Parallel Corpus information], [http://corpora.fi.muni.cz/habit/run.cgi/corp_info?corpname=czech_norwegian_opus__norwegian Norwegian-Czech Parallel Corpus information] 135 * [http://corpora.fi.muni.cz/habit/run.cgi/view?q=aword%2C%5Blc%3D%22vl%C3%A1da%22%7Clemma_lc%3D%22vl%C3%A1da%22%5D+within+czech_norwegian_opus__norwegian%3A%5Blc%3D%22regjering%22%5D;corpname=czech_norwegian_opus__czech;viewmode=align;attrs=word&ctxattrs=word&structs=p%2Cg&refs=align&pagesize=40&align=czech_norwegian_opus__norwegian&gdexconf=&iquery=vl%C3%A1da&maincorp=czech_norwegian_opus__czech&attr_tooltip=nott;fromp=1 Examples of the use of Czech "vláda" ("government") with aligned segments of Norwegian "regjering" ("government") in context] – Words or phrases in a natural Czech and Norwegian context. The base function for language study and translation services.