close
Warning:
AdminModule failed with TracError: Unable to instantiate component <class 'trac.admin.web_ui.BasicsAdminPanel'> (super(type, obj): obj must be an instance or subtype of type)
- Timestamp:
-
Jan 17, 2017, 11:13:39 AM (9 years ago)
- Author:
-
hales
- Comment:
-
--
Legend:
- Unmodified
- Added
- Removed
- Modified
-
|
v4
|
v5
|
|
| 2 | 2 | |
| 3 | 3 | == Building the Somali Web corpus == |
| 4 | | We have used the following steps to create a big Somali Web corpus: First, adopting the Corpus factory method [1] bigrams of Somali words from the Crúbadán database [2] were used to query Bing search engine for documents in Somali. URLs of 18,108 documents found by the search engine were used as starting points for web crawler SpiderLing [3]. |
| | 4 | We have used the following steps to create a big Somali Web corpus: First, adopting the Corpus factory method [1] bigrams of Somali words from the Crúbadán database [2] were used to query Bing search engine for documents in Somali. URLs of 18,108 documents found by the search engine were used as starting points for web crawler !SpiderLing [3]. |
| 5 | 5 | |
| 6 | 6 | The following language models were created: |