Changes between Version 4 and Version 5 of NorwegianWordSketches


Ignore:
Timestamp:
May 30, 2017, 10:22:15 PM (8 years ago)
Author:
xkovar3
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • NorwegianWordSketches

    v4 v5  
     1= A new definition of Word Sketches for Norwegian =
     2
     3This report describes the new sketch grammar for Norwegian created within the scope of the project.
     4
    15= State of the art =
    26The original sketch grammar for Norwegian contained only universal relations connected to the part-of-speech tagging, e.g. "noun on the left", "verb on the right", etc. The syntactic and semantic relationships among the words were not clear from the word sketch, and although there were some important collocations recorded, there was a lot of noise as well.
    37
    48= New sketch grammar =
    5 The new Norwegian sketch grammar was developed along the template used in the English and Spanish grammars within the Sketch Engine (so far the two most developed grammars), in terms of names of the grammatical relations (and their mapping to the template names which will enable matching the Norwegian word sketches to word sketches in other languages, in the Bilingual word sketch application) as well as in terms of the coding within the scope of the sketch grammar formalism which should simplify reading the grammar and make future modifications easy. The definitions of the relations are based on the does-it-have-a-name tagset [1].   In total, there is 19 relations in the current version of the grammar, covering the most important grammatical phenomena, such as modifiers of all parts of speech, subjects, objects, predicates and coordinations.
     9The new Norwegian sketch grammar was developed along the template used in the English and Spanish grammars within the Sketch Engine (so far the two most developed grammars), in terms of names of the grammatical relations (and their mapping to the template names which will enable matching the Norwegian word sketches to word sketches in other languages, in the Bilingual word sketch application) as well as in terms of the coding within the scope of the sketch grammar formalism which should simplify reading the grammar and make future modifications easy. The definitions of the relations are based on the [https://www.sketchengine.co.uk/norwegian-oslo-bergen-part-of-speech-tagset/ Oslo-Bergen tagset for Norwegian].   In total, there is 19 relations in the current version of the grammar, covering the most important grammatical phenomena, such as modifiers of all parts of speech, subjects, objects, predicates and coordinations.
    610
    711[[Image(fig1.png)]]
     
    913Figure 1: Word sketch for noun katt (cat)
    1014
    11 As can be seen from Figures 1-3, modifiers are the words that somehow modify or specify meaning of given word. This relation is called ''modifier of “word”''. There are also a few complement relations called ''nouns/verbs/adverbs modified by “word”'', where the word is the one who modifies the meaning (of noun, adjective, adverb or verb). These can be seen in Figure 3.
     15As can be seen from Figures 1-3, modifiers are the words that modify or specify meaning of given word. This relation is called ''modifier of “word”''. There are also a few complement relations called ''nouns/verbs/adverbs modified by “word”'', where the word is the one who modifies the meaning (of noun, adjective, adverb or verb). These can be seen in Figure 3.
    1216
    1317[[Image(fig2.png)]]
     
    2125Figure 3: Word sketch for adjective svart (black)
    2226
    23 The last relation is ''“word” and/or'', which shows frequent coordinations among words. This relation is shown in Figures 1-3.   The rules for the particular grammatical relations were developed in collaboration with native speakers of Norwegian who recorded their linguistic knowledge, needed to build a sketch grammar, as answers to specific language questions in a computer-aided environment. These answers were then manually reviewed and converted into a formal sketch grammar.   The new version of the grammar was reviewed by native speakers of Norwegian and it was evaluated as clearly better and more usable than the previous universal grammar.
     27The last relation is ''“word” and/or'', which shows frequent coordinations among words. This relation is shown in Figures 1-3.
    2428
    25 [1] https://www.researchgate.net/publication/237553785_Manual_Annotation_of_Amharic_News_Items_with_Part-of-Speech_Tags_and_its_Challenges
     29The rules for the particular grammatical relations were developed in collaboration with native speakers of Norwegian who recorded their linguistic knowledge, needed to build a sketch grammar, as answers to specific language questions in a computer-aided environment. These answers were then manually reviewed and converted into a formal sketch grammar.   The new version of the grammar was reviewed by native speakers of Norwegian and it was evaluated as clearly better and more usable than the previous universal grammar.
     30