Verb Valency Enhanced Croatian Lexicon
In this paper authors will show how verb valency data, added to the Croatian dictionary in NooJ, enhances recognition of VP as well as NP and PP parts of a sentence. At the Department of Information Sciences two parallel PhD Theses were being developed. One is construction of a chunker for Croatian...
Permalink: | http://skupnikatalog.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:316908/Details |
---|---|
Matična publikacija: |
Applications of Finite-State Language Processing - Selected Papers from the 2008 International NooJ Conference Cambridge Scholars Publishing, 2010 |
Glavni autori: | Kocijan, Kristina (-), Dovedan Han, Zdravko (Author), Mikelić Preradović, Nives |
Vrsta građe: | Članak |
Jezik: | eng |
LEADER | 02716naa a2200265uu 4500 | ||
---|---|---|---|
008 | 131111s2010 xx 1 eng|d | ||
035 | |a (CROSBI)489877 | ||
040 | |a HR-ZaFF |b hrv |c HR-ZaFF |e ppiak | ||
100 | 1 | |9 446 |a Kocijan, Kristina | |
245 | 1 | 0 | |a Verb Valency Enhanced Croatian Lexicon / |c Vučković, Kristina ; Mikelić Preradović, Nives ; Zdravko Dovedan. |
246 | 3 | |i Naslov na engleskom: |a Verb Valency Enhanced Croatian Lexicon | |
300 | |a 52-60 |f str. | ||
520 | |a In this paper authors will show how verb valency data, added to the Croatian dictionary in NooJ, enhances recognition of VP as well as NP and PP parts of a sentence. At the Department of Information Sciences two parallel PhD Theses were being developed. One is construction of a chunker for Croatian using NooJ and the other one is construction of Croatian verb valency lexicon (CRVLLEX). Here we combined the two projects by adding the data from CRVLLEX to the existing NooJ dictionary hoping thus to obtain better, improved Croatian chunker results. Our dictionary has over 36 000 entries of which 1 884 are verbs. Each verb is only marked by its category and the FLX. Additional data is being added from the CRVLLEX. Theoretic motivation behind the construction of CRVLLEX is Praha’s Dependency Treebank (PDT) but also good Czech verb valency dictionaries (VALLEX and Verbalex). So far, CRVLLEX has 1 739 verbs with 5 118 valency frames (approx. 3 frames per verb) and 173 syntactic-semantic classes. Each word entry in CRVLLEX contains headword lemma, reflexivity example (depending whether the verb is reflexive or not) and frame entry which is the part that describes the valency frame for each verb. Every verb also has the following attributes: aspect, frequency and form, which we hope will be of great importance in disambiguating parts of a sentence surrounding the verb. | ||
536 | |a Projekt MZOS |f 130-1300646-1776 | ||
536 | |a Projekt MZOS |f 130-1301679-1380 | ||
546 | |a ENG | ||
690 | |a 5.04 | ||
690 | |a 6.03 | ||
693 | |a verb valency, lexicon, grammars, NooJ, noun phrase, prepositional phrase, verb phrase |l hrv |2 crosbi | ||
693 | |a verb valency, lexicon, grammars, NooJ, noun phrase, prepositional phrase, verb phrase |l eng |2 crosbi | ||
773 | 0 | |a Nooj 2008 (08-10.06.2008. ; Budimpešta, Mađarska) |t Applications of Finite-State Language Processing - Selected Papers from the 2008 International NooJ Conference |d Cambridge Scholars Publishing, 2010 |n Varadi, Tamás ; Kuti, Judit ; Silberztein, Max |z 1-4438-2573-5 |g str. 52-60 | |
700 | 1 | |9 415 |a Dovedan Han, Zdravko |4 aut | |
700 | 1 | |9 449 |a Mikelić Preradović, Nives |4 aut | |
942 | |c RZB |u 2 |v Recenzija |z Znanstveni - Predavanje - CijeliRad |t 1.08 | ||
999 | |c 316908 |d 316906 |