Verb Valency Enhanced Croatian Lexicon

In this paper authors will show how verb valency data, added to the Croatian dictionary in NooJ, enhances recognition of VP as well as NP and PP parts of a sentence. At the Department of Information Sciences two parallel PhD Theses were being developed. One is construction of a chunker for Croatian...

Full description

Permalink: http://skupnikatalog.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:316908/Details
Matična publikacija: Applications of Finite-State Language Processing - Selected Papers from the 2008 International NooJ Conference
Cambridge Scholars Publishing, 2010
Glavni autori: Kocijan, Kristina (-), Dovedan Han, Zdravko (Author), Mikelić Preradović, Nives
Vrsta građe: Članak
Jezik: eng
LEADER 02716naa a2200265uu 4500
008 131111s2010 xx 1 eng|d
035 |a (CROSBI)489877 
040 |a HR-ZaFF  |b hrv  |c HR-ZaFF  |e ppiak 
100 1 |9 446  |a Kocijan, Kristina 
245 1 0 |a Verb Valency Enhanced Croatian Lexicon /  |c Vučković, Kristina ; Mikelić Preradović, Nives ; Zdravko Dovedan. 
246 3 |i Naslov na engleskom:  |a Verb Valency Enhanced Croatian Lexicon 
300 |a 52-60  |f str. 
520 |a In this paper authors will show how verb valency data, added to the Croatian dictionary in NooJ, enhances recognition of VP as well as NP and PP parts of a sentence. At the Department of Information Sciences two parallel PhD Theses were being developed. One is construction of a chunker for Croatian using NooJ and the other one is construction of Croatian verb valency lexicon (CRVLLEX). Here we combined the two projects by adding the data from CRVLLEX to the existing NooJ dictionary hoping thus to obtain better, improved Croatian chunker results. Our dictionary has over 36 000 entries of which 1 884 are verbs. Each verb is only marked by its category and the FLX. Additional data is being added from the CRVLLEX. Theoretic motivation behind the construction of CRVLLEX is Praha’s Dependency Treebank (PDT) but also good Czech verb valency dictionaries (VALLEX and Verbalex). So far, CRVLLEX has 1 739 verbs with 5 118 valency frames (approx. 3 frames per verb) and 173 syntactic-semantic classes. Each word entry in CRVLLEX contains headword lemma, reflexivity example (depending whether the verb is reflexive or not) and frame entry which is the part that describes the valency frame for each verb. Every verb also has the following attributes: aspect, frequency and form, which we hope will be of great importance in disambiguating parts of a sentence surrounding the verb. 
536 |a Projekt MZOS  |f 130-1300646-1776 
536 |a Projekt MZOS  |f 130-1301679-1380 
546 |a ENG 
690 |a 5.04 
690 |a 6.03 
693 |a verb valency, lexicon, grammars, NooJ, noun phrase, prepositional phrase, verb phrase  |l hrv  |2 crosbi 
693 |a verb valency, lexicon, grammars, NooJ, noun phrase, prepositional phrase, verb phrase  |l eng  |2 crosbi 
773 0 |a Nooj 2008 (08-10.06.2008. ; Budimpešta, Mađarska)  |t Applications of Finite-State Language Processing - Selected Papers from the 2008 International NooJ Conference  |d Cambridge Scholars Publishing, 2010  |n Varadi, Tamás ; Kuti, Judit ; Silberztein, Max  |z 1-4438-2573-5  |g str. 52-60 
700 1 |9 415  |a Dovedan Han, Zdravko  |4 aut 
700 1 |9 449  |a Mikelić Preradović, Nives  |4 aut 
942 |c RZB  |u 2  |v Recenzija  |z Znanstveni - Predavanje - CijeliRad  |t 1.08 
999 |c 316908  |d 316906