Slovene-Croatian Treebank Transfer Using Bilingual Lexicon Improves Croatian Dependency Parsing

A method is presented for transferring dependency treebanks between similar languages by using a bilingual lexicon, aiming to improve dependency parsing accuracy on the target language. It is illustrated by transferring the Slovene Dependency Treebank to Croatian by using a GIZA++ bilingual lexicon...

Full description

Permalink: http://skupnikatalog.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:318021/Details
Matična publikacija: Proceedings of the 15th International Multiconference Information Society (IS 2012), Volume C, Proceedings of the 8th Language Technologies Conference
Ljubljana : Institut Jožef Stefan, 2012
Glavni autori: Agić, Željko (-), Berović, Daša (Author), Merkler, Danijela
Vrsta građe: Članak
Jezik: eng
Online pristup: http://bib.irb.hr/datoteka/597440.zadmdb_islt_2012_final.pdf
http://nl.ijs.si/isjt12/
LEADER 02287naa a2200289uu 4500
008 131111s2012 xx 1 eng|d
035 |a (CROSBI)597440 
040 |a HR-ZaFF  |b hrv  |c HR-ZaFF  |e ppiak 
100 1 |9 495  |a Agić, Željko 
245 1 0 |a Slovene-Croatian Treebank Transfer Using Bilingual Lexicon Improves Croatian Dependency Parsing /  |c Agić, Željko ; Merkler, Danijela ; Berović, Daša. 
246 3 |i Naslov na engleskom:  |a Slovene-Croatian Treebank Transfer Using Bilingual Lexicon Improves Croatian Dependency Parsing 
300 |a 5-9  |f str. 
520 |a A method is presented for transferring dependency treebanks between similar languages by using a bilingual lexicon, aiming to improve dependency parsing accuracy on the target language. It is illustrated by transferring the Slovene Dependency Treebank to Croatian by using a GIZA++ bilingual lexicon constructed from the Croatian-Slovene 1984 parallel corpus from the Multext East project. The transferred treebank is merged with the Croatian Dependency Treebank and the merged treebank is used to train and test two graph-based dependency parsers. MSTParser and CroDep accuracy on parsing the 1984 fictional text shows a statistically significant increase and a similar decrease on parsing the Croatian Dependency Treebank newspaper text. 
536 |a Projekt MZOS  |f 130-1300646-0645 
536 |a Projekt MZOS  |f 130-1300646-1776 
546 |a ENG 
690 |a 5.04 
690 |a 6.03 
693 |a treebank transfer, bilingual lexicon, dependency parsing  |l hrv  |2 crosbi 
693 |a treebank transfer, bilingual lexicon, dependency parsing  |l eng  |2 crosbi 
773 0 |a Eighth Language Technologies Conference (8-9.10.2012. ; Ljubljana, Slovenija)  |t Proceedings of the 15th International Multiconference Information Society (IS 2012), Volume C, Proceedings of the 8th Language Technologies Conference  |d Ljubljana : Institut Jožef Stefan, 2012  |n Erjavec, Tomaž ; Žganec Gros, Jerneja  |x 1581-9973  |z 978-961-264-048-4  |g str. 5-9 
700 1 |9 839  |a Berović, Daša  |4 aut 
700 1 |9 868  |a Merkler, Danijela  |4 aut 
856 |u http://bib.irb.hr/datoteka/597440.zadmdb_islt_2012_final.pdf 
856 |u http://nl.ijs.si/isjt12/ 
942 |c RZB  |u 2  |v Recenzija  |z Znanstveni - Predavanje - CijeliRad  |t 1.08 
999 |c 318021  |d 318019