hrWaC and slWac: compiling web corpora for Croatian and Slovene
Web corpora have become an attractive source of linguistic content, yet are for many languages still not available. This paper introduces two new annotated web corpora: the Croatian hrWaC and the Slovene slWaC. Both were built using a modified standard “Web as Corpus” pipeline having in mind the lim...
Permalink: | http://skupnikatalog.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:312924/Similar |
---|---|
Matična publikacija: |
Text, Speech and Dialogue : 14th International Conference, TSD 2011, Pilsen, Czech Republic, September 1-5, 2011. : Proceedings Lecture Notes in Computer Science |
Glavni autori: | Ljubešić, Nikola, informatičar (-), Erjavec, Tomaž (Author) |
Vrsta građe: | Članak |
Jezik: | eng |
Online pristup: |
http://link.springer.com/book/10.1007/978-3-642-23538-2 |
APA stil citiranja
Ljubešić, N. (2011). hrWaC and slWac: compiling web corpora for Croatian and Slovene: HrWaC and slWac: compiling web corpora for Croatian and Slovene. Text, Speech and Dialogue : 14th International Conference, TSD 2011, Pilsen, Czech Republic, September 1-5, 2011. : Proceedings.
Chicago stil citiranjaLjubešić, Nikola. "hrWaC and slWac: compiling web corpora for Croatian and Slovene: HrWaC and slWac: compiling web corpora for Croatian and Slovene." 2011.
MLA stil citiranjaLjubešić, Nikola. "hrWaC and slWac: compiling web corpora for Croatian and Slovene: HrWaC and slWac: compiling web corpora for Croatian and Slovene." 2011.