Taja Kuzman
8 papers · 2023–2024 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (15) 🌍 Conference Polyglot (3) 🗺️ Taxonomy Completionist (17)
⚡
Prolific Year
(6)
❓
The Questioner
Conferences
COLING (4)
EACL (2)
NAACL (2)
Top co-authors
Keywords
genre classification
(2)
zero-shot learning
(2)
text classification
(2)
in-context learning
(2)
linguistic annotation
(1)
cross-lingual transfer
(1)
transfer learning
(1)
multilingual evaluation
(1)
dialect translation
(1)
language identification
(1)
language model training
(1)
web corpus
(1)
corpus linguistics
(1)
svm classification
(1)
support vector machine
(1)
low-resource language
(1)
language model
(1)
downstream task
(1)
information retrieval
(1)
machine translation
(1)
Papers
CLASSLA-web: Comparable Web Corpora of South Slavic Languages Enriched with Linguistic and Genre Annotation
COLING 2024
Do Language Models Care about Text Quality? Evaluating Web-Crawled Corpora across 11 Languages
COLING 2024
ParlaMint Ngram viewer: Multilingual Comparative Diachronic Search Across 26 Parliaments
COLING 2024
Language Models on a Diet: Cost-Efficient Development of Encoders for Closely-Related Languages via Additional Pretraining
COLING 2024
DIALECT-COPA: Extending the Standard Translations of the COPA Causal Commonsense Reasoning Dataset to South Slavic Dialects
NAACL 2024
JSI and WüNLP at the DIALECT-COPA Shared Task: In-Context Learning From Just a Few Dialectal Examples Gets You Quite Far
NAACL 2024
Get to Know Your Parallel Data: Performing English Variety and Genre Classification over MaCoCu Corpora
EACL 2023
BENCHić-lang: A Benchmark for Discriminating between Bosnian, Croatian, Montenegrin and Serbian
EACL 2023