Tanja Samardžić
23 papers · 2012–2026 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
🏃 Academic Marathon (13) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (6) 🐝 Cross-Pollinator (9)
🐣
Hot Topic Early Bird
🌍
Conference Polyglot
(6)
🏃
Academic Marathon
(13)
🌱
Topic Pioneer
🔬
Deep Specialist
(10)
🔥
Unstoppable
(10)
💎
Century Club
(22)
⚡
Prolific Year
(5)
📈
Trend Setter
🗃️
Keyword Collector
(91)
Conferences
EMNLP (6)
COLING (4)
EACL (4)
NAACL (4)
ACL (3)
CONLL (2)
Top co-authors
Research topics
Keywords
dialect identification
(6)
cross-lingual transfer
(5)
subword tokenization
(4)
language identification
(3)
dialect classification
(3)
automatic speech recognition
(3)
transfer learning
(2)
typological distance
(2)
universal dependencies
(2)
language modeling
(2)
multilingual nlp
(2)
dependency parsing
(2)
swiss german
(2)
syntactic parsing
(2)
low-resource language
(2)
text classification
(2)
language distance
(2)
multilingual model
(2)
multi-label classification
(1)
question answering
(1)
Papers
Regional Variation in the Performance of ASR Models on Croatian and Serbian
EACL 2026
Tokenization and Representation Biases in Multilingual Models on Dialectal NLP Tasks
EMNLP 2025
DistaLs: a Comprehensive Collection of Language Distance Measures
EMNLP 2025
Functional Lexicon in Subword Tokenization
NAACL 2025
NLP_DI at NADI 2024 shared task: Multi-label Arabic Dialect Classifications with an Unsupervised Cross-Encoder
ACL 2024
System Description of the NordicsAlps Submission to the AmericasNLP 2024 Machine Translation Shared Task
NAACL 2024
A Measure for Transparent Comparison of Linguistic Diversity in Multilingual NLP Data Sets
NAACL 2024
Optimizing the Size of Subword Vocabularies in Dialect Classification
EACL 2023
STT4SG-350: A Speech Corpus for All Swiss German Dialect Regions
ACL 2023
On Language Spaces, Scales and Cross-Lingual Transfer of UD Parsers
EMNLP 2022
NLP DI at NADI Shared Task Subtask-1: Sub-word Level Convolutional Neural Models and Pre-trained Binary Classifiers for Dialect Identification
EMNLP 2022
On Language Spaces, Scales and Cross-Lingual Transfer of UD Parsers
CONLL 2022
Subword Evenness (SuE) as a Predictor of Cross-lingual Transfer to Low-resource Languages
EMNLP 2022
Early Guessing for Dialect Identification
EMNLP 2022
Interpretability for Morphological Inflection: from Character-level Predictions to Subword-level Rules
EACL 2021
From characters to words: the turning point of BPE merges
EACL 2021
ASR for Non-standardised Languages with Dialectal Variation: the case of Swiss German
COLING 2020
A Report on the Third VarDial Evaluation Campaign
NAACL 2019
Encoder-Decoder Methods for Text Normalization
COLING 2018
Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign
COLING 2018
Neural Sequence-to-sequence Learning of Internal Word Structure
CONLL 2017
TweetGeo - A Tool for Collecting, Processing and Analysing Geo-encoded Linguistic Data
COLING 2016
Lemmatisation as a Tagging Task
ACL 2012