Tommi Jauhiainen
19 papers · 2018–2024 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
๐ Interdisciplinary Bridge ๐ Cross-Pollinator (12) ๐ Conference Polyglot (5) ๐ Academic Marathon (6) ๐ Renaissance Researcher (6)
๐
Renaissance Researcher
(6)
๐บ๏ธ
Taxonomy Completionist
(18)
๐งญ
Keyword Pioneer
๐งฌ
Topic Evolution
๐
Keyword Champion
(14)
๐ค
Dynamic Duo
(14)
๐ฌ
Deep Specialist
(10)
๐๏ธ
Keyword Collector
(54)
๐
Century Club
(19)
๐ฅ
Unstoppable
(7)
Conferences
COLING (10)
EACL (4)
NAACL (3)
EMNLP (1)
INTERSPEECH (1)
Top co-authors
Research topics
Keywords
language identification
(14)
dialect identification
(13)
text classification
(5)
language variety
(5)
naive baye
(4)
character n-gram
(3)
multilingual nlp
(3)
shared task
(3)
naive bayes classifier
(2)
adaptive language model
(2)
language model adaptation
(1)
speaker recognition
(1)
intent detection
(1)
parameter optimization
(1)
corpus linguistics
(1)
speech recognition
(1)
classifier comparison
(1)
arabic dialect
(1)
adaptive system
(1)
end-to-end training
(1)
Papers
Language Variety Identification with True Labels
COLING 2024
Improving Language Coverage on HeLI-OTS
COLING 2024
Investigating Multilinguality in the Plenary Sessions of the Parliament of Finland with Automatic Language Identification
COLING 2024
Findings of the VarDial Evaluation Campaign 2023
EACL 2023
Italian Language and Dialect Identification and Regional French Variety Detection using Adaptive Naive Bayes
COLING 2022
Optimizing Naive Bayes for Arabic Dialect Identification
EMNLP 2022
Findings of the VarDial Evaluation Campaign 2021
EACL 2021
Naive Bayes-based Experiments in Romanian Dialect Identification
EACL 2021
Comparing Approaches to Dravidian Language Identification
EACL 2021
Experiments in Language Variety Geolocation and Dialect Identification
COLING 2020
Releasing a Toolkit and Comparing the Performance of Language Embeddings Across Various Spoken Language Identification Datasets
INTERSPEECH 2020
Uralic Language Identification (ULI) 2020 shared task dataset and the Wanca 2017 corpora
COLING 2020
A Report on the VarDial Evaluation Campaign 2020
COLING 2020
Discriminating between Mandarin Chinese and Swiss-German varieties using adaptive language models
NAACL 2019
A Report on the Third VarDial Evaluation Campaign
NAACL 2019
Language and Dialect Identification of Cuneiform Texts
NAACL 2019
Iterative Language Model Adaptation for Indo-Aryan Language Identification
COLING 2018
HeLI-based Experiments in Swiss German Dialect Identification
COLING 2018
HeLI-based Experiments in Discriminating Between Dutch and Flemish Subtitles
COLING 2018