Malte Ostendorff
14 papers · 2020–2026 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (27) π Academic Marathon (5) π Conference Polyglot (6) π§ Keyword Pioneer
π
Academic Marathon
(5)
π₯
Mega-Team
(82)
π€
Dynamic Duo
(10)
π
Century Club
(13)
β‘
Prolific Year
(5)
ποΈ
Keyword Collector
(58)
β
The Questioner
(2)
Conferences
ACL (5)
EMNLP (4)
COLING (2)
ICLR (1)
IJCNLP (1)
NAACL (1)
Top co-authors
Keywords
text classification
(4)
large language model
(4)
web datum
(2)
multi-class classification
(2)
german news
(2)
semi-supervised learning
(2)
low-resource language
(2)
political bias detection
(2)
embedding learning
(1)
multimodal learning
(1)
benchmark evaluation
(1)
text summarization
(1)
language identification
(1)
fine-grained classification
(1)
named entity recognition
(1)
extractive summarization
(1)
efficient training
(1)
instruction tuning
(1)
nearest neighbor
(1)
document representation
(1)
Papers
CommonLID: Re-evaluating State-of-the-Art Language Identification Performance on Web Data
ACL 2026
MMTEB: Massive Multilingual Text Embedding Benchmark
ICLR 2025
Multi-LMentry: Can Multilingual LLMs Solve Elementary Tasks Across Languages?
EMNLP 2025
Table Understanding and (Multimodal) LLMs: A Cross-Domain Case Study on Scientific vs. Non-Scientific Data
ACL 2025
Community OSCAR: A Community Effort for Multilingual Web Data
EMNLP 2024
Symmetric Dot-Product Attention for Efficient Training of BERT Language Models
ACL 2024
A CURATEd CATalog: Rethinking the Extraction of Pretraining Corpora for Mid-Resourced Languages
COLING 2024
Occiglot at WMT24: European Open-source Large Language Models Evaluated on Translation
EMNLP 2024
Tokenizer Choice For LLM Training: Negligible or Crucial?
NAACL 2024
Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings
EMNLP 2022
HiStruct+: Improving Extractive Text Summarization with Hierarchical Structure Information
ACL 2022
Fine-grained Classification of Political Bias in German News: A Data Set and Initial Experiments
IJCNLP 2021
Fine-grained Classification of Political Bias in German News: A Data Set and Initial Experiments
ACL 2021
Aspect-based Document Similarity for Research Papers
COLING 2020