Benjamin Minixhofer
9 papers · 2021–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+4 more ↓ Show less ↑
π Conference Polyglot (5) π£ Hot Topic Early Bird π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (29) π Interdisciplinary Bridge
π
Cross-Pollinator
(10)
π
Keyword Champion
π₯
Unstoppable
(5)
β
The Questioner
Conferences
ACL (3)
EMNLP (3)
IJCNLP (1)
NAACL (1)
NIPS (1)
Top co-authors
Keywords
language model
(3)
sentence segmentation
(2)
transfer learning
(2)
self-supervised learning
(2)
subword tokenization
(2)
subword embedding
(2)
token embedding
(2)
text preprocessing
(2)
language model efficiency
(2)
multilingual nlp
(2)
efficient computing
(1)
parameter-efficient fine-tuning
(1)
pre-trained language model
(1)
multilingual fairness
(1)
inference speed
(1)
language model fine-tuning
(1)
humanitarian response
(1)
multilingual model
(1)
cross-lingual model
(1)
sentence retrieval
(1)
Papers
Retrofitting Large Language Models with Dynamic Tokenization
ACL 2025
Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation
EMNLP 2024
Zero-Shot Tokenizer Transfer
NIPS 2024
CompoundPiece: Evaluating and Improving Decompounding Performance of Language Models
EMNLP 2023
Whereβs the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation
ACL 2023
WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models
NAACL 2022
HumSet: Dataset of Multilingual Information Extraction and Classification for Humanitarian Crises Response
EMNLP 2022
Enhancing Transformers with Gradient Boosted Decision Trees for NLI Fine-Tuning
IJCNLP 2021
Enhancing Transformers with Gradient Boosted Decision Trees for NLI Fine-Tuning
ACL 2021