Zébulon Goriely
10 papers · 2023–2025 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+5 more ↓ Show less ↑
🗺️ Taxonomy Completionist (17) 🌍 Conference Polyglot (3) 🐝 Cross-Pollinator (8) 🌈 Renaissance Researcher (6) 🌉 Interdisciplinary Bridge
🧭
Keyword Pioneer
🤝
Dynamic Duo
(10)
❓
The Questioner
⚡
Prolific Year
(5)
💎
Century Club
(10)
Conferences
CONLL (5)
EMNLP (3)
ACL (2)
Top co-authors
Keywords
language model
(5)
child-directed speech
(3)
self-supervised learning
(2)
grapheme-to-phoneme conversion
(2)
word segmentation
(2)
cross-lingual phonology
(2)
phoneme language model
(2)
phonological probing
(2)
token representation
(1)
cross-lingual analysis
(1)
phonological feature
(1)
subword tokenizer
(1)
word boundary
(1)
phonological representation
(1)
phoneme representation
(1)
sequence length
(1)
phonemic representation
(1)
distinctive feature
(1)
small-scale language model
(1)
syntactic smoothing
(1)
Papers
What is the Best Sequence Length for BabyLM?
EMNLP 2025
BabyLM’s First Words: Word Segmentation as a Phonological Probing Task
ACL 2025
IPA CHILDES & G2P+: Feature-Rich Resources for Cross-Lingual Phonology and Phonemic Language Modeling
CONLL 2025
BabyLM’s First Words: Word Segmentation as a Phonological Probing Task
CONLL 2025
IPA CHILDES & G2P+: Feature-Rich Resources for Cross-Lingual Phonology and Phonemic Language Modeling
ACL 2025
Mitigating Frequency Bias and Anisotropy in Language Model Pre-Training with Syntactic Smoothing
EMNLP 2024
From Babble to Words: Pre-Training Language Models on Continuous Streams of Phonemes
CONLL 2024
Less is More: Pre-Training Cross-Lingual Small-Scale Language Models with Cognitively-Plausible Curriculum Learning Strategies
CONLL 2024
CLIMB – Curriculum Learning for Infant-inspired Model Building
EMNLP 2023
CLIMB – Curriculum Learning for Infant-inspired Model Building
CONLL 2023