Xiaozhe Yao
6 papers · 2023–2026 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Conference Polyglot (3) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (10) π£ Hot Topic Early Bird π Cross-Pollinator (15)
π₯
Mega-Team
(47)
Conferences
ICML (2)
NIPS (2)
ACL (1)
COLING (1)
Top co-authors
Keywords
large language model
(3)
data quality
(2)
multilingual language model
(2)
code generation
(1)
ai safety
(1)
model training
(1)
continual pre-training
(1)
continual pretraining
(1)
dataset benchmark
(1)
data-centric ai
(1)
multilingual model
(1)
data curation
(1)
open-source model
(1)
evaluation platform
(1)
responsible artificial intelligence
(1)
data compliance
(1)
catastrophic forgetting
(1)
goldfish objective
(1)
benchmark suite
(1)
model evaluation
(1)
Papers
Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
ACL 2026
Aurora-M: Open Source Continual Pre-training for Multilingual Language and Code
COLING 2025
Demystifying Cost-Efficiency in LLM Serving over Heterogeneous GPUs
ICML 2025
RedPajama: an Open Dataset for Training Large Language Models
NIPS 2024
HexGen: Generative Inference of Large Language Model over Heterogeneous Environment
ICML 2024
DataPerf: Benchmarks for Data-Centric AI Development
NIPS 2023