Jeffrey Li
10 papers · 2020–2026 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π Cross-Pollinator (15) π Academic Marathon (5) π§ Keyword Pioneer π Conference Polyglot (5) π Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(15)
π₯
Mega-Team
(60)
π
Century Club
(10)
Conferences
ICLR (3)
NIPS (3)
ACL (1)
CORL (1)
EACL (1)
EMNLP (1)
Top co-authors
Keywords
large language model
(2)
catastrophic forgetting
(1)
semi-supervised learning
(1)
language model alignment
(1)
weak supervision
(1)
synthetic data generation
(1)
instruction tuning
(1)
model alignment
(1)
language model
(1)
synthetic datum
(1)
noisy label
(1)
continual pretraining
(1)
data curation
(1)
data filtering
(1)
knowledge retention
(1)
temporal adaptation
(1)
text extraction
(1)
data programming
(1)
label model
(1)
pretraining datum
(1)
Papers
Beyond a Single Extractor: Re-thinking HTML-to-Text Extraction for LLM Pre-training
EACL 2026
TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining
ACL 2025
Language models scale reliably with over-training and on downstream tasks
ICLR 2025
SDS β See it, Do it, Sorted: Quadruped Skill Synthesis from Single Video Demonstration
CORL 2025
Better Alignment with Instruction Back-and-Forth Translation
EMNLP 2024
DataComp-LM: In search of the next generation of training sets for language models
NIPS 2024
Stronger Than You Think: Benchmarking Weak Supervision on Realistic Tasks
NIPS 2024
Characterizing the Impacts of Semi-supervised Learning for Weak Supervision
NIPS 2023
A Learning Theoretic Perspective on Local Explainability
ICLR 2021
Differentially Private Meta-Learning
ICLR 2020