conftrace_

Jeffrey Li

10 papers · 2020–2026 · 6 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+3 more ↓

🐝 Cross-Pollinator (15) 🏃 Academic Marathon (5) 🧭 Keyword Pioneer 🌍 Conference Polyglot (5) 🌉 Interdisciplinary Bridge

🗺️ Taxonomy Completionist (15) 👥 Mega-Team (60) 💎 Century Club (10)

Conferences

ICLR (3) NIPS (3) ACL (1) CORL (1) EACL (1) EMNLP (1)

Top co-authors

Ludwig Schmidt (5) Vaishaal Shankar (3) Fartash Faghri (3) Hadi Pouransari (3) Thao Nguyen (2) Gabriel Ilharco (2) Reinhard Heckel (2) Jenia Jitsev (2) Yair Carmon (2) Oncel Tuzel (2)

Keywords

large language model (2) catastrophic forgetting (1) semi-supervised learning (1) language model alignment (1) weak supervision (1) synthetic data generation (1) instruction tuning (1) model alignment (1) language model (1) synthetic datum (1) noisy label (1) continual pretraining (1) data curation (1) data filtering (1) knowledge retention (1) temporal adaptation (1) text extraction (1) data programming (1) label model (1) pretraining datum (1)

Papers

Beyond a Single Extractor: Re-thinking HTML-to-Text Extraction for LLM Pre-training EACL 2026 TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining ACL 2025 Language models scale reliably with over-training and on downstream tasks ICLR 2025 SDS – See it, Do it, Sorted: Quadruped Skill Synthesis from Single Video Demonstration CORL 2025 Better Alignment with Instruction Back-and-Forth Translation EMNLP 2024 DataComp-LM: In search of the next generation of training sets for language models NIPS 2024 Stronger Than You Think: Benchmarking Weak Supervision on Realistic Tasks NIPS 2024 Characterizing the Impacts of Semi-supervised Learning for Weak Supervision NIPS 2023 A Learning Theoretic Perspective on Local Explainability ICLR 2021 Differentially Private Meta-Learning ICLR 2020