Jiaxiang Li
5 papers · 2024–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
🌍
Conference Polyglot
(4)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
🐝
Cross-Pollinator
(14)
Conferences
NIPS (2)
ICLR (1)
ICML (1)
JMLR (1)
Top co-authors
Keywords
stochastic optimization
(1)
robust optimization
(1)
inverse reinforcement learning
(1)
parameter efficient
(1)
riemannian manifold
(1)
bilevel optimization
(1)
reward model
(1)
supervised fine-tuning
(1)
llm alignment
(1)
memory efficient
(1)
sparse plus low-rank decomposition
(1)
self-play fine-tune
(1)
Papers
Joint Reward and Policy Learning with Demonstrations and Human Feedback Improves Alignment
ICLR 2025
Riemannian Bilevel Optimization
JMLR 2025
SLTrain: a sparse plus low rank approach for parameter and memory efficient pretraining
NIPS 2024
Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT for LLM Alignment
NIPS 2024
Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark
ICML 2024