Yihao Feng
25 papers · 2019–2025 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
🏃 Academic Marathon (6) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (9) 🐝 Cross-Pollinator (13)
🐣
Hot Topic Early Bird
🌍
Conference Polyglot
(9)
🏃
Academic Marathon
(6)
🤝
Dynamic Duo
(12)
🏆
Grand Slam
🧬
Topic Evolution
📈
Trend Setter
⚡
Prolific Year
(6)
💎
Century Club
(25)
🗃️
Keyword Collector
(104)
🔥
Unstoppable
(7)
Conferences
NIPS (8)
ICLR (5)
NAACL (3)
AAAI (2)
ACL (2)
ICML (2)
CVPR (1)
ICCV (1)
IJCNLP (1)
Top co-authors
Keywords
unsupervised learning
(3)
preference optimization
(2)
out-of-domain detection
(2)
latent representation
(2)
pre-trained transformer
(2)
policy optimization
(2)
model-based reinforcement learning
(2)
offline reinforcement learning
(2)
diffusion model
(2)
domain generalization
(2)
controllable generation
(2)
agent system
(2)
off-policy evaluation
(2)
large language model
(2)
language model
(2)
policy learning
(2)
text generation
(2)
preference learning
(1)
gradient-based optimization
(1)
text classification
(1)
Papers
Text2Data: Low-Resource Data Generation with Textual Control
AAAI 2025
xLAM: A Family of Large Action Models to Empower AI Agent Systems
NAACL 2025
Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
NAACL 2025
Longhorn: State Space Models are Amortized Online Learners
ICLR 2025
Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents
ICLR 2025
Structured Policy Optimization: Enhance Large Vision-Language Model via Self-referenced Dialogue
ICCV 2025
HIVE: Harnessing Human Feedback for Instructional Visual Editing
CVPR 2024
APIGen: Automated PIpeline for Generating Verifiable and Diverse Function-Calling Datasets
NIPS 2024
FOFO: A Benchmark to Evaluate LLMs’ Format-Following Capability
ACL 2024
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
ICLR 2024
FAMO: Fast Adaptive Multitask Optimization
NIPS 2023
LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning
NIPS 2023
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems
ICLR 2023
UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
NIPS 2023
Preference-grounded Token-level Guidance for Language Model Fine-tuning
NIPS 2023
Metric Residual Network for Sample Efficient Goal-Conditioned Reinforcement Learning
AAAI 2023
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning
ICML 2022
A Unified Framework for Alternating Offline Model Training and Policy Learning
NIPS 2022
Unsupervised Out-of-Domain Detection via Pre-trained Transformers
ACL 2021
Unsupervised Out-of-Domain Detection via Pre-trained Transformers
IJCNLP 2021
Incremental Few-shot Text Classification with Multi-round New Classes: Formulation, Dataset and System
NAACL 2021
Non-asymptotic Confidence Intervals of Off-policy Evaluation: Primal and Dual Bounds
ICLR 2021
Accountable Off-Policy Evaluation With Kernel Bellman Statistics
ICML 2020
Off-Policy Interval Estimation with Lipschitz Value Iteration
NIPS 2020
A Kernel Loss for Solving the Bellman Equation
NIPS 2019