Yihao Feng

25 papers · 2019–2025 · 9 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🏃 Academic Marathon (6) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (9) 🐝 Cross-Pollinator (13)

🐣 Hot Topic Early Bird 🌍 Conference Polyglot (9) 🏃 Academic Marathon (6) 🤝 Dynamic Duo (12) 🏆 Grand Slam 🧬 Topic Evolution 📈 Trend Setter ⚡ Prolific Year (6) 💎 Century Club (25) 🗃️ Keyword Collector (104) 🔥 Unstoppable (7)

Conferences

NIPS (8) ICLR (5) NAACL (3) AAAI (2) ACL (2) ICML (2) CVPR (1) ICCV (1) IJCNLP (1)

Top co-authors

Caiming Xiong (12) Huan Wang (8) Qiang Liu (8) Silvio Savarese (7) Ran Xu (6) Shentao Yang (4) Weiran Yao (4) Shelby Heinecke (4) Bo Liu (4) Tian Lan (4)

Keywords

unsupervised learning (3) preference optimization (2) out-of-domain detection (2) latent representation (2) pre-trained transformer (2) policy optimization (2) model-based reinforcement learning (2) offline reinforcement learning (2) diffusion model (2) domain generalization (2) controllable generation (2) agent system (2) off-policy evaluation (2) large language model (2) language model (2) policy learning (2) text generation (2) preference learning (1) gradient-based optimization (1) text classification (1)

Papers

Text2Data: Low-Resource Data Generation with Textual Control AAAI 2025 xLAM: A Family of Large Action Models to Empower AI Agent Systems NAACL 2025 Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward NAACL 2025 Longhorn: State Space Models are Amortized Online Learners ICLR 2025 Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents ICLR 2025 Structured Policy Optimization: Enhance Large Vision-Language Model via Self-referenced Dialogue ICCV 2025 HIVE: Harnessing Human Feedback for Instructional Visual Editing CVPR 2024 APIGen: Automated PIpeline for Generating Verifiable and Diverse Function-Calling Datasets NIPS 2024 FOFO: A Benchmark to Evaluate LLMs’ Format-Following Capability ACL 2024 Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization ICLR 2024 FAMO: Fast Adaptive Multitask Optimization NIPS 2023 LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning NIPS 2023 Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems ICLR 2023 UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild NIPS 2023 Preference-grounded Token-level Guidance for Language Model Fine-tuning NIPS 2023 Metric Residual Network for Sample Efficient Goal-Conditioned Reinforcement Learning AAAI 2023 Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning ICML 2022 A Unified Framework for Alternating Offline Model Training and Policy Learning NIPS 2022 Unsupervised Out-of-Domain Detection via Pre-trained Transformers ACL 2021 Unsupervised Out-of-Domain Detection via Pre-trained Transformers IJCNLP 2021 Incremental Few-shot Text Classification with Multi-round New Classes: Formulation, Dataset and System NAACL 2021 Non-asymptotic Confidence Intervals of Off-policy Evaluation: Primal and Dual Bounds ICLR 2021 Accountable Off-Policy Evaluation With Kernel Bellman Statistics ICML 2020 Off-Policy Interval Estimation with Lipschitz Value Iteration NIPS 2020 A Kernel Loss for Solving the Bellman Equation NIPS 2019