Jinyang Gao

21 papers · 2018–2026 · 9 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🐣 Hot Topic Early Bird 🌍 Conference Polyglot (8) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (7)

🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🤝 Dynamic Duo (13) 👑 Triple Crown 🏆 Grand Slam 🏆 Keyword Champion 🚀 Conference Pioneer ⚡ Prolific Year (9) 📈 Trend Setter ❓ The Questioner (2) 🗃️ Keyword Collector (87) 💎 Century Club (20)

Conferences

ICML (4) IJCAI (4) AAAI (3) ICLR (3) ACL (2) CVPR (2) COLING (1) EACL (1) NIPS (1)

Top co-authors

Bolin Ding (14) Xiang Wang (6) xue wang (6) Junkang Wu (4) Yuexiang Xie (4) Xiangnan He (4) Jiancan Wu (4) Yanyan Shen (3) Shuchang Tao (3) Kexin Huang (3)

Keywords

large language model (3) reinforcement learning (3) bayesian optimization (2) hyperparameter optimization (2) confidence calibration (2) few-shot learning (1) adversarial learning (1) image classification (1) embedding space (1) transfer learning (1) network architecture (1) uncertainty quantification (1) direct preference optimization (1) attention mechanism (1) chain-of-thought reasoning (1) in-context learning (1) language model training (1) code generation (1) online learning (1) contrastive learning (1)

Papers

Incentivizing Strong Reasoning from Weak Supervision EACL 2026 Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization ICLR 2025 ToolCoder: A Systematic Code-Empowered Tool Learning Framework for Large Language Models ACL 2025 Language Adaptation of Large Language Models: An Empirical Study on LLaMA2 COLING 2025 Incorporating Dense Knowledge Alignment into Unified Multimodal Representation Models CVPR 2025 Provoking Multi-modal Few-Shot LVLM via Exploration-Exploitation In-Context Learning CVPR 2025 What is Wrong with Perplexity for Long-context Language Modeling? ICLR 2025 Learning Bayesian Nash Equilibrium in Auction Games via Approximate Best Response ICML 2025 Larger or Smaller Reward Margins to Select Preferences for LLM Alignment? ICML 2025 AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization ICML 2025 CARD: Channel Aligned Robust Blend Transformer for Time Series Forecasting ICLR 2024 Auctionformer: A Unified Deep Learning Algorithm for Solving Equilibrium Strategies in Auction Games ICML 2024 When to Trust LLMs: Aligning Confidence with Response Quality ACL 2024 $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$ NIPS 2024 MFES-HB: Efficient Hyperband with Multi-Fidelity Quality Measurements AAAI 2021 Intent Preference Decoupling for User Representation on Online Recommender System IJCAI 2020 Efficient Automatic CASH via Rising Bandits AAAI 2020 Towards Reliable Learning for High Stakes Applications AAAI 2019 Cuckoo Feature Hashing: Dynamic Weight Sharing for Sparse Analytics IJCAI 2018 Refine or Represent: Residual Networks with Explicit Channel-wise Configuration IJCAI 2018 Medical Concept Embedding with Time-Aware Attention IJCAI 2018