Jinyang Gao
21 papers · 2018–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
🐣 Hot Topic Early Bird 🌍 Conference Polyglot (8) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (7)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
🤝
Dynamic Duo
(13)
👑
Triple Crown
🏆
Grand Slam
🏆
Keyword Champion
🚀
Conference Pioneer
⚡
Prolific Year
(9)
📈
Trend Setter
❓
The Questioner
(2)
🗃️
Keyword Collector
(87)
💎
Century Club
(20)
Conferences
ICML (4)
IJCAI (4)
AAAI (3)
ICLR (3)
ACL (2)
CVPR (2)
COLING (1)
EACL (1)
NIPS (1)
Top co-authors
Keywords
large language model
(3)
reinforcement learning
(3)
bayesian optimization
(2)
hyperparameter optimization
(2)
confidence calibration
(2)
few-shot learning
(1)
adversarial learning
(1)
image classification
(1)
embedding space
(1)
transfer learning
(1)
network architecture
(1)
uncertainty quantification
(1)
direct preference optimization
(1)
attention mechanism
(1)
chain-of-thought reasoning
(1)
in-context learning
(1)
language model training
(1)
code generation
(1)
online learning
(1)
contrastive learning
(1)
Papers
Incentivizing Strong Reasoning from Weak Supervision
EACL 2026
Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization
ICLR 2025
ToolCoder: A Systematic Code-Empowered Tool Learning Framework for Large Language Models
ACL 2025
Language Adaptation of Large Language Models: An Empirical Study on LLaMA2
COLING 2025
Incorporating Dense Knowledge Alignment into Unified Multimodal Representation Models
CVPR 2025
Provoking Multi-modal Few-Shot LVLM via Exploration-Exploitation In-Context Learning
CVPR 2025
What is Wrong with Perplexity for Long-context Language Modeling?
ICLR 2025
Learning Bayesian Nash Equilibrium in Auction Games via Approximate Best Response
ICML 2025
Larger or Smaller Reward Margins to Select Preferences for LLM Alignment?
ICML 2025
AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization
ICML 2025
CARD: Channel Aligned Robust Blend Transformer for Time Series Forecasting
ICLR 2024
Auctionformer: A Unified Deep Learning Algorithm for Solving Equilibrium Strategies in Auction Games
ICML 2024
When to Trust LLMs: Aligning Confidence with Response Quality
ACL 2024
$\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$
NIPS 2024
MFES-HB: Efficient Hyperband with Multi-Fidelity Quality Measurements
AAAI 2021
Intent Preference Decoupling for User Representation on Online Recommender System
IJCAI 2020
Efficient Automatic CASH via Rising Bandits
AAAI 2020
Towards Reliable Learning for High Stakes Applications
AAAI 2019
Cuckoo Feature Hashing: Dynamic Weight Sharing for Sparse Analytics
IJCAI 2018
Refine or Represent: Residual Networks with Explicit Channel-wise Configuration
IJCAI 2018
Medical Concept Embedding with Time-Aware Attention
IJCAI 2018