Yang Gao
194 papers · 2011–2026 · 22 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+18 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (20) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (6) π£ Hot Topic Early Bird
π
Renaissance Researcher
(6)
π
Interdisciplinary Bridge
π§
Keyword Pioneer
π
Keyword Trendsetter Combo
(4)
π
Conference Loyalist
(24)
π₯
Mega-Team
(26)
π€
Dynamic Duo
(21)
π¬
Deep Specialist
(20)
π
Triple Crown
π
Keyword Champion
(4)
π
Grand Slam
ποΈ
Keyword Collector
(54)
π
Century Club
(185)
π
Conference Pioneer
β‘
Prolific Year
(14)
π₯
Unstoppable
(13)
β
The Questioner
(4)
π
Trend Setter
Conferences
AAAI (29)
ICLR (23)
ACL (18)
CVPR (17)
EMNLP (16)
NIPS (14)
IJCAI (13)
ICCV (12)
CORL (10)
ECCV (9)
ICML (8)
IJCNLP (7)
COLING (3)
NAACL (3)
INTERSPEECH (2)
JMLR (2)
RSS (2)
UAI (2)
MICCAI (1)
AACL (1)
SEMEVAL (1)
WACV (1)
Top co-authors
Keywords
few-shot learning
(13)
large language model
(13)
reinforcement learning
(11)
domain adaptation
(9)
transfer learning
(8)
multi-agent reinforcement learning
(7)
abstractive summarization
(7)
metric learning
(6)
multimodal learning
(6)
representation learning
(6)
imitation learning
(6)
semi-supervised learning
(5)
vision-language model
(4)
text classification
(4)
supervised fine-tuning
(4)
convolutional neural network
(4)
domain generalization
(4)
semantic segmentation
(4)
document summarization
(4)
monte carlo tree search
(4)
Papers
Think Better, Not Longer: Token-Level Marginal Utility for Efficient Reasoning in Large Reasoning Models
ACL 2026
Thinking-Based Non-Thinking: Solving the Reward Hacking Problem in Training Hybrid Reasoning Models via Reinforcement Learning
ACL 2026
ManiLong-Shot: Interaction-Aware One-Shot Imitation Learning for Long-Horizon Manipulation
AAAI 2026
Causality-Aware Efficient Exploration for Cooperative Multi-Agent Reinforcement Learning
AAAI 2026
Simulated Rewards, Skewed Strategies: Tracing the Acquired Preference Bias in LLM-Based Dialogue Planners
AAAI 2026
Identifying and Analyzing Performance-Critical Tokens in Large Language Models
AAAI 2026
EduBench: A Comprehensive Benchmarking Dataset for Evaluating Large Language Models in Diverse Educational Scenarios
ACL 2026
Persona-EΒ²: A Human-Grounded Dataset for Personality-Shaped Emotional Responses to Textual Events
ACL 2026
Faster Game Solving via Asymmetry of Step Sizes
AAAI 2026
SRA-MCTS: Self-driven Reasoning Augmentation with Monte Carlo Tree Search for Code Generation
IJCAI 2025
Sharpness-aware Zeroth-order Optimization for Graph Transformers
IJCAI 2025
Reducing Variance of Stochastic Optimization for Approximating Nash Equilibria in Normal-Form Games
ICML 2025
Causal Information Prioritization for Efficient Reinforcement Learning
ICLR 2025
Towards Empowerment Gain through Causal Structure Learning in Model-Based Reinforcement Learning
ICLR 2025
HuB: Learning Extreme Humanoid Balance
CORL 2025
FACET: Force-Adaptive Control via Impedance Reference Tracking for Legged Robots
CORL 2025
KineDex: Learning Tactile-Informed Visuomotor Policies via Kinesthetic Teaching for Dexterous Manipulation
CORL 2025
Adapting In-Domain Few-Shot Segmentation to New Domains without Source Domain Retraining
ICCV 2025
Dynamic Multi-Layer Null Space Projection for Vision-Language Continual Learning
ICCV 2025
Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos
ICCV 2025
Beyond Mandatory Federations: Balancing Egoism, Utilitarianism and Egalitarianism in Mixed-Motive Games
AAAI 2025
Large Language Models Enhanced Personalized Graph Neural Architecture Search in Federated Learning
AAAI 2025
NATRA: Noise-Agnostic Framework for Trajectory Prediction with Noisy Observations
ICCV 2025
Data Scaling Laws in Imitation Learning for Robotic Manipulation
ICLR 2025
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
ACL 2025
CoT-VTM: Visual-to-Music Generation with Chain-of-Thought Reasoning
ACL 2025
Unveiling and Addressing Pseudo Forgetting in Large Language Models
ACL 2025
LongSafety: Enhance Safety for Long-Context LLMs
ACL 2025
Near-Optimal Regret Bounds for Federated Multi-armed Bandits with Fully Distributed Communication
UAI 2025
Leveraging Flatness to Improve Information-Theoretic Generalization Bounds for SGD
ICLR 2025
SKIL: Semantic Keypoint Imitation Learning for Generalizable Data-efficient Manipulation
RSS 2025
High-Res Brain Source Imaging of MEG using a Vector Bayesian Beamformer with Noise learning
MICCAI 2025
GravMAD: Grounded Spatial Value Maps Guided Action Diffusion for Generalized 3D Manipulation
ICLR 2025
Learning to Plan Before Answering: Self-Teaching LLMs to Learn Abstract Plans for Problem Solving
ICLR 2025
RRM: Robust Reward Model Training Mitigates Reward Hacking
ICLR 2025
SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters
CVPR 2025
Enhancing Few-Shot Class-Incremental Learning via Training-Free Bi-Level Modality Calibration
CVPR 2025
Firewall Routing: Blocking Leads to Better Hybrid Inference for LLMs
EMNLP 2025
Imitation Learning from Observation with Automatic Discount Scheduling
ICLR 2024
ViT-Calibrator: Decision Stream Calibration for Vision Transformer
AAAI 2024
Weakly Supervised Multimodal Affordance Grounding for Egocentric Images
AAAI 2024
Angle Robustness Unmanned Aerial Vehicle Navigation in GNSS-Denied Scenarios
AAAI 2024
PG-LBO: Enhancing High-Dimensional Bayesian Optimization with Pseudo-Label and Gaussian Process Guidance
AAAI 2024
DGA-GNN: Dynamic Grouping Aggregation GNN for Fraud Detection
AAAI 2024
Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning
AAAI 2024
Optimistic Value Instructors for Cooperative Multi-Agent Reinforcement Learning
AAAI 2024
Exploiting Inter-sample and Inter-feature Relations in Dataset Distillation
CVPR 2024
Bit_numeval at SemEval-2024 Task 7: Enhance Numerical Sensitivity and Reasoning Completeness for Quantitative Understanding
SEMEVAL 2024
Can Transformers Capture Spatial Relations between Objects?
ICLR 2024
PSST: A Benchmark for Evaluation-driven Text Public-Speaking Style Transfer
EMNLP 2024
How Far Can In-Context Alignment Go? Exploring the State of In-Context Alignment
EMNLP 2024
Scalable and Domain-General Abstractive Proposition Segmentation
EMNLP 2024
HOIAnimator: Generating Text-prompt Human-object Animations using Novel Perceptive Diffusion Models
CVPR 2024
Seer: Language Instructed Video Prediction with Latent Diffusion Models
ICLR 2024
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
NIPS 2024
InsertNeRF: Instilling Generalizability into NeRF with HyperNet Modules
ICLR 2024
Any-point Trajectory Modeling for Policy Learning
RSS 2024
Transformer Doctor: Diagnosing and Treating Vision Transformers
NIPS 2024
Fundamental Capabilities of Large Language Models and their Applications in Domain Scenarios: A Survey
ACL 2024
EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data
ICML 2024
Safe and Robust Subgame Exploitation in Imperfect Information Games
ICML 2024
Word Matters: What Influences Domain Adaptation in Summarization?
ACL 2024
Bit_numeval at SemEval-2024 Task 7: Enhance Numerical Sensitivity and Reasoning Completeness for Quantitative Understanding
NAACL 2024
General Flow as Foundation Affordance for Scalable Robot Learning
CORL 2024
Multi-Transmotion: Pre-trained Model for Human Motion Prediction
CORL 2024
DexCatch: Learning to Catch Arbitrary Objects with Dexterous Hands
CORL 2024
Leveraging Locality to Boost Sample Efficiency in Robotic Manipulation
CORL 2024
Constructing and Exploring Intermediate Domains in Mixed Domain Semi-supervised Medical Image Segmentation
CVPR 2024
Towards Explainable Evaluation Metrics for Machine Translation
JMLR 2024
Digital Life Project: Autonomous 3D Characters with Social Intelligence
CVPR 2024
STAR: Spatio-Temporal State Compression for Multi-Agent Tasks with Rich Observations
IJCAI 2024
Discriminative Feature Decoupling Enhancement for Speech Forgery Detection
IJCAI 2024
The Devil is in the Statistics: Mitigating and Exploiting Statistics Difference for Generalizable Semi-supervised Medical Image Segmentation
ECCV 2024
Learn to Preserve and Diversify: Parameter-Efficient Group with Orthogonal Regularization for Domain Generalization
ECCV 2024
START: A Generalized State Space Model with Saliency-Driven Token-Aware Transformation
NIPS 2024
CItruS: Chunked Instruction-aware State Eviction for Long Sequence Modeling
EMNLP 2024
Reinforcement Learning with Foundation Priors: Let Embodied Agent Efficiently Learn on Its Own
CORL 2024
SCaR: Refining Skill Chaining for Long-Horizon Robotic Manipulation via Dual Regularization
NIPS 2024
Model LEGO: Creating Models Like Disassembling and Assembling Building Blocks
NIPS 2024
OpenMSD: Towards Multilingual Scientific Documents Similarity Measurement
COLING 2024
Social-Transmotion: Promptable Human Trajectory Prediction
ICLR 2024
Diving Segmentation Model into Pixels
ICLR 2024
Hybrid Sharing for Multi-Label Image Classification
ICLR 2024
Uni-O4: Unifying Online and Offline Deep Reinforcement Learning with Multi-Step On-Policy Optimization
ICLR 2024
Predictive Inference with Feature Conformal Prediction
ICLR 2023
Programmatically Grounded, Compositionally Generalizable Robotic Manipulation
ICLR 2023
Entity-Agnostic Representation Learning for Parameter-Efficient Knowledge Graph Embedding
AAAI 2023
An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Games
AAAI 2023
Maximum Entropy Population-Based Training for Zero-Shot Human-AI Coordination
AAAI 2023
Enhanced Tensor Low-Rank and Sparse Representation Recovery for Incomplete Multi-View Clustering
AAAI 2023
Learning Explicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning via Polarization Policy Gradient
AAAI 2023
The Eval4NLP 2023 Shared Task on Prompting Large Language Models as Explainable Metrics
AACL 2023
Efficient Subgame Refinement for Extensive-form Games
NIPS 2023
TemplateGEC: Improving Grammatical Error Correction with Detection Template
ACL 2023
DePA: Improving Non-autoregressive Translation with Dependency-Aware Decoder
ACL 2023
Modified Retrace for Off-Policy Temporal Difference Learning
UAI 2023
The Eval4NLP 2023 Shared Task on Prompting Large Language Models as Explainable Metrics
IJCNLP 2023
Orthogonal Annotation Benefits Barely-Supervised Medical Image Segmentation
CVPR 2023
Modeling Inter-Class and Intra-Class Constraints in Novel Class Discovery
CVPR 2023
Policy Contrastive Imitation Learning
ICML 2023
Graph vs. Sequence: An Empirical Study on Knowledge Forms for Knowledge-Grounded Dialogue
EMNLP 2023
For Pre-Trained Vision Models in Motor Control, Not All Policy Learning Methods are Created Equal
ICML 2023
A Policy Optimization Method Towards Optimal-time Stability
CORL 2023
IOMatch: Simplifying Open-Set Semi-Supervised Learning with Joint Inliers and Outliers Utilization
ICCV 2023
DNA-Rendering: A Diverse Neural Actor Repository for High-Fidelity Human-Centric Rendering
ICCV 2023
DomainAdaptor: A Novel Approach to Test-time Adaptation
ICCV 2023
A Universal Semantic-Geometric Representation for Robotic Manipulation
CORL 2023
Decision Transformer under Random Frame Dropping
ICLR 2023
SpeedyZero: Mastering Atari with Limited Data and Time
ICLR 2023
Become a Proficient Player with Limited Data through Watching Pure Videos
ICLR 2023
ST++: Make Self-Training Work Better for Semi-Supervised Semantic Segmentation
CVPR 2022
PSP: Pre-trained Soft Prompts for Few-Shot Abstractive Summarization
COLING 2022
Spending Thinking Time Wisely: Accelerating MCTS with Virtual Expansions
NIPS 2022
Planning for Sample Efficient Imitation Learning
NIPS 2022
Individual Reward Assisted Multi-Agent Reinforcement Learning
ICML 2022
An Empirical Study on Disentanglement of Negative-free Contrastive Learning
NIPS 2022
Stage-wise Stylistic Headline Generation: Style Generation and Summarized Content Insertion
IJCAI 2022
Resolving Copycat Problems in Visual Imitation Learning via Residual Action Prediction
ECCV 2022
Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning
NIPS 2022
CYBORGS: Contrastively Bootstrapping Object Representations by Grounding in Segmentation
ECCV 2022
Semantic-Aware Fine-Grained Correspondence
ECCV 2022
MVDG: A Unified Multi-View Framework for Domain Generalization
ECCV 2022
EleGANt: Exquisite and Locally Editable GAN for Makeup Transfer
ECCV 2022
HuMMan: Multi-modal 4D Human Dataset for Versatile Sensing and Modeling
ECCV 2022
LaSSL: Label-Guided Self-Training for Semi-supervised Learning
AAAI 2022
Fighting Fire with Fire: Avoiding DNN Shortcuts through Priming
ICML 2022
Prototypical Cross-Domain Self-Supervised Learning for Few-Shot Unsupervised Domain Adaptation
CVPR 2021
Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration
NIPS 2021
Reinforcement Learning with Latent Flow
NIPS 2021
Mastering Atari Games with Limited Data
NIPS 2021
Single View Point Cloud Generation via Unified 3D Prototype
AAAI 2021
ePointDA: An End-to-End Simulation-to-Real Domain Adaptation Framework for LiDAR Point Cloud Segmentation
AAAI 2021
Exploring Explainable Selection to Control Abstractive Summarization
AAAI 2021
Cross-Lingual Abstractive Summarization with Limited Parallel Resources
ACL 2021
Supporting Complaints Investigation for Nursing and Midwifery Regulatory Agencies
ACL 2021
Prediction or Comparison: Toward Interpretable Qualitative Reasoning
ACL 2021
To be Closer: Learning to Link up Aspects with Opinions
EMNLP 2021
The Eval4NLP Shared Task on Explainable Quality Estimation: Overview and Results
EMNLP 2021
Manifold Alignment for Semantically Aligned Style Transfer
ICCV 2021
LoFGAN: Fusing Local Representations for Few-Shot Image Generation
ICCV 2021
Mining Latent Classes for Few-Shot Segmentation
ICCV 2021
Discovering Non-monotonic Autoregressive Orderings with Variational Inference
ICLR 2021
Mutual Information State Intrinsic Control
ICLR 2021
Keyframe-Focused Visual Imitation Learning
ICML 2021
Cross-Lingual Abstractive Summarization with Limited Parallel Resources
IJCNLP 2021
Supporting Complaints Investigation for Nursing and Midwifery Regulatory Agencies
IJCNLP 2021
Prediction or Comparison: Toward Interpretable Qualitative Reasoning
IJCNLP 2021
Generalized Spoofing Detection Inspired from Audio Generation Artifacts
INTERSPEECH 2021
Attention-Based Spatial Guidance for Image-to-Image Translation
WACV 2021
Learning Task-aware Local Representations for Few-shot Learning
IJCAI 2020
From Few to More: Large-Scale Dynamic Multiagent Curriculum Learning
AAAI 2020
Multi-Agent Game Abstraction via Graph Attention Neural Network
AAAI 2020
Layerwise Sparse Coding for Pruned Deep Neural Networks with Extreme Compression Ratio
AAAI 2020
Fighting Copycat Agents in Behavioral Cloning from Observation Histories
NIPS 2020
Interactive Text-to-Speech System via Joint Style Analysis
INTERSPEECH 2020
Graph Neural Architecture Search
IJCAI 2020
NAS-FCOS: Fast Neural Architecture Search for Object Detection
CVPR 2020
Consistent MetaReg: Alleviating Intra-task Discrepancy for Better Meta-knowledge
IJCAI 2020
Asymmetric Distribution Measure for Few-shot Learning
IJCAI 2020
Action Semantics Network: Considering the Effects of Actions in Multiagent Systems
ICLR 2020
Differentiable Meta-Learning Model for Few-Shot Semantic Segmentation
AAAI 2020
SUPERT: Towards New Frontiers in Unsupervised Evaluation Metrics for Multi-Document Summarization
ACL 2020
On the Limitations of Cross-lingual Encoders as Exposed by Reference-Free Machine Translation Evaluation
ACL 2020
Unsupervised Domain Attention Adaptation Network for Caricature Attribute Recognition
ECCV 2020
SetConv: A New Approach for Learning from Imbalanced Data
EMNLP 2020
Biased Feature Learning for Occlusion Invariant Face Recognition
IJCAI 2020
Better Rewards Yield Better Summaries: Learning to Summarise Without References
EMNLP 2019
MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance
EMNLP 2019
Value Function Transfer for Deep Multi-Agent Reinforcement Learning Based on N-Step Returns
IJCAI 2019
Reward Learning for Efficient Reinforcement Learning in Extractive Document Summarisation
IJCAI 2019
MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance
IJCNLP 2019
Concept Pointer Network for Abstractive Summarization
IJCNLP 2019
Better Rewards Yield Better Summaries: Learning to Summarise Without References
IJCNLP 2019
Distribution Consistency Based Covariance Metric Networks for Few-Shot Learning
AAAI 2019
Revisiting Local Descriptor Based Image-To-Class Measure for Few-Shot Learning
CVPR 2019
Crowdsourcing Lightweight Pyramids for Manual Summary Evaluation
NAACL 2019
Does My Rebuttal Matter? Insights from a Major NLP Conference
NAACL 2019
Multistream Classification with Relative Density Ratio Estimation
AAAI 2019
A Novel Unsupervised Camera-Aware Domain Adaptation Framework for Person Re-Identification
ICCV 2019
Concept Pointer Network for Abstractive Summarization
EMNLP 2019
Task-oriented Word Embedding for Text Classification
COLING 2018
APRIL: Interactively Learning to Summarise by Combining Active Preference Learning and Reinforcement Learning
EMNLP 2018
Using Argument-based Features to Predict and Analyse Review Helpfulness
EMNLP 2017
End-To-End Learning of Driving Models From Large-Scale Video Datasets
CVPR 2017
Revisiting Metric Learning for SPD Matrix Based Visual Representation
CVPR 2017
Generalized Orderless Pooling Performs Implicit Salient Matching
ICCV 2017
Compact Bilinear Pooling
CVPR 2016
Multi-layered Gesture Recognition with Kinect
JMLR 2015
Potential Based Reward Shaping for Hierarchical Reinforcement Learning
IJCAI 2015
Joint Coupled-Feature Representation and Coupled Boosting for AD Diagnosis
CVPR 2014
Aligning English Strings with Abstract Meaning Representation Graphs
EMNLP 2014
Deceptive Answer Prediction with User Preference Graph
ACL 2013
Prostate Segmentation in CT Images via Spatial-Constrained Transductive Lasso
CVPR 2013
Soft Dependency Constraints for Reordering in Hierarchical Phrase-Based Translation
EMNLP 2011