Yuan Yao
117 papers · 2012–2026 · 17 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (20) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (7) π£ Hot Topic Early Bird
π
Renaissance Researcher
(7)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(20)
π€
Dynamic Duo
(33)
π
Triple Crown
π
Grand Slam
π₯
Mega-Team
(35)
π¬
Deep Specialist
(11)
π§¬
Topic Evolution
π
Keyword Champion
π₯
Unstoppable
(8)
π
Trend Setter
ποΈ
Keyword Collector
(70)
π
Century Club
(112)
β
The Questioner
π
Conference Pioneer
β‘
Prolific Year
(25)
Conferences
CVPR (17)
AAAI (11)
ACL (11)
EMNLP (11)
NIPS (10)
ICML (9)
IJCAI (8)
ICLR (7)
ICCV (7)
ECCV (6)
COLING (4)
JMLR (4)
AISTATS (4)
IJCNLP (3)
MIDL (2)
NAACL (2)
WACV (1)
Top co-authors
Research topics
Keywords
neural network
(7)
deep learning
(6)
multimodal large language model
(6)
representation learning
(5)
reinforcement learning
(5)
relation extraction
(5)
distant supervision
(5)
deep neural network
(4)
vision-language model
(4)
monte carlo tree search
(4)
backdoor attack
(4)
knowledge base
(4)
3d reconstruction
(4)
multi-agent system
(4)
transfer learning
(3)
generative adversarial network
(3)
diffusion model
(3)
few-shot learning
(3)
multimodal learning
(3)
style transfer
(3)
Papers
CharTide: Data-Centric Chart-to-Code Generation via Tri-Perspective Tuning and Inquiry-Driven Evolution
ACL 2026
Hashed Watermark as a Filter: A Unified Defense Against Forging and Overwriting Attacks in Neural Network Watermarking
AAAI 2026
OpenGlass: A Sensing-Computing Split Architecture for Local MLLM-Driven Real-Time Visual Assistance
ACL 2026
ChessArena: A Chess Testbed for Evaluating Strategic Reasoning Capabilities of Large Language Models
ACL 2026
LLaVA-UHD v2: Exploiting Hierarchical Vision Granularity in MLLMs via Inverse Semantic Pyramid
AAAI 2026
Towards Fine-grained Interactive Segmentation in Images and Videos
ICCV 2025
RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness
CVPR 2025
Dynamic Guided and Domain Applicable Safeguards for Enhanced Security in Large Language Models
NAACL 2025
Optimal Sample Selection Through Uncertainty Estimation and Its Application in Deep Learning
JMLR 2025
Simulate, Refine and Integrate: Strategy Synthesis for Efficient SMT Solving
IJCAI 2025
Elucidating the design space of language models for image generation
ICML 2025
VLMInferSlow: Evaluating the Efficiency Robustness of Large Vision-Language Models as a Service
ACL 2025
EventRAG: Enhancing LLM Generation with Event Knowledge Graphs
ACL 2025
GUICourse: From General Vision Language Model to Versatile GUI Agent
ACL 2025
RiOT: Efficient Prompt Refinement with Residual Optimization Tree
ACL 2025
UniMatch: Universal Matching from Atom to Task for Few-Shot Drug Discovery
ICLR 2025
Proving Olympiad Inequalities by Synergizing LLMs and Symbolic Reasoning
ICLR 2025
InstructMol: Multi-Modal Integration for Building a Versatile and Reliable Molecular Assistant in Drug Discovery
COLING 2025
UniGS: Modeling Unitary 3D Gaussians for Novel View Synthesis from Sparse-view Images
ICCV 2025
LeanGaussian: Breaking Pixel or Point Cloud Correspondence in Modeling 3D Gaussians
CVPR 2025
Perception-as-Control: Fine-grained Controllable Image Animation with 3D-aware Motion Representation
ICCV 2025
AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-Tuning
EMNLP 2025
FedMIA: An Effective Membership Inference Attack Exploiting "All for One" Principle in Federated Learning
CVPR 2025
GaussianIP: Identity-Preserving Realistic 3D Human Generation via Human-Centric Diffusion Prior
CVPR 2025
MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling
CVPR 2025
En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data
CVPR 2024
3DToonify: Creating Your High-Fidelity 3D Stylized Avatar Easily from 2D Portrait Images
CVPR 2024
Referee Can Play: An Alternative Approach to Conditional Generation via Model Inversion
ICML 2024
PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes
EMNLP 2024
MoleculeQA: A Dataset to Evaluate Factual Accuracy in Molecular Comprehension
EMNLP 2024
LLaVA-UHD: an LMM Perceiving any Aspect Ratio and High-Resolution Images
ECCV 2024
Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages
ICLR 2024
Towards Global Optimal Visual In-Context Learning Prompt Selection
NIPS 2024
ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis
NIPS 2024
UniGAD: Unifying Multi-level Graph Anomaly Detection
NIPS 2024
Neuro-Symbolic Data Generation for Math Reasoning
NIPS 2024
AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
ECCV 2024
Fine-Grained Legal Argument-Pair Extraction via Coarse-Grained Pre-training
COLING 2024
From Multimodal LLM to Human-level AI: Modality, Instruction, Reasoning, Efficiency and beyond
COLING 2024
Random Smoothing Regularization in Kernel Gradient Descent Learning
JMLR 2024
Mitigating the Alignment Tax of RLHF
EMNLP 2024
Intention Progression with Temporally Extended Goals
IJCAI 2024
Towards Open Domain Text-Driven Synthesis of Multi-Person Motions
ECCV 2024
Inspecting Prediction Confidence for Detecting Black-Box Backdoor Attacks
AAAI 2024
KoLA: Carefully Benchmarking World Knowledge of Large Language Models
ICLR 2024
NExT-Chat: An LMM for Chat, Detection and Segmentation
ICML 2024
Rethinking Guidance Information to Utilize Unlabeled Samples: A Label Encoding Perspective
ICML 2024
PrivLM-Bench: A Multi-level Privacy Evaluation Benchmark for Language Models
ACL 2024
Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis
CVPR 2024
RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback
CVPR 2024
Beyond Object Recognition: A New Benchmark towards Object Concept Learning
ICCV 2023
VPGTrans: Transfer Visual Prompt Generator across LLMs
NIPS 2023
Neuro-symbolic Learning Yielding Logical Constraints
NIPS 2023
Visually Grounded Commonsense Knowledge Acquisition
AAAI 2023
Inducing Neural Collapse in Deep Long-tailed Learning
AISTATS 2023
An Embarrassingly Simple Backdoor Attack on Self-supervised Learning
ICCV 2023
Softened Symbol Grounding for Neuro-symbolic Systems
ICLR 2023
Learning with Logical Constraints but without Shortcut Satisfaction
ICLR 2023
Multi-Agent Intention Recognition and Progression
IJCAI 2023
SegPrompt: Using Segmentation Map as a Better Prompt to Finetune Deep Models for Kidney Stone Classification
MIDL 2023
End-to-End Weakly Supervised Object Detection with Sparse Proposal Evolution
ECCV 2022
Private Streaming SCO in $\ell_p$ geometry with Applications in High Dimensional Online Decision Making
ICML 2022
Fine-Grained Scene Graph Generation with Data Transfer
ECCV 2022
Unsupervised Domain Adaptation through Shape Modeling for Medical Image Segmentation
MIDL 2022
PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models
EMNLP 2022
Structure-Aware Flow Generation for Human Body Reshaping
CVPR 2022
Unpaired Cartoon Image Synthesis via Gated Cycle Mapping
CVPR 2022
A Deep Learning Dataloader with Shared Data Preparation
NIPS 2022
An Invisible Black-Box Backdoor Attack through Frequency Domain
ECCV 2022
Prompt Tuning for Discriminative Pre-trained Language Models
ACL 2022
Multi-Agent Intention Progression with Reward Machines
IJCAI 2022
Visual Distant Supervision for Scene Graph Generation
ICCV 2021
Multi-Agent Intention Progression with Black-Box Agents
IJCAI 2021
Adversarial Language Games for Advanced Natural Language Intelligence
AAAI 2021
StrokeGAN: Reducing Mode Collapse in Chinese Font Generation via Stroke Encoding
AAAI 2021
CodRED: A Cross-Document Relation Extraction Dataset for Acquiring Knowledge in the Wild
EMNLP 2021
Turn the Combination Lock: Learnable Textual Backdoor Attacks via Word Substitution
IJCNLP 2021
On ADMM in Deep Learning: Convergence and Saturation-Avoidance
JMLR 2021
Deep Partial Rank Aggregation for Personalized Attributes
AAAI 2021
ONION: A Simple and Effective Defense Against Textual Backdoor Attacks
EMNLP 2021
Open Hierarchical Relation Extraction
NAACL 2021
Turn the Combination Lock: Learnable Textual Backdoor Attacks via Word Substitution
ACL 2021
Front2Back: Single View 3D Shape Reconstruction via Front to Back Prediction
CVPR 2020
Multiview Co-segmentation for Wide Baseline Images using Cross-view Supervision
WACV 2020
Who Likes What? β SplitLBI in Exploring Preferential Diversity of Ratings
AAAI 2020
Video Playback Rate Perception for Self-Supervised Spatio-Temporal Representation Learning
CVPR 2020
DessiLBI: Exploring Structural Sparsity of Deep Networks via Differential Inclusion Paths
ICML 2020
Characterizing Membership Privacy in Stochastic Gradient Langevin Dynamics
AAAI 2020
Trading Personalization for Accuracy: Data Debugging in Collaborative Filtering
NIPS 2020
Generative Adversarial Nets for Robust Scatter Estimation: A Proper Scoring Rule Perspective
JMLR 2020
Intention Progression under Uncertainty
IJCAI 2020
Denoising Relation Extraction from Document-level Distant Supervision
EMNLP 2020
Meta-Information Guided Meta-Learning for Few-Shot Relation Classification
COLING 2020
Boosting Semantic Human Matting With Coarse Annotations
CVPR 2020
Deep Robust Subjective Visual Property Prediction in Crowdsourcing
CVPR 2019
MONET: Multiview Semi-Supervised Keypoint Detection via Epipolar Divergence
ICCV 2019
Hashtag Recommendation for Photo Sharing Services
AAAI 2019
OpenNRE: An Open and Extensible Toolkit for Neural Relation Extraction
EMNLP 2019
Orthogonal Decomposition Network for Pixel-Wise Binary Classification
CVPR 2019
ROBUST ESTIMATION VIA GENERATIVE ADVERSARIAL NETWORKS
ICLR 2019
An Integral Tag Recommendation Model for Textual Content
AAAI 2019
Attention-Aware Multi-Stroke Style Transfer
CVPR 2019
Global Convergence of Block Coordinate Descent in Deep Learning
ICML 2019
DocRED: A Large-Scale Document-Level Relation Extraction Dataset
ACL 2019
Commit Message Generation for Source Code Changes
IJCAI 2019
Open Relation Extraction: Relational Knowledge Transfer from Supervised Data to Unsupervised Data
IJCNLP 2019
OpenNRE: An Open and Extensible Toolkit for Neural Relation Extraction
IJCNLP 2019
iSplit LBI: Individualized Partial Ranking with Ties via Split LBI
NIPS 2019
Open Relation Extraction: Relational Knowledge Transfer from Supervised Data to Unsupervised Data
EMNLP 2019
FewRel: A Large-Scale Supervised Few-Shot Relation Classification Dataset with State-of-the-Art Evaluation
EMNLP 2018
MSplit LBI: Realizing Feature Selection and Dense Estimation Simultaneously in Few-shot and Zero-shot Learning
ICML 2018
Finding Global Optima in Nonconvex Stochastic Semidefinite Optimization with Variance Reduction
AISTATS 2018
A Unified Dynamic Approach to Sparse Model Selection
AISTATS 2018
Split LBI: An Iterative Regularization Path with Structural Sparsity
NIPS 2016
False Discovery Rate Control and Statistical Quality Assessment of Annotators in Crowdsourced Ranking
ICML 2016
Ice-Breaking: Mitigating Cold-Start Recommendation Problem by Rating Comparison
IJCAI 2015
Detecting Network Cliques with Radon Basis Pursuit
AISTATS 2012