Yuan Wang
56 papers · 2016–2026 · 14 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π Conference Polyglot (14) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (5) π Academic Marathon (9)
π
Academic Marathon
(9)
π
Cross-Pollinator
(8)
πΊοΈ
Taxonomy Completionist
(90)
π§¬
Topic Evolution
π₯
Mega-Team
(22)
π€
Dynamic Duo
(12)
π
Grand Slam
π
Conference Pioneer
ποΈ
Keyword Collector
(253)
π
Trend Setter
π
Century Club
(49)
π₯
Unstoppable
(7)
β
The Questioner
β‘
Prolific Year
(18)
Conferences
AAAI (14)
CVPR (12)
ICCV (6)
IJCAI (5)
ACL (4)
EMNLP (4)
ECCV (2)
MICCAI (2)
NIPS (2)
COLING (1)
ICLR (1)
ICML (1)
MIDL (1)
NAACL (1)
Top co-authors
Research topics
Keywords
semantic segmentation
(8)
large language model
(6)
multimodal learning
(4)
vision-language model
(4)
few-shot learning
(3)
prototype learning
(3)
few-shot segmentation
(3)
representation learning
(2)
spiking neural network
(2)
spatial reasoning
(2)
image classification
(2)
state space model
(2)
point cloud
(2)
foundation model
(2)
feature learning
(2)
question answering
(2)
domain adaptation
(2)
knowledge distillation
(2)
federated learning
(2)
information retrieval
(2)
Papers
Act as you think: Reinforcing Consistent Reasoning in Medical Visual Question Answering
ACL 2026
Beyond N-grams: A Hierarchical Reward Learning Framework for Clinically-Aware Medical Report Generation
AAAI 2026
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation
AAAI 2026
IPFormer: Instance Prompt-guided Transformer for Multi-modal Multi-shot Video Understanding
AAAI 2026
TCoT: Trajectory Chain-of-Thoughts for Robotic Manipulation with Failure Recovery in Vision-Language-Action Model
AAAI 2026
Unreal-MAP: Unreal-Engine-Based General Platform for Multi-agent Reinforcement Learning
AAAI 2026
Data Efficient RLVR via Off-Policy Influence Guidance
ACL 2026
Generalized Few-Shot Point Cloud Segmentation via LLM-Assisted Hyper-Relation Matching
ICCV 2025
Mamba-3VL: Taming State Space Model for 3D Vision Language Learning
ICCV 2025
Two Losses, One Goal: Balancing Conflict Gradients for Semi-supervised Semantic Segmentation
ICCV 2025
U-ViLAR: Uncertainty-Aware Visual Localization for Autonomous Driving via Differentiable Association and Registration
ICCV 2025
ComRAG: Retrieval-Augmented Generation with Dynamic Vector Stores for Real-time Community Question Answering in Industry
ACL 2025
V2T-CoT: From Vision to Text Chain-of-Thought for Medical Reasoning and Diagnosis
MICCAI 2025
LIBA: Language Instructed Multi-granularity Bridge Assistant for 3D Visual Grounding
AAAI 2025
Precise, Fast, and Low-cost Concept Erasure in Value Space: Orthogonal Complement Matters
CVPR 2025
HSI-GPT: A General-Purpose Large Scene-Motion-Language Model for Human Scene Interaction
CVPR 2025
Dual-Agent Optimization framework for Cross-Domain Few-Shot Segmentation
CVPR 2025
Golden Cudgel Network for Real-Time Semantic Segmentation
CVPR 2025
Towards Effective and Sparse Adversarial Attack on Spiking Neural Networks via Breaking Invisible Surrogate Gradients
CVPR 2025
A Survey of Optimization Modeling Meets LLMs: Progress and Future Directions
IJCAI 2025
Look Back for More: Harnessing Historical Sequential Updates for Personalized Federated Adapter Tuning
AAAI 2025
Human-Centric Foundation Models: Perception, Generation and Agentic Modeling
IJCAI 2025
Beyond Confidence: Exploiting Homogeneous Pattern for Semi-Supervised Semantic Segmentation
ICML 2025
Exploring the Better Multimodal Synergy Strategy for Vision-Language Models
AAAI 2025
Evaluating Fairness in Large Vision-Language Models Across Diverse Demographic Attributes and Prompts
EMNLP 2025
Do Large Language Models Rank Fairly? An Empirical Study on the Fairness of LLMs as Rankers
NAACL 2024
Enhancing LLM Reasoning via Vision-Augmented Prompting
NIPS 2024
Frequency Shuffling and Enhancement for Open Set Recognition
AAAI 2024
Pay Attention to Target: Relation-Aware Temporal Consistency for Domain Adaptive Video Semantic Segmentation
AAAI 2024
Near-Optimal Resilient Aggregation Rules for Distributed Learning Using 1-Center and 1-Mean Clustering with Outliers
AAAI 2024
An Aggregation-Free Federated Learning for Tackling Data Heterogeneity
CVPR 2024
Exploring Pose-Aware Human-Object Interaction via Hybrid Learning
CVPR 2024
Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation
CVPR 2024
G^3-LQ: Marrying Hyperbolic Alignment with Explicit Semantic-Geometric Modeling for 3D Visual Grounding
CVPR 2024
Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation
ECCV 2024
MedCoT: Medical Chain of Thought via Hierarchical Expert
EMNLP 2024
Aggregation and Purification: Dual Enhancement Network for Point Cloud Few-shot Segmentation
IJCAI 2024
MedSynth: Leveraging Generative Model for Healthcare Data Sharing
MICCAI 2024
A New ANN-SNN Conversion Method with High Accuracy, Low Latency and Good Robustness
IJCAI 2023
Neural TSP Solver with Progressive Distillation
AAAI 2023
Rethinking the Correlation in Few-Shot Segmentation: A Buoys View
CVPR 2023
Alignment Before Aggregation: Trajectory Memory Retrieval Network for Video Object Segmentation
ICCV 2023
Focus on Query: Adversarial Mining Transformer for Few-Shot Segmentation
NIPS 2023
Dynamic Graph Learning With Content-Guided Spatial-Frequency Relation Reasoning for Deepfake Detection
CVPR 2023
Adaptive Agent Transformer for Few-Shot Segmentation
ECCV 2022
Exploring Dual Encoder Architectures for Question Answering
EMNLP 2022
Estimation and Comparison of Linear Regions for ReLU Networks
IJCAI 2022
Learning to Detect 3D Facial Landmarks via Heatmap Regression with Graph Convolutional Network
AAAI 2022
Improving Adversarially Robust Few-Shot Image Classification With Generalizable Representations
CVPR 2022
Memory-efficient Segmentation of High-resolution Volumetric MicroCT Images
MIDL 2022
AdaFit: Rethinking Learning-Based Normal Estimation on Point Clouds
ICCV 2021
Anchor & Transform: Learning Sparse Embeddings for Large Vocabularies
ICLR 2021
Neural Dynamics and Gamma Oscillation on a Hybrid Excitatory-Inhibitory Complex Network (Student Abstract)
AAAI 2020
Toward Automated Content Feedback Generation for Non-native Spontaneous Speech
ACL 2019
Improving Usersβ Demographic Prediction via the Videos They Talk about
EMNLP 2016
Predicting Restaurant Consumption Level through Social Media Footprints
COLING 2016