Yizhou Wang
110 papers · 2013–2026 · 16 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
π§ Keyword Pioneer π£ Hot Topic Early Bird πΊοΈ Taxonomy Completionist (13) π Interdisciplinary Bridge π Conference Polyglot (16)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(13)
π§
Keyword Pioneer
π
Conference Loyalist
(29)
π€
Dynamic Duo
(17)
π
Triple Crown
π
Grand Slam
π±
Topic Pioneer
π¬
Deep Specialist
(14)
π
Keyword Champion
β‘
Prolific Year
(14)
π
Trend Setter
β
The Questioner
(2)
ποΈ
Keyword Collector
(409)
π
Century Club
(106)
π
Conference Pioneer
π₯
Unstoppable
(13)
Conferences
CVPR (29)
ICLR (14)
ICCV (13)
ICML (11)
NIPS (10)
ECCV (8)
AAAI (6)
ACL (5)
IJCAI (4)
WACV (3)
EMNLP (2)
AISTATS (1)
COLING (1)
CORL (1)
INTERSPEECH (1)
MICCAI (1)
Top co-authors
Research topics
Keywords
human pose estimation
(8)
pose estimation
(8)
object detection
(6)
transfer learning
(6)
large language model
(5)
3d reconstruction
(5)
contrastive learning
(4)
medical imaging
(4)
few-shot learning
(4)
multi-agent system
(4)
domain adaptation
(4)
attention mechanism
(4)
action recognition
(4)
video understanding
(4)
3d pose estimation
(4)
reinforcement learning
(3)
person re-identification
(3)
multi-agent reinforcement learning
(3)
representation learning
(3)
causal inference
(3)
Papers
Communication-Efficient Desire Alignment for Proactive Embodied HumanβAgent Interaction
ACL 2026
From Words to Pixels: A Comprehensive Survey on Large Language Models in Visual Segmentation
ACL 2026
Revealing the Seen, Imagining the Beyond: A Survey of Image-Grounded Chain-of-Thought Reasoning in Multimodal LLMs
ACL 2026
How do Role Models Shape Collective Morality? Exemplar-Driven Moral Learning in Multi-Agent Simulation
ACL 2026
Towards Zero-Shot 3D Anomaly Localization
WACV 2025
Human-Centric Foundation Models: Perception, Generation and Agentic Modeling
IJCAI 2025
Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models
ICML 2025
Bayesian Active Learning for Bivariate Causal Discovery
ICML 2025
Behavior-agnostic Task Inference for Robust Offline In-context Reinforcement Learning
ICML 2025
Simulating Human-like Daily Activities with Desire-driven Autonomy
ICLR 2025
AdaManip: Adaptive Articulated Object Manipulation Environments and Policy Learning
ICLR 2025
Learning Causal Alignment for Reliable Disease Diagnosis
ICLR 2025
Autoregressive Sequence Modeling for 3D Medical Image Representation
AAAI 2025
Cautious Next Token Prediction
ACL 2025
A Differential Inclusion Approach for Learning Heterogeneous Sparsity in Neuroimaging Analysis
AISTATS 2025
Exploring Fine-Grained Human Motion Video Captioning
COLING 2025
Aligning Human Motion Generation with Human Perceptions
ICLR 2025
DyWA: Dynamics-adaptive World Action Model for Generalizable Non-prehensile Manipulation
ICCV 2025
UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI
ICCV 2025
Embodied Representation Alignment with Mirror Neurons
ICCV 2025
CMT: A Cascade MAR with Topology Predictor for Multimodal Conditional CAD Generation
ICCV 2025
EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds
ICCV 2025
Representation Potentials of Foundation Models for Multimodal Alignment: A Survey
EMNLP 2025
FreeCloth: Free-form Generation Enhances Challenging Clothed Human Modeling
CVPR 2025
D-CoDe: Scaling Image-Pretrained VLMs to Video via Dynamic Compression and Question Decomposition
EMNLP 2025
SAT-HMR: Real-Time Multi-Person 3D Mesh Estimation via Scale-Adaptive Tokens
CVPR 2025
InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing
CVPR 2025
Shift Equivariant Pose Network
WACV 2025
DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM
ECCV 2024
Richelieu: Self-Evolving LLM-Based Agents for AI Diplomacy
NIPS 2024
ScissorBot: Learning Generalizable Scissor Skill for Paper Cutting via Simulation, Imitation, and Sim2Real
CORL 2024
Rewrite the Stars
CVPR 2024
Instruct-ReID: A Multi-purpose Person Re-identification Task with Instructions
CVPR 2024
ScoreHypo: Probabilistic Human Mesh Estimation with Hypothesis Scoring
CVPR 2024
Real-time Holistic Robot Pose Estimation with Unknown States
ECCV 2024
Safe RLHF: Safe Reinforcement Learning from Human Feedback
ICLR 2024
Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World
ICLR 2024
Don't Judge by the Look: Towards Motion Coherent Video Representation
ICLR 2024
Causal Discovery via Conditional Independence Testing with Proxy Variables
ICML 2024
Fast Peer Adaptation with Context-aware Exploration
ICML 2024
Language Models Represent Beliefs of Self and Others
ICML 2024
Cross-Dimensional Medical Self-Supervised Representation Learning Based on a Pseudo-3D Transformation
MICCAI 2024
Learning Domain-Agnostic Representation for Disease Diagnosis
ICLR 2023
Proactive Multi-Camera Collaboration for 3D Human Pose Estimation
ICLR 2023
UniHCP: A Unified Model for Human-Centric Perceptions
CVPR 2023
3D Human Mesh Estimation From Virtual Markers
CVPR 2023
GFPose: Learning 3D Human Pose Prior With Gradient Fields
CVPR 2023
HumanBench: Towards General Human-Centric Perception With Projector Assisted Pretraining
CVPR 2023
Social Motion Prediction with Cognitive Hierarchies
NIPS 2023
Which Invariance Should We Transfer? A Causal Minimax Learning Approach
ICML 2023
Causal Discovery from Subsampled Time Series with Proxy Variables
NIPS 2023
MotionBERT: A Unified Perspective on Learning Human Motion Representations
ICCV 2023
RSPT: Reconstruct Surroundings and Predict Trajectory for Generalizable Active Object Tracking
AAAI 2023
BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset
NIPS 2023
ChimpACT: A Longitudinal Dataset for Understanding Chimpanzee Behaviors
NIPS 2023
Cycle-consistent Masked AutoEncoder for Unsupervised Domain Generalization
ICLR 2023
Intrinsic Image Decomposition by Pursuing Reflectance Image
IJCAI 2022
MemREIN: Rein the Domain Shift for Cross-Domain Few-Shot Learning
IJCAI 2022
LUNA: Localizing Unfamiliarity Near Acquaintance for Open-Set Long-Tailed Recognition
AAAI 2022
Unsupervised Object Detection Pretraining with Joint Object Priors Generation and Detector Learning
NIPS 2022
ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind
ICLR 2022
Native phonotactic interference in L2 vowel processing: Mouse-tracking reveals cognitive conflicts during identification
INTERSPEECH 2022
MATE: Benchmarking Multi-Agent Reinforcement Learning in Distributed Target Coverage Control
NIPS 2022
MoCaNet: Motion Retargeting In-the-Wild via Canonicalization Networks
AAAI 2022
Disentangling Disease-related Representation from Obscure for Disease Prediction
ICML 2022
Revisiting the Transferability of Supervised Pretraining: An MLP Perspective
CVPR 2022
Adaptive Trajectory Prediction via Transferable GNN
CVPR 2022
VirtualPose: Learning Generalizable 3D Human Pose Models from Virtual Data
ECCV 2022
Faster VoxelPose: Real-Time 3D Human Pose Estimation by Orthographic Projection
ECCV 2022
One-Shot Medical Landmark Localization by Edge-Guided Transform and Noisy Landmark Refinement
ECCV 2022
Domain Invariant Masked Autoencoders for Self-Supervised Learning from Multi-Domains
ECCV 2022
Causal Intervention for Subject-Deconfounded Facial Action Unit Recognition
AAAI 2022
Causal Hidden Markov Model for Time Series Disease Forecasting
CVPR 2021
An Empirical Study of the Collapsing Problem in Semi-Supervised 2D Human Pose Estimation
ICCV 2021
ACE: Ally Complementary Experts for Solving Long-Tailed Recognition in One-Shot
ICCV 2021
RODNet: Radar Object Detection Using Cross-Modal Supervision
WACV 2021
Towards Distraction-Robust Active Visual Tracking
ICML 2021
Context Modeling in 3D Human Pose Estimation: A Unified Perspective
CVPR 2021
Towards Unified Surgical Skill Assessment
CVPR 2021
Forecasting Irreversible Disease via Progression Learning
CVPR 2021
TCGM: An Information-Theoretic Framework for Semi-Supervised Multi-Modality Learning
ECCV 2020
Learning Multi-Agent Coordination for Enhancing Target Coverage in Directional Sensor Networks
NIPS 2020
Pose-Assisted Multi-Camera Collaboration for Active Object Tracking
AAAI 2020
Cross-View Correspondence Reasoning Based on Bipartite Graph Convolutional Network for Mammogram Mass Detection
CVPR 2020
MetaFuse: A Pre-trained Fusion Model for Human Pose Estimation
CVPR 2020
On Computation and Generalization of Generative Adversarial Imitation Learning
ICLR 2020
Align, Attend and Locate: Chest X-Ray Diagnosis via Contrast Induced Attention Network With Limited Supervision
ICCV 2019
Learning With Unsure Data for Medical Image Diagnosis
ICCV 2019
Optimizing Network Structure for 3D Human Pose Estimation
ICCV 2019
Max-MIG: an Information Theoretic Approach for Joint Learning from Crowds
ICLR 2019
Completeness Modeling and Context Separation for Weakly Supervised Temporal Action Localization
CVPR 2019
L_DMI: A Novel Information-theoretic Loss Function for Training Deep Nets Robust to Label Noise
NIPS 2019
AD-VAT: An Asymmetric Dueling mechanism for learning Visual Active Tracking
ICLR 2019
CRAVES: Controlling Robotic Arm With a Vision-Based Economic System
CVPR 2019
Multi-Agent Tensor Fusion for Contextual Trajectory Prediction
CVPR 2019
Cascaded Generative and Discriminative Learning for Microcalcification Detection in Breast Mammograms
CVPR 2019
MSplit LBI: Realizing Feature Selection and Dense Estimation Simultaneously in Few-shot and Zero-shot Learning
ICML 2018
End-to-end Active Object Tracking via Reinforcement Learning
ICML 2018
Video Object Segmentation by Learning Location-Sensitive Embeddings
ECCV 2018
Collaborative Deep Reinforcement Learning for Joint Object Search
CVPR 2017
Mining 3D Key-Pose-Motifs for Action Recognition
CVPR 2016
Maximal Sparsity with Deep Networks?
NIPS 2016
Quantized Correlation Hashing for Fast Cross-Modal Search
IJCAI 2015
Background Subtraction via Generalized Fused Lasso Foreground Modeling
CVPR 2015
Exploiting Object Similarity in 3D Reconstruction
ICCV 2015
Robust Estimation of 3D Human Poses from a Single Image
CVPR 2014
What Object Motion Reveals about Shape with Unknown BRDF and Lighting
CVPR 2013
A Method of Perceptual-Based Shape Decomposition
ICCV 2013
An Approach to Pose-Based Action Recognition
CVPR 2013
Weakly Supervised Learning for Attribute Localization in Outdoor Scenes
CVPR 2013