Jian Wang

143 papers · 2015–2026 · 18 conferences · across top CS/AI conferences

Achievements

+17 more ↓

🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (13) 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (6) 🌍 Conference Polyglot (18)

🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (13) 🧭 Keyword Pioneer 🌟 Keyword Trendsetter Combo (3) 🏠 Conference Loyalist (21) 🏆 Grand Slam 🔬 Deep Specialist (16) 🧬 Topic Evolution 🏆 Keyword Champion 🤝 Dynamic Duo (16) ❓ The Questioner (2) 📈 Trend Setter 🗃️ Keyword Collector (561) 💎 Century Club (126) 🚀 Conference Pioneer 🔥 Unstoppable (10) ⚡ Prolific Year (10)

Conferences

CVPR (28) AAAI (22) ICCV (21) ACL (15) ECCV (12) EMNLP (10) NIPS (9) ICML (6) IJCAI (4) MICCAI (3) COLING (3) WACV (3) NAACL (2) ICLR (1) IJCNLP (1) INTERSPEECH (1) MIDL (1) SEMEVAL (1)

Top co-authors

Jingdong Wang (16) Errui Ding (15) Wenjie Li (14) Jingdong Chen (10) Christian Theobalt (10) Junyu Han (8) Hongfei Lin (8) Jusheng Zhang (7) Keze Wang (7) Haocheng Feng (6)

Research topics

Privacy (1)

Keywords

large language model (11) human pose estimation (10) domain adaptation (7) egocentric vision (7) object detection (7) semantic segmentation (7) multimodal learning (7) neural network (6) diffusion model (5) contrastive learning (5) person re-identification (5) dialogue system (5) 3d pose estimation (5) graph neural network (5) generative model (4) attention mechanism (4) adversarial attack (4) vision-language model (4) depth estimation (4) image restoration (4)

Papers

OptScale: Probabilistic Optimality for Inference-time Scaling AAAI 2026 Efficient Reinforcement Learning for Zero-Shot Coordination in Evolving Games AAAI 2026 Top-Down Semantic Refinement for Image Captioning AAAI 2026 PulseMind: A Multi-Modal Medical Model for Real-World Clinical Diagnosis AAAI 2026 Invisible Triggers, Visible Threats! Road-Style Adversarial Creation Attack for Visual 3D Detection in Autonomous Driving AAAI 2026 3DAlign-DAER: Dynamic Attention Policy and Efficient Retrieval Strategy for Fine-grained 3D-Text Alignment at Scale AAAI 2026 WebClipper: Efficient Evolution of Web Agents with Graph-based Trajectory Pruning ACL 2026 Cost-Effective Communication: An Auction-based Method for Language Agent Interaction AAAI 2026 Federated Context-Aware Personalized Recommendation AAAI 2026 Foresight Optimization for Strategic Reasoning in Large Language Models ACL 2026 Perplexity-Aware Data Scaling Law: Perplexity Landscapes Predict Performance for Continual Pre-training ACL 2026 Reinforcement Learning for Diffusion LLMs via Energy-Based Gibbs Alignment ACL 2026 Snapmoji: Instant Generation of Animatable Dual-Stylized Avatars WACV 2026 PRGB Benchmark: A Robust Placeholder-Assisted Algorithm for Benchmarking Retrieval-Augmented Generation AAAI 2026 LLM-CAS: Dynamic Neuron Perturbation for Real-Time Hallucination Correction AAAI 2026 Where and What: Reasoning Dynamic and Implicit Preferences in Situated Conversational Recommendation ACL 2026 Trustworthy AI-Assisted Programming: Detection and Repair of Unreliable Code AAAI 2026 MHB: Medical Hallucination Benchmark for Large Language Models in Complex Clinical Tasks AAAI 2026 Temporal Atlas-Guided Generation of Longitudinal Data via Geometric Latent Embeddings MICCAI 2025 Hierarchical Corpus-View-Category Refinement for Carotid Plaque Risk Grading in Ultrasound MICCAI 2025 Accurate and Efficient Fetal Birth Weight Estimation from 3D Ultrasound MICCAI 2025 An Empirical Study of Federated Prompt Learning for Vision Language Model IJCAI 2025 KABB: Knowledge-Aware Bayesian Bandits for Dynamic Expert Coordination in Multi-Agent Systems ICML 2025 RAGDiffusion: Faithful Cloth Generation via External Knowledge Assimilation ICCV 2025 T2Bs: Text-to-Character Blendshapes via Video Generation ICCV 2025 TextMaster: A Unified Framework for Realistic Text Editing via Glyph-Style Dual-Control ICCV 2025 Training-Free Text-Guided Image Editing with Visual Autoregressive Model ICCV 2025 Ponimator: Unfolding Interactive Pose for Versatile Human-human Interaction Animation ICCV 2025 Class Token as Proxy: Optimal Transport-assisted Proxy Learning for Weakly Supervised Semantic Segmentation ICCV 2025 Bring Your Rear Cameras for Egocentric 3D Human Pose Estimation ICCV 2025 Inducing Argument Facets for Faithful Opinion Summarization EMNLP 2025 Benchmarking for Domain-Specific LLMs: A Case Study on Academia and Beyond EMNLP 2025 SceneMI: Motion In-betweening for Modeling Human-Scene Interaction ICCV 2025 FRAME: Floor-aligned Representation for Avatar Motion from Egocentric Video CVPR 2025 Style Quantization for Data-Efficient GAN Training CVPR 2025 POT: Prototypical Optimal Transport for Weakly Supervised Semantic Segmentation CVPR 2025 Ego4o: Egocentric Human Motion Capture and Understanding from Multi-Modal Input CVPR 2025 KVQ: Boosting Video Quality Assessment via Saliency-guided Local Perception CVPR 2025 SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling CVPR 2025 HIRAG: Hierarchical-Thought Instruction-Tuning Retrieval-Augmented Generation EMNLP 2025 DrDiff: Dynamic Routing Diffusion with Hierarchical Attention for Breaking the Efficiency-Quality Trade-off EMNLP 2025 Similar Modality Enhancement and Action Consistency Learning for Weakly Supervised Temporal Action Localization AAAI 2025 Federated Recommendation with Explicitly Encoding Item Bias AAAI 2025 Discrete Curvature Graph Information Bottleneck AAAI 2025 Why Safeguarded Ships Run Aground? Aligned Large Language Models’ Safety Mechanisms Tend to Be Anchored in The Template Region ACL 2025 STeCa: Step-level Trajectory Calibration for LLM Agent Learning ACL 2025 Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors ACL 2025 Empowering Persuasion Detection in Slavic Texts through Two-Stage Generative Reasoning ACL 2025 Copy or Not? Reference-Based Face Image Restoration with Fine Details WACV 2025 Prototype Tuning: A Meta-Learning Approach for Few-Shot Document-Level Relation Extraction with Large Language Models NAACL 2025 EcoMatcher: Efficient Clustering Oriented Matcher for Detector-free Image Matching ECCV 2024 Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight NIPS 2024 Robust Communicative Multi-Agent Reinforcement Learning with Active Defense AAAI 2024 Cooper: Coordinating Specialized Agents towards a Complex Dialogue Goal AAAI 2024 Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue ACL 2024 Towards Better Vision-Inspired Vision-Language Models CVPR 2024 RobustSAM: Segment Anything Robustly on Degraded Images CVPR 2024 EventEgo3D: 3D Human Motion Capture from Egocentric Event Streams CVPR 2024 DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer CVPR 2024 3D Human Pose Perception from Egocentric Stereo Videos CVPR 2024 Egocentric Whole-Body Motion Capture with FisheyeViT and Diffusion-Based Motion Refinement CVPR 2024 SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery CVPR 2024 POA: Pre-training Once for Models of All Sizes ECCV 2024 "Clearer Frames, Anytime: Resolving Velocity Ambiguity in Video Frame Interpolation" ECCV 2024 Delving Deep into Engagement Prediction of Short Videos ECCV 2024 Holodepth: Programmable Depth-Varying Projection via Computer-Generated Holography ECCV 2024 ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models EMNLP 2024 E2CL: Exploration-based Error Correction Learning for Embodied Agents EMNLP 2024 MS$^3$D: A RG Flow-Based Regularization for GAN Training with Limited Data ICML 2024 Exponential Spectral Pursuit: An Effective Initialization Method for Sparse Phase Retrieval ICML 2024 Mobile Attention: Mobile-Friendly Linear-Attention for Vision Transformers ICML 2024 Joint Motion Estimation with Geometric Deformation Correction for Fetal Echo Planar Images Via Deep Learning MIDL 2024 CoT-based Data Augmentation Strategy for Persuasion Techniques Detection NAACL 2024 CoT-based Data Augmentation Strategy for Persuasion Techniques Detection SEMEVAL 2024 Unified Pre-Training with Pseudo Texts for Text-To-Image Person Re-Identification ICCV 2023 Self-Detoxifying Language Models via Toxification Reversal EMNLP 2023 Stroke Extraction of Chinese Character Based on Deep Structure Deformable Image Registration AAAI 2023 HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception NIPS 2023 COLA: Improving Conversational Recommender Systems by Collaborative Augmentation AAAI 2023 Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation EMNLP 2023 Simultaneously Short- and Long-Term Temporal Modeling for Semi-Supervised Video Semantic Segmentation CVPR 2023 PSVT: End-to-End Multi-Person 3D Pose and Shape Estimation With Progressive Video Transformers CVPR 2023 Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive Dialogue ACL 2023 Medical Dialogue Generation via Dual Flow Modeling ACL 2023 Self-Supervised 2D/3D Registration for X-Ray to CT Image Fusion WACV 2023 A Unified Conditional Framework for Diffusion-based Image Restoration NIPS 2023 Graph Contrastive Learning for Skeleton-based Action Recognition ICLR 2023 Energy-Efficient Adaptive 3D Sensing CVPR 2023 Scene-Aware Egocentric 3D Human Pose Estimation CVPR 2023 Uncertainty-guided Learning for Improving Image Manipulation Detection ICCV 2023 Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment ICCV 2023 Group Pose: A Simple Baseline for End-to-End Multi-Person Pose Estimation ICCV 2023 s-Adaptive Decoupled Prototype for Few-Shot Object Detection ICCV 2023 UFO: Unified Feature Optimization ECCV 2022 3D Photo Stylization: Learning To Generate Stylized Novel Views From a Single Image CVPR 2022 Implicit Sample Extension for Unsupervised Person Re-Identification CVPR 2022 Training Object Detectors From Scratch: An Empirical Study in the Era of Vision Transformer CVPR 2022 Estimating Egocentric 3D Human Pose in the Wild With External Weak Supervision CVPR 2022 MixFormer: Mixing Features Across Windows and Dimensions CVPR 2022 Human-Object Interaction Detection via Disentangled Transformer CVPR 2022 Uncertainty Modeling in Generative Compressed Sensing ICML 2022 Singular Value Fine-tuning: Few-shot Segmentation requires Few-parameters Fine-tuning NIPS 2022 Geo-SIC: Learning Deformable Geometric Shapes in Deep Image Classifiers NIPS 2022 Self-Guided Hard Negative Generation for Unsupervised Person Re-Identification IJCAI 2022 RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer NIPS 2022 RealMedDial: A Real Telemedical Dialogue Dataset Collected from Online Chinese Short-Video Clips COLING 2022 Two Languages Are Better than One: Bilingual Enhancement for Chinese Named Entity Recognition COLING 2022 Domain-specific knowledge distillation yields smaller and better models for conversational commerce ACL 2022 Action Quality Assessment with Temporal Parsing Transformer ECCV 2022 UnrealEgo: A New Dataset for Robust Egocentric 3D Human Motion Capture ECCV 2022 Seeing Far in the Dark with Patterned Flash ECCV 2022 Hierarchical Memory Learning for Fine-Grained Scene Graph Generation ECCV 2022 Focus on Interaction: A Novel Dynamic Graph Model for Joint Multiple Intent Detection and Slot Filling IJCAI 2021 One Shot Face Swapping on Megapixels CVPR 2021 Unsupervised Multi-Source Domain Adaptation for Person Re-Identification CVPR 2021 Estimating Egocentric 3D Human Pose in Global Space ICCV 2021 MFNet: Multi-Filter Directive Network for Weakly Supervised Salient Object Detection ICCV 2021 Mining Contextual Information Beyond Image for Semantic Segmentation ICCV 2021 RNNRepair: Automatic RNN Repair via Model-based Analysis ICML 2021 Seeing in Extra Darkness Using a Deep-Red Flash CVPR 2021 Group Contextual Encoding for 3D Point Clouds NIPS 2020 Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement ECCV 2020 Watch out! Motion is Blurring the Vision of Your Deep Neural Networks NIPS 2020 Working Memory-Driven Neural Networks with a Novel Knowledge Enhancement Paradigm for Implicit Discourse Relation Recognition AAAI 2020 Dual Dynamic Memory Network for End-to-End Multi-turn Task-oriented Dialog Systems COLING 2020 Improving Knowledge-Aware Dialogue Generation via Knowledge Base Question Answering AAAI 2020 TransS-Driven Joint Learning Architecture for Implicit Discourse Relation Recognition ACL 2020 FakeSpotter: A Simple yet Robust Baseline for Spotting AI-Synthesized Fake Faces IJCAI 2020 Cross-Lingual Low-Resource Set-to-Description Retrieval for Global E-Commerce AAAI 2020 DeepFLASH: An Efficient Network for Learning-Based Medical Image Registration CVPR 2020 Agile Depth Sensing Using Triangulation Light Curtains ICCV 2019 Micro-Baseline Structured Light ICCV 2019 Joint Maximization Decoder with Neural Converters for Fully Neural Network-Based Japanese Speech Recognition INTERSPEECH 2019 Re-Identification Supervised Texture Generation CVPR 2019 Think Visually: Question Answering through Virtual Imagery ACL 2018 Programmable Triangulation Light Curtains ECCV 2018 WECA: A WordNet-Encoded Collocation-Attention Network for Homographic Pun Recognition EMNLP 2018 Deep Metric Learning With Angular Loss ICCV 2017 Reflectance Capture Using Univariate Sampling of BRDFs ICCV 2017 Premise Selection for Theorem Proving by Deep Graph Embedding NIPS 2017 Alibaba at IJCNLP-2017 Task 2: A Boosted Deep System for Dimensional Sentiment Analysis of Chinese Phrases IJCNLP 2017 Photometric Stereo With Small Angular Variations ICCV 2015 Biography-Dependent Collaborative Entity Archiving for Slot Filling EMNLP 2015