Jian Wang
143 papers · 2015–2026 · 18 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (13) π Interdisciplinary Bridge π Renaissance Researcher (6) π Conference Polyglot (18)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(13)
π§
Keyword Pioneer
π
Keyword Trendsetter Combo
(3)
π
Conference Loyalist
(21)
π
Grand Slam
π¬
Deep Specialist
(16)
π§¬
Topic Evolution
π
Keyword Champion
π€
Dynamic Duo
(16)
β
The Questioner
(2)
π
Trend Setter
ποΈ
Keyword Collector
(561)
π
Century Club
(126)
π
Conference Pioneer
π₯
Unstoppable
(10)
β‘
Prolific Year
(10)
Conferences
CVPR (28)
AAAI (22)
ICCV (21)
ACL (15)
ECCV (12)
EMNLP (10)
NIPS (9)
ICML (6)
IJCAI (4)
MICCAI (3)
COLING (3)
WACV (3)
NAACL (2)
ICLR (1)
IJCNLP (1)
INTERSPEECH (1)
MIDL (1)
SEMEVAL (1)
Top co-authors
Research topics
Keywords
large language model
(11)
human pose estimation
(10)
domain adaptation
(7)
egocentric vision
(7)
object detection
(7)
semantic segmentation
(7)
multimodal learning
(7)
neural network
(6)
diffusion model
(5)
contrastive learning
(5)
person re-identification
(5)
dialogue system
(5)
3d pose estimation
(5)
graph neural network
(5)
generative model
(4)
attention mechanism
(4)
adversarial attack
(4)
vision-language model
(4)
depth estimation
(4)
image restoration
(4)
Papers
OptScale: Probabilistic Optimality for Inference-time Scaling
AAAI 2026
Efficient Reinforcement Learning for Zero-Shot Coordination in Evolving Games
AAAI 2026
Top-Down Semantic Refinement for Image Captioning
AAAI 2026
PulseMind: A Multi-Modal Medical Model for Real-World Clinical Diagnosis
AAAI 2026
Invisible Triggers, Visible Threats! Road-Style Adversarial Creation Attack for Visual 3D Detection in Autonomous Driving
AAAI 2026
3DAlign-DAER: Dynamic Attention Policy and Efficient Retrieval Strategy for Fine-grained 3D-Text Alignment at Scale
AAAI 2026
WebClipper: Efficient Evolution of Web Agents with Graph-based Trajectory Pruning
ACL 2026
Cost-Effective Communication: An Auction-based Method for Language Agent Interaction
AAAI 2026
Federated Context-Aware Personalized Recommendation
AAAI 2026
Foresight Optimization for Strategic Reasoning in Large Language Models
ACL 2026
Perplexity-Aware Data Scaling Law: Perplexity Landscapes Predict Performance for Continual Pre-training
ACL 2026
Reinforcement Learning for Diffusion LLMs via Energy-Based Gibbs Alignment
ACL 2026
Snapmoji: Instant Generation of Animatable Dual-Stylized Avatars
WACV 2026
PRGB Benchmark: A Robust Placeholder-Assisted Algorithm for Benchmarking Retrieval-Augmented Generation
AAAI 2026
LLM-CAS: Dynamic Neuron Perturbation for Real-Time Hallucination Correction
AAAI 2026
Where and What: Reasoning Dynamic and Implicit Preferences in Situated Conversational Recommendation
ACL 2026
Trustworthy AI-Assisted Programming: Detection and Repair of Unreliable Code
AAAI 2026
MHB: Medical Hallucination Benchmark for Large Language Models in Complex Clinical Tasks
AAAI 2026
Temporal Atlas-Guided Generation of Longitudinal Data via Geometric Latent Embeddings
MICCAI 2025
Hierarchical Corpus-View-Category Refinement for Carotid Plaque Risk Grading in Ultrasound
MICCAI 2025
Accurate and Efficient Fetal Birth Weight Estimation from 3D Ultrasound
MICCAI 2025
An Empirical Study of Federated Prompt Learning for Vision Language Model
IJCAI 2025
KABB: Knowledge-Aware Bayesian Bandits for Dynamic Expert Coordination in Multi-Agent Systems
ICML 2025
RAGDiffusion: Faithful Cloth Generation via External Knowledge Assimilation
ICCV 2025
T2Bs: Text-to-Character Blendshapes via Video Generation
ICCV 2025
TextMaster: A Unified Framework for Realistic Text Editing via Glyph-Style Dual-Control
ICCV 2025
Training-Free Text-Guided Image Editing with Visual Autoregressive Model
ICCV 2025
Ponimator: Unfolding Interactive Pose for Versatile Human-human Interaction Animation
ICCV 2025
Class Token as Proxy: Optimal Transport-assisted Proxy Learning for Weakly Supervised Semantic Segmentation
ICCV 2025
Bring Your Rear Cameras for Egocentric 3D Human Pose Estimation
ICCV 2025
Inducing Argument Facets for Faithful Opinion Summarization
EMNLP 2025
Benchmarking for Domain-Specific LLMs: A Case Study on Academia and Beyond
EMNLP 2025
SceneMI: Motion In-betweening for Modeling Human-Scene Interaction
ICCV 2025
FRAME: Floor-aligned Representation for Avatar Motion from Egocentric Video
CVPR 2025
Style Quantization for Data-Efficient GAN Training
CVPR 2025
POT: Prototypical Optimal Transport for Weakly Supervised Semantic Segmentation
CVPR 2025
Ego4o: Egocentric Human Motion Capture and Understanding from Multi-Modal Input
CVPR 2025
KVQ: Boosting Video Quality Assessment via Saliency-guided Local Perception
CVPR 2025
SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling
CVPR 2025
HIRAG: Hierarchical-Thought Instruction-Tuning Retrieval-Augmented Generation
EMNLP 2025
DrDiff: Dynamic Routing Diffusion with Hierarchical Attention for Breaking the Efficiency-Quality Trade-off
EMNLP 2025
Similar Modality Enhancement and Action Consistency Learning for Weakly Supervised Temporal Action Localization
AAAI 2025
Federated Recommendation with Explicitly Encoding Item Bias
AAAI 2025
Discrete Curvature Graph Information Bottleneck
AAAI 2025
Why Safeguarded Ships Run Aground? Aligned Large Language Modelsβ Safety Mechanisms Tend to Be Anchored in The Template Region
ACL 2025
STeCa: Step-level Trajectory Calibration for LLM Agent Learning
ACL 2025
Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors
ACL 2025
Empowering Persuasion Detection in Slavic Texts through Two-Stage Generative Reasoning
ACL 2025
Copy or Not? Reference-Based Face Image Restoration with Fine Details
WACV 2025
Prototype Tuning: A Meta-Learning Approach for Few-Shot Document-Level Relation Extraction with Large Language Models
NAACL 2025
EcoMatcher: Efficient Clustering Oriented Matcher for Detector-free Image Matching
ECCV 2024
Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight
NIPS 2024
Robust Communicative Multi-Agent Reinforcement Learning with Active Defense
AAAI 2024
Cooper: Coordinating Specialized Agents towards a Complex Dialogue Goal
AAAI 2024
Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue
ACL 2024
Towards Better Vision-Inspired Vision-Language Models
CVPR 2024
RobustSAM: Segment Anything Robustly on Degraded Images
CVPR 2024
EventEgo3D: 3D Human Motion Capture from Egocentric Event Streams
CVPR 2024
DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer
CVPR 2024
3D Human Pose Perception from Egocentric Stereo Videos
CVPR 2024
Egocentric Whole-Body Motion Capture with FisheyeViT and Diffusion-Based Motion Refinement
CVPR 2024
SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery
CVPR 2024
POA: Pre-training Once for Models of All Sizes
ECCV 2024
"Clearer Frames, Anytime: Resolving Velocity Ambiguity in Video Frame Interpolation"
ECCV 2024
Delving Deep into Engagement Prediction of Short Videos
ECCV 2024
Holodepth: Programmable Depth-Varying Projection via Computer-Generated Holography
ECCV 2024
ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models
EMNLP 2024
E2CL: Exploration-based Error Correction Learning for Embodied Agents
EMNLP 2024
MS$^3$D: A RG Flow-Based Regularization for GAN Training with Limited Data
ICML 2024
Exponential Spectral Pursuit: An Effective Initialization Method for Sparse Phase Retrieval
ICML 2024
Mobile Attention: Mobile-Friendly Linear-Attention for Vision Transformers
ICML 2024
Joint Motion Estimation with Geometric Deformation Correction for Fetal Echo Planar Images Via Deep Learning
MIDL 2024
CoT-based Data Augmentation Strategy for Persuasion Techniques Detection
NAACL 2024
CoT-based Data Augmentation Strategy for Persuasion Techniques Detection
SEMEVAL 2024
Unified Pre-Training with Pseudo Texts for Text-To-Image Person Re-Identification
ICCV 2023
Self-Detoxifying Language Models via Toxification Reversal
EMNLP 2023
Stroke Extraction of Chinese Character Based on Deep Structure Deformable Image Registration
AAAI 2023
HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception
NIPS 2023
COLA: Improving Conversational Recommender Systems by Collaborative Augmentation
AAAI 2023
Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation
EMNLP 2023
Simultaneously Short- and Long-Term Temporal Modeling for Semi-Supervised Video Semantic Segmentation
CVPR 2023
PSVT: End-to-End Multi-Person 3D Pose and Shape Estimation With Progressive Video Transformers
CVPR 2023
Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive Dialogue
ACL 2023
Medical Dialogue Generation via Dual Flow Modeling
ACL 2023
Self-Supervised 2D/3D Registration for X-Ray to CT Image Fusion
WACV 2023
A Unified Conditional Framework for Diffusion-based Image Restoration
NIPS 2023
Graph Contrastive Learning for Skeleton-based Action Recognition
ICLR 2023
Energy-Efficient Adaptive 3D Sensing
CVPR 2023
Scene-Aware Egocentric 3D Human Pose Estimation
CVPR 2023
Uncertainty-guided Learning for Improving Image Manipulation Detection
ICCV 2023
Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
ICCV 2023
Group Pose: A Simple Baseline for End-to-End Multi-Person Pose Estimation
ICCV 2023
s-Adaptive Decoupled Prototype for Few-Shot Object Detection
ICCV 2023
UFO: Unified Feature Optimization
ECCV 2022
3D Photo Stylization: Learning To Generate Stylized Novel Views From a Single Image
CVPR 2022
Implicit Sample Extension for Unsupervised Person Re-Identification
CVPR 2022
Training Object Detectors From Scratch: An Empirical Study in the Era of Vision Transformer
CVPR 2022
Estimating Egocentric 3D Human Pose in the Wild With External Weak Supervision
CVPR 2022
MixFormer: Mixing Features Across Windows and Dimensions
CVPR 2022
Human-Object Interaction Detection via Disentangled Transformer
CVPR 2022
Uncertainty Modeling in Generative Compressed Sensing
ICML 2022
Singular Value Fine-tuning: Few-shot Segmentation requires Few-parameters Fine-tuning
NIPS 2022
Geo-SIC: Learning Deformable Geometric Shapes in Deep Image Classifiers
NIPS 2022
Self-Guided Hard Negative Generation for Unsupervised Person Re-Identification
IJCAI 2022
RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer
NIPS 2022
RealMedDial: A Real Telemedical Dialogue Dataset Collected from Online Chinese Short-Video Clips
COLING 2022
Two Languages Are Better than One: Bilingual Enhancement for Chinese Named Entity Recognition
COLING 2022
Domain-specific knowledge distillation yields smaller and better models for conversational commerce
ACL 2022
Action Quality Assessment with Temporal Parsing Transformer
ECCV 2022
UnrealEgo: A New Dataset for Robust Egocentric 3D Human Motion Capture
ECCV 2022
Seeing Far in the Dark with Patterned Flash
ECCV 2022
Hierarchical Memory Learning for Fine-Grained Scene Graph Generation
ECCV 2022
Focus on Interaction: A Novel Dynamic Graph Model for Joint Multiple Intent Detection and Slot Filling
IJCAI 2021
One Shot Face Swapping on Megapixels
CVPR 2021
Unsupervised Multi-Source Domain Adaptation for Person Re-Identification
CVPR 2021
Estimating Egocentric 3D Human Pose in Global Space
ICCV 2021
MFNet: Multi-Filter Directive Network for Weakly Supervised Salient Object Detection
ICCV 2021
Mining Contextual Information Beyond Image for Semantic Segmentation
ICCV 2021
RNNRepair: Automatic RNN Repair via Model-based Analysis
ICML 2021
Seeing in Extra Darkness Using a Deep-Red Flash
CVPR 2021
Group Contextual Encoding for 3D Point Clouds
NIPS 2020
Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement
ECCV 2020
Watch out! Motion is Blurring the Vision of Your Deep Neural Networks
NIPS 2020
Working Memory-Driven Neural Networks with a Novel Knowledge Enhancement Paradigm for Implicit Discourse Relation Recognition
AAAI 2020
Dual Dynamic Memory Network for End-to-End Multi-turn Task-oriented Dialog Systems
COLING 2020
Improving Knowledge-Aware Dialogue Generation via Knowledge Base Question Answering
AAAI 2020
TransS-Driven Joint Learning Architecture for Implicit Discourse Relation Recognition
ACL 2020
FakeSpotter: A Simple yet Robust Baseline for Spotting AI-Synthesized Fake Faces
IJCAI 2020
Cross-Lingual Low-Resource Set-to-Description Retrieval for Global E-Commerce
AAAI 2020
DeepFLASH: An Efficient Network for Learning-Based Medical Image Registration
CVPR 2020
Agile Depth Sensing Using Triangulation Light Curtains
ICCV 2019
Micro-Baseline Structured Light
ICCV 2019
Joint Maximization Decoder with Neural Converters for Fully Neural Network-Based Japanese Speech Recognition
INTERSPEECH 2019
Re-Identification Supervised Texture Generation
CVPR 2019
Think Visually: Question Answering through Virtual Imagery
ACL 2018
Programmable Triangulation Light Curtains
ECCV 2018
WECA: A WordNet-Encoded Collocation-Attention Network for Homographic Pun Recognition
EMNLP 2018
Deep Metric Learning With Angular Loss
ICCV 2017
Reflectance Capture Using Univariate Sampling of BRDFs
ICCV 2017
Premise Selection for Theorem Proving by Deep Graph Embedding
NIPS 2017
Alibaba at IJCNLP-2017 Task 2: A Boosted Deep System for Dimensional Sentiment Analysis of Chinese Phrases
IJCNLP 2017
Photometric Stereo With Small Angular Variations
ICCV 2015
Biography-Dependent Collaborative Entity Archiving for Slot Filling
EMNLP 2015