Kai Wang
139 papers · 2010–2026 · 17 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+18 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (23) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (8) π Conference Polyglot (17)
π
Renaissance Researcher
(8)
π
Interdisciplinary Bridge
π£
Hot Topic Early Bird
π
Conference Loyalist
(21)
π€
Dynamic Duo
(26)
π
Triple Crown
π§¬
Topic Evolution
π
Grand Slam
π₯
Mega-Team
(20)
π¬
Deep Specialist
(19)
π
Keyword Champion
(5)
π₯
Unstoppable
(10)
π
Trend Setter
β‘
Prolific Year
(27)
π
Century Club
(130)
β
The Questioner
(3)
ποΈ
Keyword Collector
(55)
π
Conference Pioneer
Conferences
AAAI (21)
NIPS (21)
CVPR (21)
ICCV (11)
ICLR (10)
EMNLP (10)
ACL (10)
ECCV (9)
ICML (6)
IJCAI (6)
WACV (4)
INTERSPEECH (3)
MICCAI (2)
UAI (2)
COLING (1)
AISTATS (1)
OSDI (1)
Top co-authors
Research topics
Keywords
diffusion model
(15)
large language model
(10)
neural network
(8)
image generation
(8)
transfer learning
(5)
knowledge distillation
(5)
representation learning
(5)
dataset distillation
(5)
graph neural network
(5)
image synthesis
(4)
contrastive learning
(4)
prompt learning
(4)
reinforcement learning
(4)
continual learning
(4)
few-shot learning
(4)
vision-language model
(4)
zero-shot learning
(4)
generative model
(4)
decision-focused learning
(4)
multimodal learning
(3)
Papers
TeCES: Collaborative Geometric Knowledge Representation Framework under Evolving Fact Snapshots
ACL 2026
SDNet: LiDAR Semantic Scene Completion with Sparse-Dense Fusion and Input-Aware Label Refinement
AAAI 2026
Leveraging Semantic Attribute Binding for Free-Lunch Color Control in Diffusion Models
WACV 2026
Agent-based Substructure Counting under Local Differential Privacy
ACL 2026
Empowering Tabular Data Preparation with Language Models: Why and How?
ACL 2026
KOALA: Knowledge of Optimization and Learning Algorithms for Healthcare
AAAI 2026
Mixture of Heterogeneous Grouped Experts for Language Modeling
ACL 2026
MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models
AAAI 2026
HiMo-CLIP: Modeling Semantic Hierarchy and Monotonicity in Vision-Language Alignment
AAAI 2026
EAGLE: Episodic Appearance- and Geometry-aware Memory for Unified 2D-3D Visual Query Localization in Egocentric Vision
AAAI 2026
Emphasizing Discriminative Features for Dataset Distillation in Complex Scenarios
CVPR 2025
Optimizing for the Shortest Path in Denoising Diffusion Model
CVPR 2025
What is the Right Notion of Distance between Predict-then-Optimize Tasks?
UAI 2025
Topology-Constrained Learning for Efficient Laparoscopic Liver Landmark Detection
MICCAI 2025
MedPro-DG: Domain-Aware Masked Contrastive Prompt Learning of Institution Generalization for Outcome Prediction
MICCAI 2025
Time-Frequency Disentanglement Boosted Pre-Training: A Universal Spatio-Temporal Modeling Framework
IJCAI 2025
ElaD-Net: An Elastic Semantic Decoupling Network for Lesion Segmentation in Breast Ultrasound Images
IJCAI 2025
DcDsDiff: Dual-Conditional and Dual-Stream Diffusion Model for Generative Image Tampering Localization
IJCAI 2025
Info-Coevolution: An Efficient Framework for Data Model Coevolution
ICML 2025
Efficient Online Reinforcement Learning for Diffusion Policy
ICML 2025
Unsupervised Learning for Class Distribution Mismatch
ICML 2025
$InterLCM$: Low-Quality Images as Intermediate States of Latent Consistency Models for Effective Blind Face Restoration
ICLR 2025
Drawing Informative Gradients from Sources: A One-stage Transfer Learning Framework for Cross-city Spatiotemporal Forecasting
AAAI 2025
CALLIC: Content Adaptive Learning for Lossless Image Compression
AAAI 2025
InpDiffusion: Image Inpainting Localization via Conditional Diffusion Models
AAAI 2025
Single-View Graph Contrastive Learning with Soft Neighborhood Awareness
AAAI 2025
FilterTS: Comprehensive Frequency Filtering for Multivariate Time Series Forecasting
AAAI 2025
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks
ICLR 2025
One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt
ICLR 2025
Dynamic Diffusion Transformer
ICLR 2025
Real-Time Video Generation with Pyramid Attention Broadcast
ICLR 2025
Multi-Class Textual-Inversion Secretly Yields a Semantic-Agnostic Classifier
WACV 2025
MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification
ACL 2025
Primal-Dual Spectral Representation for Off-policy Evaluation
AISTATS 2025
IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance
ICCV 2025
AR-1-to-3: Single Image to Consistent 3D Object via Next-View Prediction
ICCV 2025
Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing
ICCV 2025
TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction
ICCV 2025
Permitted Knowledge Boundary: Evaluating the Knowledge-Constrained Responsiveness of Large Language Models
EMNLP 2025
EA-Vit: Efficient Adaptation for Elastic Vision Transformer
ICCV 2025
Fuzzy Reasoning Chain (FRC): An Innovative Reasoning Framework from Fuzziness to Clarity
EMNLP 2025
Self-Improvement in Multimodal Large Language Models: A Survey
EMNLP 2025
ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion
EMNLP 2025
DAST: Difficulty-Adaptive Slow-Thinking for Large Reasoning Models
EMNLP 2025
Distilling Long-tailed Datasets
CVPR 2025
The Art of Deception: Color Visual Illusions and Diffusion Models
CVPR 2025
A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
CVPR 2025
One-Way Ticket: Time-Independent Unified Encoder for Distilling Text-to-Image Diffusion Models
CVPR 2025
A Stitch in Time Saves Nine: Small VLM is a Precise Guidance for Accelerating Large VLMs
CVPR 2025
Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning
NIPS 2024
Aligning Large Language Models with Representation Editing: A Control Perspective
NIPS 2024
GDeR: Safeguarding Efficiency, Balancing, and Robustness via Prototypical Graph Pruning
NIPS 2024
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability, Reproducibility, and Practicality
NIPS 2024
MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark
NIPS 2024
Causal Deciphering and Inpainting in Spatio-Temporal Dynamics via Diffusion Model
NIPS 2024
Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation
NIPS 2024
Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis
NIPS 2024
First-Order Methods for Linearly Constrained Bilevel Optimization
NIPS 2024
EnMatch: Matchmaking for Better Player Engagement via Neural Combinatorial Optimization
AAAI 2024
Summarizing Stream Data for Memory-Constrained Online Continual Learning
AAAI 2024
LLM as Prompter: Low-resource Inductive Reasoning on Arbitrary Knowledge Graphs
ACL 2024
ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape Disentanglement
ECCV 2024
Exemplar-free Continual Representation Learning via Learnable Drift Compensation
ECCV 2024
Dataset Growth
ECCV 2024
A Multimodal Benchmark Dataset and Model for Crop Disease Diagnosis
ECCV 2024
VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation
EMNLP 2024
Can We Evaluate Domain Adaptation Models Without Target-Domain Labels?
ICLR 2024
NuwaDynamics: Discovering and Updating in Causal Spatio-Temporal Modeling
ICLR 2024
InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning
ICLR 2024
Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching
ICLR 2024
DiffAug: Enhance Unsupervised Contrastive Learning with Domain-Knowledge-Free Diffusion-based Data Augmentation
ICML 2024
Two Heads Are Better Than One: Boosting Graph Sparse Training via Semantic and Topological Awareness
ICML 2024
Navigating Complexity: Toward Lossless Graph Condensation via Expanding Window Matching
ICML 2024
Synthesizing Long-Form Speech merely from Sentence-Level Corpus with Content Extrapolation and LLM Contextual Enrichment
INTERSPEECH 2024
FarSight: A Physics-Driven Whole-Body Biometric System at Large Distance and Altitude
WACV 2024
Plasticity-Optimized Complementary Networks for Unsupervised Continual Learning
WACV 2024
Scalable Decision-Focused Learning in Restless Multi-Armed Bandits with Application to Maternal and Child Health
AAAI 2023
Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing
NIPS 2023
Divide to Adapt: Mitigating Confirmation Bias for Domain Adaptation of Black-Box Predictors
ICLR 2023
Scenario Diffusion: Controllable Driving Scenario Generation With Diffusion
NIPS 2023
Does Graph Distillation See Like Vision Dataset Counterpart?
NIPS 2023
PRIOR: Personalized Prior for Reactivating the Information Overlooked in Federated Learning.
NIPS 2023
BiCro: Noisy Correspondence Rectification for Multi-Modality Data via Bi-Directional Cross-Modal Similarity Consistency
CVPR 2023
Expanding Small-Scale Datasets with Guided Imagination
NIPS 2023
Zero-Shot Generative Model Adaptation via Image-Specific Prompt Learning
CVPR 2023
MSINet: Twins Contrastive Search of Multi-Scale Interaction for Object ReID
CVPR 2023
Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models
ICCV 2023
Dataset Quantization
ICCV 2023
DREAM: Efficient Dataset Distillation by Representative Matching
ICCV 2023
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model
ICCV 2023
CORE: Co-planarity Regularized Monocular Geometry Estimation with Weak Supervision
ICCV 2023
Specialist Diffusion: Plug-and-Play Sample-Efficient Fine-Tuning of Text-to-Image Diffusion Models To Learn Any Unseen Style
CVPR 2023
Optimistic Whittle Index Policy: Online Learning for Restless Bandits
AAAI 2023
Smoothed Online Combinatorial Optimization Using Imperfect Predictions
AAAI 2023
The Shape Part Slot Machine: Contact-Based Reasoning for Generating 3D Shapes from Parts
ECCV 2022
Instance-Guided Prompt Learning for Few-Shot Text Matching
EMNLP 2022
MSDN: Mutually Semantic Distillation Network for Zero-Shot Learning
CVPR 2022
Crafting Better Contrastive Views for Siamese Representation Learning
CVPR 2022
CAFE: Learning To Condense Dataset by Aligning Features
CVPR 2022
Modeling Motion With Multi-Modal Features for Text-Based Video Segmentation
CVPR 2022
Less-forgetting Multi-lingual Fine-tuning
NIPS 2022
Attracting and Dispersing: A Simple Approach for Source-free Domain Adaptation
NIPS 2022
Decision-Focused Learning without Decision-Making: Learning Locally Optimized Decision Losses
NIPS 2022
Coordinating Followers to Reach Better Equilibria: End-to-End Gradient Descent for Stackelberg Games
AAAI 2022
A Transfer and Multi-Task Learning based Approach for MOS Prediction
INTERSPEECH 2022
Dataset Distillation via Factorization
NIPS 2022
An Efficient Training Approach for Very Large Scale Face Recognition
CVPR 2022
Point-to-Box Network for Accurate Object Detection via Single Point Supervision
ECCV 2022
DLME: Deep Local-Flatness Manifold Embedding
ECCV 2022
Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Making by Reinforcement Learning
NIPS 2021
End-to-End Speech Separation Using Orthogonal Representation in Complex and Real Time-Frequency Domain
INTERSPEECH 2021
Labeling Trick: A Theory of Using Graph Neural Networks for Multi-Node Representation Learning
NIPS 2021
Hyperbolic Geometry is Not Necessary: Lightweight Euclidean-Based Models for Low-Dimensional Knowledge Graph Embeddings
EMNLP 2021
Interpretable Visual Reasoning via Induced Symbolic Space
ICCV 2021
Neighborhood Intervention Consistency: Measuring Confidence for Knowledge Graph Link Prediction
IJCAI 2021
Dual-Mandate Patrols: Multi-Armed Bandits for Green Security
AAAI 2021
Reinforcement Learning with a Disentangled Universal Value Function for Item Recommendation
AAAI 2021
Suppressing Uncertainties for Large-Scale Facial Expression Recognition
CVPR 2020
Interactive Dual Generative Adversarial Networks for Image Captioning
AAAI 2020
Robust Spatial-Temporal Incident Prediction
UAI 2020
Automatically Learning Compact Quality-aware Surrogates for Optimization Problems
NIPS 2020
On the Generation of Medical Question-Answer Pairs
AAAI 2020
PSENet: Psoriasis Severity Evaluation Network
AAAI 2020
Semantic Drift Compensation for Class-Incremental Learning
CVPR 2020
Multi-Domain Dialogue Acts and Response Co-Generation
ACL 2020
Low-Resource Generation of Multi-hop Reasoning Questions
ACL 2020
Relational Graph Attention Network for Aspect-based Sentiment Analysis
ACL 2020
Suppressing Mislabeled Data via Grouping and Self-Attention
ECCV 2020
A Robust Local Spectral Descriptor for Matching Non-Rigid Shapes With Incompatible Shape Structures
CVPR 2019
BiSET: Bi-directional Selective Encoding with Template for Abstractive Summarization
ACL 2019
Adversarial Machine Learning with Double Oracle
IJCAI 2019
Fast and Flexible Indoor Scene Synthesis via Deep Convolutional Generative Models
CVPR 2019
The Price of Usability: Designing Operationalizable Strategies for Security Games
IJCAI 2018
Sub-GAN: An Unsupervised Generative Model via Subspaces
ECCV 2018
LIUM-CVC Submissions for WMT18 Multimodal Translation Task
EMNLP 2018
RStream: Marrying Relational Algebra with Streaming for Efficient Graph Mining on A Single Machine
OSDI 2018
Richer Convolutional Features for Edge Detection
CVPR 2017
Domain-Assisted Product Aspect Hierarchy Generation: Towards Hierarchical Organization of Unstructured Consumer Reviews
EMNLP 2011
Exploiting Salient Patterns for Question Detection and Question Retrieval in Community-based Question Answering
COLING 2010