Kai Wang

139 papers · 2010–2026 · 17 conferences · across top CS/AI conferences

Achievements

+18 more ↓

🗺️ Taxonomy Completionist (23) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (8) 🌍 Conference Polyglot (17)

🌈 Renaissance Researcher (8) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird 🏠 Conference Loyalist (21) 🤝 Dynamic Duo (26) 👑 Triple Crown 🧬 Topic Evolution 🏆 Grand Slam 👥 Mega-Team (20) 🔬 Deep Specialist (19) 🏆 Keyword Champion (5) 🔥 Unstoppable (10) 📈 Trend Setter ⚡ Prolific Year (27) 💎 Century Club (130) ❓ The Questioner (3) 🗃️ Keyword Collector (55) 🚀 Conference Pioneer

Conferences

AAAI (21) NIPS (21) CVPR (21) ICCV (11) ICLR (10) EMNLP (10) ACL (10) ECCV (9) ICML (6) IJCAI (6) WACV (4) INTERSPEECH (3) MICCAI (2) UAI (2) COLING (1) AISTATS (1) OSDI (1)

Top co-authors

Yang You (27) Joost van de Weijer (13) Milind Tambe (10) Xiaojiang Peng (9) Wangbo Zhao (8) Ming-Ming Cheng (7) Humphrey Shi (7) Andrew Perrault (6) Zhaoxiang Liu (6) yaxing wang (6)

Research topics

Differential Privacy (1)

Keywords

diffusion model (15) large language model (10) neural network (8) image generation (8) transfer learning (5) knowledge distillation (5) representation learning (5) dataset distillation (5) graph neural network (5) image synthesis (4) contrastive learning (4) prompt learning (4) reinforcement learning (4) continual learning (4) few-shot learning (4) vision-language model (4) zero-shot learning (4) generative model (4) decision-focused learning (4) multimodal learning (3)

Papers

TeCES: Collaborative Geometric Knowledge Representation Framework under Evolving Fact Snapshots ACL 2026 SDNet: LiDAR Semantic Scene Completion with Sparse-Dense Fusion and Input-Aware Label Refinement AAAI 2026 Leveraging Semantic Attribute Binding for Free-Lunch Color Control in Diffusion Models WACV 2026 Agent-based Substructure Counting under Local Differential Privacy ACL 2026 Empowering Tabular Data Preparation with Language Models: Why and How? ACL 2026 KOALA: Knowledge of Optimization and Learning Algorithms for Healthcare AAAI 2026 Mixture of Heterogeneous Grouped Experts for Language Modeling ACL 2026 MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models AAAI 2026 HiMo-CLIP: Modeling Semantic Hierarchy and Monotonicity in Vision-Language Alignment AAAI 2026 EAGLE: Episodic Appearance- and Geometry-aware Memory for Unified 2D-3D Visual Query Localization in Egocentric Vision AAAI 2026 Emphasizing Discriminative Features for Dataset Distillation in Complex Scenarios CVPR 2025 Optimizing for the Shortest Path in Denoising Diffusion Model CVPR 2025 What is the Right Notion of Distance between Predict-then-Optimize Tasks? UAI 2025 Topology-Constrained Learning for Efficient Laparoscopic Liver Landmark Detection MICCAI 2025 MedPro-DG: Domain-Aware Masked Contrastive Prompt Learning of Institution Generalization for Outcome Prediction MICCAI 2025 Time-Frequency Disentanglement Boosted Pre-Training: A Universal Spatio-Temporal Modeling Framework IJCAI 2025 ElaD-Net: An Elastic Semantic Decoupling Network for Lesion Segmentation in Breast Ultrasound Images IJCAI 2025 DcDsDiff: Dual-Conditional and Dual-Stream Diffusion Model for Generative Image Tampering Localization IJCAI 2025 Info-Coevolution: An Efficient Framework for Data Model Coevolution ICML 2025 Efficient Online Reinforcement Learning for Diffusion Policy ICML 2025 Unsupervised Learning for Class Distribution Mismatch ICML 2025 $InterLCM$: Low-Quality Images as Intermediate States of Latent Consistency Models for Effective Blind Face Restoration ICLR 2025 Drawing Informative Gradients from Sources: A One-stage Transfer Learning Framework for Cross-city Spatiotemporal Forecasting AAAI 2025 CALLIC: Content Adaptive Learning for Lossless Image Compression AAAI 2025 InpDiffusion: Image Inpainting Localization via Conditional Diffusion Models AAAI 2025 Single-View Graph Contrastive Learning with Soft Neighborhood Awareness AAAI 2025 FilterTS: Comprehensive Frequency Filtering for Multivariate Time Series Forecasting AAAI 2025 MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks ICLR 2025 One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt ICLR 2025 Dynamic Diffusion Transformer ICLR 2025 Real-Time Video Generation with Pyramid Attention Broadcast ICLR 2025 Multi-Class Textual-Inversion Secretly Yields a Semantic-Agnostic Classifier WACV 2025 MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification ACL 2025 Primal-Dual Spectral Representation for Off-policy Evaluation AISTATS 2025 IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance ICCV 2025 AR-1-to-3: Single Image to Consistent 3D Object via Next-View Prediction ICCV 2025 Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing ICCV 2025 TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction ICCV 2025 Permitted Knowledge Boundary: Evaluating the Knowledge-Constrained Responsiveness of Large Language Models EMNLP 2025 EA-Vit: Efficient Adaptation for Elastic Vision Transformer ICCV 2025 Fuzzy Reasoning Chain (FRC): An Innovative Reasoning Framework from Fuzziness to Clarity EMNLP 2025 Self-Improvement in Multimodal Large Language Models: A Survey EMNLP 2025 ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion EMNLP 2025 DAST: Difficulty-Adaptive Slow-Thinking for Large Reasoning Models EMNLP 2025 Distilling Long-tailed Datasets CVPR 2025 The Art of Deception: Color Visual Illusions and Diffusion Models CVPR 2025 A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training CVPR 2025 One-Way Ticket: Time-Independent Unified Encoder for Distilling Text-to-Image Diffusion Models CVPR 2025 A Stitch in Time Saves Nine: Small VLM is a Precise Guidance for Accelerating Large VLMs CVPR 2025 Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning NIPS 2024 Aligning Large Language Models with Representation Editing: A Control Perspective NIPS 2024 GDeR: Safeguarding Efficiency, Balancing, and Robustness via Prototypical Graph Pruning NIPS 2024 Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability, Reproducibility, and Practicality NIPS 2024 MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark NIPS 2024 Causal Deciphering and Inpainting in Spatio-Temporal Dynamics via Diffusion Model NIPS 2024 Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation NIPS 2024 Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis NIPS 2024 First-Order Methods for Linearly Constrained Bilevel Optimization NIPS 2024 EnMatch: Matchmaking for Better Player Engagement via Neural Combinatorial Optimization AAAI 2024 Summarizing Stream Data for Memory-Constrained Online Continual Learning AAAI 2024 LLM as Prompter: Low-resource Inductive Reasoning on Arbitrary Knowledge Graphs ACL 2024 ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape Disentanglement ECCV 2024 Exemplar-free Continual Representation Learning via Learnable Drift Compensation ECCV 2024 Dataset Growth ECCV 2024 A Multimodal Benchmark Dataset and Model for Crop Disease Diagnosis ECCV 2024 VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation EMNLP 2024 Can We Evaluate Domain Adaptation Models Without Target-Domain Labels? ICLR 2024 NuwaDynamics: Discovering and Updating in Causal Spatio-Temporal Modeling ICLR 2024 InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning ICLR 2024 Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching ICLR 2024 DiffAug: Enhance Unsupervised Contrastive Learning with Domain-Knowledge-Free Diffusion-based Data Augmentation ICML 2024 Two Heads Are Better Than One: Boosting Graph Sparse Training via Semantic and Topological Awareness ICML 2024 Navigating Complexity: Toward Lossless Graph Condensation via Expanding Window Matching ICML 2024 Synthesizing Long-Form Speech merely from Sentence-Level Corpus with Content Extrapolation and LLM Contextual Enrichment INTERSPEECH 2024 FarSight: A Physics-Driven Whole-Body Biometric System at Large Distance and Altitude WACV 2024 Plasticity-Optimized Complementary Networks for Unsupervised Continual Learning WACV 2024 Scalable Decision-Focused Learning in Restless Multi-Armed Bandits with Application to Maternal and Child Health AAAI 2023 Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing NIPS 2023 Divide to Adapt: Mitigating Confirmation Bias for Domain Adaptation of Black-Box Predictors ICLR 2023 Scenario Diffusion: Controllable Driving Scenario Generation With Diffusion NIPS 2023 Does Graph Distillation See Like Vision Dataset Counterpart? NIPS 2023 PRIOR: Personalized Prior for Reactivating the Information Overlooked in Federated Learning. NIPS 2023 BiCro: Noisy Correspondence Rectification for Multi-Modality Data via Bi-Directional Cross-Modal Similarity Consistency CVPR 2023 Expanding Small-Scale Datasets with Guided Imagination NIPS 2023 Zero-Shot Generative Model Adaptation via Image-Specific Prompt Learning CVPR 2023 MSINet: Twins Contrastive Search of Multi-Scale Interaction for Object ReID CVPR 2023 Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models ICCV 2023 Dataset Quantization ICCV 2023 DREAM: Efficient Dataset Distillation by Representative Matching ICCV 2023 Versatile Diffusion: Text, Images and Variations All in One Diffusion Model ICCV 2023 CORE: Co-planarity Regularized Monocular Geometry Estimation with Weak Supervision ICCV 2023 Specialist Diffusion: Plug-and-Play Sample-Efficient Fine-Tuning of Text-to-Image Diffusion Models To Learn Any Unseen Style CVPR 2023 Optimistic Whittle Index Policy: Online Learning for Restless Bandits AAAI 2023 Smoothed Online Combinatorial Optimization Using Imperfect Predictions AAAI 2023 The Shape Part Slot Machine: Contact-Based Reasoning for Generating 3D Shapes from Parts ECCV 2022 Instance-Guided Prompt Learning for Few-Shot Text Matching EMNLP 2022 MSDN: Mutually Semantic Distillation Network for Zero-Shot Learning CVPR 2022 Crafting Better Contrastive Views for Siamese Representation Learning CVPR 2022 CAFE: Learning To Condense Dataset by Aligning Features CVPR 2022 Modeling Motion With Multi-Modal Features for Text-Based Video Segmentation CVPR 2022 Less-forgetting Multi-lingual Fine-tuning NIPS 2022 Attracting and Dispersing: A Simple Approach for Source-free Domain Adaptation NIPS 2022 Decision-Focused Learning without Decision-Making: Learning Locally Optimized Decision Losses NIPS 2022 Coordinating Followers to Reach Better Equilibria: End-to-End Gradient Descent for Stackelberg Games AAAI 2022 A Transfer and Multi-Task Learning based Approach for MOS Prediction INTERSPEECH 2022 Dataset Distillation via Factorization NIPS 2022 An Efficient Training Approach for Very Large Scale Face Recognition CVPR 2022 Point-to-Box Network for Accurate Object Detection via Single Point Supervision ECCV 2022 DLME: Deep Local-Flatness Manifold Embedding ECCV 2022 Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Making by Reinforcement Learning NIPS 2021 End-to-End Speech Separation Using Orthogonal Representation in Complex and Real Time-Frequency Domain INTERSPEECH 2021 Labeling Trick: A Theory of Using Graph Neural Networks for Multi-Node Representation Learning NIPS 2021 Hyperbolic Geometry is Not Necessary: Lightweight Euclidean-Based Models for Low-Dimensional Knowledge Graph Embeddings EMNLP 2021 Interpretable Visual Reasoning via Induced Symbolic Space ICCV 2021 Neighborhood Intervention Consistency: Measuring Confidence for Knowledge Graph Link Prediction IJCAI 2021 Dual-Mandate Patrols: Multi-Armed Bandits for Green Security AAAI 2021 Reinforcement Learning with a Disentangled Universal Value Function for Item Recommendation AAAI 2021 Suppressing Uncertainties for Large-Scale Facial Expression Recognition CVPR 2020 Interactive Dual Generative Adversarial Networks for Image Captioning AAAI 2020 Robust Spatial-Temporal Incident Prediction UAI 2020 Automatically Learning Compact Quality-aware Surrogates for Optimization Problems NIPS 2020 On the Generation of Medical Question-Answer Pairs AAAI 2020 PSENet: Psoriasis Severity Evaluation Network AAAI 2020 Semantic Drift Compensation for Class-Incremental Learning CVPR 2020 Multi-Domain Dialogue Acts and Response Co-Generation ACL 2020 Low-Resource Generation of Multi-hop Reasoning Questions ACL 2020 Relational Graph Attention Network for Aspect-based Sentiment Analysis ACL 2020 Suppressing Mislabeled Data via Grouping and Self-Attention ECCV 2020 A Robust Local Spectral Descriptor for Matching Non-Rigid Shapes With Incompatible Shape Structures CVPR 2019 BiSET: Bi-directional Selective Encoding with Template for Abstractive Summarization ACL 2019 Adversarial Machine Learning with Double Oracle IJCAI 2019 Fast and Flexible Indoor Scene Synthesis via Deep Convolutional Generative Models CVPR 2019 The Price of Usability: Designing Operationalizable Strategies for Security Games IJCAI 2018 Sub-GAN: An Unsupervised Generative Model via Subspaces ECCV 2018 LIUM-CVC Submissions for WMT18 Multimodal Translation Task EMNLP 2018 RStream: Marrying Relational Algebra with Streaming for Efficient Graph Mining on A Single Machine OSDI 2018 Richer Convolutional Features for Edge Detection CVPR 2017 Domain-Assisted Product Aspect Hierarchy Generation: Towards Hierarchical Organization of Unstructured Consumer Reviews EMNLP 2011 Exploiting Salient Patterns for Question Detection and Question Retrieval in Community-based Question Answering COLING 2010