Chao Wang

114 papers · 2003–2026 · 20 conferences · across top CS/AI conferences

Achievements

+14 more ↓

🧭 Keyword Pioneer 🌈 Renaissance Researcher (8) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (27) 🐣 Hot Topic Early Bird

🗺️ Taxonomy Completionist (27) 🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🏠 Conference Loyalist (22) 🔬 Deep Specialist (13) 🏆 Keyword Champion 🏆 Grand Slam 👥 Mega-Team (32) 🗃️ Keyword Collector (70) 🔥 Unstoppable (9) ⚡ Prolific Year (10) 📈 Trend Setter 🚀 Conference Pioneer 💎 Century Club (101)

Conferences

AAAI (22) INTERSPEECH (16) ACL (12) CVPR (8) NIPS (6) IJCAI (6) ICML (6) ICCV (6) EMNLP (5) ECCV (5) NAACL (5) MICCAI (4) IJCNLP (3) WACV (3) COLING (2) ICLR (1) EACL (1) CONLL (1) NSDI (1) OSDI (1)

Top co-authors

Hui Xiong (10) Chuan Qin (8) Chieh-Chi Kao (8) Ming Sun (8) Hengshu Zhu (6) Weiran Wang (5) Shuangyong Song (5) Viktor Rozgic (4) Dazhong Shen (4) Zihan Wang (4)

Keywords

attention mechanism (6) domain adaptation (5) representation learning (5) neural network (5) diffusion model (5) vision-language model (4) reinforcement learning (4) knowledge distillation (4) image restoration (4) large language model (4) semi-supervised learning (4) graph neural network (4) transfer learning (4) zero-shot learning (4) weakly supervised learning (3) uncertainty estimation (3) transformer encoder (3) contrastive learning (3) knowledge graph (3) model compression (3)

Papers

Clear Sights on Site: A Spatial-Adaptive Channel Network for Deblurring Construction Site Images WACV 2026 Accelerating LLM Inference Throughput via Asynchronous KV Cache Prefetching AAAI 2026 Enhancing Conversational Recommender Systems with Tree-Structured Knowledge and Pretrained Language Models AAAI 2026 AEDR: Training-Free AI-Generated Image Attribution via Autoencoder Double-Reconstruction AAAI 2026 Small but Mighty: Dynamic Wavelet Expert-Guided Fine-Tuning of Large-Scale Models for Optical Remote Sensing Object Segmentation AAAI 2026 Integrating Reweighted Least Squares with Plug-and-Play Diffusion Priors for Noisy Image Restoration AAAI 2026 TransLLM: A Unified Multi-Task Large Language Model for Urban Transportation via Learnable Prompting ACL 2026 CloserToMe: A Unified Framework for Accurate and Transferable Latency Prediction Across Heterogeneous Devices AAAI 2026 GenDis: Generative-Discriminative Dual-View Co-Training for Generalized Category Discovery ACL 2026 Image Restoration via Primal Dual Hybrid Gradient and Flow Generative Model AAAI 2026 MessToClean: Evidence-Grounded Structure-Preserving Reconstruction for Real-World Degraded Exam Paper Images ACL 2026 MSAnchor: De Novo Molecular Generation from Mass Spectrometry Data with Anchor-Extended Molecular Scaffolds AAAI 2026 StarFlow: Generating Structured Workflow Outputs From Sketch Images EACL 2026 Calibrated Speculative Decoding: Frequency-Guided Candidate Selection for Efficient Inference ACL 2026 High-Level Semantics and Low-Level Features Fusion for Multi-Scale Object Detection in Dynamic Construction Environments WACV 2026 Information Theoretic Text-to-Image Alignment ICLR 2025 LLMSR@XLLM25: A Language Model-Based Pipeline for Structured Reasoning Data Construction ACL 2025 Transparent Vision: A Theory of Hierarchical Invariant Representations ICCV 2025 VideoMiner: Iteratively Grounding Key Frames of Hour-Long Videos via Tree-based Group Relative Policy Optimization ICCV 2025 X-Dancer: Expressive Music to Human Dance Video Generation ICCV 2025 Adapting Text-to-Image Generation with Feature Difference Instruction for Generic Image Restoration CVPR 2025 LEDiff: Latent Exposure Diffusion for HDR Generation CVPR 2025 DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models CVPR 2025 X-Dyna: Expressive Dynamic Human Image Animation CVPR 2025 CD-PolypNet: Cross-Domain Polyp Segmentation Network with Internal Feature Distillation and Dual-Stream Boundary Focus via Large Vision Model MICCAI 2025 IMPACT: Iterative Mask-based Parallel Decoding for Text-to-Audio Generation with Diffusion Modeling ICML 2025 Generative Audio Language Modeling with Continuous-valued Tokens and Masked Next-Token Prediction ICML 2025 ITAdaptor: Image-Tag Adapter Framework with Knowledge Enhancement for Radiology Report Generation MICCAI 2025 Evolution of Aegis: Fault Diagnosis for AI Model Training Service in Production NSDI 2025 MG-UNet: A Memory-Guided UNet for Lesion Segmentation in Chest Images MICCAI 2025 TokenSelect: Efficient Long-Context Inference and Length Extrapolation for LLMs via Dynamic Token-Level KV Cache Selection EMNLP 2025 Unaligned Message-Passing and Contextualized-Pretraining for Robust Geo-Entity Resolution AAAI 2025 SpikingSSMs: Learning Long Sequences with Sparse and Parallel Spiking State Space Models AAAI 2025 RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs AAAI 2025 Pre-DyGAE: Pre-training Enhanced Dynamic Graph Autoencoder for Occupational Skill Demand Forecasting IJCAI 2024 FlagVNE: A Flexible and Generalizable Reinforcement Learning Framework for Network Resource Allocation IJCAI 2024 Prompt Learning with Extended Kalman Filter for Pre-trained Language Models IJCAI 2024 Job-SDF: A Multi-Granularity Dataset for Job Skill Demand Forecasting and Benchmarking NIPS 2024 Multi-Domain Multi-Scale Diffusion Model for Low-Light Image Enhancement AAAI 2024 Temporal Graph Contrastive Learning for Sequential Recommendation AAAI 2024 Emergent Communication for Numerical Concepts Generalization AAAI 2024 Beyond Entities: A Large-Scale Multi-Modal Knowledge Graph with Triplet Fact Grounding AAAI 2024 icsPLMs: Exploring Pre-trained Language Models in Intelligent Customer Service (Student Abstract) AAAI 2024 TeleChat: An Open-source Billingual Large Language Model ACL 2024 Doc2SoarGraph: Discrete Reasoning over Visually-Rich Table-Text Documents via Semantic-Oriented Hierarchical Graphs COLING 2024 DR2: Disentangled Recurrent Representation Learning for Data-Efficient Speech Video Synthesis WACV 2024 Superpixel-informed Implicit Neural Representation for Multi-Dimensional Data ECCV 2024 Depth-Aware Blind Image Decomposition for Real-World Adverse Weather Recovery ECCV 2024 On the Target-kernel Alignment: a Unified Analysis with Kernel Complexity NIPS 2024 OwMatch: Conditional Self-Labeling with Consistency for Open-World Semi-Supervised Learning NIPS 2024 DiffFPR: Diffusion Prior for Oversampled Fourier Phase Retrieval ICML 2024 Towards Theoretical Understanding of Learning Large-scale Dependent Data via Random Features ICML 2024 A Scanning Laser Ophthalmoscopy Image Database and Trustworthy Retinal Disease Detection Method MICCAI 2024 Multi-modal Adversarial Training for Zero-Shot Voice Cloning INTERSPEECH 2024 DGR: A General Graph Desmoothing Framework for Recommendation via Global and Local Perspectives IJCAI 2024 Speech-Text Pre-training for Spoken Dialog Understanding with Explicit Cross-Modal Alignment ACL 2023 End-to-End Neural Speaker Diarization with Absolute Speaker Loss INTERSPEECH 2023 Incremental Image De-raining via Associative Memory AAAI 2023 Towards Paralinguistic-Only Speech Representations for End-to-End Speech Emotion Recognition INTERSPEECH 2023 Causal Document-Grounded Dialogue Pre-training EMNLP 2023 SEPH: Scalable, Efficient, and Predictable Hashing on Persistent Memory OSDI 2023 GlowGAN: Unsupervised Learning of HDR Images from LDR Images in the Wild ICCV 2023 Image Cropping With Spatial-Aware Feature and Rank Consistency CVPR 2023 Context-Aware Pretraining for Efficient Blind Image Decomposition CVPR 2023 Towards a Unified Analysis of Kernel-based Methods Under Covariate Shift NIPS 2023 BabelTower: Learning to Auto-parallelized Program Translation ICML 2022 DeepVisualInsight: Time-Travelling Visualization for Spatio-Temporal Causality of Deep Classification Training AAAI 2022 Convolutions for Spatial Interaction Modeling CVPR 2022 Exploring Compositional Image Retrieval with Hybrid Compositional Learning and Heuristic Negative Mining EMNLP 2022 Smoothed Adaptive Weighting for Imbalanced Semi-Supervised Learning: Improve Reliability Against Unknown Distribution Data ICML 2022 Impact of Acoustic Event Tagging on Scene Classification in a Multi-Task Learning Framework INTERSPEECH 2022 Towards high-fidelity singing voice conversion with acoustic reference and contrastive predictive coding INTERSPEECH 2022 Topic Modeling Revisited: A Document Graph-based Neural Network Perspective NIPS 2021 TAT-QA: A Question Answering Benchmark on a Hybrid of Tabular and Textual Content in Finance IJCNLP 2021 Exploring Cross-Lingual Transfer Learning with Unsupervised Machine Translation IJCNLP 2021 Regularizing Variational Autoencoder with Diversity and Uncertainty Awareness IJCAI 2021 An Emotional Comfort Framework for Improving User Satisfaction in E-Commerce Customer Service Chatbots NAACL 2021 Event Specific Attention for Polyphonic Sound Event Detection INTERSPEECH 2021 Active Learning for Lane Detection: A Knowledge Distillation Approach ICCV 2021 Learning Term Embeddings for Lexical Taxonomies AAAI 2021 TAT-QA: A Question Answering Benchmark on a Hybrid of Tabular and Textual Content in Finance ACL 2021 Our Learned Lessons from Cross-Lingual Speaker Verification: The CRMI-DKU System Description for the Short-Duration Speaker Verification Challenge 2021 INTERSPEECH 2021 Exploring Cross-Lingual Transfer Learning with Unsupervised Machine Translation ACL 2021 Bootstrapping Named Entity Recognition in E-Commerce with Positive Unlabeled Learning ACL 2020 Intra-Utterance Similarity Preserving Knowledge Distillation for Audio Tagging INTERSPEECH 2020 Acoustic Scene Analysis with Multi-Head Attention Networks INTERSPEECH 2020 Balanced Joint Adversarial Training for Robust Intent Detection and Slot Filling COLING 2020 Discriminative Partial Domain Adversarial Network ECCV 2020 Semi-Supervised ASR by End-to-End Self-Training INTERSPEECH 2020 SetRank: A Setwise Bayesian Approach for Collaborative Ranking from Implicit Feedback AAAI 2020 A Joint Framework for Audio Tagging and Weakly Supervised Acoustic Event Detection Using DenseNet with Global Average Pooling INTERSPEECH 2020 Session-Level User Satisfaction Prediction for Customer Service Chatbot in E-Commerce (Student Abstract) AAAI 2020 Molecular Property Prediction: A Multilevel Quantum Interactions Modeling Perspective AAAI 2019 Sub-Band Convolutional Neural Networks for Small-Footprint Spoken Term Classification INTERSPEECH 2019 Compression of Acoustic Event Detection Models with Quantized Distillation INTERSPEECH 2019 Improving Back-Translation with Uncertainty-based Confidence Estimation IJCNLP 2019 Relation Extraction Using Supervision from Topic Knowledge of Relation Labels IJCAI 2019 The Lower The Simpler: Simplifying Hierarchical Recurrent Models NAACL 2019 Improving Back-Translation with Uncertainty-based Confidence Estimation EMNLP 2019 Hierarchical Disentanglement of Discriminative Latent Features for Zero-Shot Learning CVPR 2019 Explicit Utilization of General Knowledge in Machine Reading Comprehension ACL 2019 Multimodal and Multi-view Models for Emotion Recognition ACL 2019 R-CRNN: Region-based Convolutional Recurrent Neural Network for Audio Event Detection INTERSPEECH 2018 Detecting Media Sound Presence in Acoustic Scenes INTERSPEECH 2018 A Simple Model for Detection of Rare Sound Events INTERSPEECH 2018 Coded Illumination and Imaging for Fluorescence Based Classification ECCV 2018 Discriminative Region Proposal Adversarial Networks for High-Quality Image-to-Image Translation ECCV 2018 Encoding High Dimensional Local Features by Sparse Coding Based Fisher Vectors NIPS 2014 Improving Graph Matching via Density Maximization ICCV 2013 Spoken Dialogue Systems for Language Learning NAACL 2007 Automatic Assessment of Student Translations for Foreign Language Tutoring NAACL 2007 Chinese Syntactic Reordering for Statistical Machine Translation EMNLP 2007 Chinese Syntactic Reordering for Statistical Machine Translation CONLL 2007 Automatic Acquisition of Names Using Speak and Spell Mode in Spoken Dialogue Systems NAACL 2003