Chao Wang
114 papers · 2003–2026 · 20 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π§ Keyword Pioneer π Renaissance Researcher (8) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (27) π£ Hot Topic Early Bird
πΊοΈ
Taxonomy Completionist
(27)
π£
Hot Topic Early Bird
π§
Keyword Pioneer
π
Conference Loyalist
(22)
π¬
Deep Specialist
(13)
π
Keyword Champion
π
Grand Slam
π₯
Mega-Team
(32)
ποΈ
Keyword Collector
(70)
π₯
Unstoppable
(9)
β‘
Prolific Year
(10)
π
Trend Setter
π
Conference Pioneer
π
Century Club
(101)
Conferences
AAAI (22)
INTERSPEECH (16)
ACL (12)
CVPR (8)
NIPS (6)
IJCAI (6)
ICML (6)
ICCV (6)
EMNLP (5)
ECCV (5)
NAACL (5)
MICCAI (4)
IJCNLP (3)
WACV (3)
COLING (2)
ICLR (1)
EACL (1)
CONLL (1)
NSDI (1)
OSDI (1)
Top co-authors
Keywords
attention mechanism
(6)
domain adaptation
(5)
representation learning
(5)
neural network
(5)
diffusion model
(5)
vision-language model
(4)
reinforcement learning
(4)
knowledge distillation
(4)
image restoration
(4)
large language model
(4)
semi-supervised learning
(4)
graph neural network
(4)
transfer learning
(4)
zero-shot learning
(4)
weakly supervised learning
(3)
uncertainty estimation
(3)
transformer encoder
(3)
contrastive learning
(3)
knowledge graph
(3)
model compression
(3)
Papers
Clear Sights on Site: A Spatial-Adaptive Channel Network for Deblurring Construction Site Images
WACV 2026
Accelerating LLM Inference Throughput via Asynchronous KV Cache Prefetching
AAAI 2026
Enhancing Conversational Recommender Systems with Tree-Structured Knowledge and Pretrained Language Models
AAAI 2026
AEDR: Training-Free AI-Generated Image Attribution via Autoencoder Double-Reconstruction
AAAI 2026
Small but Mighty: Dynamic Wavelet Expert-Guided Fine-Tuning of Large-Scale Models for Optical Remote Sensing Object Segmentation
AAAI 2026
Integrating Reweighted Least Squares with Plug-and-Play Diffusion Priors for Noisy Image Restoration
AAAI 2026
TransLLM: A Unified Multi-Task Large Language Model for Urban Transportation via Learnable Prompting
ACL 2026
CloserToMe: A Unified Framework for Accurate and Transferable Latency Prediction Across Heterogeneous Devices
AAAI 2026
GenDis: Generative-Discriminative Dual-View Co-Training for Generalized Category Discovery
ACL 2026
Image Restoration via Primal Dual Hybrid Gradient and Flow Generative Model
AAAI 2026
MessToClean: Evidence-Grounded Structure-Preserving Reconstruction for Real-World Degraded Exam Paper Images
ACL 2026
MSAnchor: De Novo Molecular Generation from Mass Spectrometry Data with Anchor-Extended Molecular Scaffolds
AAAI 2026
StarFlow: Generating Structured Workflow Outputs From Sketch Images
EACL 2026
Calibrated Speculative Decoding: Frequency-Guided Candidate Selection for Efficient Inference
ACL 2026
High-Level Semantics and Low-Level Features Fusion for Multi-Scale Object Detection in Dynamic Construction Environments
WACV 2026
Information Theoretic Text-to-Image Alignment
ICLR 2025
LLMSR@XLLM25: A Language Model-Based Pipeline for Structured Reasoning Data Construction
ACL 2025
Transparent Vision: A Theory of Hierarchical Invariant Representations
ICCV 2025
VideoMiner: Iteratively Grounding Key Frames of Hour-Long Videos via Tree-based Group Relative Policy Optimization
ICCV 2025
X-Dancer: Expressive Music to Human Dance Video Generation
ICCV 2025
Adapting Text-to-Image Generation with Feature Difference Instruction for Generic Image Restoration
CVPR 2025
LEDiff: Latent Exposure Diffusion for HDR Generation
CVPR 2025
DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models
CVPR 2025
X-Dyna: Expressive Dynamic Human Image Animation
CVPR 2025
CD-PolypNet: Cross-Domain Polyp Segmentation Network with Internal Feature Distillation and Dual-Stream Boundary Focus via Large Vision Model
MICCAI 2025
IMPACT: Iterative Mask-based Parallel Decoding for Text-to-Audio Generation with Diffusion Modeling
ICML 2025
Generative Audio Language Modeling with Continuous-valued Tokens and Masked Next-Token Prediction
ICML 2025
ITAdaptor: Image-Tag Adapter Framework with Knowledge Enhancement for Radiology Report Generation
MICCAI 2025
Evolution of Aegis: Fault Diagnosis for AI Model Training Service in Production
NSDI 2025
MG-UNet: A Memory-Guided UNet for Lesion Segmentation in Chest Images
MICCAI 2025
TokenSelect: Efficient Long-Context Inference and Length Extrapolation for LLMs via Dynamic Token-Level KV Cache Selection
EMNLP 2025
Unaligned Message-Passing and Contextualized-Pretraining for Robust Geo-Entity Resolution
AAAI 2025
SpikingSSMs: Learning Long Sequences with Sparse and Parallel Spiking State Space Models
AAAI 2025
RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs
AAAI 2025
Pre-DyGAE: Pre-training Enhanced Dynamic Graph Autoencoder for Occupational Skill Demand Forecasting
IJCAI 2024
FlagVNE: A Flexible and Generalizable Reinforcement Learning Framework for Network Resource Allocation
IJCAI 2024
Prompt Learning with Extended Kalman Filter for Pre-trained Language Models
IJCAI 2024
Job-SDF: A Multi-Granularity Dataset for Job Skill Demand Forecasting and Benchmarking
NIPS 2024
Multi-Domain Multi-Scale Diffusion Model for Low-Light Image Enhancement
AAAI 2024
Temporal Graph Contrastive Learning for Sequential Recommendation
AAAI 2024
Emergent Communication for Numerical Concepts Generalization
AAAI 2024
Beyond Entities: A Large-Scale Multi-Modal Knowledge Graph with Triplet Fact Grounding
AAAI 2024
icsPLMs: Exploring Pre-trained Language Models in Intelligent Customer Service (Student Abstract)
AAAI 2024
TeleChat: An Open-source Billingual Large Language Model
ACL 2024
Doc2SoarGraph: Discrete Reasoning over Visually-Rich Table-Text Documents via Semantic-Oriented Hierarchical Graphs
COLING 2024
DR2: Disentangled Recurrent Representation Learning for Data-Efficient Speech Video Synthesis
WACV 2024
Superpixel-informed Implicit Neural Representation for Multi-Dimensional Data
ECCV 2024
Depth-Aware Blind Image Decomposition for Real-World Adverse Weather Recovery
ECCV 2024
On the Target-kernel Alignment: a Unified Analysis with Kernel Complexity
NIPS 2024
OwMatch: Conditional Self-Labeling with Consistency for Open-World Semi-Supervised Learning
NIPS 2024
DiffFPR: Diffusion Prior for Oversampled Fourier Phase Retrieval
ICML 2024
Towards Theoretical Understanding of Learning Large-scale Dependent Data via Random Features
ICML 2024
A Scanning Laser Ophthalmoscopy Image Database and Trustworthy Retinal Disease Detection Method
MICCAI 2024
Multi-modal Adversarial Training for Zero-Shot Voice Cloning
INTERSPEECH 2024
DGR: A General Graph Desmoothing Framework for Recommendation via Global and Local Perspectives
IJCAI 2024
Speech-Text Pre-training for Spoken Dialog Understanding with Explicit Cross-Modal Alignment
ACL 2023
End-to-End Neural Speaker Diarization with Absolute Speaker Loss
INTERSPEECH 2023
Incremental Image De-raining via Associative Memory
AAAI 2023
Towards Paralinguistic-Only Speech Representations for End-to-End Speech Emotion Recognition
INTERSPEECH 2023
Causal Document-Grounded Dialogue Pre-training
EMNLP 2023
SEPH: Scalable, Efficient, and Predictable Hashing on Persistent Memory
OSDI 2023
GlowGAN: Unsupervised Learning of HDR Images from LDR Images in the Wild
ICCV 2023
Image Cropping With Spatial-Aware Feature and Rank Consistency
CVPR 2023
Context-Aware Pretraining for Efficient Blind Image Decomposition
CVPR 2023
Towards a Unified Analysis of Kernel-based Methods Under Covariate Shift
NIPS 2023
BabelTower: Learning to Auto-parallelized Program Translation
ICML 2022
DeepVisualInsight: Time-Travelling Visualization for Spatio-Temporal Causality of Deep Classification Training
AAAI 2022
Convolutions for Spatial Interaction Modeling
CVPR 2022
Exploring Compositional Image Retrieval with Hybrid Compositional Learning and Heuristic Negative Mining
EMNLP 2022
Smoothed Adaptive Weighting for Imbalanced Semi-Supervised Learning: Improve Reliability Against Unknown Distribution Data
ICML 2022
Impact of Acoustic Event Tagging on Scene Classification in a Multi-Task Learning Framework
INTERSPEECH 2022
Towards high-fidelity singing voice conversion with acoustic reference and contrastive predictive coding
INTERSPEECH 2022
Topic Modeling Revisited: A Document Graph-based Neural Network Perspective
NIPS 2021
TAT-QA: A Question Answering Benchmark on a Hybrid of Tabular and Textual Content in Finance
IJCNLP 2021
Exploring Cross-Lingual Transfer Learning with Unsupervised Machine Translation
IJCNLP 2021
Regularizing Variational Autoencoder with Diversity and Uncertainty Awareness
IJCAI 2021
An Emotional Comfort Framework for Improving User Satisfaction in E-Commerce Customer Service Chatbots
NAACL 2021
Event Specific Attention for Polyphonic Sound Event Detection
INTERSPEECH 2021
Active Learning for Lane Detection: A Knowledge Distillation Approach
ICCV 2021
Learning Term Embeddings for Lexical Taxonomies
AAAI 2021
TAT-QA: A Question Answering Benchmark on a Hybrid of Tabular and Textual Content in Finance
ACL 2021
Our Learned Lessons from Cross-Lingual Speaker Verification: The CRMI-DKU System Description for the Short-Duration Speaker Verification Challenge 2021
INTERSPEECH 2021
Exploring Cross-Lingual Transfer Learning with Unsupervised Machine Translation
ACL 2021
Bootstrapping Named Entity Recognition in E-Commerce with Positive Unlabeled Learning
ACL 2020
Intra-Utterance Similarity Preserving Knowledge Distillation for Audio Tagging
INTERSPEECH 2020
Acoustic Scene Analysis with Multi-Head Attention Networks
INTERSPEECH 2020
Balanced Joint Adversarial Training for Robust Intent Detection and Slot Filling
COLING 2020
Discriminative Partial Domain Adversarial Network
ECCV 2020
Semi-Supervised ASR by End-to-End Self-Training
INTERSPEECH 2020
SetRank: A Setwise Bayesian Approach for Collaborative Ranking from Implicit Feedback
AAAI 2020
A Joint Framework for Audio Tagging and Weakly Supervised Acoustic Event Detection Using DenseNet with Global Average Pooling
INTERSPEECH 2020
Session-Level User Satisfaction Prediction for Customer Service Chatbot in E-Commerce (Student Abstract)
AAAI 2020
Molecular Property Prediction: A Multilevel Quantum Interactions Modeling Perspective
AAAI 2019
Sub-Band Convolutional Neural Networks for Small-Footprint Spoken Term Classification
INTERSPEECH 2019
Compression of Acoustic Event Detection Models with Quantized Distillation
INTERSPEECH 2019
Improving Back-Translation with Uncertainty-based Confidence Estimation
IJCNLP 2019
Relation Extraction Using Supervision from Topic Knowledge of Relation Labels
IJCAI 2019
The Lower The Simpler: Simplifying Hierarchical Recurrent Models
NAACL 2019
Improving Back-Translation with Uncertainty-based Confidence Estimation
EMNLP 2019
Hierarchical Disentanglement of Discriminative Latent Features for Zero-Shot Learning
CVPR 2019
Explicit Utilization of General Knowledge in Machine Reading Comprehension
ACL 2019
Multimodal and Multi-view Models for Emotion Recognition
ACL 2019
R-CRNN: Region-based Convolutional Recurrent Neural Network for Audio Event Detection
INTERSPEECH 2018
Detecting Media Sound Presence in Acoustic Scenes
INTERSPEECH 2018
A Simple Model for Detection of Rare Sound Events
INTERSPEECH 2018
Coded Illumination and Imaging for Fluorescence Based Classification
ECCV 2018
Discriminative Region Proposal Adversarial Networks for High-Quality Image-to-Image Translation
ECCV 2018
Encoding High Dimensional Local Features by Sparse Coding Based Fisher Vectors
NIPS 2014
Improving Graph Matching via Density Maximization
ICCV 2013
Spoken Dialogue Systems for Language Learning
NAACL 2007
Automatic Assessment of Student Translations for Foreign Language Tutoring
NAACL 2007
Chinese Syntactic Reordering for Statistical Machine Translation
EMNLP 2007
Chinese Syntactic Reordering for Statistical Machine Translation
CONLL 2007
Automatic Acquisition of Names Using Speak and Spell Mode in Spoken Dialogue Systems
NAACL 2003