Tao Wang

122 papers · 2007–2026 · 18 conferences · across top CS/AI conferences

Achievements

+16 more ↓

🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (18) 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🐣 Hot Topic Early Bird

🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (18) 🐣 Hot Topic Early Bird 🌟 Keyword Trendsetter Combo (3) 🤝 Dynamic Duo (20) 🏆 Keyword Champion 👥 Mega-Team (27) 🔬 Deep Specialist (13) 🧬 Topic Evolution 🔥 Unstoppable (7) 📈 Trend Setter ❓ The Questioner 🗃️ Keyword Collector (60) 💎 Century Club (116) 🚀 Conference Pioneer ⚡ Prolific Year (9)

Conferences

AAAI (16) ACL (15) CVPR (14) EMNLP (12) INTERSPEECH (11) ICCV (10) NIPS (8) IJCAI (7) ICML (6) IJCNLP (6) ECCV (5) NAACL (4) NSDI (3) L4DC (1) MICCAI (1) OSDI (1) SEMEVAL (1) WACV (1)

Top co-authors

Yong Jiang (20) Fei Huang (19) Kewei Tu (17) Zhongqiang Huang (17) Nguyen Bach (16) Xinyu Wang (12) Jiashi Feng (10) Li Yuan (9) Jianhua Tao (9) Zhengqi Wen (8)

Research topics

Computer Vision (1) Applications (1) Privacy (1)

Keywords

neural network (10) knowledge distillation (9) object detection (8) model compression (7) sequence labeling (7) structured prediction (7) domain adaptation (6) named entity recognition (6) zero-shot learning (6) reinforcement learning (6) transfer learning (6) image restoration (5) cross-lingual transfer (5) graph neural network (5) large language model (5) representation learning (4) multi-task learning (4) attention mechanism (4) speech translation (4) machine translation (4)

Papers

SDAR-VL: Stable and Efficient Block-wise Diffusion for Vision-Language Understanding ACL 2026 CrossCheck-Bench: Diagnosing Compositional Failures in Multimodal Conflict Resolution AAAI 2026 DiMA: Distinguishing Resident and Tourist Preferences via Multi-Modal LLM Alignment for Out-of-Town Cross-Domain Recommendation AAAI 2026 Generalizable and Efficient Automated Scoring with a Knowledge-Distilled Multi-Task Mixture-of-Experts AAAI 2026 LADR: Locality-Aware Dynamic Rescue for Efficient Text-to-Image Generation with Diffusion Large Language Models ACL 2026 Hybrid-DMKG: A Hybrid Reasoning Framework over Dynamic Multimodal Knowledge Graphs for Multimodal Multihop QA with Knowledge Editing AAAI 2026 Improving Value Estimation Critically Enhances Vanilla Policy Gradient ICML 2025 StickMotion: Generating 3D Human Motions by Drawing a Stickman CVPR 2025 GAPO: Learning Preferential Prompt through Generative Adversarial Policy Optimization ACL 2025 Open-Det: An Efficient Learning Framework for Open-Ended Detection ICML 2025 Collaborative Multi-LoRA Experts with Achievement-based Multi-Tasks Loss for Unified Multimodal Information Extraction IJCAI 2025 HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model ICCV 2025 MaterialMVP: Illumination-Invariant Material Generation via Multi-view PBR Diffusion ICCV 2025 MOERL: When Mixture-of-Experts Meet Reinforcement Learning for Adverse Weather Image Restoration ICCV 2025 QCRD: Quality-guided Contrastive Rationale Distillation for Large Language Models EMNLP 2025 State-Compute Replication: Parallelizing High-Speed Stateful Packet Processing NSDI 2025 UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model NAACL 2025 Synergy-Guided Regional Supervision of Pseudo Labels for Semi-Supervised Medical Image Segmentation MICCAI 2025 A Hubness Perspective on Representation Learning for Graph-Based Multi-View Clustering CVPR 2025 Mollification Effects of Policy Gradient Methods ICML 2024 OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models ECCV 2024 Controlled Decoding from Language Models ICML 2024 Updating Large Language Models’ Memories with Time Constraints EMNLP 2024 Generated and Pseudo Content guided Prototype Refinement for Few-shot Point Cloud Segmentation NIPS 2024 Trend-Aware Supervision: On Learning Invariance for Semi-supervised Facial Action Unit Intensity Estimation AAAI 2024 VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning AAAI 2024 Zero-Shot Aerial Object Detection with Visual Description Regularization AAAI 2024 Sparse Convolutional Networks for Surface Reconstruction From Noisy Point Clouds WACV 2024 Understanding the difficulty of solving Cauchy problems with PINNs L4DC 2024 PPPR: Portable Plug-in Prompt Refiner for Text to Audio Generation INTERSPEECH 2024 Multi-modal Adversarial Training for Zero-Shot Voice Cloning INTERSPEECH 2024 PANORAMIA: Privacy Auditing of Machine Learning Models without Retraining NIPS 2024 Unlocking Data-free Low-bit Quantization with Matrix Decomposition for KV Cache Compression ACL 2024 GroundingGPT: Language Enhanced Multi-modal Grounding Model ACL 2024 Residual Speaker Representation for One-Shot Voice Conversion INTERSPEECH 2024 TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking INTERSPEECH 2024 DFA-GNN: Forward Learning of Graph Neural Networks by Direct Feedback Alignment NIPS 2024 SynSP: Synergy of Smoothness and Precision in Pose Sequences Refinement CVPR 2024 Rethinking the Representation in Federated Unsupervised Learning with Non-IID Data CVPR 2024 BLEURT Has Universal Translations: An Analysis of Automatic Metrics by Minimum Risk Training ACL 2023 Ultra-High-Definition Low-Light Image Enhancement: A Benchmark and Transformer-Based Method AAAI 2023 Punctuation-level Attack: Single-shot and Single Punctuation Can Fool Text Models NIPS 2023 Learning To Detect and Segment for Open Vocabulary Object Detection CVPR 2023 Fractal Landscapes in Policy Optimization NIPS 2023 Improving Speech Translation by Fusing Speech and Text EMNLP 2023 GigaST: A 10,000-hour Pseudo Speech Translation Corpus INTERSPEECH 2023 Graph Propagation Transformer for Graph Representation Learning IJCAI 2023 Orion: Online Backdoor Sample Detection via Evolution Deviance IJCAI 2023 FedInv: Byzantine-Robust Federated Learning by Inversing Local Model Updates AAAI 2022 Rethinking Image Restoration for Object Detection NIPS 2022 Causal Intervention for Subject-Deconfounded Facial Action Unit Recognition AAAI 2022 Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-Supervised Action Recognition AAAI 2022 Pose-Guided Feature Disentangling for Occluded Person Re-identification Based on Transformer AAAI 2022 Powerful Graph Convolutional Networks with Adaptive Propagation Mechanism for Homophily and Heterophily AAAI 2022 A Novel Framework Based on Medical Concept Driven Attention for Explainable Medical Code Prediction via External Knowledge ACL 2022 PoseTriplet: Co-Evolving 3D Human Pose Estimation, Imitation, and Hallucination Under Self-Supervision CVPR 2022 On Mitigating Hard Clusters for Face Clustering ECCV 2022 BézierPalm: A Free Lunch for Palmprint Recognition ECCV 2022 Towards Real-World HDRTV Reconstruction: A Data Synthesis-Based Approach ECCV 2022 GLaM: Efficient Scaling of Language Models with Mixture-of-Experts ICML 2022 Uncertainty-Guided Pixel Contrastive Learning for Semi-Supervised Medical Image Segmentation IJCAI 2022 Discrete Listwise Personalized Ranking for Fast Top-N Recommendation with Implicit Feedback IJCAI 2022 ITA: Image-Text Alignments for Multi-Modal Named Entity Recognition NAACL 2022 DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition NAACL 2022 NetVRM: Virtual Register Memory for Programmable Networks NSDI 2022 Isolation Mechanisms for High-Speed Packet-Processing Pipelines NSDI 2022 DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition SEMEVAL 2022 Ultra-High-Definition Image HDR Reconstruction via Collaborative Bilateral Learning ICCV 2021 PnP-DETR: Towards Efficient Visual Analysis With Transformers ICCV 2021 Deep Reinforcement Learning for Multi-contact Motion Planning of Hexapod Robots IJCAI 2021 Half-Truth: A Partially Fake Audio Detection Dataset INTERSPEECH 2021 Word Reordering for Zero-shot Cross-lingual Structured Prediction EMNLP 2021 Secoco: Self-Correcting Encoding for Neural Machine Translation EMNLP 2021 A Unified Encoding of Structures in Transition Systems EMNLP 2021 The Volctrans Neural Speech Translation System for IWSLT 2021 ACL 2021 Risk Minimization for Zero-shot Sequence Labeling ACL 2021 Multi-View Cross-Lingual Structured Prediction with Minimum Supervision ACL 2021 Automated Concatenation of Embeddings for Structured Prediction ACL 2021 Autocorrect in the Process of Translation — Multi-task Learning Improves Dialogue Machine Translation NAACL 2021 Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor ACL 2021 An Entity-Aware Adversarial Domain Adaptation Network for Cross-Domain Named Entity Recognition (Student Abstract) AAAI 2021 Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning ACL 2021 End-to-End Video Instance Segmentation via Spatial-Temporal Graph Neural Networks ICCV 2021 MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations EMNLP 2021 Direct Multi-view Multi-person 3D Pose Estimation NIPS 2021 Real-Time Image Enhancer via Learnable Spatial-Aware 3D Lookup Tables ICCV 2021 Multi-Scale Separable Network for Ultra-High-Definition Video Deblurring ICCV 2021 Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor IJCNLP 2021 Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning IJCNLP 2021 Automated Concatenation of Embeddings for Structured Prediction IJCNLP 2021 Multi-View Cross-Lingual Structured Prediction with Minimum Supervision IJCNLP 2021 Risk Minimization for Zero-shot Sequence Labeling IJCNLP 2021 The Volctrans Neural Speech Translation System for IWSLT 2021 IJCNLP 2021 Ultra-High-Definition Image Dehazing via Multi-Guided Bilateral Learning CVPR 2021 Tokens-to-Token ViT: Training Vision Transformers From Scratch on ImageNet ICCV 2021 Dynamic Soft Windowing and Language Dependent Style Token for Code-Switching End-to-End Speech Synthesis INTERSPEECH 2020 The Box is in the Pen: Evaluating Commonsense Reasoning in Neural Machine Translation EMNLP 2020 An Investigation of Potential Function Designs for Neural CRF EMNLP 2020 AIN: Fast and Accurate Sequence Labeling with Approximate Inference Network EMNLP 2020 Task-oriented Domain-specific Meta-Embedding for Text Classification EMNLP 2020 The Devil is in Classification: A Simple Framework for Long-tail Instance Segmentation ECCV 2020 Overcoming Classifier Imbalance for Long-Tail Object Detection With Balanced Group Softmax CVPR 2020 Structure-Level Knowledge Distillation For Multilingual Sequence Labeling ACL 2020 Learning Combinatorial Solver for Graph Matching CVPR 2020 Revisiting Knowledge Distillation via Label Smoothing Regularization CVPR 2020 Central Similarity Quantization for Efficient Image and Video Retrieval CVPR 2020 Spoken Content and Voice Factorization for Few-Shot Speaker Adaptation INTERSPEECH 2020 Non-Autoregressive End-to-End TTS with Coarse-to-Fine Decoding INTERSPEECH 2020 Bi-Level Speaker Supervision for One-Shot Speech Synthesis INTERSPEECH 2020 Dynamic Speaker Representations Adjustment and Decoder Factorization for Speaker Adaptation in End-to-End Speech Synthesis INTERSPEECH 2020 Finding Action Tubes with a Sparse-to-Dense Framework AAAI 2020 Gauntlet: Finding Bugs in Compilers for Programmable Packet Processing OSDI 2020 More Embeddings, Better Sequence Labelers? EMNLP 2020 Distilling Object Detectors With Fine-Grained Feature Imitation CVPR 2019 Deformable Surface Tracking by Graph Matching ICCV 2019 Partial Multi-Label Learning by Low-Rank and Sparse Decomposition AAAI 2019 Few-Shot Adaptive Faster R-CNN CVPR 2019 Interactive Image Segmentation via Pairwise Likelihood Learning IJCAI 2017 Dual Training and Dual Prediction for Polarity Classification ACL 2013 Learning Structured Hough Voting for Joint Object Detection and Occlusion Reasoning CVPR 2013 Deep learning with COTS HPC systems ICML 2013 Stable Dual Dynamic Programming NIPS 2007