Tao Wang
122 papers · 2007–2026 · 18 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (18) π Interdisciplinary Bridge π Renaissance Researcher (5) π£ Hot Topic Early Bird
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(18)
π£
Hot Topic Early Bird
π
Keyword Trendsetter Combo
(3)
π€
Dynamic Duo
(20)
π
Keyword Champion
π₯
Mega-Team
(27)
π¬
Deep Specialist
(13)
π§¬
Topic Evolution
π₯
Unstoppable
(7)
π
Trend Setter
β
The Questioner
ποΈ
Keyword Collector
(60)
π
Century Club
(116)
π
Conference Pioneer
β‘
Prolific Year
(9)
Conferences
AAAI (16)
ACL (15)
CVPR (14)
EMNLP (12)
INTERSPEECH (11)
ICCV (10)
NIPS (8)
IJCAI (7)
ICML (6)
IJCNLP (6)
ECCV (5)
NAACL (4)
NSDI (3)
L4DC (1)
MICCAI (1)
OSDI (1)
SEMEVAL (1)
WACV (1)
Top co-authors
Research topics
Keywords
neural network
(10)
knowledge distillation
(9)
object detection
(8)
model compression
(7)
sequence labeling
(7)
structured prediction
(7)
domain adaptation
(6)
named entity recognition
(6)
zero-shot learning
(6)
reinforcement learning
(6)
transfer learning
(6)
image restoration
(5)
cross-lingual transfer
(5)
graph neural network
(5)
large language model
(5)
representation learning
(4)
multi-task learning
(4)
attention mechanism
(4)
speech translation
(4)
machine translation
(4)
Papers
SDAR-VL: Stable and Efficient Block-wise Diffusion for Vision-Language Understanding
ACL 2026
CrossCheck-Bench: Diagnosing Compositional Failures in Multimodal Conflict Resolution
AAAI 2026
DiMA: Distinguishing Resident and Tourist Preferences via Multi-Modal LLM Alignment for Out-of-Town Cross-Domain Recommendation
AAAI 2026
Generalizable and Efficient Automated Scoring with a Knowledge-Distilled Multi-Task Mixture-of-Experts
AAAI 2026
LADR: Locality-Aware Dynamic Rescue for Efficient Text-to-Image Generation with Diffusion Large Language Models
ACL 2026
Hybrid-DMKG: A Hybrid Reasoning Framework over Dynamic Multimodal Knowledge Graphs for Multimodal Multihop QA with Knowledge Editing
AAAI 2026
Improving Value Estimation Critically Enhances Vanilla Policy Gradient
ICML 2025
StickMotion: Generating 3D Human Motions by Drawing a Stickman
CVPR 2025
GAPO: Learning Preferential Prompt through Generative Adversarial Policy Optimization
ACL 2025
Open-Det: An Efficient Learning Framework for Open-Ended Detection
ICML 2025
Collaborative Multi-LoRA Experts with Achievement-based Multi-Tasks Loss for Unified Multimodal Information Extraction
IJCAI 2025
HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model
ICCV 2025
MaterialMVP: Illumination-Invariant Material Generation via Multi-view PBR Diffusion
ICCV 2025
MOERL: When Mixture-of-Experts Meet Reinforcement Learning for Adverse Weather Image Restoration
ICCV 2025
QCRD: Quality-guided Contrastive Rationale Distillation for Large Language Models
EMNLP 2025
State-Compute Replication: Parallelizing High-Speed Stateful Packet Processing
NSDI 2025
UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model
NAACL 2025
Synergy-Guided Regional Supervision of Pseudo Labels for Semi-Supervised Medical Image Segmentation
MICCAI 2025
A Hubness Perspective on Representation Learning for Graph-Based Multi-View Clustering
CVPR 2025
Mollification Effects of Policy Gradient Methods
ICML 2024
OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models
ECCV 2024
Controlled Decoding from Language Models
ICML 2024
Updating Large Language Modelsβ Memories with Time Constraints
EMNLP 2024
Generated and Pseudo Content guided Prototype Refinement for Few-shot Point Cloud Segmentation
NIPS 2024
Trend-Aware Supervision: On Learning Invariance for Semi-supervised Facial Action Unit Intensity Estimation
AAAI 2024
VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning
AAAI 2024
Zero-Shot Aerial Object Detection with Visual Description Regularization
AAAI 2024
Sparse Convolutional Networks for Surface Reconstruction From Noisy Point Clouds
WACV 2024
Understanding the difficulty of solving Cauchy problems with PINNs
L4DC 2024
PPPR: Portable Plug-in Prompt Refiner for Text to Audio Generation
INTERSPEECH 2024
Multi-modal Adversarial Training for Zero-Shot Voice Cloning
INTERSPEECH 2024
PANORAMIA: Privacy Auditing of Machine Learning Models without Retraining
NIPS 2024
Unlocking Data-free Low-bit Quantization with Matrix Decomposition for KV Cache Compression
ACL 2024
GroundingGPT: Language Enhanced Multi-modal Grounding Model
ACL 2024
Residual Speaker Representation for One-Shot Voice Conversion
INTERSPEECH 2024
TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking
INTERSPEECH 2024
DFA-GNN: Forward Learning of Graph Neural Networks by Direct Feedback Alignment
NIPS 2024
SynSP: Synergy of Smoothness and Precision in Pose Sequences Refinement
CVPR 2024
Rethinking the Representation in Federated Unsupervised Learning with Non-IID Data
CVPR 2024
BLEURT Has Universal Translations: An Analysis of Automatic Metrics by Minimum Risk Training
ACL 2023
Ultra-High-Definition Low-Light Image Enhancement: A Benchmark and Transformer-Based Method
AAAI 2023
Punctuation-level Attack: Single-shot and Single Punctuation Can Fool Text Models
NIPS 2023
Learning To Detect and Segment for Open Vocabulary Object Detection
CVPR 2023
Fractal Landscapes in Policy Optimization
NIPS 2023
Improving Speech Translation by Fusing Speech and Text
EMNLP 2023
GigaST: A 10,000-hour Pseudo Speech Translation Corpus
INTERSPEECH 2023
Graph Propagation Transformer for Graph Representation Learning
IJCAI 2023
Orion: Online Backdoor Sample Detection via Evolution Deviance
IJCAI 2023
FedInv: Byzantine-Robust Federated Learning by Inversing Local Model Updates
AAAI 2022
Rethinking Image Restoration for Object Detection
NIPS 2022
Causal Intervention for Subject-Deconfounded Facial Action Unit Recognition
AAAI 2022
Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-Supervised Action Recognition
AAAI 2022
Pose-Guided Feature Disentangling for Occluded Person Re-identification Based on Transformer
AAAI 2022
Powerful Graph Convolutional Networks with Adaptive Propagation Mechanism for Homophily and Heterophily
AAAI 2022
A Novel Framework Based on Medical Concept Driven Attention for Explainable Medical Code Prediction via External Knowledge
ACL 2022
PoseTriplet: Co-Evolving 3D Human Pose Estimation, Imitation, and Hallucination Under Self-Supervision
CVPR 2022
On Mitigating Hard Clusters for Face Clustering
ECCV 2022
BΓ©zierPalm: A Free Lunch for Palmprint Recognition
ECCV 2022
Towards Real-World HDRTV Reconstruction: A Data Synthesis-Based Approach
ECCV 2022
GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
ICML 2022
Uncertainty-Guided Pixel Contrastive Learning for Semi-Supervised Medical Image Segmentation
IJCAI 2022
Discrete Listwise Personalized Ranking for Fast Top-N Recommendation with Implicit Feedback
IJCAI 2022
ITA: Image-Text Alignments for Multi-Modal Named Entity Recognition
NAACL 2022
DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition
NAACL 2022
NetVRM: Virtual Register Memory for Programmable Networks
NSDI 2022
Isolation Mechanisms for High-Speed Packet-Processing Pipelines
NSDI 2022
DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition
SEMEVAL 2022
Ultra-High-Definition Image HDR Reconstruction via Collaborative Bilateral Learning
ICCV 2021
PnP-DETR: Towards Efficient Visual Analysis With Transformers
ICCV 2021
Deep Reinforcement Learning for Multi-contact Motion Planning of Hexapod Robots
IJCAI 2021
Half-Truth: A Partially Fake Audio Detection Dataset
INTERSPEECH 2021
Word Reordering for Zero-shot Cross-lingual Structured Prediction
EMNLP 2021
Secoco: Self-Correcting Encoding for Neural Machine Translation
EMNLP 2021
A Unified Encoding of Structures in Transition Systems
EMNLP 2021
The Volctrans Neural Speech Translation System for IWSLT 2021
ACL 2021
Risk Minimization for Zero-shot Sequence Labeling
ACL 2021
Multi-View Cross-Lingual Structured Prediction with Minimum Supervision
ACL 2021
Automated Concatenation of Embeddings for Structured Prediction
ACL 2021
Autocorrect in the Process of Translation β Multi-task Learning Improves Dialogue Machine Translation
NAACL 2021
Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor
ACL 2021
An Entity-Aware Adversarial Domain Adaptation Network for Cross-Domain Named Entity Recognition (Student Abstract)
AAAI 2021
Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning
ACL 2021
End-to-End Video Instance Segmentation via Spatial-Temporal Graph Neural Networks
ICCV 2021
MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations
EMNLP 2021
Direct Multi-view Multi-person 3D Pose Estimation
NIPS 2021
Real-Time Image Enhancer via Learnable Spatial-Aware 3D Lookup Tables
ICCV 2021
Multi-Scale Separable Network for Ultra-High-Definition Video Deblurring
ICCV 2021
Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor
IJCNLP 2021
Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning
IJCNLP 2021
Automated Concatenation of Embeddings for Structured Prediction
IJCNLP 2021
Multi-View Cross-Lingual Structured Prediction with Minimum Supervision
IJCNLP 2021
Risk Minimization for Zero-shot Sequence Labeling
IJCNLP 2021
The Volctrans Neural Speech Translation System for IWSLT 2021
IJCNLP 2021
Ultra-High-Definition Image Dehazing via Multi-Guided Bilateral Learning
CVPR 2021
Tokens-to-Token ViT: Training Vision Transformers From Scratch on ImageNet
ICCV 2021
Dynamic Soft Windowing and Language Dependent Style Token for Code-Switching End-to-End Speech Synthesis
INTERSPEECH 2020
The Box is in the Pen: Evaluating Commonsense Reasoning in Neural Machine Translation
EMNLP 2020
An Investigation of Potential Function Designs for Neural CRF
EMNLP 2020
AIN: Fast and Accurate Sequence Labeling with Approximate Inference Network
EMNLP 2020
Task-oriented Domain-specific Meta-Embedding for Text Classification
EMNLP 2020
The Devil is in Classification: A Simple Framework for Long-tail Instance Segmentation
ECCV 2020
Overcoming Classifier Imbalance for Long-Tail Object Detection With Balanced Group Softmax
CVPR 2020
Structure-Level Knowledge Distillation For Multilingual Sequence Labeling
ACL 2020
Learning Combinatorial Solver for Graph Matching
CVPR 2020
Revisiting Knowledge Distillation via Label Smoothing Regularization
CVPR 2020
Central Similarity Quantization for Efficient Image and Video Retrieval
CVPR 2020
Spoken Content and Voice Factorization for Few-Shot Speaker Adaptation
INTERSPEECH 2020
Non-Autoregressive End-to-End TTS with Coarse-to-Fine Decoding
INTERSPEECH 2020
Bi-Level Speaker Supervision for One-Shot Speech Synthesis
INTERSPEECH 2020
Dynamic Speaker Representations Adjustment and Decoder Factorization for Speaker Adaptation in End-to-End Speech Synthesis
INTERSPEECH 2020
Finding Action Tubes with a Sparse-to-Dense Framework
AAAI 2020
Gauntlet: Finding Bugs in Compilers for Programmable Packet Processing
OSDI 2020
More Embeddings, Better Sequence Labelers?
EMNLP 2020
Distilling Object Detectors With Fine-Grained Feature Imitation
CVPR 2019
Deformable Surface Tracking by Graph Matching
ICCV 2019
Partial Multi-Label Learning by Low-Rank and Sparse Decomposition
AAAI 2019
Few-Shot Adaptive Faster R-CNN
CVPR 2019
Interactive Image Segmentation via Pairwise Likelihood Learning
IJCAI 2017
Dual Training and Dual Prediction for Polarity Classification
ACL 2013
Learning Structured Hough Voting for Joint Object Detection and Occlusion Reasoning
CVPR 2013
Deep learning with COTS HPC systems
ICML 2013
Stable Dual Dynamic Programming
NIPS 2007