Wenhao Wu
48 papers · 2018–2025 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (10) π Interdisciplinary Bridge π Renaissance Researcher (5) π£ Hot Topic Early Bird
π
Interdisciplinary Bridge
π£
Hot Topic Early Bird
πΊοΈ
Taxonomy Completionist
(10)
π€
Dynamic Duo
(17)
π§¬
Topic Evolution
π
Century Club
(48)
π
Conference Pioneer
β‘
Prolific Year
(11)
π₯
Unstoppable
(8)
β
The Questioner
(2)
ποΈ
Keyword Collector
(215)
π
Trend Setter
Conferences
CVPR (8)
EMNLP (8)
ECCV (7)
ACL (6)
ICCV (6)
AAAI (4)
NIPS (3)
ICLR (2)
IJCAI (1)
IJCNLP (1)
NAACL (1)
WACV (1)
Top co-authors
Keywords
large language model
(4)
text generation
(4)
abstractive summarization
(4)
vision-language model
(4)
contrastive learning
(4)
zero-shot learning
(4)
semi-supervised learning
(3)
video understanding
(3)
video recognition
(3)
video classification
(3)
convolutional neural network
(3)
multimodal large language model
(3)
reinforcement learning
(2)
few-shot learning
(2)
named entity recognition
(2)
multimodal learning
(2)
adversarial learning
(2)
transfer learning
(2)
weakly supervised learning
(2)
knowledge distillation
(2)
Papers
More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression
EMNLP 2025
Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision
EMNLP 2025
DistinctAD: Distinctive Audio Description Generation in Contexts
CVPR 2025
Retrieval Head Mechanistically Explains Long-Context Factuality
ICLR 2025
MMReason: An Open-Ended Multi-Modal Multi-Step Reasoning Benchmark for MLLMs Toward AGI
ICCV 2025
Automated Multi-level Preference for MLLMs
NIPS 2024
Dense Connector for MLLMs
NIPS 2024
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement
NIPS 2024
InstructEval: Instruction-Tuned Text Evaluator from Human Preference
ACL 2024
Relational Matching for Weakly Semi-Supervised Oriented Object Detection
CVPR 2024
DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM
ECCV 2024
LongEmbed: Extending Embedding Models for Long Context Retrieval
EMNLP 2024
Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement
EMNLP 2024
AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories
EMNLP 2024
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training
ICLR 2024
CoUDA: Coherence Evaluation via Unified Data Augmentation
NAACL 2024
Effective Invertible Arbitrary Image Rescaling
WACV 2023
What Can Simple Arithmetic Operations Do for Temporal Modeling?
ICCV 2023
Debiasing Generative Named Entity Recognition by Calibrating Sequence Likelihood
ACL 2023
Exploring In-Context Learning for Knowledge Grounded Dialog Generation
EMNLP 2023
UATVR: Uncertainty-Adaptive Text-Video Retrieval
ICCV 2023
AdaCM: Adaptive ColorMLP for Real-Time Universal Photo-Realistic Style Transfer
AAAI 2023
Revisiting Classifier: Transferring Vision-Language Models for Video Recognition
AAAI 2023
WeCheck: Strong Factual Consistency Checker via Weakly Supervised Learning
ACL 2023
Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
CVPR 2023
Bidirectional Cross-Modal Knowledge Exploration for Video Recognition With Pre-Trained Vision-Language Models
CVPR 2023
Semi-Supervised Stereo-Based 3D Object Detection via Cross-View Consensus
CVPR 2023
Maximum Spatial Perturbation Consistency for Unpaired Image-to-Image Translation
CVPR 2022
Temporal Action Proposal Generation with Background Constraint
AAAI 2022
NSNet: Non-Saliency Suppression Sampler for Efficient Video Recognition
ECCV 2022
Temporal Saliency Query Network for Efficient Video Recognition
ECCV 2022
CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval
ECCV 2022
Precisely the Point: Adversarial Augmentations for Faithful and Informative Text Generation
EMNLP 2022
FRSUM: Towards Faithful Abstractive Summarization via Enhancing Factual Robustness
EMNLP 2022
Learn and Review: Enhancing Continual Named Entity Recognition via Reviewing Synthetic Samples
ACL 2022
Towards Bidirectional Arbitrary Image Rescaling: Joint Optimization and Cycle Idempotence
CVPR 2022
Weakly-Supervised Spatio-Temporal Anomaly Detection in Surveillance Video
IJCAI 2021
BASS: Boosting Abstractive Summarization with Unified Semantic Graph
IJCNLP 2021
ASCNet: Self-Supervised Video Representation Learning With Appearance-Speed Consistency
ICCV 2021
MVFNet: Multi-View Fusion Network for Efficient Video Recognition
AAAI 2021
BASS: Boosting Abstractive Summarization with Unified Semantic Graph
ACL 2021
Attention-Driven Dynamic Graph Convolutional Network for Multi-Label Image Recognition
ECCV 2020
Composing Elementary Discourse Units in Abstractive Summarization
ACL 2020
Semi-Supervised Pedestrian Instance Synthesis and Detection With Mutual Reinforcement
ICCV 2019
Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video Recognition
ICCV 2019
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes
ECCV 2018
TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes
ECCV 2018
Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation
CVPR 2018