Wenhao Wu

48 papers · 2018–2025 · 12 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (10) 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🐣 Hot Topic Early Bird

🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird 🗺️ Taxonomy Completionist (10) 🤝 Dynamic Duo (17) 🧬 Topic Evolution 💎 Century Club (48) 🚀 Conference Pioneer ⚡ Prolific Year (11) 🔥 Unstoppable (8) ❓ The Questioner (2) 🗃️ Keyword Collector (215) 📈 Trend Setter

Conferences

CVPR (8) EMNLP (8) ECCV (7) ACL (6) ICCV (6) AAAI (4) NIPS (3) ICLR (2) IJCAI (1) IJCNLP (1) NAACL (1) WACV (1)

Top co-authors

Sujian Li (17) Dongliang He (8) Jingdong Wang (7) Wanli Ouyang (7) Errui Ding (6) Dawei Zhu (6) Yuxin Song (6) Yifan Song (6) Jiachen Liu (6) Xinyan Xiao (6)

Keywords

large language model (4) text generation (4) abstractive summarization (4) vision-language model (4) contrastive learning (4) zero-shot learning (4) semi-supervised learning (3) video understanding (3) video recognition (3) video classification (3) convolutional neural network (3) multimodal large language model (3) reinforcement learning (2) few-shot learning (2) named entity recognition (2) multimodal learning (2) adversarial learning (2) transfer learning (2) weakly supervised learning (2) knowledge distillation (2)

Papers

More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression EMNLP 2025 Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision EMNLP 2025 DistinctAD: Distinctive Audio Description Generation in Contexts CVPR 2025 Retrieval Head Mechanistically Explains Long-Context Factuality ICLR 2025 MMReason: An Open-Ended Multi-Modal Multi-Step Reasoning Benchmark for MLLMs Toward AGI ICCV 2025 Automated Multi-level Preference for MLLMs NIPS 2024 Dense Connector for MLLMs NIPS 2024 Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement NIPS 2024 InstructEval: Instruction-Tuned Text Evaluator from Human Preference ACL 2024 Relational Matching for Weakly Semi-Supervised Oriented Object Detection CVPR 2024 DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM ECCV 2024 LongEmbed: Extending Embedding Models for Long Context Retrieval EMNLP 2024 Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement EMNLP 2024 AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories EMNLP 2024 PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training ICLR 2024 CoUDA: Coherence Evaluation via Unified Data Augmentation NAACL 2024 Effective Invertible Arbitrary Image Rescaling WACV 2023 What Can Simple Arithmetic Operations Do for Temporal Modeling? ICCV 2023 Debiasing Generative Named Entity Recognition by Calibrating Sequence Likelihood ACL 2023 Exploring In-Context Learning for Knowledge Grounded Dialog Generation EMNLP 2023 UATVR: Uncertainty-Adaptive Text-Video Retrieval ICCV 2023 AdaCM: Adaptive ColorMLP for Real-Time Universal Photo-Realistic Style Transfer AAAI 2023 Revisiting Classifier: Transferring Vision-Language Models for Video Recognition AAAI 2023 WeCheck: Strong Factual Consistency Checker via Weakly Supervised Learning ACL 2023 Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval? CVPR 2023 Bidirectional Cross-Modal Knowledge Exploration for Video Recognition With Pre-Trained Vision-Language Models CVPR 2023 Semi-Supervised Stereo-Based 3D Object Detection via Cross-View Consensus CVPR 2023 Maximum Spatial Perturbation Consistency for Unpaired Image-to-Image Translation CVPR 2022 Temporal Action Proposal Generation with Background Constraint AAAI 2022 NSNet: Non-Saliency Suppression Sampler for Efficient Video Recognition ECCV 2022 Temporal Saliency Query Network for Efficient Video Recognition ECCV 2022 CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval ECCV 2022 Precisely the Point: Adversarial Augmentations for Faithful and Informative Text Generation EMNLP 2022 FRSUM: Towards Faithful Abstractive Summarization via Enhancing Factual Robustness EMNLP 2022 Learn and Review: Enhancing Continual Named Entity Recognition via Reviewing Synthetic Samples ACL 2022 Towards Bidirectional Arbitrary Image Rescaling: Joint Optimization and Cycle Idempotence CVPR 2022 Weakly-Supervised Spatio-Temporal Anomaly Detection in Surveillance Video IJCAI 2021 BASS: Boosting Abstractive Summarization with Unified Semantic Graph IJCNLP 2021 ASCNet: Self-Supervised Video Representation Learning With Appearance-Speed Consistency ICCV 2021 MVFNet: Multi-View Fusion Network for Efficient Video Recognition AAAI 2021 BASS: Boosting Abstractive Summarization with Unified Semantic Graph ACL 2021 Attention-Driven Dynamic Graph Convolutional Network for Multi-Label Image Recognition ECCV 2020 Composing Elementary Discourse Units in Abstractive Summarization ACL 2020 Semi-Supervised Pedestrian Instance Synthesis and Detection With Mutual Reinforcement ICCV 2019 Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video Recognition ICCV 2019 Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes ECCV 2018 TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes ECCV 2018 Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation CVPR 2018