Teng Wang

23 papers · 2021–2026 · 10 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🐝 Cross-Pollinator (13) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (9) 🏃 Academic Marathon (5)

🌍 Conference Polyglot (9) 🌈 Renaissance Researcher (6) 🐝 Cross-Pollinator (13) 🤝 Dynamic Duo (11) 🔥 Unstoppable (5) 💎 Century Club (19) 🗃️ Keyword Collector (98) ⚡ Prolific Year (6)

Conferences

ICCV (5) CVPR (4) AAAI (3) ACL (2) ICLR (2) ICML (2) IJCAI (2) COLING (1) ECCV (1) EMNLP (1)

Top co-authors

Feng Zheng (12) Tiantian Geng (4) Ping Luo (4) Jinrui Zhang (4) Ran Cheng (3) Yixiao Ge (3) Ying Shan (3) Jisheng Dang (2) Nannan Zhu (2) Bimei Wang (2)

Keywords

multimodal learning (4) vision-language model (4) video understanding (3) large language model (3) transfer learning (2) event detection (2) video-language model (2) prompt tuning (2) reinforcement learning (2) conformal prediction (1) attention mechanism (1) event camera (1) knowledge distillation (1) event understanding (1) multi-task learning (1) contrastive learning (1) motion estimation (1) uncertainty quantification (1) continual learning (1) multi-modal learning (1)

Papers

CP-Router: An Uncertainty-Aware Router Between LLM and LRM AAAI 2026 R-AVST: Empowering Video-LLMs with Fine-Grained Spatio-Temporal Reasoning in Complex Audio-Visual Scenarios AAAI 2026 ColorBrowserAgent: Complex Long-Horizon Browser Agent with Adaptive Knowledge Evolution ACL 2026 First Learn, Then Review: Human-Like Continual Learning for Cross-View Geo-Localization with Limited Field of View AAAI 2026 Seeing More, Saying More: Lightweight Language Experts are Dynamic Video Token Compressors EMNLP 2025 BPP-Search: Enhancing Tree of Thought Reasoning for Mathematical Modeling Problem Solving ACL 2025 Large Language Models are good multi-lingual learners : When LLMs meet cross-lingual prompts COLING 2025 EDCFlow: Exploring Temporally Dense Difference Maps for Event-based Optical Flow Estimation CVPR 2025 LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos CVPR 2025 GenHancer: Imperfect Generative Models are Secretly Strong Vision-Centric Enhancers ICCV 2025 Sample then Identify: A General Framework for Risk Control and Assessment in Multimodal Large Language Models ICLR 2025 ImagineNav: Prompting Vision-Language Models as Embodied Navigator through Scene Imagination ICLR 2025 Diff-LMM: Diffusion Teacher-Guided Spatio-Temporal Perception for Video Large Multimodal Models IJCAI 2025 Hallucination Reduction in Video-Language Models via Hierarchical Multimodal Consistency IJCAI 2025 Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models ECCV 2024 $\pi$-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation ICML 2023 Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline CVPR 2023 Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models ICCV 2023 Knowledge-Aware Prompt Tuning for Generalizable Vision-Language Models ICCV 2023 Accelerating Vision-Language Pretraining With Free Language Modeling CVPR 2023 Transferable Decoding with Visual Entities for Zero-Shot Image Captioning ICCV 2023 VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix ICML 2022 End-to-End Dense Video Captioning With Parallel Decoding ICCV 2021