Teng Wang
23 papers · 2021–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
🐝 Cross-Pollinator (13) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (9) 🏃 Academic Marathon (5)
🌍
Conference Polyglot
(9)
🌈
Renaissance Researcher
(6)
🐝
Cross-Pollinator
(13)
🤝
Dynamic Duo
(11)
🔥
Unstoppable
(5)
💎
Century Club
(19)
🗃️
Keyword Collector
(98)
⚡
Prolific Year
(6)
Conferences
ICCV (5)
CVPR (4)
AAAI (3)
ACL (2)
ICLR (2)
ICML (2)
IJCAI (2)
COLING (1)
ECCV (1)
EMNLP (1)
Top co-authors
Keywords
multimodal learning
(4)
vision-language model
(4)
video understanding
(3)
large language model
(3)
transfer learning
(2)
event detection
(2)
video-language model
(2)
prompt tuning
(2)
reinforcement learning
(2)
conformal prediction
(1)
attention mechanism
(1)
event camera
(1)
knowledge distillation
(1)
event understanding
(1)
multi-task learning
(1)
contrastive learning
(1)
motion estimation
(1)
uncertainty quantification
(1)
continual learning
(1)
multi-modal learning
(1)
Papers
CP-Router: An Uncertainty-Aware Router Between LLM and LRM
AAAI 2026
R-AVST: Empowering Video-LLMs with Fine-Grained Spatio-Temporal Reasoning in Complex Audio-Visual Scenarios
AAAI 2026
ColorBrowserAgent: Complex Long-Horizon Browser Agent with Adaptive Knowledge Evolution
ACL 2026
First Learn, Then Review: Human-Like Continual Learning for Cross-View Geo-Localization with Limited Field of View
AAAI 2026
Seeing More, Saying More: Lightweight Language Experts are Dynamic Video Token Compressors
EMNLP 2025
BPP-Search: Enhancing Tree of Thought Reasoning for Mathematical Modeling Problem Solving
ACL 2025
Large Language Models are good multi-lingual learners : When LLMs meet cross-lingual prompts
COLING 2025
EDCFlow: Exploring Temporally Dense Difference Maps for Event-based Optical Flow Estimation
CVPR 2025
LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos
CVPR 2025
GenHancer: Imperfect Generative Models are Secretly Strong Vision-Centric Enhancers
ICCV 2025
Sample then Identify: A General Framework for Risk Control and Assessment in Multimodal Large Language Models
ICLR 2025
ImagineNav: Prompting Vision-Language Models as Embodied Navigator through Scene Imagination
ICLR 2025
Diff-LMM: Diffusion Teacher-Guided Spatio-Temporal Perception for Video Large Multimodal Models
IJCAI 2025
Hallucination Reduction in Video-Language Models via Hierarchical Multimodal Consistency
IJCAI 2025
Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models
ECCV 2024
$\pi$-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation
ICML 2023
Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline
CVPR 2023
Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models
ICCV 2023
Knowledge-Aware Prompt Tuning for Generalizable Vision-Language Models
ICCV 2023
Accelerating Vision-Language Pretraining With Free Language Modeling
CVPR 2023
Transferable Decoding with Visual Entities for Zero-Shot Image Captioning
ICCV 2023
VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix
ICML 2022
End-to-End Dense Video Captioning With Parallel Decoding
ICCV 2021