conftrace_

Teng Wang

23 papers · 2021–2026 · 10 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+8 more ↓ 🐝 Cross-Pollinator (13) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (9) 🏃 Academic Marathon (5)
🌍 Conference Polyglot (9) 🌈 Renaissance Researcher (6) 🐝 Cross-Pollinator (13) 🤝 Dynamic Duo (11) 🔥 Unstoppable (5) 💎 Century Club (19) 🗃️ Keyword Collector (98) Prolific Year (6)

Conferences

ICCV (5) CVPR (4) AAAI (3) ACL (2) ICLR (2) ICML (2) IJCAI (2) COLING (1) ECCV (1) EMNLP (1)

Papers

CP-Router: An Uncertainty-Aware Router Between LLM and LRM AAAI 2026 R-AVST: Empowering Video-LLMs with Fine-Grained Spatio-Temporal Reasoning in Complex Audio-Visual Scenarios AAAI 2026 ColorBrowserAgent: Complex Long-Horizon Browser Agent with Adaptive Knowledge Evolution ACL 2026 First Learn, Then Review: Human-Like Continual Learning for Cross-View Geo-Localization with Limited Field of View AAAI 2026 Seeing More, Saying More: Lightweight Language Experts are Dynamic Video Token Compressors EMNLP 2025 BPP-Search: Enhancing Tree of Thought Reasoning for Mathematical Modeling Problem Solving ACL 2025 Large Language Models are good multi-lingual learners : When LLMs meet cross-lingual prompts COLING 2025 EDCFlow: Exploring Temporally Dense Difference Maps for Event-based Optical Flow Estimation CVPR 2025 LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos CVPR 2025 GenHancer: Imperfect Generative Models are Secretly Strong Vision-Centric Enhancers ICCV 2025 Sample then Identify: A General Framework for Risk Control and Assessment in Multimodal Large Language Models ICLR 2025 ImagineNav: Prompting Vision-Language Models as Embodied Navigator through Scene Imagination ICLR 2025 Diff-LMM: Diffusion Teacher-Guided Spatio-Temporal Perception for Video Large Multimodal Models IJCAI 2025 Hallucination Reduction in Video-Language Models via Hierarchical Multimodal Consistency IJCAI 2025 Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models ECCV 2024 $\pi$-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation ICML 2023 Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline CVPR 2023 Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models ICCV 2023 Knowledge-Aware Prompt Tuning for Generalizable Vision-Language Models ICCV 2023 Accelerating Vision-Language Pretraining With Free Language Modeling CVPR 2023 Transferable Decoding with Visual Entities for Zero-Shot Image Captioning ICCV 2023 VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix ICML 2022 End-to-End Dense Video Captioning With Parallel Decoding ICCV 2021