Zhibin Wang

25 papers · 2017–2026 · 10 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🏃 Academic Marathon (8) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (10) 🐝 Cross-Pollinator (10)

🗺️ Taxonomy Completionist (46) 🌍 Conference Polyglot (10) 👑 Triple Crown 🏆 Grand Slam 🤝 Dynamic Duo (11) 🧬 Topic Evolution 💎 Century Club (21) 🗃️ Keyword Collector (108) 🔥 Unstoppable (5) ⚡ Prolific Year (6)

Conferences

AAAI (6) CVPR (5) ICLR (3) NIPS (3) ACL (2) ICCV (2) ECCV (1) ICML (1) INTERSPEECH (1) WACV (1)

Top co-authors

Qiang Zhou (11) Fan Wang (9) Gang Yu (5) BIN FU (5) Chunhua Shen (5) Hao Li (4) Fei Du (3) Xianfang Zeng (3) Jianlong Yuan (3) Shaofeng Zhang (3)

Keywords

semantic segmentation (4) diffusion model (3) weakly supervised learning (2) data augmentation (2) pseudo labeling (2) foundation model (2) video understanding (2) semi-supervised learning (2) pseudo label (2) image generation (2) image segmentation (1) policy optimization (1) human perception (1) object detection (1) self-attention mechanism (1) contrastive learning (1) question answering (1) vision transformer (1) catastrophic forgetting (1) knowledge distillation (1)

Papers

StreamKV: Streaming Video Question-Answering with Segment-based KV Cache Retrieval and Compression AAAI 2026 MMG-Vid: Maximizing Marginal Gains at Segment-level and Token-level for Efficient Video LLMs AAAI 2026 Contribution-aware Token Compression for Efficient Video Understanding via Reinforcement Learning AAAI 2026 Resonating with RoPE: Spectral Quantization for High-Fidelity Key Cache Compression ACL 2026 MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D CVPR 2025 CR2PQ: Continuous Relative Rotary Positional Query for Dense Visual Representation Learning ICLR 2025 MeshXL: Neural Coordinate Field for Generative 3D Foundation Models NIPS 2024 Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach ICLR 2024 Dynamic Token-Pass Transformers for Semantic Segmentation WACV 2024 I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing NIPS 2024 Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models CVPR 2024 Enhanced Visual Instruction Tuning with Synthesized Image-Dialogue Data ACL 2024 Data Pruning via Moving-one-Sample-out NIPS 2023 SwinRDM: Integrate SwinRNN with Diffusion Model towards High-Resolution and High-Quality Weather Forecasting AAAI 2023 Point-Teaching: Weakly Semi-supervised Object Detection with Point Annotations AAAI 2023 Frequency Domain Disentanglement for Arbitrary Neural Style Transfer AAAI 2023 Efficient Mask Correction for Click-Based Interactive Image Segmentation CVPR 2023 Foundation Model Drives Weakly Incremental Learning for Semantic Segmentation CVPR 2023 Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering ICCV 2023 LMSeg: Language-guided Multi-dataset Segmentation ICLR 2023 Patch-level Contrastive Learning via Positional Query for Visual Pre-training ICML 2023 Poseur: Direct Human Pose Regression with Transformers ECCV 2022 A Simple Baseline for Semi-Supervised Semantic Segmentation With Strong Data Augmentation ICCV 2021 Instant-Teaching: An End-to-End Semi-Supervised Object Detection Framework CVPR 2021 The Opensesame NIST 2016 Speaker Recognition Evaluation System INTERSPEECH 2017