Zhibin Wang
25 papers · 2017–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π Academic Marathon (8) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (10) π Cross-Pollinator (10)
πΊοΈ
Taxonomy Completionist
(46)
π
Conference Polyglot
(10)
π
Triple Crown
π
Grand Slam
π€
Dynamic Duo
(11)
π§¬
Topic Evolution
π
Century Club
(21)
ποΈ
Keyword Collector
(108)
π₯
Unstoppable
(5)
β‘
Prolific Year
(6)
Conferences
AAAI (6)
CVPR (5)
ICLR (3)
NIPS (3)
ACL (2)
ICCV (2)
ECCV (1)
ICML (1)
INTERSPEECH (1)
WACV (1)
Top co-authors
Keywords
semantic segmentation
(4)
diffusion model
(3)
weakly supervised learning
(2)
data augmentation
(2)
pseudo labeling
(2)
foundation model
(2)
video understanding
(2)
semi-supervised learning
(2)
pseudo label
(2)
image generation
(2)
image segmentation
(1)
policy optimization
(1)
human perception
(1)
object detection
(1)
self-attention mechanism
(1)
contrastive learning
(1)
question answering
(1)
vision transformer
(1)
catastrophic forgetting
(1)
knowledge distillation
(1)
Papers
StreamKV: Streaming Video Question-Answering with Segment-based KV Cache Retrieval and Compression
AAAI 2026
MMG-Vid: Maximizing Marginal Gains at Segment-level and Token-level for Efficient Video LLMs
AAAI 2026
Contribution-aware Token Compression for Efficient Video Understanding via Reinforcement Learning
AAAI 2026
Resonating with RoPE: Spectral Quantization for High-Fidelity Key Cache Compression
ACL 2026
MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D
CVPR 2025
CR2PQ: Continuous Relative Rotary Positional Query for Dense Visual Representation Learning
ICLR 2025
MeshXL: Neural Coordinate Field for Generative 3D Foundation Models
NIPS 2024
Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach
ICLR 2024
Dynamic Token-Pass Transformers for Semantic Segmentation
WACV 2024
I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing
NIPS 2024
Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models
CVPR 2024
Enhanced Visual Instruction Tuning with Synthesized Image-Dialogue Data
ACL 2024
Data Pruning via Moving-one-Sample-out
NIPS 2023
SwinRDM: Integrate SwinRNN with Diffusion Model towards High-Resolution and High-Quality Weather Forecasting
AAAI 2023
Point-Teaching: Weakly Semi-supervised Object Detection with Point Annotations
AAAI 2023
Frequency Domain Disentanglement for Arbitrary Neural Style Transfer
AAAI 2023
Efficient Mask Correction for Click-Based Interactive Image Segmentation
CVPR 2023
Foundation Model Drives Weakly Incremental Learning for Semantic Segmentation
CVPR 2023
Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering
ICCV 2023
LMSeg: Language-guided Multi-dataset Segmentation
ICLR 2023
Patch-level Contrastive Learning via Positional Query for Visual Pre-training
ICML 2023
Poseur: Direct Human Pose Regression with Transformers
ECCV 2022
A Simple Baseline for Semi-Supervised Semantic Segmentation With Strong Data Augmentation
ICCV 2021
Instant-Teaching: An End-to-End Semi-Supervised Object Detection Framework
CVPR 2021
The Opensesame NIST 2016 Speaker Recognition Evaluation System
INTERSPEECH 2017