Jiahao Wang

62 papers · 2020–2026 · 14 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🗺️ Taxonomy Completionist (19) 🌍 Conference Polyglot (14)

🌈 Renaissance Researcher (5) 🧭 Keyword Pioneer 🐝 Cross-Pollinator (10) 🏆 Grand Slam 🏆 Keyword Champion 👥 Mega-Team (20) 🔥 Unstoppable (6) 📈 Trend Setter 💎 Century Club (56) 🗃️ Keyword Collector (290) ⚡ Prolific Year (19)

Conferences

CVPR (11) AAAI (10) ACL (7) ICCV (7) NIPS (7) IJCAI (5) ECCV (3) EMNLP (3) ICML (3) COLING (2) ICLR (1) MICCAI (1) NAACL (1) WACV (1)

Top co-authors

Yujiu Yang (8) Taiqiang Wu (7) Yong Liu (6) Alan Yuille (6) Ping Luo (5) Weizhan Zhang (5) Ngai Wong (4) Xin Tao (4) Mengmeng Wang (4) Tong Lu (4)

Keywords

large language model (7) image generation (5) diffusion model (5) vision-language model (5) knowledge distillation (5) video understanding (4) multimodal learning (4) model compression (4) attention mechanism (3) 3d reconstruction (3) semantic segmentation (3) zero-shot learning (3) low-rank adaptation (3) code generation (3) cooperative perception (3) contrastive learning (2) benchmark evaluation (2) 3d vision (2) transformer architecture (2) autonomous driving (2)

Papers

Efficient Protein Optimization via Structure-aware Hamiltonian Dynamics AAAI 2026 SparseCoop: Cooperative Perception with Kinematic-Grounded Queries AAAI 2026 Griffin: Aerial-Ground Cooperative Detection and Tracking Dataset and Benchmark AAAI 2026 HTTrack: Learning to Perceive Targets via Historical Trajectories in Satellite Video Tracking AAAI 2026 Revisiting Model Interpolation for Efficient Reasoning ACL 2026 Semantic Feature Purification for Adversarially-Aware RGB-T Tracking AAAI 2026 Spatiotemporal-Sensitive Network for Microvascular Obstruction Segmentation from Cine Cardiac Magnetic Resonance MICCAI 2025 CoopTrack: Exploring End-to-End Learning for Efficient Cooperative Sequential Perception ICCV 2025 DynamicID: Zero-Shot Multi-ID Image Personalization with Flexible Facial Editability ICCV 2025 LiT: Delving into a Simple Linear Diffusion Transformer for Image Generation ICCV 2025 Imbalance in Balance: Online Concept Balancing in Generation Models ICCV 2025 Stepping Out of Similar Semantic Space for Open-Vocabulary Segmentation ICCV 2025 VP-MEL: Visual Prompts Guided Multimodal Entity Linking ACL 2025 CFPT: Empowering Time Series Forecasting through Cross-Frequency Interaction and Periodic-Aware Timestamp Modeling ICML 2025 RobustLight: Improving Robustness via Diffusion Reinforcement Learning for Traffic Signal Control ICML 2025 Function-to-Style Guidance of LLMs for Code Translation ICML 2025 Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner IJCAI 2025 Egocentric Object-Interaction Anticipation with Retentive and Predictive Learning IJCAI 2025 Edge-free but Structure-aware: Prototype-Guided Knowledge Distillation from GNNs to MLPs COLING 2025 Rethinking Kullback-Leibler Divergence in Knowledge Distillation for Large Language Models COLING 2025 Mamba-Reg: Vision Mamba Also Needs Registers CVPR 2025 SceneCrafter: Controllable Multi-View Driving Scene Editing CVPR 2025 Towards Precise Scaling Laws for Video Diffusion Transformers CVPR 2025 IWRN:A Robust Blind Watermarking Method for Artwork Image Copyright Protection Against Noise Attack AAAI 2025 SpotActor: Training-Free Layout-Controlled Consistent Image Generation AAAI 2025 PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models CVPR 2025 Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content CVPR 2025 EfficientQAT: Efficient Quantization-Aware Training for Large Language Models ACL 2025 EasyRet3D: Uncalibrated Multi-View Multi-Human 3D Reconstruction and Tracking WACV 2025 Speed Up Your Code: Progressive Code Acceleration Through Bidirectional Tree Editing ACL 2025 SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs EMNLP 2025 Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots NAACL 2025 LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition ACL 2024 OneActor: Consistent Subject Generation via Cluster-Conditioned Guidance NIPS 2024 Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery NIPS 2024 Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing NIPS 2024 Accelerating Non-Maximum Suppression: A Graph Theory Perspective NIPS 2024 Unveiling LoRA Intrinsic Ranks via Salience Analysis NIPS 2024 Unchosen Experts Can Contribute Too: Unleashing MoE Models’ Power by Self-Contrast NIPS 2024 CRA-PCN: Point Cloud Completion with Intra- and Inter-level Cross-Resolution Transformers AAAI 2024 ViLT-CLIP: Video and Language Tuning CLIP with Multimodal Prompt Learning and Scenario-Guided Optimization AAAI 2024 SAUI: Scale-Aware Unseen Imagineer for Zero-Shot Object Detection AAAI 2024 LLaMA Pro: Progressive LLaMA with Block Expansion ACL 2024 Boosting Textural NER with Synthetic Image and Instructive Alignment ACL 2024 RepKPU: Point Cloud Upsampling with Kernel Point Representation and Deformation CVPR 2024 Structure-Aware Sparse-View X-ray 3D Reconstruction CVPR 2024 Universal Segmentation at Arbitrary Granularity with Language Instruction CVPR 2024 Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis ECCV 2024 Mixture-of-Subspaces in Low-Rank Adaptation EMNLP 2024 Generating Images with 3D Annotations Using Diffusion Models ICLR 2024 Fast and Continual Knowledge Graph Embedding via Incremental LoRA IJCAI 2024 Prompting ChatGPT in MNER: Enhanced Multimodal Named Entity Recognition with Auxiliary Refined Knowledge EMNLP 2023 RIFormer: Keep Your Vision Backbone Effective but Removing Token Mixer CVPR 2023 Animal3D: A Comprehensive Dataset of 3D Animal Pose and Shape ICCV 2023 Memory-and-Anticipation Transformer for Online Action Understanding ICCV 2023 PACE: Predictive and Contrastive Embedding for Unsupervised Action Segmentation IJCAI 2022 Global Spectral Filter Memory Network for Video Object Segmentation ECCV 2022 SAGA: Stochastic Whole-Body Grasping with Contact ECCV 2022 Accelerating Neural Network Optimization Through an Automated Control Theory Lens CVPR 2022 Learning Adaptive Warping for Real-World Rolling Shutter Correction CVPR 2022 Adder Attention for Vision Transformer NIPS 2021 Enhancing Urban Flow Maps via Neural ODEs IJCAI 2020