Jiahao Wang
62 papers · 2020–2026 · 14 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (5) πΊοΈ Taxonomy Completionist (19) π Conference Polyglot (14)
π
Renaissance Researcher
(5)
π§
Keyword Pioneer
π
Cross-Pollinator
(10)
π
Grand Slam
π
Keyword Champion
π₯
Mega-Team
(20)
π₯
Unstoppable
(6)
π
Trend Setter
π
Century Club
(56)
ποΈ
Keyword Collector
(290)
β‘
Prolific Year
(19)
Conferences
CVPR (11)
AAAI (10)
ACL (7)
ICCV (7)
NIPS (7)
IJCAI (5)
ECCV (3)
EMNLP (3)
ICML (3)
COLING (2)
ICLR (1)
MICCAI (1)
NAACL (1)
WACV (1)
Top co-authors
Keywords
large language model
(7)
image generation
(5)
diffusion model
(5)
vision-language model
(5)
knowledge distillation
(5)
video understanding
(4)
multimodal learning
(4)
model compression
(4)
attention mechanism
(3)
3d reconstruction
(3)
semantic segmentation
(3)
zero-shot learning
(3)
low-rank adaptation
(3)
code generation
(3)
cooperative perception
(3)
contrastive learning
(2)
benchmark evaluation
(2)
3d vision
(2)
transformer architecture
(2)
autonomous driving
(2)
Papers
Efficient Protein Optimization via Structure-aware Hamiltonian Dynamics
AAAI 2026
SparseCoop: Cooperative Perception with Kinematic-Grounded Queries
AAAI 2026
Griffin: Aerial-Ground Cooperative Detection and Tracking Dataset and Benchmark
AAAI 2026
HTTrack: Learning to Perceive Targets via Historical Trajectories in Satellite Video Tracking
AAAI 2026
Revisiting Model Interpolation for Efficient Reasoning
ACL 2026
Semantic Feature Purification for Adversarially-Aware RGB-T Tracking
AAAI 2026
Spatiotemporal-Sensitive Network for Microvascular Obstruction Segmentation from Cine Cardiac Magnetic Resonance
MICCAI 2025
CoopTrack: Exploring End-to-End Learning for Efficient Cooperative Sequential Perception
ICCV 2025
DynamicID: Zero-Shot Multi-ID Image Personalization with Flexible Facial Editability
ICCV 2025
LiT: Delving into a Simple Linear Diffusion Transformer for Image Generation
ICCV 2025
Imbalance in Balance: Online Concept Balancing in Generation Models
ICCV 2025
Stepping Out of Similar Semantic Space for Open-Vocabulary Segmentation
ICCV 2025
VP-MEL: Visual Prompts Guided Multimodal Entity Linking
ACL 2025
CFPT: Empowering Time Series Forecasting through Cross-Frequency Interaction and Periodic-Aware Timestamp Modeling
ICML 2025
RobustLight: Improving Robustness via Diffusion Reinforcement Learning for Traffic Signal Control
ICML 2025
Function-to-Style Guidance of LLMs for Code Translation
ICML 2025
Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner
IJCAI 2025
Egocentric Object-Interaction Anticipation with Retentive and Predictive Learning
IJCAI 2025
Edge-free but Structure-aware: Prototype-Guided Knowledge Distillation from GNNs to MLPs
COLING 2025
Rethinking Kullback-Leibler Divergence in Knowledge Distillation for Large Language Models
COLING 2025
Mamba-Reg: Vision Mamba Also Needs Registers
CVPR 2025
SceneCrafter: Controllable Multi-View Driving Scene Editing
CVPR 2025
Towards Precise Scaling Laws for Video Diffusion Transformers
CVPR 2025
IWRN:A Robust Blind Watermarking Method for Artwork Image Copyright Protection Against Noise Attack
AAAI 2025
SpotActor: Training-Free Layout-Controlled Consistent Image Generation
AAAI 2025
PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models
CVPR 2025
Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content
CVPR 2025
EfficientQAT: Efficient Quantization-Aware Training for Large Language Models
ACL 2025
EasyRet3D: Uncalibrated Multi-View Multi-Human 3D Reconstruction and Tracking
WACV 2025
Speed Up Your Code: Progressive Code Acceleration Through Bidirectional Tree Editing
ACL 2025
SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs
EMNLP 2025
Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots
NAACL 2025
LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition
ACL 2024
OneActor: Consistent Subject Generation via Cluster-Conditioned Guidance
NIPS 2024
Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery
NIPS 2024
Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing
NIPS 2024
Accelerating Non-Maximum Suppression: A Graph Theory Perspective
NIPS 2024
Unveiling LoRA Intrinsic Ranks via Salience Analysis
NIPS 2024
Unchosen Experts Can Contribute Too: Unleashing MoE Modelsβ Power by Self-Contrast
NIPS 2024
CRA-PCN: Point Cloud Completion with Intra- and Inter-level Cross-Resolution Transformers
AAAI 2024
ViLT-CLIP: Video and Language Tuning CLIP with Multimodal Prompt Learning and Scenario-Guided Optimization
AAAI 2024
SAUI: Scale-Aware Unseen Imagineer for Zero-Shot Object Detection
AAAI 2024
LLaMA Pro: Progressive LLaMA with Block Expansion
ACL 2024
Boosting Textural NER with Synthetic Image and Instructive Alignment
ACL 2024
RepKPU: Point Cloud Upsampling with Kernel Point Representation and Deformation
CVPR 2024
Structure-Aware Sparse-View X-ray 3D Reconstruction
CVPR 2024
Universal Segmentation at Arbitrary Granularity with Language Instruction
CVPR 2024
Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis
ECCV 2024
Mixture-of-Subspaces in Low-Rank Adaptation
EMNLP 2024
Generating Images with 3D Annotations Using Diffusion Models
ICLR 2024
Fast and Continual Knowledge Graph Embedding via Incremental LoRA
IJCAI 2024
Prompting ChatGPT in MNER: Enhanced Multimodal Named Entity Recognition with Auxiliary Refined Knowledge
EMNLP 2023
RIFormer: Keep Your Vision Backbone Effective but Removing Token Mixer
CVPR 2023
Animal3D: A Comprehensive Dataset of 3D Animal Pose and Shape
ICCV 2023
Memory-and-Anticipation Transformer for Online Action Understanding
ICCV 2023
PACE: Predictive and Contrastive Embedding for Unsupervised Action Segmentation
IJCAI 2022
Global Spectral Filter Memory Network for Video Object Segmentation
ECCV 2022
SAGA: Stochastic Whole-Body Grasping with Contact
ECCV 2022
Accelerating Neural Network Optimization Through an Automated Control Theory Lens
CVPR 2022
Learning Adaptive Warping for Real-World Rolling Shutter Correction
CVPR 2022
Adder Attention for Vision Transformer
NIPS 2021
Enhancing Urban Flow Maps via Neural ODEs
IJCAI 2020