Wenbo Hu
31 papers · 2017–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
🏃 Academic Marathon (8) 🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌍 Conference Polyglot (10) 🐝 Cross-Pollinator (11)
🏃
Academic Marathon
(8)
🐝
Cross-Pollinator
(11)
🌈
Renaissance Researcher
(7)
🧬
Topic Evolution
🗃️
Keyword Collector
(136)
💎
Century Club
(28)
⚡
Prolific Year
(10)
Conferences
ICCV (7)
AAAI (5)
CVPR (5)
ECCV (4)
NIPS (3)
ACL (2)
IJCAI (2)
COLING (1)
EMNLP (1)
ICLR (1)
Top co-authors
Research topics
Keywords
diffusion model
(4)
multimodal large language model
(3)
multimodal learning
(3)
vision-language model
(3)
neural radiance field
(3)
temporal consistency
(3)
novel view synthesis
(2)
image restoration
(2)
jailbreak attack
(2)
image generation
(2)
few-shot learning
(2)
vision language model
(2)
time series forecasting
(2)
3d vision
(2)
visual question answering
(2)
3d reconstruction
(2)
point cloud
(2)
depth estimation
(2)
safety alignment
(2)
variational autoencoder
(2)
Papers
Deep Hidden Cognition Facilitates Reliable Chain-of-Thought Reasoning
AAAI 2026
Sparse-Scale Transformer with Bidirectional Awareness for Time Series Forecasting
AAAI 2026
Benchmarking Trustworthiness in Multimodal LLMs for Video Understanding
AAAI 2026
MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models
ICLR 2025
TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models
ICCV 2025
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
ICCV 2025
NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors
ICCV 2025
Verbalized Representation Learning for Interpretable Few-Shot Generalization
ICCV 2025
Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh
CVPR 2025
NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images
CVPR 2025
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
CVPR 2025
Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models
COLING 2025
SATA: A Paradigm for LLM Jailbreak via Simple Assistive Task Linkage
ACL 2025
SURE: Safety Understanding and Reasoning Enhancement for Multimodal Large Language Models
EMNLP 2025
Texture-GS: Disentangle the Geometry and Texture for 3D Gaussian Splatting Editing
ECCV 2024
CV-VAE: A Compatible Video VAE for Latent Generative Video Models
NIPS 2024
Matryoshka Query Transformer for Large Vision-Language Models
NIPS 2024
BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions
AAAI 2024
Spotting the Unseen: Reciprocal Consensus Network Guided by Visual Archetypes
AAAI 2024
VALOR-EVAL: Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models
ACL 2024
Inverse Rendering of Glossy Objects via the Neural Plenoptic Function and Radiance Fields
CVPR 2024
Analytic-Splatting: Anti-Aliased 3D Gaussian Splatting via Analytic Integration
ECCV 2024
Pixel-GS Density Control with Pixel-aware Gradient for 3D Gaussian Splatting
ECCV 2024
HiFi-123: Towards High-fidelity One Image to 3D Content Generation
ECCV 2024
Tri-MipRF: Tri-Mip Representation for Efficient Anti-Aliasing Neural Radiance Fields
ICCV 2023
Sparse Needlets for Lighting Estimation With Spherical Transport Loss
ICCV 2021
Two Birds with One Stone: Series Saliency for Accurate and Interpretable Multivariate Time Series Forecasting
IJCAI 2021
Deep Halftoning With Reversible Binary Pattern
ICCV 2021
Bidirectional Projection Network for Cross Dimension Scene Understanding
CVPR 2021
Calibrated Reliable Regression using Maximum Mean Discrepancy
NIPS 2020
Semi-supervised Max-margin Topic Model with Manifold Posterior Regularization
IJCAI 2017