Wenbo Hu

31 papers · 2017–2026 · 10 conferences · across top CS/AI conferences

Achievements

+7 more ↓

🏃 Academic Marathon (8) 🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌍 Conference Polyglot (10) 🐝 Cross-Pollinator (11)

🏃 Academic Marathon (8) 🐝 Cross-Pollinator (11) 🌈 Renaissance Researcher (7) 🧬 Topic Evolution 🗃️ Keyword Collector (136) 💎 Century Club (28) ⚡ Prolific Year (10)

Conferences

ICCV (7) AAAI (5) CVPR (5) ECCV (4) NIPS (3) ACL (2) IJCAI (2) COLING (1) EMNLP (1) ICLR (1)

Top co-authors

Ying Shan (8) Xiaoyu Li (6) Richang Hong (5) Xiangjun Gao (4) Nanyun Peng (4) Zi-Yi Dou (3) Long Quan (3) Jun Zhu (3) Kai-Wei Chang (3) Zijun Chen (3)

Research topics

Applications (1)

Keywords

diffusion model (4) multimodal large language model (3) multimodal learning (3) vision-language model (3) neural radiance field (3) temporal consistency (3) novel view synthesis (2) image restoration (2) jailbreak attack (2) image generation (2) few-shot learning (2) vision language model (2) time series forecasting (2) 3d vision (2) visual question answering (2) 3d reconstruction (2) point cloud (2) depth estimation (2) safety alignment (2) variational autoencoder (2)

Papers

Deep Hidden Cognition Facilitates Reliable Chain-of-Thought Reasoning AAAI 2026 Sparse-Scale Transformer with Bidirectional Awareness for Time Series Forecasting AAAI 2026 Benchmarking Trustworthiness in Multimodal LLMs for Video Understanding AAAI 2026 MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models ICLR 2025 TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models ICCV 2025 GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors ICCV 2025 NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors ICCV 2025 Verbalized Representation Learning for Interpretable Few-Shot Generalization ICCV 2025 Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh CVPR 2025 NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images CVPR 2025 DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos CVPR 2025 Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models COLING 2025 SATA: A Paradigm for LLM Jailbreak via Simple Assistive Task Linkage ACL 2025 SURE: Safety Understanding and Reasoning Enhancement for Multimodal Large Language Models EMNLP 2025 Texture-GS: Disentangle the Geometry and Texture for 3D Gaussian Splatting Editing ECCV 2024 CV-VAE: A Compatible Video VAE for Latent Generative Video Models NIPS 2024 Matryoshka Query Transformer for Large Vision-Language Models NIPS 2024 BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions AAAI 2024 Spotting the Unseen: Reciprocal Consensus Network Guided by Visual Archetypes AAAI 2024 VALOR-EVAL: Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models ACL 2024 Inverse Rendering of Glossy Objects via the Neural Plenoptic Function and Radiance Fields CVPR 2024 Analytic-Splatting: Anti-Aliased 3D Gaussian Splatting via Analytic Integration ECCV 2024 Pixel-GS Density Control with Pixel-aware Gradient for 3D Gaussian Splatting ECCV 2024 HiFi-123: Towards High-fidelity One Image to 3D Content Generation ECCV 2024 Tri-MipRF: Tri-Mip Representation for Efficient Anti-Aliasing Neural Radiance Fields ICCV 2023 Sparse Needlets for Lighting Estimation With Spherical Transport Loss ICCV 2021 Two Birds with One Stone: Series Saliency for Accurate and Interpretable Multivariate Time Series Forecasting IJCAI 2021 Deep Halftoning With Reversible Binary Pattern ICCV 2021 Bidirectional Projection Network for Cross Dimension Scene Understanding CVPR 2021 Calibrated Reliable Regression using Maximum Mean Discrepancy NIPS 2020 Semi-supervised Max-margin Topic Model with Manifold Posterior Regularization IJCAI 2017