Zhen Xu

35 papers · 2015–2026 · 12 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (12) 🏃 Academic Marathon (10) 🌈 Renaissance Researcher (10) 🗺️ Taxonomy Completionist (74)

🌈 Renaissance Researcher (10) 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (10) 🧬 Topic Evolution 🏆 Keyword Champion 📈 Trend Setter 💎 Century Club (32) 🔥 Unstoppable (6) ⚡ Prolific Year (6) 🗃️ Keyword Collector (179)

Conferences

CVPR (12) AAAI (5) EMNLP (4) ICCV (4) ACL (2) JMLR (2) COLING (1) IJCAI (1) IJCNLP (1) INTERSPEECH (1) NAACL (1) NIPS (1)

Top co-authors

Xiaowei Zhou (9) Sida Peng (9) Hujun Bao (8) Baoxun Wang (6) Liming Wang (4) Jiaming Sun (3) Yunjian Zhang (3) Zhuoran Wang (3) Bingquan Liu (3) Si Wu (3)

Research topics

Education (1)

Keywords

view synthesis (4) dialogue system (4) novel view synthesis (4) response generation (4) generative adversarial network (3) text generation (3) neural rendering (3) large language model (3) conversational agent (2) 3d reconstruction (2) deformation field (2) energy-based model (2) multimodal fusion (2) electronic health record (2) 3d gaussian splatting (2) neural architecture search (2) scene reconstruction (2) point cloud (2) diffusion model (2) dynamic scene (2)

Papers

ST-SAM: Multimodal Scene Text Segmentation with Dense Visual and Sparse Textual Prompts via SAM AAAI 2026 The Digital Dunning-Kruger Effect: Decoupling Hallucinations via Geometric Hidden-state Observation for Semantic Truthfulness ACL 2026 CR³: Boosting Compositional Reasoning in MLLMs Through Rule-Based Reinforcement Learning AAAI 2026 FreeTimeGS: Free Gaussian Primitives at Anytime Anywhere for Dynamic Scene Reconstruction CVPR 2025 StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models CVPR 2025 Task-aware Cross-modal Feature Refinement Transformer with Large Language Models for Visual Grounding CVPR 2025 EnvGS: Modeling View-Dependent Appearance with Environment Gaussian CVPR 2025 Anchoring-Guidance Fine-Tuning (AnGFT): Elevating Professional Response Quality in Role-Playing Conversational Agents EMNLP 2025 Bringing Pedagogy into Focus: Evaluating Virtual Teaching Assistants’ Question-Answering in Asynchronous Learning Environments EMNLP 2025 Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models ICCV 2025 Hierarchy UGP: Hierarchy Unified Gaussian Primitive for Large-Scale Dynamic Scene Reconstruction ICCV 2025 ERNet: Efficient Non-Rigid Registration Network for Point Sequences ICCV 2025 Unveiling the Lexical Sensitivity of LLMs: Combinatorial Optimization for Prompt Enhancement EMNLP 2024 Relightable and Animatable Neural Avatar from Sparse-View Video CVPR 2024 4K4D: Real-Time 4D View Synthesis at 4K Resolution CVPR 2024 Learning Neural Volumetric Representations of Dynamic Humans in Minutes CVPR 2023 CodaLab Competitions: An Open Source Platform to Organize Scientific Challenges JMLR 2023 Text-Guided Unsupervised Latent Transformation for Multi-Attribute Image Manipulation CVPR 2023 Blemish-Aware and Progressive Face Retouching With Limited Paired Data CVPR 2023 360-Attack: Distortion-Aware Perturbations From Perspective-Views CVPR 2022 Confidence Propagation Cluster: Unleash Full Potential of Object Detectors CVPR 2022 MUFASA: Multimodal Fusion Architecture Search for Electronic Health Records AAAI 2021 Empowering Adaptive Early-Exit Inference with Latency Awareness AAAI 2021 Multimodal Fusion with Co-Attention Networks for Fake News Detection IJCNLP 2021 Multimodal Fusion with Co-Attention Networks for Fake News Detection ACL 2021 MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction IJCAI 2021 LocalGAN: Modeling Local Distributions for Adversarial Response Generation JMLR 2021 AutoSpeech 2020: The Second Automated Machine Learning Challenge for Speech Classification INTERSPEECH 2020 Learning the Graphical Structure of Electronic Health Records with Graph Convolutional Transformer AAAI 2020 Flow Contrastive Estimation of Energy-Based Models CVPR 2020 A Prospective-Performance Network to Alleviate Myopia in Beam Search for Response Generation COLING 2018 LSDSCC: a Large Scale Domain-Specific Conversational Corpus for Response Generation with Diversity Oriented Evaluation Metrics NAACL 2018 Neural Response Generation via GAN with an Approximate Embedding Layer EMNLP 2017 Using Social Dynamics to Make Individual Predictions: Variational Inference with a Stochastic Kinetic Model NIPS 2016 Activity Auto-Completion: Predicting Human Activities From Partial Videos ICCV 2015