Zhen Xu
35 papers · 2015–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π Interdisciplinary Bridge π Conference Polyglot (12) π Academic Marathon (10) π Renaissance Researcher (10) πΊοΈ Taxonomy Completionist (74)
π
Renaissance Researcher
(10)
π
Interdisciplinary Bridge
π
Academic Marathon
(10)
π§¬
Topic Evolution
π
Keyword Champion
π
Trend Setter
π
Century Club
(32)
π₯
Unstoppable
(6)
β‘
Prolific Year
(6)
ποΈ
Keyword Collector
(179)
Conferences
CVPR (12)
AAAI (5)
EMNLP (4)
ICCV (4)
ACL (2)
JMLR (2)
COLING (1)
IJCAI (1)
IJCNLP (1)
INTERSPEECH (1)
NAACL (1)
NIPS (1)
Top co-authors
Research topics
Keywords
view synthesis
(4)
dialogue system
(4)
novel view synthesis
(4)
response generation
(4)
generative adversarial network
(3)
text generation
(3)
neural rendering
(3)
large language model
(3)
conversational agent
(2)
3d reconstruction
(2)
deformation field
(2)
energy-based model
(2)
multimodal fusion
(2)
electronic health record
(2)
3d gaussian splatting
(2)
neural architecture search
(2)
scene reconstruction
(2)
point cloud
(2)
diffusion model
(2)
dynamic scene
(2)
Papers
ST-SAM: Multimodal Scene Text Segmentation with Dense Visual and Sparse Textual Prompts via SAM
AAAI 2026
The Digital Dunning-Kruger Effect: Decoupling Hallucinations via Geometric Hidden-state Observation for Semantic Truthfulness
ACL 2026
CRΒ³: Boosting Compositional Reasoning in MLLMs Through Rule-Based Reinforcement Learning
AAAI 2026
FreeTimeGS: Free Gaussian Primitives at Anytime Anywhere for Dynamic Scene Reconstruction
CVPR 2025
StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models
CVPR 2025
Task-aware Cross-modal Feature Refinement Transformer with Large Language Models for Visual Grounding
CVPR 2025
EnvGS: Modeling View-Dependent Appearance with Environment Gaussian
CVPR 2025
Anchoring-Guidance Fine-Tuning (AnGFT): Elevating Professional Response Quality in Role-Playing Conversational Agents
EMNLP 2025
Bringing Pedagogy into Focus: Evaluating Virtual Teaching Assistantsβ Question-Answering in Asynchronous Learning Environments
EMNLP 2025
Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models
ICCV 2025
Hierarchy UGP: Hierarchy Unified Gaussian Primitive for Large-Scale Dynamic Scene Reconstruction
ICCV 2025
ERNet: Efficient Non-Rigid Registration Network for Point Sequences
ICCV 2025
Unveiling the Lexical Sensitivity of LLMs: Combinatorial Optimization for Prompt Enhancement
EMNLP 2024
Relightable and Animatable Neural Avatar from Sparse-View Video
CVPR 2024
4K4D: Real-Time 4D View Synthesis at 4K Resolution
CVPR 2024
Learning Neural Volumetric Representations of Dynamic Humans in Minutes
CVPR 2023
CodaLab Competitions: An Open Source Platform to Organize Scientific Challenges
JMLR 2023
Text-Guided Unsupervised Latent Transformation for Multi-Attribute Image Manipulation
CVPR 2023
Blemish-Aware and Progressive Face Retouching With Limited Paired Data
CVPR 2023
360-Attack: Distortion-Aware Perturbations From Perspective-Views
CVPR 2022
Confidence Propagation Cluster: Unleash Full Potential of Object Detectors
CVPR 2022
MUFASA: Multimodal Fusion Architecture Search for Electronic Health Records
AAAI 2021
Empowering Adaptive Early-Exit Inference with Latency Awareness
AAAI 2021
Multimodal Fusion with Co-Attention Networks for Fake News Detection
IJCNLP 2021
Multimodal Fusion with Co-Attention Networks for Fake News Detection
ACL 2021
MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction
IJCAI 2021
LocalGAN: Modeling Local Distributions for Adversarial Response Generation
JMLR 2021
AutoSpeech 2020: The Second Automated Machine Learning Challenge for Speech Classification
INTERSPEECH 2020
Learning the Graphical Structure of Electronic Health Records with Graph Convolutional Transformer
AAAI 2020
Flow Contrastive Estimation of Energy-Based Models
CVPR 2020
A Prospective-Performance Network to Alleviate Myopia in Beam Search for Response Generation
COLING 2018
LSDSCC: a Large Scale Domain-Specific Conversational Corpus for Response Generation with Diversity Oriented Evaluation Metrics
NAACL 2018
Neural Response Generation via GAN with an Approximate Embedding Layer
EMNLP 2017
Using Social Dynamics to Make Individual Predictions: Variational Inference with a Stochastic Kinetic Model
NIPS 2016
Activity Auto-Completion: Predicting Human Activities From Partial Videos
ICCV 2015