Chen Gao

43 papers · 2019–2026 · 9 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (10) 🏃 Academic Marathon (6) 🌍 Conference Polyglot (9) 🗺️ Taxonomy Completionist (88)

🗺️ Taxonomy Completionist (88) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🤝 Dynamic Duo (14) 🧬 Topic Evolution 💎 Century Club (37) ❓ The Questioner (2) 🗃️ Keyword Collector (206) 🔥 Unstoppable (7) ⚡ Prolific Year (5)

Conferences

CVPR (10) ACL (7) AAAI (6) ICCV (5) ECCV (4) EMNLP (4) NIPS (4) IJCAI (2) ICLR (1)

Top co-authors

Yong Li (15) Jia-Bin Huang (9) Si Liu (9) Xinlei Chen (6) Johannes Kopf (5) Zhengqiu Zhu (5) Ayush Saraf (5) Changil Kim (5) Depeng Jin (4) Jirong Zha (4)

Research topics

Statistics (1)

Keywords

large language model (8) neural radiance field (4) vision-language model (3) embodied agent (3) vision-language navigation (3) embodied ai (3) spatial reasoning (3) view synthesis (3) recommendation system (2) novel view synthesis (2) radiance field (2) image generation (2) reinforcement learning (2) dynamic scene (2) generative adversarial network (2) neural architecture search (2) point cloud (2) object localization (2) 3d reconstruction (2) hierarchical planning (2)

Papers

Towards Autonomous UAV Visual Object Search in City Space: Benchmark and Agentic Methodology AAAI 2026 DIMM: Decoupled Multi-hierarchy Kalman Filter via Reinforcement Learning AAAI 2026 CityCube: Benchmarking Cross-view Spatial Reasoning on Vision-Language Models in Urban Environments ACL 2026 Learn to Relax with Large Language Models: Solving Constraint Optimization Problems via Bidirectional Coevolution ACL 2026 SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World AAAI 2026 AirCopBench: A Benchmark for Multi-drone Collaborative Embodied Perception and Reasoning AAAI 2026 Open-Set Living Need Prediction with Large Language Models ACL 2025 Iterative Sparse Attention for Long-sequence Recommendation AAAI 2025 MIA-Tuner: Adapting Large Language Models as Pre-training Text Detector AAAI 2025 Defining and Evaluating Visual Language Models’ Basic Spatial Abilities: A Perspective from Psychometrics ACL 2025 CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory ACL 2025 UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban Spaces ACL 2025 Textured Gaussians for Enhanced 3D Scene Appearance Modeling CVPR 2025 CityEQA: A Hierarchical LLM Agent on Embodied Question Answering Benchmark in City Space EMNLP 2025 PychoAgent: Psychology-driven LLM Agents for Explainable Panic Prediction on Social Media during Sudden Disaster Events EMNLP 2025 Analyzing and Modeling LLM Response Lengths with Extreme Value Theory: Anchoring Effects and Hybrid Distributions EMNLP 2025 Depression Detection on Social Media with Large Language Models EMNLP 2025 Exploring View Consistency for Scene-Adaptive Low-Light Light Field Image Enhancement ICCV 2025 Epipolar Consistent Attention Aggregation Network for Unsupervised Light Field Disparity Estimation ICCV 2025 How to Enable LLM with 3D Capacity? A Survey of Spatial Reasoning in LLM IJCAI 2025 Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection ECCV 2024 SpecNeRF: Gaussian Directional Encoding for Specular Reflections CVPR 2024 Membership Inference Attacks against Fine-tuned Large Language Models via Self-prompt Calibration NIPS 2024 EconAgent: Large Language Model-Empowered Agents for Simulating Macroeconomic Activities ACL 2024 OmnimatteRF: Robust Omnimatte with 3D Background Modeling ICCV 2023 Adaptive Zone-Aware Hierarchical Planner for Vision-Language Navigation CVPR 2023 Progressively Optimized Local Radiance Fields for Robust View Synthesis CVPR 2023 Robust Dynamic Radiance Fields CVPR 2023 3D-SPS: Single-Stage 3D Visual Grounding via Referred Point Progressive Selection CVPR 2022 Reinforced Structured State-Evolution for Vision-Language Navigation CVPR 2022 Dynamic View Synthesis From Dynamic Monocular Video ICCV 2021 Mining the Benefits of Two-stage and One-stage HOI Detection NIPS 2021 Learnable Embedding sizes for Recommender Systems ICLR 2021 Room-and-Object Aware Knowledge Reasoning for Remote Embodied Referring Expression CVPR 2021 Language-Guided Global Image Editing via Cross-Modal Cyclic Mechanism ICCV 2021 Progressive Feature Interaction Search for Deep Sparse Network NIPS 2021 NAS-DIP: Learning Deep Image Prior with Neural Architecture Search ECCV 2020 Flow-edge Guided Video Completion ECCV 2020 PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer CVPR 2020 DRG: Dual Relation Graph for Human-Object Interaction Detection ECCV 2020 AdversarialNAS: Adversarial Neural Architecture Search for GANs CVPR 2020 Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition NIPS 2019 DeepAPF: Deep Attentive Probabilistic Factorization for Multi-site Video Recommendation IJCAI 2019