Chen Gao
43 papers · 2019–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π Interdisciplinary Bridge π Renaissance Researcher (10) π Academic Marathon (6) π Conference Polyglot (9) πΊοΈ Taxonomy Completionist (88)
πΊοΈ
Taxonomy Completionist
(88)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π€
Dynamic Duo
(14)
π§¬
Topic Evolution
π
Century Club
(37)
β
The Questioner
(2)
ποΈ
Keyword Collector
(206)
π₯
Unstoppable
(7)
β‘
Prolific Year
(5)
Conferences
CVPR (10)
ACL (7)
AAAI (6)
ICCV (5)
ECCV (4)
EMNLP (4)
NIPS (4)
IJCAI (2)
ICLR (1)
Top co-authors
Research topics
Keywords
large language model
(8)
neural radiance field
(4)
vision-language model
(3)
embodied agent
(3)
vision-language navigation
(3)
embodied ai
(3)
spatial reasoning
(3)
view synthesis
(3)
recommendation system
(2)
novel view synthesis
(2)
radiance field
(2)
image generation
(2)
reinforcement learning
(2)
dynamic scene
(2)
generative adversarial network
(2)
neural architecture search
(2)
point cloud
(2)
object localization
(2)
3d reconstruction
(2)
hierarchical planning
(2)
Papers
Towards Autonomous UAV Visual Object Search in City Space: Benchmark and Agentic Methodology
AAAI 2026
DIMM: Decoupled Multi-hierarchy Kalman Filter via Reinforcement Learning
AAAI 2026
CityCube: Benchmarking Cross-view Spatial Reasoning on Vision-Language Models in Urban Environments
ACL 2026
Learn to Relax with Large Language Models: Solving Constraint Optimization Problems via Bidirectional Coevolution
ACL 2026
SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World
AAAI 2026
AirCopBench: A Benchmark for Multi-drone Collaborative Embodied Perception and Reasoning
AAAI 2026
Open-Set Living Need Prediction with Large Language Models
ACL 2025
Iterative Sparse Attention for Long-sequence Recommendation
AAAI 2025
MIA-Tuner: Adapting Large Language Models as Pre-training Text Detector
AAAI 2025
Defining and Evaluating Visual Language Modelsβ Basic Spatial Abilities: A Perspective from Psychometrics
ACL 2025
CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory
ACL 2025
UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban Spaces
ACL 2025
Textured Gaussians for Enhanced 3D Scene Appearance Modeling
CVPR 2025
CityEQA: A Hierarchical LLM Agent on Embodied Question Answering Benchmark in City Space
EMNLP 2025
PychoAgent: Psychology-driven LLM Agents for Explainable Panic Prediction on Social Media during Sudden Disaster Events
EMNLP 2025
Analyzing and Modeling LLM Response Lengths with Extreme Value Theory: Anchoring Effects and Hybrid Distributions
EMNLP 2025
Depression Detection on Social Media with Large Language Models
EMNLP 2025
Exploring View Consistency for Scene-Adaptive Low-Light Light Field Image Enhancement
ICCV 2025
Epipolar Consistent Attention Aggregation Network for Unsupervised Light Field Disparity Estimation
ICCV 2025
How to Enable LLM with 3D Capacity? A Survey of Spatial Reasoning in LLM
IJCAI 2025
Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection
ECCV 2024
SpecNeRF: Gaussian Directional Encoding for Specular Reflections
CVPR 2024
Membership Inference Attacks against Fine-tuned Large Language Models via Self-prompt Calibration
NIPS 2024
EconAgent: Large Language Model-Empowered Agents for Simulating Macroeconomic Activities
ACL 2024
OmnimatteRF: Robust Omnimatte with 3D Background Modeling
ICCV 2023
Adaptive Zone-Aware Hierarchical Planner for Vision-Language Navigation
CVPR 2023
Progressively Optimized Local Radiance Fields for Robust View Synthesis
CVPR 2023
Robust Dynamic Radiance Fields
CVPR 2023
3D-SPS: Single-Stage 3D Visual Grounding via Referred Point Progressive Selection
CVPR 2022
Reinforced Structured State-Evolution for Vision-Language Navigation
CVPR 2022
Dynamic View Synthesis From Dynamic Monocular Video
ICCV 2021
Mining the Benefits of Two-stage and One-stage HOI Detection
NIPS 2021
Learnable Embedding sizes for Recommender Systems
ICLR 2021
Room-and-Object Aware Knowledge Reasoning for Remote Embodied Referring Expression
CVPR 2021
Language-Guided Global Image Editing via Cross-Modal Cyclic Mechanism
ICCV 2021
Progressive Feature Interaction Search for Deep Sparse Network
NIPS 2021
NAS-DIP: Learning Deep Image Prior with Neural Architecture Search
ECCV 2020
Flow-edge Guided Video Completion
ECCV 2020
PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer
CVPR 2020
DRG: Dual Relation Graph for Human-Object Interaction Detection
ECCV 2020
AdversarialNAS: Adversarial Neural Architecture Search for GANs
CVPR 2020
Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition
NIPS 2019
DeepAPF: Deep Attentive Probabilistic Factorization for Multi-site Video Recommendation
IJCAI 2019