Xingyu Chen

48 papers · 2019–2026 · 13 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (10) 🏃 Academic Marathon (6) 🌍 Conference Polyglot (13) 🗺️ Taxonomy Completionist (86)

🏃 Academic Marathon (6) 🧭 Keyword Pioneer 🌈 Renaissance Researcher (10) 🔬 Deep Specialist (10) 🏆 Grand Slam 🏆 Keyword Champion (2) ⚡ Prolific Year (6) 🔥 Unstoppable (7) 🗃️ Keyword Collector (233) 💎 Century Club (46) ❓ The Questioner

Conferences

CVPR (15) EMNLP (8) ICCV (4) ICML (4) AAAI (3) ECCV (3) NIPS (3) ACL (2) WACV (2) ICLR (1) IJCAI (1) NAACL (1) NSDI (1)

Top co-authors

Rui Wang (7) Kai Yu (6) Zhaopeng Tu (5) Yue Chen (5) Xuguang Lan (5) Baoyuan Wang (5) Matt Feiszli (4) Lipeng Wan (4) Zeyang Liu (4) Lu Chen (4)

Research topics

Optimization (1)

Keywords

3d reconstruction (6) large language model (4) generative model (4) neural radiance field (4) novel view synthesis (3) self-supervised learning (3) camera pose estimation (3) multi-agent reinforcement learning (3) image generation (3) neural rendering (3) pose estimation (3) few-shot learning (2) mesh generation (2) adversarial learning (2) metric learning (2) 3d generation (2) point cloud (2) monocular depth estimation (2) object detection (2) machine translation (2)

Papers

SegDINO3D: 3D Instance Segmentation Empowered by Both Image-Level and Object-Level 2D Features AAAI 2026 BatonVoice: An Operationalist Framework for Enhancing Controllable Speech Synthesis with Linguistic Intelligence from LLMs ACL 2026 State Revisit and Re-explore: Bridging Sim-to-Real Gaps in Offline-and-Online Reinforcement Learning with An Imperfect Simulator IJCAI 2025 Feat2GS: Probing Visual Foundation Models with Gaussian Splatting CVPR 2025 HOIGPT: Learning Long-Sequence Hand-Object Interaction with Language Models CVPR 2025 HandOS: 3D Hand Reconstruction in One Stage CVPR 2025 Radio Frequency Ray Tracing with Neural Object Representation for Enhanced RF Modeling CVPR 2025 Draft Model Knows When to Stop: Self-Verification Speculative Decoding for Long-Form Generation EMNLP 2025 Alignment for Efficient Tool Calling of Large Language Models EMNLP 2025 CogDual: Enhancing Dual Cognition of LLMs via Reinforcement Learning with Implicit Rule-Based Rewards EMNLP 2025 Easi3R: Estimating Disentangled Motion from DUSt3R Without Training ICCV 2025 RaSA: Rank-Sharing Low-Rank Adaptation ICLR 2025 Do NOT Think That Much for 2+3=? On the Overthinking of Long Reasoning Models ICML 2025 ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization CVPR 2024 HAVE-FUN: Human Avatar Reconstruction from Few-Shot Unconstrained Images CVPR 2024 Grounded Answers for Multi-agent Decision-making Problem through Generative World Model NIPS 2024 Portrait4D: Learning One-Shot 4D Head Avatar Synthesis using Synthetic Data CVPR 2024 Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning AAAI 2024 Cell2Sentence: Teaching Large Language Models the Language of Biology ICML 2024 UV Volumes for Real-Time Rendering of Editable Free-View Human Performance CVPR 2023 Object Reprojection Error (ORE): Camera pose benchmarks from lightweight tracking annotations NIPS 2023 CAPro: Webly Supervised Learning with Cross-modality Aligned Prototypes NIPS 2023 Rethinking Word-Level Auto-Completion in Computer-Aided Translation EMNLP 2023 SJTU-MTLAB’s Submission to the WMT23 Word-Level Auto Completion Task EMNLP 2023 Self-Supervised Monocular Depth Estimation: Solving the Edge-Fattening Problem WACV 2023 Reinforced Disentanglement for Face Swapping without Skip Connection ICCV 2023 Self-Supervised Object Detection from Egocentric Videos ICCV 2023 Mimic3D: Thriving 3D-Aware GANs via 3D-to-2D Imitation ICCV 2023 Frequency-Aware Self-Supervised Monocular Depth Estimation WACV 2023 FoPro: Few-Shot Guided Robust Webly-Supervised Prototypical Learning AAAI 2023 Local-to-Global Registration for Bundle-Adjusting Neural Radiance Fields CVPR 2023 Hand Avatar: Free-Pose Hand Animation and Rendering From Monocular Video CVPR 2023 META-GUI: Towards Multi-modal Conversational Agents on Mobile GUI EMNLP 2022 The AISP-SJTU Translation System for WMT 2022 EMNLP 2022 MobRecon: Mobile-Friendly Hand Mesh Reconstruction From Monocular Image CVPR 2022 Greedy based Value Representation for Optimal Coordination in Multi-agent Reinforcement Learning ICML 2022 The AISP-SJTU Simultaneous Translation System for IWSLT 2022 ACL 2022 TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages NAACL 2022 UC-OWOD: Unknown-Classified Open World Object Detection ECCV 2022 ECO-TR: Efficient Correspondences Finding via Coarse-to-Fine Refinement ECCV 2022 Hallucinated Neural Radiance Fields in the Wild CVPR 2022 WebSRC: A Dataset for Web-Based Structural Reading Comprehension EMNLP 2021 img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation CVPR 2021 SDD-FIQA: Unsupervised Face Image Quality Assessment With Similarity Distribution Distance CVPR 2021 Camera-Space Hand Mesh Recovery via Semantic Aggregation and Adaptive 2D-1D Registration CVPR 2021 Eingerprint: Robust Energy-related Fingerprinting for Passive RFID Tags NSDI 2020 A Boundary Based Out-of-Distribution Classifier for Generalized Zero-Shot Learning ECCV 2020 Proportionally Fair Clustering ICML 2019