Xingyu Chen
48 papers · 2019–2026 · 13 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π Interdisciplinary Bridge π Renaissance Researcher (10) π Academic Marathon (6) π Conference Polyglot (13) πΊοΈ Taxonomy Completionist (86)
π
Academic Marathon
(6)
π§
Keyword Pioneer
π
Renaissance Researcher
(10)
π¬
Deep Specialist
(10)
π
Grand Slam
π
Keyword Champion
(2)
β‘
Prolific Year
(6)
π₯
Unstoppable
(7)
ποΈ
Keyword Collector
(233)
π
Century Club
(46)
β
The Questioner
Conferences
CVPR (15)
EMNLP (8)
ICCV (4)
ICML (4)
AAAI (3)
ECCV (3)
NIPS (3)
ACL (2)
WACV (2)
ICLR (1)
IJCAI (1)
NAACL (1)
NSDI (1)
Top co-authors
Research topics
Keywords
3d reconstruction
(6)
large language model
(4)
generative model
(4)
neural radiance field
(4)
novel view synthesis
(3)
self-supervised learning
(3)
camera pose estimation
(3)
multi-agent reinforcement learning
(3)
image generation
(3)
neural rendering
(3)
pose estimation
(3)
few-shot learning
(2)
mesh generation
(2)
adversarial learning
(2)
metric learning
(2)
3d generation
(2)
point cloud
(2)
monocular depth estimation
(2)
object detection
(2)
machine translation
(2)
Papers
SegDINO3D: 3D Instance Segmentation Empowered by Both Image-Level and Object-Level 2D Features
AAAI 2026
BatonVoice: An Operationalist Framework for Enhancing Controllable Speech Synthesis with Linguistic Intelligence from LLMs
ACL 2026
State Revisit and Re-explore: Bridging Sim-to-Real Gaps in Offline-and-Online Reinforcement Learning with An Imperfect Simulator
IJCAI 2025
Feat2GS: Probing Visual Foundation Models with Gaussian Splatting
CVPR 2025
HOIGPT: Learning Long-Sequence Hand-Object Interaction with Language Models
CVPR 2025
HandOS: 3D Hand Reconstruction in One Stage
CVPR 2025
Radio Frequency Ray Tracing with Neural Object Representation for Enhanced RF Modeling
CVPR 2025
Draft Model Knows When to Stop: Self-Verification Speculative Decoding for Long-Form Generation
EMNLP 2025
Alignment for Efficient Tool Calling of Large Language Models
EMNLP 2025
CogDual: Enhancing Dual Cognition of LLMs via Reinforcement Learning with Implicit Rule-Based Rewards
EMNLP 2025
Easi3R: Estimating Disentangled Motion from DUSt3R Without Training
ICCV 2025
RaSA: Rank-Sharing Low-Rank Adaptation
ICLR 2025
Do NOT Think That Much for 2+3=? On the Overthinking of Long Reasoning Models
ICML 2025
ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization
CVPR 2024
HAVE-FUN: Human Avatar Reconstruction from Few-Shot Unconstrained Images
CVPR 2024
Grounded Answers for Multi-agent Decision-making Problem through Generative World Model
NIPS 2024
Portrait4D: Learning One-Shot 4D Head Avatar Synthesis using Synthetic Data
CVPR 2024
Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning
AAAI 2024
Cell2Sentence: Teaching Large Language Models the Language of Biology
ICML 2024
UV Volumes for Real-Time Rendering of Editable Free-View Human Performance
CVPR 2023
Object Reprojection Error (ORE): Camera pose benchmarks from lightweight tracking annotations
NIPS 2023
CAPro: Webly Supervised Learning with Cross-modality Aligned Prototypes
NIPS 2023
Rethinking Word-Level Auto-Completion in Computer-Aided Translation
EMNLP 2023
SJTU-MTLABβs Submission to the WMT23 Word-Level Auto Completion Task
EMNLP 2023
Self-Supervised Monocular Depth Estimation: Solving the Edge-Fattening Problem
WACV 2023
Reinforced Disentanglement for Face Swapping without Skip Connection
ICCV 2023
Self-Supervised Object Detection from Egocentric Videos
ICCV 2023
Mimic3D: Thriving 3D-Aware GANs via 3D-to-2D Imitation
ICCV 2023
Frequency-Aware Self-Supervised Monocular Depth Estimation
WACV 2023
FoPro: Few-Shot Guided Robust Webly-Supervised Prototypical Learning
AAAI 2023
Local-to-Global Registration for Bundle-Adjusting Neural Radiance Fields
CVPR 2023
Hand Avatar: Free-Pose Hand Animation and Rendering From Monocular Video
CVPR 2023
META-GUI: Towards Multi-modal Conversational Agents on Mobile GUI
EMNLP 2022
The AISP-SJTU Translation System for WMT 2022
EMNLP 2022
MobRecon: Mobile-Friendly Hand Mesh Reconstruction From Monocular Image
CVPR 2022
Greedy based Value Representation for Optimal Coordination in Multi-agent Reinforcement Learning
ICML 2022
The AISP-SJTU Simultaneous Translation System for IWSLT 2022
ACL 2022
TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages
NAACL 2022
UC-OWOD: Unknown-Classified Open World Object Detection
ECCV 2022
ECO-TR: Efficient Correspondences Finding via Coarse-to-Fine Refinement
ECCV 2022
Hallucinated Neural Radiance Fields in the Wild
CVPR 2022
WebSRC: A Dataset for Web-Based Structural Reading Comprehension
EMNLP 2021
img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation
CVPR 2021
SDD-FIQA: Unsupervised Face Image Quality Assessment With Similarity Distribution Distance
CVPR 2021
Camera-Space Hand Mesh Recovery via Semantic Aggregation and Adaptive 2D-1D Registration
CVPR 2021
Eingerprint: Robust Energy-related Fingerprinting for Passive RFID Tags
NSDI 2020
A Boundary Based Out-of-Distribution Classifier for Generalized Zero-Shot Learning
ECCV 2020
Proportionally Fair Clustering
ICML 2019