Xin Yu

98 papers · 2017–2026 · 14 conferences · across top CS/AI conferences

Achievements

+16 more ↓

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🗺️ Taxonomy Completionist (17) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (13)

🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (13) 🏠 Conference Loyalist (27) 👥 Mega-Team (21) 🏆 Grand Slam 🔬 Deep Specialist (12) 🤝 Dynamic Duo (17) 🏆 Keyword Champion 🗃️ Keyword Collector (419) ❓ The Questioner (2) ⚡ Prolific Year (18) 🚀 Conference Pioneer 📈 Trend Setter 💎 Century Club (96) 🔥 Unstoppable (9)

Conferences

CVPR (27) ICCV (11) NIPS (11) WACV (10) AAAI (9) ICLR (9) ECCV (8) IJCAI (5) EMNLP (2) ICML (2) ACL (1) AISTATS (1) COLING (1) MIDL (1)

Top co-authors

Lincheng Li (17) Heming Du (14) Yi Yang (11) Hongdong Li (9) Changjie Fan (8) Xiaojuan Qi (8) Chen Liu (6) Shandian Zhe (6) Hongwei Sheng (6) Xin Shen (6)

Keywords

multi-modal learning (6) domain adaptation (5) 3d reconstruction (5) multimodal learning (4) reinforcement learning (4) human pose estimation (4) action recognition (4) diffusion model (4) self-supervised learning (4) one-shot learning (4) sign language recognition (4) semantic segmentation (4) image restoration (4) video understanding (4) neural network pruning (3) visual navigation (3) model compression (3) depth estimation (3) video generation (3) metric learning (3)

Papers

Mnemis: Dual-Route Retrieval on Hierarchical Graphs for Long-Term LLM Memory ACL 2026 Decoupling Understanding from Reasoning via Problem Space Mapping for Small-Scale Model Reasoning AAAI 2026 TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm WACV 2025 FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding WACV 2025 NL2Lean: Translating Natural Language into Lean 4 through Multi-Aspect Reinforcement Learning EMNLP 2025 ObjectMover: Generative Object Movement with Video Prior CVPR 2025 Multimodal Retina Image Analysis Survey: Datasets, Tasks and Methods IJCAI 2025 Zero-Shot Machine Unlearning with Proxy Adversarial Data Generation IJCAI 2025 NeuFrameQ: Neural Frame Fields for Scalable and Generalizable Anisotropic Quadrangulation ICCV 2025 3DRealCar: An In-the-wild RGB-D Car Dataset with 360-degree Views ICCV 2025 LDPose: Towards Inclusive Human Pose Estimation for Limb-Deficient Individuals in the Wild ICCV 2025 M3GYM: A Large-Scale Multimodal Multi-view Multi-person Pose Dataset for Fitness Activity Understanding in Real-world Settings CVPR 2025 Understanding the Statistical Accuracy-Communication Trade-off in Personalized Federated Learning with Minimax Guarantees ICML 2025 EasyCraft: A Robust and Efficient Framework for Automatic Avatar Crafting CVPR 2025 Robust Audio-Visual Segmentation via Audio-Guided Visual Convergent Alignment CVPR 2025 Dynamic Derivation and Elimination: Audio Visual Segmentation with Enhanced Audio Semantics CVPR 2025 Blind Bitstream-corrupted Video Recovery via Metadata-guided Diffusion Model CVPR 2025 Cross-View Isolated Sign Language Recognition via View Synthesis and Feature Disentanglement ICCV 2025 RichRAG: Crafting Rich Responses for Multi-faceted Queries in Retrieval-Augmented Generation COLING 2025 Functional Bayesian Tucker Decomposition for Continuous-indexed Tensor Data ICLR 2024 Image Inpainting via Iteratively Decoupled Probabilistic Modeling ICLR 2024 Text-to-3D with Classifier Score Distillation ICLR 2024 Byzantine Robust Cooperative Multi-Agent Reinforcement Learning as a Bayesian Game ICLR 2024 Leveraging Partial Symmetry for Multi-Agent Reinforcement Learning AAAI 2024 Multi-Resolution Active Learning of Fourier Neural Operators AISTATS 2024 When 3D Bounding-Box Meets SAM: Point Cloud Instance Segmentation With Weak-and-Noisy Supervision WACV 2024 Benchmarking Audio Visual Segmentation for Long-Untrimmed Videos CVPR 2024 EfficientDreamer: High-Fidelity and Robust 3D Creation via Orthogonal-view Diffusion Priors CVPR 2024 Text-Guided 3D Face Synthesis - From Generation to Editing CVPR 2024 Machine Unlearning via Null Space Calibration IJCAI 2024 UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation ECCV 2024 CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection Appearance ECCV 2024 OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection ECCV 2024 An Empirical Analysis on Spatial Reasoning Capabilities of Large Multimodal Models EMNLP 2024 DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection NIPS 2024 MM-WLAuslan: Multi-View Multi-Modal Word-Level Australian Sign Language Recognition Dataset NIPS 2024 TPR: Topology-Preserving Reservoirs for Generalized Zero-Shot Learning NIPS 2024 Diverse 3D Hand Gesture Prediction From Body Dynamics by Bilateral Hand Disentanglement CVPR 2023 Hybrid Neural Rendering for Large-Scale Scenes With Motion Blur CVPR 2023 DyGait: Exploiting Dynamic Representations for High-performance Gait Recognition ICCV 2023 Weakly-Supervised Point Cloud Instance Segmentation With Geometric Priors WACV 2023 TI2Net: Temporal Identity Inconsistency Network for Deepfake Detection WACV 2023 Alleviating tiling effect by random walk sliding window in high-resolution histological whole slide image synthesis MIDL 2023 Proactive Deepfake Defence via Identity Watermarking WACV 2023 RVD: A Handheld Device-Based Fundus Video Dataset for Retinal Vessel Segmentation NIPS 2023 Streaming Factor Trajectory Learning for Temporal Tensor Decomposition NIPS 2023 Auslan-Daily: Australian Sign Language Translation for Daily Communication and News NIPS 2023 NeFII: Inverse Rendering for Reflectance Decomposition With Near-Field Indirect Illumination CVPR 2023 Object-Goal Visual Navigation via Effective Exploration of Relations Among Historical Navigation States CVPR 2023 Meta Knowledge Condensation for Federated Learning ICLR 2023 Exploring Active 3D Object Detection from a Generalization Perspective ICLR 2023 Sim2RealVS: A New Benchmark for Video Stabilization With a Strong Baseline WACV 2023 FlowFace: Semantic Flow-Guided Shape-Aware Face Swapping AAAI 2023 StyleTalk: One-Shot Talking Head Generation with Controllable Speaking Styles AAAI 2023 IS SYNTHETIC DATA FROM GENERATIVE MODELS READY FOR IMAGE RECOGNITION? ICLR 2023 Texture Generation on 3D Meshes with Point-UV Diffusion ICCV 2023 Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields IJCAI 2022 Recall Distortion in Neural Network Pruning and the Undecayed Pruning Algorithm NIPS 2022 MHR-Net: Multiple-Hypothesis Reconstruction of Non-rigid Shapes from 2D Views ECCV 2022 Towards Efficient and Scale-Robust Ultra-High-Definition Image Demoiréing ECCV 2022 Instance As Identity: A Generic Online Paradigm for Video Instance Segmentation ECCV 2022 Video Demoireing With Relation-Based Temporal Consistency CVPR 2022 Monocular Camera-Based Point-Goal Navigation by Learning Depth Channel and Cross-Modality Pyramid Fusion AAAI 2022 One-Shot Talking Face Generation from Single-Speaker Audio-Visual Correlation Learning AAAI 2022 The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks ICML 2022 Batch Multi-Fidelity Active Learning with Budget Constraints NIPS 2022 RGB-D Saliency Detection via Cascaded Mutual Information Minimization ICCV 2021 Scaling Up Exact Neural Network Compression by ReLU Stability NIPS 2021 Modeling the Probabilistic Distribution of Unlabeled Data for One-shot Medical Image Segmentation AAAI 2021 Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation AAAI 2021 ARVo: Learning All-Range Volumetric Correspondence for Video Deblurring CVPR 2021 Self-Supervised Visibility Learning for Novel View Synthesis CVPR 2021 DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-Scale Consistency CVPR 2021 Removing Raindrops and Rain Streaks in One Go CVPR 2021 PR-RRN: Pairwise-Regularized Residual-Recursive Networks for Non-Rigid Structure-From-Motion ICCV 2021 Super-Resolving Cross-Domain Face Miniatures by Peeking at One-Shot Exemplar ICCV 2021 Gait Recognition via Effective Global-Local Feature Representation and Local Temporal Aggregation ICCV 2021 RFNet: Region-Aware Fusion Network for Incomplete Multi-Modal Brain Tumor Segmentation ICCV 2021 VTNet: Visual Transformer Network for Object Goal Navigation ICLR 2021 PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences ICLR 2021 Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion IJCAI 2021 Auto-Navigator: Decoupled Neural Architecture Search for Visual Navigation WACV 2021 The IKEA ASM Dataset: Understanding People Assembling Furniture Through Actions, Objects and Pose WACV 2021 Learning Object Relation Graph and Tentative Policy for Visual Navigation ECCV 2020 Weakly-Supervised Salient Object Detection via Scribble Annotations CVPR 2020 Copy and Paste GAN: Face Hallucination From Shaded Thumbnails CVPR 2020 Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison WACV 2020 Optimal Feature Transport for Cross-View Image Geo-Localization AAAI 2020 TSPNet: Hierarchical Feature Learning via Temporal Semantic Pyramid for Sign Language Translation NIPS 2020 Transferring Cross-Domain Knowledge for Video Sign Language Recognition CVPR 2020 Where Am I Looking At? Joint Location and Orientation Estimation by Cross-View Matching CVPR 2020 SOSNet: Second Order Similarity Regularization for Local Descriptor Learning CVPR 2019 Spatial-Aware Feature Aggregation for Image based Cross-View Geo-Localization NIPS 2019 Bringing a Blurry Frame Alive at High Frame-Rate With an Event Camera CVPR 2019 Super-Resolving Very Low-Resolution Face Images With Supplementary Attributes CVPR 2018 Learning Strict Identity Mappings in Deep Residual Networks CVPR 2018 Face Super-resolution Guided by Facial Component Heatmaps ECCV 2018 Hallucinating Very Low-Resolution Unaligned and Noisy Face Images by Transformative Discriminative Autoencoders CVPR 2017