Xin Yu
98 papers · 2017–2026 · 14 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
π§ Keyword Pioneer π£ Hot Topic Early Bird πΊοΈ Taxonomy Completionist (17) π Interdisciplinary Bridge π Conference Polyglot (13)
π£
Hot Topic Early Bird
π
Interdisciplinary Bridge
π
Conference Polyglot
(13)
π
Conference Loyalist
(27)
π₯
Mega-Team
(21)
π
Grand Slam
π¬
Deep Specialist
(12)
π€
Dynamic Duo
(17)
π
Keyword Champion
ποΈ
Keyword Collector
(419)
β
The Questioner
(2)
β‘
Prolific Year
(18)
π
Conference Pioneer
π
Trend Setter
π
Century Club
(96)
π₯
Unstoppable
(9)
Conferences
CVPR (27)
ICCV (11)
NIPS (11)
WACV (10)
AAAI (9)
ICLR (9)
ECCV (8)
IJCAI (5)
EMNLP (2)
ICML (2)
ACL (1)
AISTATS (1)
COLING (1)
MIDL (1)
Top co-authors
Keywords
multi-modal learning
(6)
domain adaptation
(5)
3d reconstruction
(5)
multimodal learning
(4)
reinforcement learning
(4)
human pose estimation
(4)
action recognition
(4)
diffusion model
(4)
self-supervised learning
(4)
one-shot learning
(4)
sign language recognition
(4)
semantic segmentation
(4)
image restoration
(4)
video understanding
(4)
neural network pruning
(3)
visual navigation
(3)
model compression
(3)
depth estimation
(3)
video generation
(3)
metric learning
(3)
Papers
Mnemis: Dual-Route Retrieval on Hierarchical Graphs for Long-Term LLM Memory
ACL 2026
Decoupling Understanding from Reasoning via Problem Space Mapping for Small-Scale Model Reasoning
AAAI 2026
TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm
WACV 2025
FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding
WACV 2025
NL2Lean: Translating Natural Language into Lean 4 through Multi-Aspect Reinforcement Learning
EMNLP 2025
ObjectMover: Generative Object Movement with Video Prior
CVPR 2025
Multimodal Retina Image Analysis Survey: Datasets, Tasks and Methods
IJCAI 2025
Zero-Shot Machine Unlearning with Proxy Adversarial Data Generation
IJCAI 2025
NeuFrameQ: Neural Frame Fields for Scalable and Generalizable Anisotropic Quadrangulation
ICCV 2025
3DRealCar: An In-the-wild RGB-D Car Dataset with 360-degree Views
ICCV 2025
LDPose: Towards Inclusive Human Pose Estimation for Limb-Deficient Individuals in the Wild
ICCV 2025
M3GYM: A Large-Scale Multimodal Multi-view Multi-person Pose Dataset for Fitness Activity Understanding in Real-world Settings
CVPR 2025
Understanding the Statistical Accuracy-Communication Trade-off in Personalized Federated Learning with Minimax Guarantees
ICML 2025
EasyCraft: A Robust and Efficient Framework for Automatic Avatar Crafting
CVPR 2025
Robust Audio-Visual Segmentation via Audio-Guided Visual Convergent Alignment
CVPR 2025
Dynamic Derivation and Elimination: Audio Visual Segmentation with Enhanced Audio Semantics
CVPR 2025
Blind Bitstream-corrupted Video Recovery via Metadata-guided Diffusion Model
CVPR 2025
Cross-View Isolated Sign Language Recognition via View Synthesis and Feature Disentanglement
ICCV 2025
RichRAG: Crafting Rich Responses for Multi-faceted Queries in Retrieval-Augmented Generation
COLING 2025
Functional Bayesian Tucker Decomposition for Continuous-indexed Tensor Data
ICLR 2024
Image Inpainting via Iteratively Decoupled Probabilistic Modeling
ICLR 2024
Text-to-3D with Classifier Score Distillation
ICLR 2024
Byzantine Robust Cooperative Multi-Agent Reinforcement Learning as a Bayesian Game
ICLR 2024
Leveraging Partial Symmetry for Multi-Agent Reinforcement Learning
AAAI 2024
Multi-Resolution Active Learning of Fourier Neural Operators
AISTATS 2024
When 3D Bounding-Box Meets SAM: Point Cloud Instance Segmentation With Weak-and-Noisy Supervision
WACV 2024
Benchmarking Audio Visual Segmentation for Long-Untrimmed Videos
CVPR 2024
EfficientDreamer: High-Fidelity and Robust 3D Creation via Orthogonal-view Diffusion Priors
CVPR 2024
Text-Guided 3D Face Synthesis - From Generation to Editing
CVPR 2024
Machine Unlearning via Null Space Calibration
IJCAI 2024
UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation
ECCV 2024
CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection Appearance
ECCV 2024
OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection
ECCV 2024
An Empirical Analysis on Spatial Reasoning Capabilities of Large Multimodal Models
EMNLP 2024
DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection
NIPS 2024
MM-WLAuslan: Multi-View Multi-Modal Word-Level Australian Sign Language Recognition Dataset
NIPS 2024
TPR: Topology-Preserving Reservoirs for Generalized Zero-Shot Learning
NIPS 2024
Diverse 3D Hand Gesture Prediction From Body Dynamics by Bilateral Hand Disentanglement
CVPR 2023
Hybrid Neural Rendering for Large-Scale Scenes With Motion Blur
CVPR 2023
DyGait: Exploiting Dynamic Representations for High-performance Gait Recognition
ICCV 2023
Weakly-Supervised Point Cloud Instance Segmentation With Geometric Priors
WACV 2023
TI2Net: Temporal Identity Inconsistency Network for Deepfake Detection
WACV 2023
Alleviating tiling effect by random walk sliding window in high-resolution histological whole slide image synthesis
MIDL 2023
Proactive Deepfake Defence via Identity Watermarking
WACV 2023
RVD: A Handheld Device-Based Fundus Video Dataset for Retinal Vessel Segmentation
NIPS 2023
Streaming Factor Trajectory Learning for Temporal Tensor Decomposition
NIPS 2023
Auslan-Daily: Australian Sign Language Translation for Daily Communication and News
NIPS 2023
NeFII: Inverse Rendering for Reflectance Decomposition With Near-Field Indirect Illumination
CVPR 2023
Object-Goal Visual Navigation via Effective Exploration of Relations Among Historical Navigation States
CVPR 2023
Meta Knowledge Condensation for Federated Learning
ICLR 2023
Exploring Active 3D Object Detection from a Generalization Perspective
ICLR 2023
Sim2RealVS: A New Benchmark for Video Stabilization With a Strong Baseline
WACV 2023
FlowFace: Semantic Flow-Guided Shape-Aware Face Swapping
AAAI 2023
StyleTalk: One-Shot Talking Head Generation with Controllable Speaking Styles
AAAI 2023
IS SYNTHETIC DATA FROM GENERATIVE MODELS READY FOR IMAGE RECOGNITION?
ICLR 2023
Texture Generation on 3D Meshes with Point-UV Diffusion
ICCV 2023
Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields
IJCAI 2022
Recall Distortion in Neural Network Pruning and the Undecayed Pruning Algorithm
NIPS 2022
MHR-Net: Multiple-Hypothesis Reconstruction of Non-rigid Shapes from 2D Views
ECCV 2022
Towards Efficient and Scale-Robust Ultra-High-Definition Image DemoirΓ©ing
ECCV 2022
Instance As Identity: A Generic Online Paradigm for Video Instance Segmentation
ECCV 2022
Video Demoireing With Relation-Based Temporal Consistency
CVPR 2022
Monocular Camera-Based Point-Goal Navigation by Learning Depth Channel and Cross-Modality Pyramid Fusion
AAAI 2022
One-Shot Talking Face Generation from Single-Speaker Audio-Visual Correlation Learning
AAAI 2022
The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks
ICML 2022
Batch Multi-Fidelity Active Learning with Budget Constraints
NIPS 2022
RGB-D Saliency Detection via Cascaded Mutual Information Minimization
ICCV 2021
Scaling Up Exact Neural Network Compression by ReLU Stability
NIPS 2021
Modeling the Probabilistic Distribution of Unlabeled Data for One-shot Medical Image Segmentation
AAAI 2021
Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation
AAAI 2021
ARVo: Learning All-Range Volumetric Correspondence for Video Deblurring
CVPR 2021
Self-Supervised Visibility Learning for Novel View Synthesis
CVPR 2021
DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-Scale Consistency
CVPR 2021
Removing Raindrops and Rain Streaks in One Go
CVPR 2021
PR-RRN: Pairwise-Regularized Residual-Recursive Networks for Non-Rigid Structure-From-Motion
ICCV 2021
Super-Resolving Cross-Domain Face Miniatures by Peeking at One-Shot Exemplar
ICCV 2021
Gait Recognition via Effective Global-Local Feature Representation and Local Temporal Aggregation
ICCV 2021
RFNet: Region-Aware Fusion Network for Incomplete Multi-Modal Brain Tumor Segmentation
ICCV 2021
VTNet: Visual Transformer Network for Object Goal Navigation
ICLR 2021
PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences
ICLR 2021
Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion
IJCAI 2021
Auto-Navigator: Decoupled Neural Architecture Search for Visual Navigation
WACV 2021
The IKEA ASM Dataset: Understanding People Assembling Furniture Through Actions, Objects and Pose
WACV 2021
Learning Object Relation Graph and Tentative Policy for Visual Navigation
ECCV 2020
Weakly-Supervised Salient Object Detection via Scribble Annotations
CVPR 2020
Copy and Paste GAN: Face Hallucination From Shaded Thumbnails
CVPR 2020
Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison
WACV 2020
Optimal Feature Transport for Cross-View Image Geo-Localization
AAAI 2020
TSPNet: Hierarchical Feature Learning via Temporal Semantic Pyramid for Sign Language Translation
NIPS 2020
Transferring Cross-Domain Knowledge for Video Sign Language Recognition
CVPR 2020
Where Am I Looking At? Joint Location and Orientation Estimation by Cross-View Matching
CVPR 2020
SOSNet: Second Order Similarity Regularization for Local Descriptor Learning
CVPR 2019
Spatial-Aware Feature Aggregation for Image based Cross-View Geo-Localization
NIPS 2019
Bringing a Blurry Frame Alive at High Frame-Rate With an Event Camera
CVPR 2019
Super-Resolving Very Low-Resolution Face Images With Supplementary Attributes
CVPR 2018
Learning Strict Identity Mappings in Deep Residual Networks
CVPR 2018
Face Super-resolution Guided by Facial Component Heatmaps
ECCV 2018
Hallucinating Very Low-Resolution Unaligned and Noisy Face Images by Transformative Discriminative Autoencoders
CVPR 2017