Yi Xu
135 papers · 2016–2026 · 18 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (17) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (6) π£ Hot Topic Early Bird
π£
Hot Topic Early Bird
π
Conference Polyglot
(18)
π
Renaissance Researcher
(6)
π
Conference Loyalist
(29)
π¬
Deep Specialist
(17)
π₯
Mega-Team
(22)
π€
Dynamic Duo
(15)
π
Grand Slam
π
Triple Crown
π
Keyword Champion
ποΈ
Keyword Collector
(555)
β
The Questioner
(4)
β‘
Prolific Year
(23)
π
Conference Pioneer
π
Century Club
(132)
π₯
Unstoppable
(11)
Conferences
CVPR (29)
NIPS (18)
AAAI (14)
INTERSPEECH (13)
ICML (12)
ECCV (9)
EMNLP (8)
ACL (6)
ICCV (5)
WACV (4)
NAACL (4)
IJCAI (4)
ICLR (4)
IJCNLP (1)
CORL (1)
COLING (1)
UAI (1)
AISTATS (1)
Top co-authors
Keywords
contrastive learning
(12)
non-convex optimization
(10)
representation learning
(8)
stochastic optimization
(7)
graph neural network
(7)
speech synthesis
(6)
large language model
(5)
stochastic gradient descent
(5)
semi-supervised learning
(5)
knowledge distillation
(5)
self-supervised learning
(5)
data augmentation
(4)
multimodal learning
(4)
depth estimation
(4)
few-shot learning
(4)
domain adaptation
(4)
trajectory prediction
(4)
transfer learning
(4)
multi-task learning
(4)
articulatory synthesis
(4)
Papers
Cost-Sensitive Conformal Training with Provably Controllable Learning Bounds
AAAI 2026
Distillation-Guided Structural Transfer for Continual Learning Beyond Sparse Distributed Memory
AAAI 2026
Score-Based Model for Low-Rank Tensor Recovery
AAAI 2026
Towards Photorealistic Style Transfer with Multimodal Guidance and Robustness to Content Images in Arbitrary Styles
WACV 2026
SpikingYOLOX: Improved YOLOX Object Detection with Fast Fourier Convolution and Spiking Neural Networks
AAAI 2025
BIG-FUSION: Brain-Inspired Global-Local Context Fusion Framework for Multimodal Emotion Recognition in Conversations
AAAI 2025
FaStFact: Faster, Stronger Long-Form Factuality Evaluations in LLMs
EMNLP 2025
Predicting Spectral Information for Self-Supervised Signal Classification
IJCAI 2025
Representation Potentials of Foundation Models for Multimodal Alignment: A Survey
EMNLP 2025
VLM-AD: End-to-End Autonomous Driving through Vision-Language Model Supervision
CORL 2025
MAPS: Motivation-Aware Personalized Search via LLM-Driven Consultation Alignment
ACL 2025
AutoMixAlign: Adaptive Data Mixing for Multi-Task Preference Optimization in LLMs
ACL 2025
Similarity = Value? Consultation Value-Assessment and Alignment for Personalized Search
EMNLP 2025
ChatVLA: Unified Multimodal Understanding and Robot Control with Vision-Language-Action Model
EMNLP 2025
Sports-Traj: A Unified Trajectory Generation Model for Multi-Agent Movement in Sports
ICLR 2025
ConformalSAM: Unlocking the Potential of Foundational Segmentation Models in Semi-Supervised Semantic Segmentation with Conformal Prediction
ICCV 2025
ActiveGAMER: Active GAussian Mapping through Efficient Rendering
CVPR 2025
Investigating Non-Transitivity in LLM-as-a-Judge
ICML 2025
Efficient ANN-SNN Conversion with Error Compensation Learning
ICML 2025
Better Representations via Adversarial Training in Pre-Training: A Theoretical Perspective
AISTATS 2024
SynPrompt: Syntax-aware Enhanced Prompt Engineering for Aspect-based Sentiment Analysis
COLING 2024
Evolutionary Contrastive Distillation for Language Model Alignment
EMNLP 2024
RepEval: Effective Text Evaluation with LLM Representation
EMNLP 2024
Learn to Optimize Denoising Scores: A Unified and Improved Diffusion Prior for 3D Generation
ECCV 2024
ParCo: Part-Coordinating Text-to-Motion Synthesis
ECCV 2024
PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance
ECCV 2024
Spacetime Gaussian Feature Splatting for Real-Time Dynamic View Synthesis
CVPR 2024
Spectrum AUC Difference (SAUCD): Human-aligned 3D Shape Evaluation
CVPR 2024
NARUTO: Neural Active Reconstruction from Uncertain Target Observations
CVPR 2024
Adapting to Length Shift: FlexiLength Network for Trajectory Prediction
CVPR 2024
Dual-Consistency Model Inversion for Non-Exemplar Class Incremental Learning
CVPR 2024
OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising
CVPR 2024
Shopping MMLU: A Massive Multi-Task Online Shopping Benchmark for Large Language Models
NIPS 2024
Facilitating Multimodal Classification via Dynamically Learning Modality Gap
NIPS 2024
HuRef: HUman-REadable Fingerprint for Large Language Models
NIPS 2024
The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks
ICML 2024
Robust Multi-Task Learning with Excess Risks
ICML 2024
Diffusion Models for Multi-Task Generative Modeling
ICLR 2024
Show Your Face: Restoring Complete Facial Images From Partial Observations for VR Meeting
WACV 2024
Is Reference Necessary in the Evaluation of NLG Systems? When and Where?
NAACL 2024
Multi-Region Text-Driven Manipulation of Diffusion Imagery
AAAI 2024
Diverse and Stable 2D Diffusion Guided Text to 3D Generation with Noise Recalibration
AAAI 2024
FBLG: A Local Graph Based Approach for Handling Dual Skewed Non-IID Data in Federated Learning
IJCAI 2024
Exploring and Verbalizing Academic Ideas by Concept Co-occurrence
ACL 2023
Not All Out-of-Distribution Data Are Harmful to Open-Set Active Learning
NIPS 2023
Latent Graph Inference with Limited Supervision
NIPS 2023
OpenIllumination: A Multi-Illumination Dataset for Inverse Rendering Evaluation on Real Objects
NIPS 2023
Supported Value Regularization for Offline Reinforcement Learning
NIPS 2023
Temporal Knowledge Graph Reasoning with Historical Contrastive Learning
AAAI 2023
Mining and Applying Composition Knowledge of Dance Moves for Style-Concentrated Dance Generation
AAAI 2023
Unsupervised Graph-Text Mutual Conversion with a Unified Pretrained Language Model
ACL 2023
ReAugKD: Retrieval-Augmented Knowledge Distillation For Pre-trained Language Models
ACL 2023
Understanding and Constructing Latent Modality Structures in Multi-Modal Representation Learning
CVPR 2023
Uncovering the Missing Pattern: Unified Framework Towards Trajectory Imputation and Prediction
CVPR 2023
RIAV-MVS: Recurrent-Indexing an Asymmetric Volume for Multi-View Stereo
CVPR 2023
High Fidelity 3D Hand Shape Reconstruction via Scalable Graph Frequency Decomposition
CVPR 2023
TWINS: A Fine-Tuning Framework for Improved Transferability of Adversarial Robustness and Generalization
CVPR 2023
AdamsFormer for Spatial Action Localization in the Future
CVPR 2023
Interventional Bag Multi-Instance Learning on Whole-Slide Pathological Images
CVPR 2023
3D-Aware Facial Landmark Detection via Multi-View Consistent Training on Synthetic Data
CVPR 2023
NeuRBF: A Neural Fields Representation with Adaptive Radial Basis Functions
ICCV 2023
Uncertainty-aware State Space Transformer for Egocentric 3D Hand Trajectory Forecasting
ICCV 2023
In-sample Actor Critic for Offline Reinforcement Learning
ICLR 2023
Supported Trust Region Optimization for Offline Reinforcement Learning
ICML 2023
Self-Supervised Solution to the Control Problem of Articulatory Synthesis
INTERSPEECH 2023
Semantics-Depth-Symbiosis: Deeply Coupled Semi-Supervised Learning of Semantics and Depth
WACV 2023
Look More but Care Less in Video Recognition
NIPS 2022
GeoRefine: Self-Supervised Online Depth Refinement for Accurate Dense Mapping
ECCV 2022
SepLUT: Separable Image-Adaptive Lookup Tables for Real-Time Image Enhancement
ECCV 2022
MemREIN: Rein the Domain Shift for Cross-Domain Few-Shot Learning
IJCAI 2022
Improved Fine-Tuning by Better Leveraging Pre-Training Data
NIPS 2022
Why do We Need Large Batchsizes in Contrastive Learning? A Gradient-Bias Perspective
NIPS 2022
Posterior Refinement on Metric Matrix Improves Generalization Bound in Metric Learning
ECCV 2022
HCSC: Hierarchical Contrastive Selective Coding
CVPR 2022
CHEX: CHannel EXploration for CNN Model Compression
CVPR 2022
Vision-Language Pre-Training With Triple Contrastive Learning
CVPR 2022
Learning From Untrimmed Videos: Self-Supervised Video Representation Learning With Hierarchical Consistency
CVPR 2022
DynaMaR: Dynamic Prompt with Mask Token Representation
EMNLP 2022
PlaneMVS: 3D Plane Reconstruction From Multi-View Stereo
CVPR 2022
PoP-Net: Pose Over Parts Network for Multi-Person 3D Pose Estimation From a Depth Image
WACV 2022
AdaInt: Learning Adaptive Intervals for 3D Lookup Tables on Real-Time Image Enhancement
CVPR 2022
Asynchronous Convergence in Multi-Task Learning via Knowledge Distillation from Converged Tasks
NAACL 2022
EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance Text Classification
NAACL 2022
Multi-Modal Alignment Using Representation Codebook
CVPR 2022
Evoc-Learn β High quality simulation of early vocal learning
INTERSPEECH 2022
Adaptive Trajectory Prediction via Transferable GNN
CVPR 2022
Articulatory Synthesis for Data Augmentation in Phoneme Recognition
INTERSPEECH 2022
Exploration strategies for articulatory synthesis of complex syllable onsets
INTERSPEECH 2022
Interventional Multi-Instance Learning with Deconfounded Instance-Level Prediction
AAAI 2022
Effective Model Sparsification by Scheduled Grow-and-Prune Methods
ICLR 2022
An Online Method for A Class of Distributionally Robust Optimization with Non-convex Objectives
NIPS 2021
Weakly-supervised Text Classification Based on Keyword Graph
EMNLP 2021
Dialogue-oriented Pre-training
ACL 2021
MonoIndoor: Towards Good Practice of Self-Supervised Monocular Depth Estimation for Indoor Environments
ICCV 2021
Topic-Aware Multi-turn Dialogue Modeling
AAAI 2021
GIF Thumbnails: Attract More Clicks to Your Videos
AAAI 2021
Dash: Semi-Supervised Learning with Dynamic Thresholding
ICML 2021
Federated Deep AUC Maximization for Hetergeneous Data with a Constant Communication Complexity
ICML 2021
DP-SSL: Towards Robust Semi-supervised Learning with A Few Labeled Samples
NIPS 2021
Dialogue-oriented Pre-training
IJCNLP 2021
Model-Based Exploration of Linking Between Vowel Articulatory Space and Acoustic Space
INTERSPEECH 2021
Segmental Alignment of English Syllables with Singleton and Cluster Onsets
INTERSPEECH 2021
Semantic Aligned Multi-modal Transformer for Vision-LanguageUnderstanding: A Preliminary Study on Visual QA
NAACL 2021
Network as Regularization for Training Deep Neural Networks: Framework, Model and Performance
AAAI 2020
Quaternion Product Units for Deep Learning on 3D Rotation Groups
CVPR 2020
Adaptive Fractional Dilated Convolution Network for Image Aesthetics Assessment
CVPR 2020
CAM: Uninteresting Speech Detector
INTERSPEECH 2020
Coarticulation as Synchronised Sequential Target Approximation: An EMA Study
INTERSPEECH 2020
An Investigation of the Target Approximation Model for Tone Modeling and Recognition in Continuous Mandarin Speech
INTERSPEECH 2020
Finding Intelligible Consonant-Vowel Sounds Using High-Quality Articulatory Synthesis
INTERSPEECH 2020
Talking-head Generation with Rhythmic Head Motion
ECCV 2020
Optimal Epoch Stochastic Gradient Descent Ascent Methods for Min-Max Optimization
NIPS 2020
Stochastic Optimization for Non-convex Inf-Projection Problems
ICML 2020
CF-LSTM: Cascaded Feature-Based Long Short-Term Networks for Predicting Pedestrian Trajectory
AAAI 2020
Stochastic Optimization for DC Functions and Non-smooth Non-convex Regularizers with Non-asymptotic Convergence
ICML 2019
Katalyst: Boosting Convex Katayusha for Non-Convex Problems with a Large Condition Number
ICML 2019
Versatile Multiple Choice Learning and Its Application to Vision Computing
CVPR 2019
Non-Local ConvLSTM for Video Compression Artifact Reduction
ICCV 2019
On the Convergence of (Stochastic) Gradient Descent with Extrapolation for Non-Convex Minimization
IJCAI 2019
Non-asymptotic Analysis of Stochastic Methods for Non-Smooth Non-Convex Regularized Problems
NIPS 2019
Learning with Non-Convex Truncated Losses by SGD
UAI 2019
A Weighted Superposition of Functional Contours Model for Modelling Contextual Prominence of Elementary Prosodic Contours
INTERSPEECH 2018
SADAGRAD: Strongly Adaptive Stochastic Gradient Methods
ICML 2018
Crowd Counting via Adversarial Cross-Scale Consistency Pursuit
CVPR 2018
Scale-Transferrable Object Detection
CVPR 2018
Geometric Constrained Joint Lane Segmentation and Lane Boundary Detection
ECCV 2018
Quaternion Convolutional Neural Networks
ECCV 2018
First-order Stochastic Algorithms for Escaping From Saddle Points in Almost Linear Time
NIPS 2018
Adaptive SVRG Methods under Error Bound Conditions with Unknown Growth Parameter
NIPS 2017
ADMM without a Fixed Penalty Parameter: Faster Convergence with New Adaptive Penalization
NIPS 2017
Does Posh English Sound Attractive?
INTERSPEECH 2017
Stochastic Convex Optimization: Faster Local Growth Implies Faster Global Convergence
ICML 2017
Video Segmentation via Multiple Granularity Analysis
CVPR 2017
Homotopy Smoothing for Non-Smooth Problems with Lower Complexity than $O(1/\epsilon)$
NIPS 2016
Model-Based Parametric Prosody Synthesis with Deep Neural Network
INTERSPEECH 2016