Yi Xu

135 papers · 2016–2026 · 18 conferences · across top CS/AI conferences

Achievements

+16 more ↓

🗺️ Taxonomy Completionist (17) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (6) 🐣 Hot Topic Early Bird

🐣 Hot Topic Early Bird 🌍 Conference Polyglot (18) 🌈 Renaissance Researcher (6) 🏠 Conference Loyalist (29) 🔬 Deep Specialist (17) 👥 Mega-Team (22) 🤝 Dynamic Duo (15) 🏆 Grand Slam 👑 Triple Crown 🏆 Keyword Champion 🗃️ Keyword Collector (555) ❓ The Questioner (4) ⚡ Prolific Year (23) 🚀 Conference Pioneer 💎 Century Club (132) 🔥 Unstoppable (11)

Conferences

CVPR (29) NIPS (18) AAAI (14) INTERSPEECH (13) ICML (12) ECCV (9) EMNLP (8) ACL (6) ICCV (5) WACV (4) NAACL (4) IJCAI (4) ICLR (4) IJCNLP (1) CORL (1) COLING (1) UAI (1) AISTATS (1)

Top co-authors

Tianbao Yang (15) Belinda Zeng (12) Rong Jin (11) Trishul Chilimbi (10) Zhong Li (10) Yun Fu (9) Xiangyang Ji (9) Peter Birkholz (8) Junsong Yuan (7) Branislav Gerazov (7)

Keywords

contrastive learning (12) non-convex optimization (10) representation learning (8) stochastic optimization (7) graph neural network (7) speech synthesis (6) large language model (5) stochastic gradient descent (5) semi-supervised learning (5) knowledge distillation (5) self-supervised learning (5) data augmentation (4) multimodal learning (4) depth estimation (4) few-shot learning (4) domain adaptation (4) trajectory prediction (4) transfer learning (4) multi-task learning (4) articulatory synthesis (4)

Papers

Cost-Sensitive Conformal Training with Provably Controllable Learning Bounds AAAI 2026 Distillation-Guided Structural Transfer for Continual Learning Beyond Sparse Distributed Memory AAAI 2026 Score-Based Model for Low-Rank Tensor Recovery AAAI 2026 Towards Photorealistic Style Transfer with Multimodal Guidance and Robustness to Content Images in Arbitrary Styles WACV 2026 SpikingYOLOX: Improved YOLOX Object Detection with Fast Fourier Convolution and Spiking Neural Networks AAAI 2025 BIG-FUSION: Brain-Inspired Global-Local Context Fusion Framework for Multimodal Emotion Recognition in Conversations AAAI 2025 FaStFact: Faster, Stronger Long-Form Factuality Evaluations in LLMs EMNLP 2025 Predicting Spectral Information for Self-Supervised Signal Classification IJCAI 2025 Representation Potentials of Foundation Models for Multimodal Alignment: A Survey EMNLP 2025 VLM-AD: End-to-End Autonomous Driving through Vision-Language Model Supervision CORL 2025 MAPS: Motivation-Aware Personalized Search via LLM-Driven Consultation Alignment ACL 2025 AutoMixAlign: Adaptive Data Mixing for Multi-Task Preference Optimization in LLMs ACL 2025 Similarity = Value? Consultation Value-Assessment and Alignment for Personalized Search EMNLP 2025 ChatVLA: Unified Multimodal Understanding and Robot Control with Vision-Language-Action Model EMNLP 2025 Sports-Traj: A Unified Trajectory Generation Model for Multi-Agent Movement in Sports ICLR 2025 ConformalSAM: Unlocking the Potential of Foundational Segmentation Models in Semi-Supervised Semantic Segmentation with Conformal Prediction ICCV 2025 ActiveGAMER: Active GAussian Mapping through Efficient Rendering CVPR 2025 Investigating Non-Transitivity in LLM-as-a-Judge ICML 2025 Efficient ANN-SNN Conversion with Error Compensation Learning ICML 2025 Better Representations via Adversarial Training in Pre-Training: A Theoretical Perspective AISTATS 2024 SynPrompt: Syntax-aware Enhanced Prompt Engineering for Aspect-based Sentiment Analysis COLING 2024 Evolutionary Contrastive Distillation for Language Model Alignment EMNLP 2024 RepEval: Effective Text Evaluation with LLM Representation EMNLP 2024 Learn to Optimize Denoising Scores: A Unified and Improved Diffusion Prior for 3D Generation ECCV 2024 ParCo: Part-Coordinating Text-to-Motion Synthesis ECCV 2024 PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance ECCV 2024 Spacetime Gaussian Feature Splatting for Real-Time Dynamic View Synthesis CVPR 2024 Spectrum AUC Difference (SAUCD): Human-aligned 3D Shape Evaluation CVPR 2024 NARUTO: Neural Active Reconstruction from Uncertain Target Observations CVPR 2024 Adapting to Length Shift: FlexiLength Network for Trajectory Prediction CVPR 2024 Dual-Consistency Model Inversion for Non-Exemplar Class Incremental Learning CVPR 2024 OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising CVPR 2024 Shopping MMLU: A Massive Multi-Task Online Shopping Benchmark for Large Language Models NIPS 2024 Facilitating Multimodal Classification via Dynamically Learning Modality Gap NIPS 2024 HuRef: HUman-REadable Fingerprint for Large Language Models NIPS 2024 The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks ICML 2024 Robust Multi-Task Learning with Excess Risks ICML 2024 Diffusion Models for Multi-Task Generative Modeling ICLR 2024 Show Your Face: Restoring Complete Facial Images From Partial Observations for VR Meeting WACV 2024 Is Reference Necessary in the Evaluation of NLG Systems? When and Where? NAACL 2024 Multi-Region Text-Driven Manipulation of Diffusion Imagery AAAI 2024 Diverse and Stable 2D Diffusion Guided Text to 3D Generation with Noise Recalibration AAAI 2024 FBLG: A Local Graph Based Approach for Handling Dual Skewed Non-IID Data in Federated Learning IJCAI 2024 Exploring and Verbalizing Academic Ideas by Concept Co-occurrence ACL 2023 Not All Out-of-Distribution Data Are Harmful to Open-Set Active Learning NIPS 2023 Latent Graph Inference with Limited Supervision NIPS 2023 OpenIllumination: A Multi-Illumination Dataset for Inverse Rendering Evaluation on Real Objects NIPS 2023 Supported Value Regularization for Offline Reinforcement Learning NIPS 2023 Temporal Knowledge Graph Reasoning with Historical Contrastive Learning AAAI 2023 Mining and Applying Composition Knowledge of Dance Moves for Style-Concentrated Dance Generation AAAI 2023 Unsupervised Graph-Text Mutual Conversion with a Unified Pretrained Language Model ACL 2023 ReAugKD: Retrieval-Augmented Knowledge Distillation For Pre-trained Language Models ACL 2023 Understanding and Constructing Latent Modality Structures in Multi-Modal Representation Learning CVPR 2023 Uncovering the Missing Pattern: Unified Framework Towards Trajectory Imputation and Prediction CVPR 2023 RIAV-MVS: Recurrent-Indexing an Asymmetric Volume for Multi-View Stereo CVPR 2023 High Fidelity 3D Hand Shape Reconstruction via Scalable Graph Frequency Decomposition CVPR 2023 TWINS: A Fine-Tuning Framework for Improved Transferability of Adversarial Robustness and Generalization CVPR 2023 AdamsFormer for Spatial Action Localization in the Future CVPR 2023 Interventional Bag Multi-Instance Learning on Whole-Slide Pathological Images CVPR 2023 3D-Aware Facial Landmark Detection via Multi-View Consistent Training on Synthetic Data CVPR 2023 NeuRBF: A Neural Fields Representation with Adaptive Radial Basis Functions ICCV 2023 Uncertainty-aware State Space Transformer for Egocentric 3D Hand Trajectory Forecasting ICCV 2023 In-sample Actor Critic for Offline Reinforcement Learning ICLR 2023 Supported Trust Region Optimization for Offline Reinforcement Learning ICML 2023 Self-Supervised Solution to the Control Problem of Articulatory Synthesis INTERSPEECH 2023 Semantics-Depth-Symbiosis: Deeply Coupled Semi-Supervised Learning of Semantics and Depth WACV 2023 Look More but Care Less in Video Recognition NIPS 2022 GeoRefine: Self-Supervised Online Depth Refinement for Accurate Dense Mapping ECCV 2022 SepLUT: Separable Image-Adaptive Lookup Tables for Real-Time Image Enhancement ECCV 2022 MemREIN: Rein the Domain Shift for Cross-Domain Few-Shot Learning IJCAI 2022 Improved Fine-Tuning by Better Leveraging Pre-Training Data NIPS 2022 Why do We Need Large Batchsizes in Contrastive Learning? A Gradient-Bias Perspective NIPS 2022 Posterior Refinement on Metric Matrix Improves Generalization Bound in Metric Learning ECCV 2022 HCSC: Hierarchical Contrastive Selective Coding CVPR 2022 CHEX: CHannel EXploration for CNN Model Compression CVPR 2022 Vision-Language Pre-Training With Triple Contrastive Learning CVPR 2022 Learning From Untrimmed Videos: Self-Supervised Video Representation Learning With Hierarchical Consistency CVPR 2022 DynaMaR: Dynamic Prompt with Mask Token Representation EMNLP 2022 PlaneMVS: 3D Plane Reconstruction From Multi-View Stereo CVPR 2022 PoP-Net: Pose Over Parts Network for Multi-Person 3D Pose Estimation From a Depth Image WACV 2022 AdaInt: Learning Adaptive Intervals for 3D Lookup Tables on Real-Time Image Enhancement CVPR 2022 Asynchronous Convergence in Multi-Task Learning via Knowledge Distillation from Converged Tasks NAACL 2022 EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance Text Classification NAACL 2022 Multi-Modal Alignment Using Representation Codebook CVPR 2022 Evoc-Learn — High quality simulation of early vocal learning INTERSPEECH 2022 Adaptive Trajectory Prediction via Transferable GNN CVPR 2022 Articulatory Synthesis for Data Augmentation in Phoneme Recognition INTERSPEECH 2022 Exploration strategies for articulatory synthesis of complex syllable onsets INTERSPEECH 2022 Interventional Multi-Instance Learning with Deconfounded Instance-Level Prediction AAAI 2022 Effective Model Sparsification by Scheduled Grow-and-Prune Methods ICLR 2022 An Online Method for A Class of Distributionally Robust Optimization with Non-convex Objectives NIPS 2021 Weakly-supervised Text Classification Based on Keyword Graph EMNLP 2021 Dialogue-oriented Pre-training ACL 2021 MonoIndoor: Towards Good Practice of Self-Supervised Monocular Depth Estimation for Indoor Environments ICCV 2021 Topic-Aware Multi-turn Dialogue Modeling AAAI 2021 GIF Thumbnails: Attract More Clicks to Your Videos AAAI 2021 Dash: Semi-Supervised Learning with Dynamic Thresholding ICML 2021 Federated Deep AUC Maximization for Hetergeneous Data with a Constant Communication Complexity ICML 2021 DP-SSL: Towards Robust Semi-supervised Learning with A Few Labeled Samples NIPS 2021 Dialogue-oriented Pre-training IJCNLP 2021 Model-Based Exploration of Linking Between Vowel Articulatory Space and Acoustic Space INTERSPEECH 2021 Segmental Alignment of English Syllables with Singleton and Cluster Onsets INTERSPEECH 2021 Semantic Aligned Multi-modal Transformer for Vision-LanguageUnderstanding: A Preliminary Study on Visual QA NAACL 2021 Network as Regularization for Training Deep Neural Networks: Framework, Model and Performance AAAI 2020 Quaternion Product Units for Deep Learning on 3D Rotation Groups CVPR 2020 Adaptive Fractional Dilated Convolution Network for Image Aesthetics Assessment CVPR 2020 CAM: Uninteresting Speech Detector INTERSPEECH 2020 Coarticulation as Synchronised Sequential Target Approximation: An EMA Study INTERSPEECH 2020 An Investigation of the Target Approximation Model for Tone Modeling and Recognition in Continuous Mandarin Speech INTERSPEECH 2020 Finding Intelligible Consonant-Vowel Sounds Using High-Quality Articulatory Synthesis INTERSPEECH 2020 Talking-head Generation with Rhythmic Head Motion ECCV 2020 Optimal Epoch Stochastic Gradient Descent Ascent Methods for Min-Max Optimization NIPS 2020 Stochastic Optimization for Non-convex Inf-Projection Problems ICML 2020 CF-LSTM: Cascaded Feature-Based Long Short-Term Networks for Predicting Pedestrian Trajectory AAAI 2020 Stochastic Optimization for DC Functions and Non-smooth Non-convex Regularizers with Non-asymptotic Convergence ICML 2019 Katalyst: Boosting Convex Katayusha for Non-Convex Problems with a Large Condition Number ICML 2019 Versatile Multiple Choice Learning and Its Application to Vision Computing CVPR 2019 Non-Local ConvLSTM for Video Compression Artifact Reduction ICCV 2019 On the Convergence of (Stochastic) Gradient Descent with Extrapolation for Non-Convex Minimization IJCAI 2019 Non-asymptotic Analysis of Stochastic Methods for Non-Smooth Non-Convex Regularized Problems NIPS 2019 Learning with Non-Convex Truncated Losses by SGD UAI 2019 A Weighted Superposition of Functional Contours Model for Modelling Contextual Prominence of Elementary Prosodic Contours INTERSPEECH 2018 SADAGRAD: Strongly Adaptive Stochastic Gradient Methods ICML 2018 Crowd Counting via Adversarial Cross-Scale Consistency Pursuit CVPR 2018 Scale-Transferrable Object Detection CVPR 2018 Geometric Constrained Joint Lane Segmentation and Lane Boundary Detection ECCV 2018 Quaternion Convolutional Neural Networks ECCV 2018 First-order Stochastic Algorithms for Escaping From Saddle Points in Almost Linear Time NIPS 2018 Adaptive SVRG Methods under Error Bound Conditions with Unknown Growth Parameter NIPS 2017 ADMM without a Fixed Penalty Parameter: Faster Convergence with New Adaptive Penalization NIPS 2017 Does Posh English Sound Attractive? INTERSPEECH 2017 Stochastic Convex Optimization: Faster Local Growth Implies Faster Global Convergence ICML 2017 Video Segmentation via Multiple Granularity Analysis CVPR 2017 Homotopy Smoothing for Non-Smooth Problems with Lower Complexity than $O(1/\epsilon)$ NIPS 2016 Model-Based Parametric Prosody Synthesis with Deep Neural Network INTERSPEECH 2016