Xiaokang Yang
153 papers · 2013–2026 · 13 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (10) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (5) π Conference Polyglot (13)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(10)
π
Conference Polyglot
(13)
π
Conference Loyalist
(43)
π
Keyword Champion
(2)
π¬
Deep Specialist
(13)
π€
Dynamic Duo
(35)
π
Grand Slam
π
Triple Crown
ποΈ
Keyword Collector
(554)
π₯
Unstoppable
(11)
π
Conference Pioneer
π
Century Club
(148)
β‘
Prolific Year
(16)
π
Trend Setter
Conferences
CVPR (43)
ICCV (21)
ICLR (17)
AAAI (16)
NIPS (15)
ECCV (13)
IJCAI (12)
ICML (8)
MICCAI (3)
ACL (2)
AUTOML (1)
JMLR (1)
WACV (1)
Top co-authors
Keywords
diffusion model
(10)
neural network
(8)
transfer learning
(8)
graph matching
(8)
action recognition
(7)
vision-language model
(6)
3d reconstruction
(6)
recurrent neural network
(6)
domain adaptation
(5)
neural architecture search
(5)
generative model
(5)
object tracking
(5)
neural radiance field
(5)
video prediction
(5)
video understanding
(4)
convolutional neural network
(4)
adversarial attack
(4)
reinforcement learning
(4)
computer vision
(4)
adversarial learning
(4)
Papers
Latent Knowledge-Guided Video Diffusion for Scientific Phenomena Generation from a Single Initial Frame
AAAI 2026
ChemReason-Bench: Benchmarking Large Language Models for Procedural Reasoning in Experimental Chemistry
ACL 2026
Coordinated Humanoid Robot Locomotion with Symmetry Equivariant Reinforcement Learning Policy
AAAI 2026
Keep On Going: Learning Robust Humanoid Motion Skills via Selective Adversarial Training
AAAI 2026
MetaTrader: Learning to Generalize RL Trading Policies Beyond Offline Data
AAAI 2026
Domain Generalization in CLIP via Learning with Diverse Text Prompts
CVPR 2025
PassionSR: Post-Training Quantization with Adaptive Scale in One-Step Diffusion based Image Super-Resolution
CVPR 2025
S^3-Face: SSS-Compliant Facial Reflectance Estimation via Diffusion Priors
CVPR 2025
OSDFace: One-Step Diffusion Model for Face Restoration
CVPR 2025
Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding
CVPR 2025
Star with Bilinear Mapping
CVPR 2025
POMP: Physics-consistent Motion Generative Model through Phase Manifolds
CVPR 2025
Domain Prompt Learning with Quaternion Networks (Extended Abstract)
IJCAI 2025
Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach
ICML 2025
Human Body Restoration with One-Step Diffusion Model and A New Benchmark
ICML 2025
EvoMesh: Adaptive Physical Simulation with Hierarchical Graph Evolutions
ICML 2025
FATE: Feature-Adapted Parameter Tuning for Vision-Language Models
AAAI 2025
ChemActor: Enhancing Automated Extraction of Chemical Synthesis Actions with LLM-Generated Data
ACL 2025
ARB-LLM: Alternating Refined Binarizations for Large Language Models
ICLR 2025
AniSDF: Fused-Granularity Neural Surfaces with Anisotropic Encoding for High-Fidelity 3D Reconstruction
ICLR 2025
Rethinking Classifier Re-Training in Long-Tailed Recognition: Label Over-Smooth Can Balance
ICLR 2025
Open-World Reinforcement Learning over Long Short-Term Imagination
ICLR 2025
Unleashing the Power of Task-Specific Directions in Parameter Efficient Fine-tuning
ICLR 2025
PostCast: Generalizable Postprocessing for Precipitation Nowcasting via Unsupervised Blurriness Modeling
ICLR 2025
PostEdit: Posterior Sampling for Efficient Zero-Shot Image Editing
ICLR 2025
Maintaining Structural Integrity in Parameter Spaces for Parameter Efficient Fine-tuning
ICLR 2025
KinFormer: Generalizable Dynamical Symbolic Regression for Catalytic Organic Reaction Kinetics
ICLR 2025
Disentangled Clothed Avatar Generation with Layered Representation
ICCV 2025
Perceiving and Acting in First-Person: A Dataset and Benchmark for Egocentric Human-Object-Human Interactions
ICCV 2025
A Token-level Text Image Foundation Model for Document Understanding
ICCV 2025
Generalized Tensor-based Parameter-Efficient Fine-Tuning via Lie Group Transformations
ICCV 2025
Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation
ICCV 2025
Degradation-Modeled Multipath Diffusion for Tunable Metalens Photography
ICCV 2025
QuantCache: Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation
ICCV 2025
Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning
ICCV 2025
Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction
ICCV 2025
EndoDAV: Depth Any Video in Endoscopy with Spatiotemporal Accuracy
MICCAI 2025
Boundary Matters: A Bi-Level Active Finetuning Method
NIPS 2024
Recursive Generalization Transformer for Image Super-Resolution
ICLR 2024
Xformer: Hybrid X-Shaped Transformer for Image Denoising
ICLR 2024
HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting
NIPS 2024
Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning
NIPS 2024
Multi-times Monte Carlo Rendering for Inter-reflection Reconstruction
NIPS 2024
NeuMA: Neural Material Adaptor for Visual Grounding of Intrinsic Dynamics
NIPS 2024
EndoGSLAM: Real-Time Dense Reconstruction and Tracking in Endoscopic Surgeries using Gaussian Splatting
MICCAI 2024
Pygmtools: A Python Graph Matching Toolkit
JMLR 2024
DynaVol: Unsupervised Learning for Dynamic Scenes through Object-Centric Voxelization
ICLR 2024
Latent Intuitive Physics: Learning to Transfer Hidden Physics from A 3D Video
ICLR 2024
ReLIZO: Sample Reusable Linear Interpolation-based Zeroth-order Optimization
NIPS 2024
Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis
ECCV 2024
CasCast: Skillful High-resolution Precipitation Nowcasting via Cascaded Modelling
ICML 2024
Domain-Controlled Prompt Learning
AAAI 2024
SAM-PARSER: Fine-Tuning SAM Efficiently by Parameter Space Reconstruction
AAAI 2024
Partial Label Learning with a Partner
AAAI 2024
LERE: Learning-Based Low-Rank Matrix Recovery with Rank Estimation
AAAI 2024
HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects
ECCV 2024
VidToMe: Video Token Merging for Zero-Shot Video Editing
CVPR 2024
Missing as Masking: Arbitrary Cross-modal Feature Reconstruction for Incomplete Multimodal Brain Tumor Segmentation
MICCAI 2024
Domain Prompt Learning with Quaternion Networks
CVPR 2024
ReGenNet: Towards Human Action-Reaction Synthesis
CVPR 2024
Monocular Identity-Conditioned Facial Reflectance Reconstruction
CVPR 2024
Inter-X: Towards Versatile Human-Human Interaction Analysis
CVPR 2024
Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors
ECCV 2024
Tendency-driven Mutual Exclusivity for Weakly Supervised Incremental Semantic Segmentation
ECCV 2024
PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer
ECCV 2024
Deep Learning of Partial Graph Matching via Differentiable Top-K
CVPR 2023
NeRF-IBVS: Visual Servo Based on NeRF for Visual Localization and Navigation
NIPS 2023
Poisson Process for Bayesian Optimization
AUTOML 2023
NeRFVS: Neural Radiance Fields for Free View Synthesis via Geometry Scaffolds
CVPR 2023
Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning Perspective
CVPR 2023
3D-Aware Face Swapping
CVPR 2023
Improving Fairness in Facial Albedo Estimation via Visual-Textual Cues
CVPR 2023
Active Finetuning: Exploiting Annotation Budget in the Pretraining-Finetuning Paradigm
CVPR 2023
Self-Supervised Character-to-Character Distillation for Text Recognition
ICCV 2023
Dual Aggregation Transformer for Image Super-Resolution
ICCV 2023
ROME: Robustifying Memory-Efficient NAS via Topology Disentanglement and Gradient Accumulation
ICCV 2023
ActFormer: A GAN-based Transformer towards General Action-Conditioned 3D Human Motion Generation
ICCV 2023
Graph Signal Sampling for Inductive One-Bit Matrix Completion: a Closed-form Solution
ICLR 2023
ROCO: A General Framework for Evaluating Robustness of Combinatorial Optimization Solvers on Graphs
ICLR 2023
Towards One-shot Neural Combinatorial Solvers: Theoretical and Empirical Notes on the Cardinality-Constrained Case
ICLR 2023
LinSATNet: The Positive Linear Satisfiability Neural Networks
ICML 2023
StockFormer: Learning Hybrid Trading Machines with Predictive Coding
IJCAI 2023
End-to-End Reconstruction-Classification Learning for Face Forgery Detection
CVPR 2022
Align Representations With Base: A New Approach to Self-Supervised Learning
CVPR 2022
Continual Predictive Learning From Videos
CVPR 2022
Adv-Attribute: Inconspicuous and Transferable Adversarial Attack on Face Recognition
NIPS 2022
CageNeRF: Cage-based Neural Radiance Field for Generalized 3D Deformation and Animation
NIPS 2022
Exploring Frequency Adversarial Attacks for Face Forgery Detection
CVPR 2022
Learning Invisible Markers for Hidden Codes in Offline-to-Online Photography
CVPR 2022
Zero-CL: Instance and Feature decorrelation for negative-free symmetric contrastive learning
ICLR 2022
Iso-Dream: Isolating and Leveraging Noncontrollable Visual Dynamics in World Models
NIPS 2022
ZARTS: On Zero-order Optimization for Neural Architecture Search
NIPS 2022
Perceptual Attacks of No-Reference Image Quality Models with Human-in-the-Loop
NIPS 2022
NeuroFluid: Fluid Dynamics Grounding with Particle-Driven Neural Radiance Fields
ICML 2022
Exploring Visual Context for Weakly Supervised Person Search
AAAI 2022
Learning Mixture of Neural Temporal Point Processes for Multi-dimensional Event Sequence Clustering
IJCAI 2022
EAutoDet: Efficient Architecture Search for Object Detection
ECCV 2022
Self-Supervised Learning of Visual Graph Matching
ECCV 2022
Context-Aware Image Inpainting with Learned Semantic Priors
IJCAI 2021
Learning Spectral Dictionary for Local Representation of Mesh
IJCAI 2021
Learning Local Neighboring Structure for Robust 3D Shape Representation
AAAI 2021
Learning Comprehensive Motion Representation for Action Recognition
AAAI 2021
Scalable and Explainable 1-Bit Matrix Completion via Graph Signal Learning
AAAI 2021
Rethinking Bi-Level Optimization in Neural Architecture Search: A Gibbs Sampling Perspective
AAAI 2021
A Bi-Level Framework for Learning to Solve Combinatorial Optimization on Graphs
NIPS 2021
Learning Self-Modulating Attention in Continuous Time Space with Applications to Sequential Recommendation
ICML 2021
Cross-Modality 3D Object Detection
WACV 2021
PointAugmenting: Cross-Modal Augmentation for 3D Object Detection
CVPR 2021
IoU Attack: Towards Temporally Coherent Black-Box Adversarial Attack for Visual Object Tracking
CVPR 2021
Bilevel Online Adaptation for Out-of-Domain Human Mesh Reconstruction
CVPR 2021
Combinatorial Learning of Graph Edit Distance via Dynamic Embedding
CVPR 2021
Learning To Track Objects From Unlabeled Videos
ICCV 2021
Semantic Equivalent Adversarial Data Augmentation for Visual Question Answering
ECCV 2020
Video Prediction via Example Guidance
ICML 2020
Deep Kinematics Analysis for Monocular 3D Human Pose Estimation
CVPR 2020
Layered Neighborhood Expansion for Incremental Multiple Graph Matching
ECCV 2020
Hierarchical Style-based Networks for Motion Synthesis
ECCV 2020
Robust Tracking against Adversarial Attacks
ECCV 2020
Graduated Assignment for Joint Multi-Graph Matching and Clustering with Application to Unsupervised Graph Matching Network Learning
NIPS 2020
MergeNAS: Merge Operations into One for Differentiable Architecture Search
IJCAI 2020
Human Action Transfer Based on 3D Model Reconstruction
AAAI 2019
Learning Context Graph for Person Search
CVPR 2019
Learning Interpretable Deep State Space Model for Probabilistic Time Series Forecasting
IJCAI 2019
Variational Few-Shot Learning
ICCV 2019
Learning Combinatorial Embedding Networks for Deep Graph Matching
ICCV 2019
Efficient Quantization for Neural Networks with Binary Weights and Low Bitwidth Activations
AAAI 2019
Fine-Grained Video Captioning for Sports Narrative
CVPR 2018
Multiple Granularity Group Interaction Prediction
CVPR 2018
Deep Regression Tracking with Shrinkage Loss
ECCV 2018
Video Prediction via Selective Sampling
NIPS 2018
Crowd Counting via Adversarial Cross-Scale Consistency Pursuit
CVPR 2018
Structure Preserving Video Prediction
CVPR 2018
Attention-GAN for Object Transfiguration in Wild Images
ECCV 2018
Video Segmentation via Multiple Granularity Analysis
CVPR 2017
Performance Guaranteed Network Acceleration via High-Order Residual Quantization
ICCV 2017
Image Matching via Loopy RNN
IJCAI 2017
Predicting Human Interaction via Relative Attention Model
IJCAI 2017
Recurrent Modeling of Interaction Context for Collective Activity Recognition
CVPR 2017
Factors in Finetuning Deep Model for Object Detection With Long-Tail Distribution
CVPR 2016
Progressively Parsing Interactional Objects for Fine Grained Action Detection
CVPR 2016
Cascaded Interactional Targeting Network for Egocentric Video Analysis
CVPR 2016
Temporal Action Localization With Pyramid of Score Distribution Features
CVPR 2016
On Modeling and Predicting Individual Paper Citation Count over Time
IJCAI 2016
Modeling Contagious Merger and Acquisition via Point Processes with a Profile Regression Prior
IJCAI 2016
Long-Term Correlation Tracking
CVPR 2015
A Matrix Decomposition Perspective to Multiple Graph Matching
ICCV 2015
Motion Part Regularization: Improving Action Recognition via Trajectory Selection
CVPR 2015
Hierarchical Convolutional Features for Visual Tracking
ICCV 2015
Multi-Task Multi-Dimensional Hawkes Processes for Modeling Event Sequences
IJCAI 2015
Cross-Scene Crowd Counting via Deep Convolutional Neural Networks
CVPR 2015
Discrete Hyper-Graph Matching
CVPR 2015
Joint Optimization for Consistent Multiple Graph Matching
ICCV 2013
Action Recognition with Actons
ICCV 2013