Shuicheng Yan
174 papers · 2010–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+20 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (23) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (9) π£ Hot Topic Early Bird
π
Renaissance Researcher
(9)
π
Interdisciplinary Bridge
π§
Keyword Pioneer
π
Keyword Trendsetter Combo
(9)
π
Conference Loyalist
(29)
π§¬
Topic Evolution
π€
Dynamic Duo
(52)
π
Grand Slam
π
Triple Crown
π₯
Mega-Team
(32)
π±
Topic Pioneer
π¬
Deep Specialist
(30)
π
Keyword Champion
(2)
ποΈ
Keyword Collector
(66)
π
Conference Pioneer
π
Century Club
(171)
π₯
Unstoppable
(16)
π
Trend Setter
β‘
Prolific Year
(26)
β
The Questioner
(2)
Conferences
CVPR (43)
NIPS (29)
ICCV (28)
ICLR (20)
ECCV (14)
ICML (12)
IJCAI (9)
AAAI (8)
ACL (4)
AISTATS (3)
EMNLP (3)
JMLR (1)
Top co-authors
Research topics
Keywords
convolutional neural network
(20)
semantic segmentation
(14)
image generation
(9)
image classification
(9)
model compression
(7)
object detection
(7)
dictionary learning
(6)
face recognition
(6)
diffusion model
(6)
generative adversarial network
(6)
large language model
(6)
image segmentation
(5)
human parsing
(5)
reinforcement learning
(5)
vision transformer
(4)
sparse representation
(4)
model architecture
(4)
convex optimization
(4)
action recognition
(4)
online learning
(4)
Papers
Deep-Reporter: Deep Research for Grounded Multimodal Long-Form Generation
ACL 2026
EvoRoute: Experience-Driven Self-Routing LLM Agent Systems
ACL 2026
PointDGRWKV: Generalizing RWKV-like Architecture to Unseen Domains for Point Cloud Classification
AAAI 2026
Policy Regularization on Globally Accessible States in Cross-Dynamics Reinforcement Learning
ICML 2025
Cradle: Empowering Foundation Agents towards General Computer Control
ICML 2025
MoH: Multi-Head Attention as Mixture-of-Head Attention
ICML 2025
On Path to Multimodal Generalist: General-Level and General-Bench
ICML 2025
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
ICLR 2025
IPDreamer: Appearance-Controllable 3D Object Generation with Complex Image Prompts
ICLR 2025
EasyInv: Toward Fast and Better DDIM Inversion
ICML 2025
MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts
ICLR 2025
Towards Semantic Equivalence of Tokenization in Multimodal LLM
ICLR 2025
Beyond Isolated Words: Diffusion Brush for Handwritten Text-Line Generation
ICCV 2025
MEGA: Memory-Efficient 4D Gaussian Splatting for Dynamic Scenes
ICCV 2025
Audio-Reasoner: Improving Reasoning Capability in Large Audio Language Models
EMNLP 2025
Policy Optimization under Imperfect Human Interactions with Agent-Gated Shared Autonomy
ICLR 2025
Poison-splat: Computation Cost Attack on 3D Gaussian Splatting
ICLR 2025
Explore In-Context Segmentation via Latent Diffusion Models
AAAI 2025
Combating Multimodal LLM Hallucination via Bottom-Up Holistic Reasoning
AAAI 2025
PointDGMamba: Domain Generalization of Point Cloud Classification via Generalized State Space Model
AAAI 2025
Point Cloud Mamba: Point Cloud Learning via State Space Model
AAAI 2025
Masks Can be Learned as an Alternative to Experts
ACL 2025
Removing Prompt-template Bias in Reinforcement Learning from Human Feedback
ACL 2025
SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction
ICLR 2025
AgentStudio: A Toolkit for Building General Virtual Agents
ICLR 2025
Improving Video Segmentation via Dynamic Anchor Queries
ECCV 2024
InceptionNeXt: When Inception Meets ConvNeXt
CVPR 2024
Non-confusing Generation of Customized Concepts in Diffusion Models
ICML 2024
Auto-Encoding Morph-Tokens for Multimodal LLM
ICML 2024
Reinforcement Learning from Diverse Human Preferences
IJCAI 2024
Win: Weight-Decay-Integrated Nesterov Acceleration for Faster Network Training
JMLR 2024
MVGamba: Unify 3D Content Generation as State Space Sequence Modeling
NIPS 2024
Action Imitation in Common Action Space for Customized Action Image Synthesis
NIPS 2024
Automating Dataset Updates Towards Reliable and Timely Evaluation of Large Language Models
NIPS 2024
Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
NIPS 2024
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding
NIPS 2024
Region-Native Visual Tokenization
ECCV 2024
LLMs-as-Instructors: Learning from Errors Toward Automating Model Improvement
EMNLP 2024
BAFFLE: A Baseline of Backpropagation-Free Federated Learning
ECCV 2024
Arbitrary Virtual Try-on Network: Characteristics Representation and Trade-off between Body and Clothing
ICLR 2023
Masked Diffusion Transformer is a Strong Image Synthesizer
ICCV 2023
STPrivacy: Spatio-Temporal Privacy-Preserving Action Recognition
ICCV 2023
Mutual Information Regularized Offline Reinforcement Learning
NIPS 2023
Gaussian Mixture Solvers for Diffusion Models
NIPS 2023
On Calibrating Diffusion Probabilistic Models
NIPS 2023
Efficient Diffusion Policies For Offline Reinforcement Learning
NIPS 2023
ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection
NIPS 2023
Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning
AAAI 2023
Bag of Tricks for Training Data Extraction from Language Models
ICML 2023
Better Diffusion Models Further Improve Adversarial Training
ICML 2023
Nonparametric Generative Modeling with Conditional Sliced-Wasserstein Flows
ICML 2023
D4FT: A Deep Learning Approach to Kohn-Sham Density Functional Theory
ICLR 2023
Win: Weight-Decay-Integrated Nesterov Acceleration for Adaptive Gradient Algorithms
ICLR 2023
Spikformer: When Spiking Neural Network Meets Transformer
ICLR 2023
LPT: Long-tailed Prompt Tuning for Image Classification
ICLR 2023
Towards Understanding Why Mask Reconstruction Pretraining Helps in Downstream Tasks
ICLR 2023
Visual Imitation Learning with Patch Rewards
ICLR 2023
Position-Guided Text Prompt for Vision-Language Pre-Training
CVPR 2023
Exploring Incompatible Knowledge Transfer in Few-Shot Image Generation
CVPR 2023
Generative Table Pre-training Empowers Models for Tabular Prediction
EMNLP 2023
Bag of Tricks for Unsupervised Text-to-Speech
ICLR 2023
RPM: Generalizable Multi-Agent Policies for Multi-Agent Reinforcement Learning
ICLR 2023
Distributional Meta-Gradient Reinforcement Learning
ICLR 2023
Efficient Offline Policy Optimization with a Learned Model
ICLR 2023
Revisiting Intrinsic Reward for Exploration in Procedurally Generated Environments
ICLR 2023
Robustness and Accuracy Could Be Reconcilable by (Proper) Definition
ICML 2022
Inception Transformer
NIPS 2022
EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine
NIPS 2022
Improving Vision Transformers by Revisiting High-Frequency Components
ECCV 2022
DualFormer: Local-Global Stratified Transformer for Efficient Video Recognition
ECCV 2022
Video Graph Transformer for Video Question Answering
ECCV 2022
MetaFormer Is Actually What You Need for Vision
CVPR 2022
Deep Color Consistent Network for Low-Light Image Enhancement
CVPR 2022
Self-Promoted Supervision for Few-Shot Transformer
ECCV 2022
Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural Human Rendering
ECCV 2022
Tokens-to-Token ViT: Training Vision Transformers From Scratch on ImageNet
ICCV 2021
PnP-DETR: Towards Efficient Visual Analysis With Transformers
ICCV 2021
Partial-Label and Structure-constrained Deep Coupled Factorization Network
AAAI 2021
How Should Pre-Trained Language Models Be Fine-Tuned Towards Adversarial Robustness?
NIPS 2021
Direct Multi-view Multi-person 3D Pose Estimation
NIPS 2021
Towards Understanding Why Lookahead Generalizes Better Than SGD and Beyond
NIPS 2021
ConvBERT: Improving BERT with Span-based Dynamic Convolution
NIPS 2020
Rethinking Bottleneck Structure for Efficient Mobile Network Design
ECCV 2020
Highly Efficient Salient Object Detection with 100K Parameters
ECCV 2020
AdversarialNAS: Adversarial Neural Architecture Search for GANs
CVPR 2020
PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer
CVPR 2020
Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks With Octave Convolution
ICCV 2019
Multi-Prototype Networks for Unconstrained Set-based Face Recognition
IJCAI 2019
Look across Elapse: Disentangled Representation Learning and Photorealistic Cross-Age Face Synthesis for Age-Invariant Face Recognition
AAAI 2019
Efficient Meta Learning via Minibatch Proximal Update
NIPS 2019
Very Long Natural Scenery Image Prediction by Outpainting
ICCV 2019
Single-Stage Multi-Person Pose Machines
ICCV 2019
A^2-Nets: Double Attention Networks
NIPS 2018
Exact Low Tubal Rank Tensor Recovery from Gaussian Measurements
IJCAI 2018
Human Pose Estimation With Parsing Induced Learner
CVPR 2018
Towards Pose Invariant Face Recognition in the Wild
CVPR 2018
Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation
CVPR 2018
Neural Style Transfer via Meta Networks
CVPR 2018
Pose Partition Networks for Multi-Person Pose Estimation
ECCV 2018
Mutual Learning to Adapt for Joint Human Parsing and Pose Estimation
ECCV 2018
Dynamic Conditional Networks for Few-Shot Learning
ECCV 2018
Multi-Fiber Networks for Video Recognition
ECCV 2018
WSNet: Compact and Efficient Networks Through Weight Sampling
ICML 2018
Sharing Residual Units Through Collective Tensor Factorization To Improve Deep Neural Networks
IJCAI 2018
High Resolution Feature Recovering for Accelerating Urban Scene Parsing
IJCAI 2018
3D-Aided Deep Pose-Invariant Face Recognition
IJCAI 2018
Interpretable Structure-Evolving LSTM
CVPR 2017
Predicting Scene Parsing and Motion Dynamics in the Future
NIPS 2017
Dual-Agent GANs for Photorealistic and Identity Preserving Profile Face Synthesis
NIPS 2017
Dual Path Networks
NIPS 2017
Perceptual Generative Adversarial Networks for Small Object Detection
CVPR 2017
Deep Joint Rain Detection and Removal From a Single Image
CVPR 2017
Memory-Augmented Attribute Manipulation Networks for Interactive Fashion Search
CVPR 2017
Object Region Mining With Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach
CVPR 2017
Semantic Segmentation via Structured Patch Prediction, Context CRF and Guidance CRF
CVPR 2017
More Is Less: A More Complicated Network With Less Inference Complexity
CVPR 2017
Neural Person Search Machines
ICCV 2017
FoveaNet: Perspective-Aware Urban Scene Parsing
ICCV 2017
Recurrent 3D-2D Dual Learning for Large-Pose Facial Landmark Detection
ICCV 2017
Scale-Adaptive Convolutions for Scene Parsing
ICCV 2017
Video Scene Parsing With Predictive Feature Learning
ICCV 2017
Training Group Orthogonal Neural Networks with Privileged Information
IJCAI 2017
Online Robust Low-Rank Tensor Learning
IJCAI 2017
Global-residual and Local-boundary Refinement Networks for Rectifying Scene Parsing Predictions
IJCAI 2017
Recurrently Target-Attending Tracking
CVPR 2016
Recurrent Face Aging
CVPR 2016
Semantic Object Parsing With Local-Global Long Short-Term Memory
CVPR 2016
Tensor Robust Principal Component Analysis: Exact Recovery of Corrupted Low-Rank Tensors via Convex Optimization
CVPR 2016
Tree-Structured Reinforcement Learning for Sequential Object Localization
NIPS 2016
Reversible Recursive Instance-Level Object Segmentation
CVPR 2016
Human Parsing With Contextualized Convolutional Neural Network
ICCV 2015
Task-Driven Feature Pooling for Image Classification
ICCV 2015
Cross-Domain Image Retrieval With a Dual Attribute-Aware Ranking Network
ICCV 2015
Towards Computational Baby Learning: A Weakly-Supervised Approach for Object Detection
ICCV 2015
SOLD: Sub-Optimal Low-rank Decomposition for Efficient Video Segmentation
CVPR 2015
Deep Domain Adaptation for Describing People Based on Fine-Grained Clothing Attributes
CVPR 2015
Motion Part Regularization: Improving Action Recognition via Trajectory Selection
CVPR 2015
Simultaneous Feature Learning and Hash Coding With Deep Neural Networks
CVPR 2015
Matching-CNN Meets KNN: Quasi-Parametric Human Parsing
CVPR 2015
Shape Driven Kernel Adaptation in Convolutional Neural Network for Robust Facial Traits Recognition
CVPR 2015
Structural Sparse Tracking
CVPR 2015
Personalized Age Progression With Aging Dictionary
ICCV 2015
Conditional Convolutional Neural Network for Modality-Aware Face Recognition
ICCV 2015
Additive Nearest Neighbor Feature Maps
ICCV 2015
Towards Unified Human Parsing and Pose Estimation
CVPR 2014
On a Theory of Nonparametric Pairwise Similarity for Clustering: Connecting Clustering to Classification
NIPS 2014
Robust Logistic Regression and Classification
NIPS 2014
Convex Optimization Procedure for Clustering: Theoretical Revisit
NIPS 2014
Generalized Nonconvex Nonsmooth Low-Rank Minimization
CVPR 2014
Robust Subspace Segmentation with Block-diagonal Prior
CVPR 2014
DL-SFA: Deeply-Learned Slow Feature Analysis for Action Recognition
CVPR 2014
Towards Multi-view and Partially-Occluded Face Alignment
CVPR 2014
Learning Scalable Discriminative Dictionary with Sample Relatedness
CVPR 2014
Cross-Scale Cost Aggregation for Stereo Matching
CVPR 2014
Hierarchical Part Matching for Fine-Grained Visual Categorization
ICCV 2013
Efficient Maximum Appearance Search for Large-Scale Object Detection
CVPR 2013
A Divide-and-Conquer Method for Scalable Low-Rank Latent Matrix Pursuit
CVPR 2013
Subcategory-Aware Object Classification
CVPR 2013
Incorporating Structural Alternatives and Sharing into Hierarchy for Multiclass Object Recognition and Detection
CVPR 2013
Complex Event Detection via Multi-source Video Attributes
CVPR 2013
Compressed Hashing
CVPR 2013
Online PCA for Contaminated Data
NIPS 2013
Online Robust PCA via Stochastic Optimization
NIPS 2013
Robust Object Tracking with Online Multi-lifespan Dictionary Learning
ICCV 2013
Correntropy Induced L2 Graph for Robust Subspace Clustering
ICCV 2013
Semantic Segmentation without Annotating Segments
ICCV 2013
How Related Exemplars Help Complex Event Detection in Web Videos?
ICCV 2013
A Deformable Mixture Parsing Model with Parselets
ICCV 2013
Correlation Adaptive Subspace Segmentation by Trace Lasso
ICCV 2013
Super-Bit Locality-Sensitive Hashing
NIPS 2012
Forward Basis Selection for Sparse Approximation over Dictionary
AISTATS 2012
Exact Subspace Segmentation and Outlier Detection by Low-Rank Representation
AISTATS 2012
A Finite Newton Algorithm for Non-degenerate Piecewise Linear Systems
AISTATS 2011
Robust Clustering as Ensembles of Affinity Relations
NIPS 2010