Shuai Yang
45 papers · 2017–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π Conference Polyglot (11) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (5) π Academic Marathon (8)
π
Academic Marathon
(8)
π
Cross-Pollinator
(12)
πΊοΈ
Taxonomy Completionist
(76)
π€
Dynamic Duo
(13)
π
Keyword Champion
(4)
π¬
Deep Specialist
(10)
π
Century Club
(44)
π₯
Unstoppable
(9)
ποΈ
Keyword Collector
(202)
π
Trend Setter
π
Conference Pioneer
β
The Questioner
β‘
Prolific Year
(14)
Conferences
CVPR (13)
ICCV (11)
ECCV (4)
ICLR (4)
COLING (3)
NIPS (3)
AAAI (2)
IJCAI (2)
EMNLP (1)
INTERSPEECH (1)
NAACL (1)
Top co-authors
Keywords
diffusion model
(8)
style transfer
(6)
generative adversarial network
(5)
temporal coherence
(4)
large language model
(4)
domain adaptation
(3)
temporal consistency
(3)
image-to-image translation
(3)
image translation
(3)
multimodal learning
(3)
video generation
(3)
generative model
(3)
text-to-image diffusion
(2)
multimodal large language model
(2)
image generation
(2)
event argument extraction
(2)
multi-modal learning
(2)
feature extraction
(2)
question answering
(2)
computer graphics
(2)
Papers
Towards Efficient and Robust Manipulation via Multi-Frame Vision-Language-Action Modeling
AAAI 2026
GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation
CVPR 2025
MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention
CVPR 2025
PTDiffusion: Free Lunch for Generating Optical Illusion Hidden Pictures with Phase-Transferred Diffusion Model
CVPR 2025
Alias-Free Latent Diffusion Models: Improving Fractional Shift Equivariance of Diffusion Latent Space
CVPR 2025
Split-and-Combine: Enhancing Style Augmentation for Single Domain Generalization
ICCV 2025
GaussianAnything: Interactive Point Cloud Flow Matching for 3D Generation
ICLR 2025
Trajectory attention for fine-grained video motion control
ICLR 2025
Balanced Image Stylization with Style Matching Score
ICCV 2025
REAR: Reinforced Reasoning Optimization for Event Argument Extraction with Relation-Aware Support
EMNLP 2025
OpenRSD: Towards Open-prompts for Object Detection in Remote Sensing Images
ICCV 2025
TokensGen: Harnessing Condensed Tokens for Long Video Generation
ICCV 2025
AnyPortal: Zero-Shot Consistent Video Background Replacement
ICCV 2025
State Revisit and Re-explore: Bridging Sim-to-Real Gaps in Offline-and-Online Reinforcement Learning with An Imperfect Simulator
IJCAI 2025
Omni-Chart-600K: A Comprehensive Dataset of Chart Types for Chart Understanding
NAACL 2025
Defect Spectrum: A Granular Look of Large-scale Defect Datasets with Rich Semantics
ECCV 2024
MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations
NIPS 2024
Video Diffusion Models are Training-free Motion Interpreter and Controller
NIPS 2024
Demonstration Retrieval-Augmented Generative Event Argument Extraction
COLING 2024
KnowVrDU: A Unified Knowledge-aware Prompt-Tuning Framework for Visually-rich Document Understanding
COLING 2024
Word-level Commonsense Knowledge Selection for Event Detection
COLING 2024
Low-Rank Approximation for Sparse Attention in Multi-Modal LLMs
CVPR 2024
FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
CVPR 2024
VideoBooth: Diffusion-based Video Generation with Image Prompts
CVPR 2024
LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation
ECCV 2024
Unified Generative and Discriminative Training for Multi-modal Large Language Models
NIPS 2024
GroupDiff: Diffusion-based Group Portrait Editing
ECCV 2024
Forward Learning of Graph Neural Networks
ICLR 2024
Denoising Diffusion Step-aware Models
ICLR 2024
StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned Faces
ICCV 2023
Text2Performer: Text-Driven Human Video Generation
ICCV 2023
Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation
ICCV 2023
Self-Supervised Geometry-Aware Encoder for Style-Based 3D GAN Inversion
CVPR 2023
Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation
ICCV 2023
DeformToon3D: Deformable Neural Radiance Fields for 3D Toonification
ICCV 2023
Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer
CVPR 2022
Unsupervised Image-to-Image Translation With Generative Prior
CVPR 2022
Instance-Aware Coherent Video Style Transfer for Chinese Ink Wash Painting
IJCAI 2021
Deep Plastic Surgery: Robust and Controllable Image Editing with Human-Drawn Sketches
ECCV 2020
Controllable Artistic Text Style Transfer via Shape-Matching GAN
ICCV 2019
Typography With Decor: Intelligent Text Style Transfer
CVPR 2019
TET-GAN: Text Effects Transfer via Stylization and Destylization
AAAI 2019
Erase or Fill? Deep Joint Recurrent Rain Removal and Reconstruction in Videos
CVPR 2018
Detection of Glottal Closure Instants from Speech Signals: A Convolutional Neural Network Based Method
INTERSPEECH 2018
Awesome Typography: Statistics-Based Text Effects Transfer
CVPR 2017