Shuai Yang

45 papers · 2017–2026 · 11 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🌍 Conference Polyglot (11) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🏃 Academic Marathon (8)

🏃 Academic Marathon (8) 🐝 Cross-Pollinator (12) 🗺️ Taxonomy Completionist (76) 🤝 Dynamic Duo (13) 🏆 Keyword Champion (4) 🔬 Deep Specialist (10) 💎 Century Club (44) 🔥 Unstoppable (9) 🗃️ Keyword Collector (202) 📈 Trend Setter 🚀 Conference Pioneer ❓ The Questioner ⚡ Prolific Year (14)

Conferences

CVPR (13) ICCV (11) ECCV (4) ICLR (4) COLING (3) NIPS (3) AAAI (2) IJCAI (2) EMNLP (1) INTERSPEECH (1) NAACL (1)

Top co-authors

Chen Change Loy (13) Jiaying Liu (8) Ziwei Liu (8) Liming Jiang (6) Xingang Pan (6) Zongming Guo (6) Yifan Zhou (5) Zeqi Xiao (4) Yushi Lan (4) Fangzhou Hong (4)

Keywords

diffusion model (8) style transfer (6) generative adversarial network (5) temporal coherence (4) large language model (4) domain adaptation (3) temporal consistency (3) image-to-image translation (3) image translation (3) multimodal learning (3) video generation (3) generative model (3) text-to-image diffusion (2) multimodal large language model (2) image generation (2) event argument extraction (2) multi-modal learning (2) feature extraction (2) question answering (2) computer graphics (2)

Papers

Towards Efficient and Robust Manipulation via Multi-Frame Vision-Language-Action Modeling AAAI 2026 GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation CVPR 2025 MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention CVPR 2025 PTDiffusion: Free Lunch for Generating Optical Illusion Hidden Pictures with Phase-Transferred Diffusion Model CVPR 2025 Alias-Free Latent Diffusion Models: Improving Fractional Shift Equivariance of Diffusion Latent Space CVPR 2025 Split-and-Combine: Enhancing Style Augmentation for Single Domain Generalization ICCV 2025 GaussianAnything: Interactive Point Cloud Flow Matching for 3D Generation ICLR 2025 Trajectory attention for fine-grained video motion control ICLR 2025 Balanced Image Stylization with Style Matching Score ICCV 2025 REAR: Reinforced Reasoning Optimization for Event Argument Extraction with Relation-Aware Support EMNLP 2025 OpenRSD: Towards Open-prompts for Object Detection in Remote Sensing Images ICCV 2025 TokensGen: Harnessing Condensed Tokens for Long Video Generation ICCV 2025 AnyPortal: Zero-Shot Consistent Video Background Replacement ICCV 2025 State Revisit and Re-explore: Bridging Sim-to-Real Gaps in Offline-and-Online Reinforcement Learning with An Imperfect Simulator IJCAI 2025 Omni-Chart-600K: A Comprehensive Dataset of Chart Types for Chart Understanding NAACL 2025 Defect Spectrum: A Granular Look of Large-scale Defect Datasets with Rich Semantics ECCV 2024 MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations NIPS 2024 Video Diffusion Models are Training-free Motion Interpreter and Controller NIPS 2024 Demonstration Retrieval-Augmented Generative Event Argument Extraction COLING 2024 KnowVrDU: A Unified Knowledge-aware Prompt-Tuning Framework for Visually-rich Document Understanding COLING 2024 Word-level Commonsense Knowledge Selection for Event Detection COLING 2024 Low-Rank Approximation for Sparse Attention in Multi-Modal LLMs CVPR 2024 FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation CVPR 2024 VideoBooth: Diffusion-based Video Generation with Image Prompts CVPR 2024 LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation ECCV 2024 Unified Generative and Discriminative Training for Multi-modal Large Language Models NIPS 2024 GroupDiff: Diffusion-based Group Portrait Editing ECCV 2024 Forward Learning of Graph Neural Networks ICLR 2024 Denoising Diffusion Step-aware Models ICLR 2024 StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned Faces ICCV 2023 Text2Performer: Text-Driven Human Video Generation ICCV 2023 Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation ICCV 2023 Self-Supervised Geometry-Aware Encoder for Style-Based 3D GAN Inversion CVPR 2023 Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation ICCV 2023 DeformToon3D: Deformable Neural Radiance Fields for 3D Toonification ICCV 2023 Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer CVPR 2022 Unsupervised Image-to-Image Translation With Generative Prior CVPR 2022 Instance-Aware Coherent Video Style Transfer for Chinese Ink Wash Painting IJCAI 2021 Deep Plastic Surgery: Robust and Controllable Image Editing with Human-Drawn Sketches ECCV 2020 Controllable Artistic Text Style Transfer via Shape-Matching GAN ICCV 2019 Typography With Decor: Intelligent Text Style Transfer CVPR 2019 TET-GAN: Text Effects Transfer via Stylization and Destylization AAAI 2019 Erase or Fill? Deep Joint Recurrent Rain Removal and Reconstruction in Videos CVPR 2018 Detection of Glottal Closure Instants from Speech Signals: A Convolutional Neural Network Based Method INTERSPEECH 2018 Awesome Typography: Statistics-Based Text Effects Transfer CVPR 2017