Jiashi Feng

191 papers · 2013–2025 · 14 conferences · across top CS/AI conferences

Achievements

+19 more ↓

🗺️ Taxonomy Completionist (17) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (6) 🐣 Hot Topic Early Bird

🏃 Academic Marathon (12) 🌈 Renaissance Researcher (6) 🌉 Interdisciplinary Bridge 🏠 Conference Loyalist (34) 🌟 Keyword Trendsetter Combo (4) 🤝 Dynamic Duo (52) 👑 Triple Crown 🧬 Topic Evolution 🏆 Grand Slam 🌱 Topic Pioneer 🔬 Deep Specialist (29) 🏆 Keyword Champion (6) 🔥 Unstoppable (13) ⚡ Prolific Year (16) ❓ The Questioner (2) 💎 Century Club (191) 🗃️ Keyword Collector (697) 📈 Trend Setter 🚀 Conference Pioneer

Conferences

CVPR (70) NIPS (34) ICCV (26) ICLR (15) ECCV (13) ICML (12) IJCAI (11) AAAI (3) WACV (2) ACL (1) AISTATS (1) EMNLP (1) IJCNLP (1) UAI (1)

Top co-authors

Shuicheng Yan (52) Xiaojie Jin (19) Jun Hao Liew (19) Daquan Zhou (18) Bingyi Kang (18) Yunpeng Chen (15) Jianfeng Zhang (14) Yunchao Wei (13) Pan Zhou (13) Zequn Jie (11)

Research topics

Computer Vision (1)

Keywords

convolutional neural network (21) semantic segmentation (16) object detection (14) knowledge distillation (12) image classification (12) representation learning (11) image generation (10) generative adversarial network (9) model compression (9) reinforcement learning (7) diffusion model (7) weakly supervised learning (6) self-supervised learning (6) recurrent neural network (6) multimodal learning (6) depth estimation (6) vision transformer (6) transfer learning (6) object localization (6) neural network (6)

Papers

VideoWorld: Exploring Knowledge Learning from Unlabeled Videos CVPR 2025 Parallelized Autoregressive Visual Generation CVPR 2025 Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders CVPR 2025 DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention CVPR 2025 Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation CVPR 2025 GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation ICCV 2025 Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation ICCV 2025 QK-Edit: Revisiting Attention-based Injection in MM-DiT for Image and Video Editing ICCV 2025 The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer ICCV 2025 Flash-VStream: Efficient Real-Time Understanding for Long Video Streams ICCV 2025 How Far Is Video Generation from World Model: A Physical Law Perspective ICML 2025 LightningDrag: Lightning Fast and Accurate Drag-based Image Editing Emerging from Videos ICML 2025 MagicArticulate: Make Your 3D Models Articulation-Ready CVPR 2025 Video Depth Anything: Consistent Depth Estimation for Super-Long Videos CVPR 2025 Depth Anything V2 NIPS 2024 DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution NIPS 2024 Image Understanding Makes for A Good Tokenizer for Image Generation NIPS 2024 PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator NIPS 2024 Classification Done Right for Vision-Language Pre-Training NIPS 2024 StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation NIPS 2024 AdjointDPM: Adjoint Sensitivity Method for Gradient Backpropagation of Diffusion Probabilistic Models ICLR 2024 COSA: Concatenated Sample Pretrained Vision-Language Foundation Model ICLR 2024 Video Recognition in Portrait Mode CVPR 2024 MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval CVPR 2024 VISTA-LLAMA: Reducing Hallucination in Video Language Models via Equal Distance to Visual Tokens CVPR 2024 Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data CVPR 2024 MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration EMNLP 2024 MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model CVPR 2024 PixelLM: Pixel Reasoning with Large Multimodal Model CVPR 2024 LVD-2M: A Long-take Video Dataset with Temporally Dense Captions NIPS 2024 GETAvatar: Generative Textured Meshes for Animatable Human Avatars ICCV 2023 Revisiting Temporal Modeling for CLIP-Based Image-to-Video Knowledge Transferring CVPR 2023 TAPS3D: Text-Guided 3D Textured Shape Generation From Pseudo Supervision CVPR 2023 Diffusion Probabilistic Model Made Slim CVPR 2023 OmniAvatar: Geometry-Guided Controllable 3D Head Synthesis CVPR 2023 Clover: Towards a Unified Video-Language Alignment and Fusion Model CVPR 2023 XAGen: 3D Expressive Human Avatars Generation NIPS 2023 Expanding Small-Scale Datasets with Guided Imagination NIPS 2023 Divide to Adapt: Mitigating Confirmation Bias for Domain Adaptation of Black-Box Predictors ICLR 2023 PV3D: A 3D Generative Model for Portrait Video Generation ICLR 2023 Revisiting Intrinsic Reward for Exploration in Procedurally Generated Environments ICLR 2023 Dataset Quantization ICCV 2023 Global Knowledge Calibration for Fast Open-Vocabulary Segmentation ICCV 2023 PPG Reloaded: An Empirical Study on What Matters in Phasic Policy Gradient ICML 2023 Reachability-Aware Laplacian Representation in Reinforcement Learning ICML 2023 Scaling & Shifting Your Features: A New Baseline for Efficient Model Tuning NIPS 2022 PoseTriplet: Co-Evolving 3D Human Pose Estimation, Imitation, and Hallucination Under Self-Supervision CVPR 2022 Shunted Self-Attention via Multi-Scale Token Aggregation CVPR 2022 MetaFormer Is Actually What You Need for Vision CVPR 2022 Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning CVPR 2022 DINE: Domain Adaptation From Single and Multiple Black-Box Predictors CVPR 2022 Self-Supervised Aggregation of Diverse Experts for Test-Agnostic Long-Tailed Recognition NIPS 2022 Sharpness-Aware Training for Free NIPS 2022 Slim Scissors: Segmenting Thin Object from Synthetic Background ECCV 2022 Towards Adversarially Robust Deep Image Denoising IJCAI 2022 Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural Human Rendering ECCV 2022 Understanding The Robustness in Vision Transformers ICML 2022 The Geometry of Robust Value Functions ICML 2022 How Well Does Self-Supervised Pre-Training Perform with Streaming Data? ICLR 2022 Efficient Sharpness-aware Minimization for Improved Training of Neural Networks ICLR 2022 Generalizing Few-Shot NAS with Gradient Matching ICLR 2022 Task similarity aware meta learning: theory-inspired improvement on MAML UAI 2021 Continual Learning via Bit-Level Information Preserving CVPR 2021 PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose Estimation CVPR 2021 Body Meshes as Points CVPR 2021 LV-BERT: Exploiting Layer Variety for BERT ACL 2021 Coordinate Attention for Efficient Mobile Network Design CVPR 2021 No Fear of Heterogeneity: Classifier Calibration for Federated Learning with Non-IID Data NIPS 2021 Towards Better Laplacian Representation in Reinforcement Learning with Generalized Graph Drawing ICML 2021 CIFS: Improving Adversarial Robustness of CNNs via Channel-wise Importance-based Feature Selection ICML 2021 Domain Adaptation With Auxiliary Target Domain-Oriented Classifier CVPR 2021 DANCE: A Deep Attentive Contour Model for Efficient Instance Segmentation WACV 2021 Deep Interactive Thin Object Selection WACV 2021 AutoSpace: Neural Architecture Search With Less Human Interference ICCV 2021 Tokens-to-Token ViT: Training Vision Transformers From Scratch on ImageNet ICCV 2021 Voxel Transformer for 3D Object Detection ICCV 2021 PnP-DETR: Towards Efficient Visual Analysis With Transformers ICCV 2021 Unleashing the Power of Contrastive Self-Supervised Visual Models via Contrast-Regularized Fine-Tuning NIPS 2021 Towards Understanding Why Lookahead Generalizes Better Than SGD and Beyond NIPS 2021 All Tokens Matter: Token Labeling for Training Better Vision Transformers NIPS 2021 Direct Multi-view Multi-person 3D Pose Estimation NIPS 2021 Exploring Balanced Feature Spaces for Representation Learning ICLR 2021 LV-BERT: Exploiting Layer Variety for BERT IJCNLP 2021 A Balanced and Uncertainty-aware Approach for Partial Domain Adaptation ECCV 2020 Inference Stage Optimization for Cross-scenario 3D Human Pose Estimation NIPS 2020 Improving Generalization in Reinforcement Learning with Mixture Regularization NIPS 2020 Residual Distillation: Towards Portable Deep Neural Networks without Shortcuts NIPS 2020 ConvBERT: Improving BERT with Span-based Dynamic Convolution NIPS 2020 Towards Theoretically Understanding Why Sgd Generalizes Better Than Adam in Deep Learning NIPS 2020 PPDM: Parallel Point Detection and Matching for Real-Time Human-Object Interaction Detection CVPR 2020 Central Similarity Quantization for Efficient Image and Video Retrieval CVPR 2020 Revisiting Knowledge Distillation via Label Smoothing Regularization CVPR 2020 Strip Pooling: Rethinking Spatial Pooling for Scene Parsing CVPR 2020 PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer CVPR 2020 Overcoming Classifier Imbalance for Long-Tail Object Detection With Balanced Group Softmax CVPR 2020 Boosting Few-Shot Learning With Adaptive Margin Loss CVPR 2020 Improving Convolutional Networks With Self-Calibrated Convolutions CVPR 2020 Rethinking Bottleneck Structure for Efficient Mobile Network Design ECCV 2020 Adversarial Self-Supervised Learning for Semi-Supervised 3D Action Recognition ECCV 2020 The Devil is in Classification: A Simple Framework for Long-tail Instance Segmentation ECCV 2020 Query-efficient Meta Attack to Deep Neural Networks ICLR 2020 Neural Epitome Search for Architecture-Agnostic Network Compression ICLR 2020 Decoupling Representation and Classifier for Long-Tailed Recognition ICLR 2020 ReClor: A Reading Comprehension Dataset Requiring Logical Reasoning ICLR 2020 On Robustness of Neural Ordinary Differential Equations ICLR 2020 Do We Really Need to Access the Source Data? Source Hypothesis Transfer for Unsupervised Domain Adaptation ICML 2020 Partial Order Pruning: For Best Speed/Accuracy Trade-Off in Neural Architecture Search CVPR 2019 Efficient Meta Learning via Minibatch Proximal Update NIPS 2019 Cycle-SUM: Cycle-Consistent Adversarial LSTM Networks for Unsupervised Video Summarization AAAI 2019 MultiSeg: Semantically Meaningful, Scale-Diverse Segmentations From Minimal User Input ICCV 2019 Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks With Octave Convolution ICCV 2019 Dynamic Kernel Distillation for Efficient Pose Estimation in Videos ICCV 2019 Single-Stage Multi-Person Pose Machines ICCV 2019 Few-Shot Object Detection via Feature Reweighting ICCV 2019 Foreground-Aware Pyramid Reconstruction for Alignment-Free Occluded Person Re-Identification ICCV 2019 PANet: Few-Shot Image Semantic Segmentation With Prototype Alignment ICCV 2019 Dynamic Feature Fusion for Semantic Edge Detection IJCAI 2019 Learning to Localize Objects with Noisy Labeled Instances AAAI 2019 Look across Elapse: Disentangled Representation Learning and Photorealistic Cross-Age Face Synthesis for Age-Invariant Face Recognition AAAI 2019 Faster First-Order Methods for Stochastic Non-Convex Optimization on Riemannian Manifolds AISTATS 2019 Multi-Prototype Networks for Unconstrained Set-based Face Recognition IJCAI 2019 Graph-Based Global Reasoning Networks CVPR 2019 Generalized Majorization-Minimization for Non-Convex Optimization IJCAI 2019 Frame-Consistent Recurrent Video Deraining With Dual-Level Flow CVPR 2019 A Simple Pooling-Based Design for Real-Time Salient Object Detection CVPR 2019 Distilling Object Detectors With Fine-Grained Feature Imitation CVPR 2019 Few-Shot Adaptive Faster R-CNN CVPR 2019 Zigzag Learning for Weakly Supervised Object Detection CVPR 2018 Left-Right Comparative Recurrent Model for Stereo Matching CVPR 2018 Towards Pose Invariant Face Recognition in the Wild CVPR 2018 Pose Partition Networks for Multi-Person Pose Estimation ECCV 2018 ML-LocNet: Improving Object Localization with Multi-view Learning Network ECCV 2018 Attention-aware Deep Adversarial Hashing for Cross-Modal Retrieval ECCV 2018 Mutual Learning to Adapt for Joint Human Parsing and Pose Estimation ECCV 2018 Dynamic Conditional Networks for Few-Shot Learning ECCV 2018 Multi-Fiber Networks for Video Recognition ECCV 2018 TS2C: Tight Box Mining with Surrounding Segmentation Context for Weakly Supervised Object Detection ECCV 2018 Exact Low Tubal Rank Tensor Recovery from Gaussian Measurements IJCAI 2018 Human Pose Estimation With Parsing Induced Learner CVPR 2018 Deep Adversarial Subspace Clustering CVPR 2018 Adversarial Complementary Learning for Weakly Supervised Object Localization CVPR 2018 MoNet: Deep Motion Exploitation for Video Object Segmentation CVPR 2018 Empirical Risk Landscape Analysis for Understanding Deep Neural Networks ICLR 2018 WSNet: Compact and Efficient Networks Through Weight Sampling ICML 2018 Policy Optimization with Demonstrations ICML 2018 Understanding Generalization and Optimization Performance of Deep CNNs ICML 2018 Efficient Stochastic Gradient Hard Thresholding NIPS 2018 A^2-Nets: Double Attention Networks NIPS 2018 New Insight into Hybrid Stochastic Gradient Descent: Beyond With-Replacement Sampling and Convexity NIPS 2018 Sharing Residual Units Through Collective Tensor Factorization To Improve Deep Neural Networks IJCAI 2018 3D-Aided Deep Pose-Invariant Face Recognition IJCAI 2018 Revisiting Dilated Convolution: A Simple Approach for Weakly- and Semi-Supervised Semantic Segmentation CVPR 2018 Learning Markov Clustering Networks for Scene Text Detection CVPR 2018 Weakly Supervised Phrase Localization With Multi-Scale Anchored Transformer Network CVPR 2018 Dual Path Networks NIPS 2017 Multimodal Learning and Reasoning for Visual Question Answering NIPS 2017 Predicting Scene Parsing and Motion Dynamics in the Future NIPS 2017 Dual-Agent GANs for Photorealistic and Identity Preserving Profile Face Synthesis NIPS 2017 Training Group Orthogonal Neural Networks with Privileged Information IJCAI 2017 Memory-Augmented Attribute Manipulation Networks for Interactive Fashion Search CVPR 2017 Deep Self-Taught Learning for Weakly Supervised Object Localization CVPR 2017 Deep Joint Rain Detection and Removal From a Single Image CVPR 2017 Outlier-Robust Tensor PCA CVPR 2017 Deep Future Gaze: Gaze Anticipation on Egocentric Videos Using Adversarial Networks CVPR 2017 Object Region Mining With Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach CVPR 2017 Online Robust Low-Rank Tensor Learning IJCAI 2017 Recurrent 3D-2D Dual Learning for Large-Pose Facial Landmark Detection ICCV 2017 Interpretable Structure-Evolving LSTM CVPR 2017 Learning Detection With Diverse Proposals CVPR 2017 Video Scene Parsing With Predictive Feature Learning ICCV 2017 Regional Interactive Image Segmentation Networks ICCV 2017 FoveaNet: Perspective-Aware Urban Scene Parsing ICCV 2017 Neural Person Search Machines ICCV 2017 Perceptual Generative Adversarial Networks for Small Object Detection CVPR 2017 Tree-Structured Reinforcement Learning for Sequential Object Localization NIPS 2016 Semantic Object Parsing With Local-Global Long Short-Term Memory CVPR 2016 Highway Vehicle Counting in Compressed Domain CVPR 2016 Recurrent Face Aging CVPR 2016 Recurrently Target-Attending Tracking CVPR 2016 Reversible Recursive Instance-Level Object Segmentation CVPR 2016 DrMAD: Distilling Reverse-Mode Automatic Differentiation for Optimizing Hyperparameters of Deep Neural Networks IJCAI 2016 Deep Subspace Clustering with Sparsity Prior IJCAI 2016 Tensor Robust Principal Component Analysis: Exact Recovery of Corrupted Low-Rank Tensors via Convex Optimization CVPR 2016 Natural Language Object Retrieval CVPR 2016 Learning The Structure of Deep Convolutional Networks ICCV 2015 Learning Scalable Discriminative Dictionary with Sample Relatedness CVPR 2014 Robust Logistic Regression and Classification NIPS 2014 Robust Subspace Segmentation with Block-diagonal Prior CVPR 2014 Correlation Adaptive Subspace Segmentation by Trace Lasso ICCV 2013 Online Robust PCA via Stochastic Optimization NIPS 2013 Online PCA for Contaminated Data NIPS 2013