Jiashi Feng
191 papers · 2013–2025 · 14 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+19 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (17) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (6) π£ Hot Topic Early Bird
π
Academic Marathon
(12)
π
Renaissance Researcher
(6)
π
Interdisciplinary Bridge
π
Conference Loyalist
(34)
π
Keyword Trendsetter Combo
(4)
π€
Dynamic Duo
(52)
π
Triple Crown
π§¬
Topic Evolution
π
Grand Slam
π±
Topic Pioneer
π¬
Deep Specialist
(29)
π
Keyword Champion
(6)
π₯
Unstoppable
(13)
β‘
Prolific Year
(16)
β
The Questioner
(2)
π
Century Club
(191)
ποΈ
Keyword Collector
(697)
π
Trend Setter
π
Conference Pioneer
Conferences
CVPR (70)
NIPS (34)
ICCV (26)
ICLR (15)
ECCV (13)
ICML (12)
IJCAI (11)
AAAI (3)
WACV (2)
ACL (1)
AISTATS (1)
EMNLP (1)
IJCNLP (1)
UAI (1)
Top co-authors
Research topics
Keywords
convolutional neural network
(21)
semantic segmentation
(16)
object detection
(14)
knowledge distillation
(12)
image classification
(12)
representation learning
(11)
image generation
(10)
generative adversarial network
(9)
model compression
(9)
reinforcement learning
(7)
diffusion model
(7)
weakly supervised learning
(6)
self-supervised learning
(6)
recurrent neural network
(6)
multimodal learning
(6)
depth estimation
(6)
vision transformer
(6)
transfer learning
(6)
object localization
(6)
neural network
(6)
Papers
VideoWorld: Exploring Knowledge Learning from Unlabeled Videos
CVPR 2025
Parallelized Autoregressive Visual Generation
CVPR 2025
Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders
CVPR 2025
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
CVPR 2025
Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation
CVPR 2025
GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation
ICCV 2025
Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation
ICCV 2025
QK-Edit: Revisiting Attention-based Injection in MM-DiT for Image and Video Editing
ICCV 2025
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer
ICCV 2025
Flash-VStream: Efficient Real-Time Understanding for Long Video Streams
ICCV 2025
How Far Is Video Generation from World Model: A Physical Law Perspective
ICML 2025
LightningDrag: Lightning Fast and Accurate Drag-based Image Editing Emerging from Videos
ICML 2025
MagicArticulate: Make Your 3D Models Articulation-Ready
CVPR 2025
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
CVPR 2025
Depth Anything V2
NIPS 2024
DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
NIPS 2024
Image Understanding Makes for A Good Tokenizer for Image Generation
NIPS 2024
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator
NIPS 2024
Classification Done Right for Vision-Language Pre-Training
NIPS 2024
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
NIPS 2024
AdjointDPM: Adjoint Sensitivity Method for Gradient Backpropagation of Diffusion Probabilistic Models
ICLR 2024
COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
ICLR 2024
Video Recognition in Portrait Mode
CVPR 2024
MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval
CVPR 2024
VISTA-LLAMA: Reducing Hallucination in Video Language Models via Equal Distance to Visual Tokens
CVPR 2024
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
CVPR 2024
MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration
EMNLP 2024
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
CVPR 2024
PixelLM: Pixel Reasoning with Large Multimodal Model
CVPR 2024
LVD-2M: A Long-take Video Dataset with Temporally Dense Captions
NIPS 2024
GETAvatar: Generative Textured Meshes for Animatable Human Avatars
ICCV 2023
Revisiting Temporal Modeling for CLIP-Based Image-to-Video Knowledge Transferring
CVPR 2023
TAPS3D: Text-Guided 3D Textured Shape Generation From Pseudo Supervision
CVPR 2023
Diffusion Probabilistic Model Made Slim
CVPR 2023
OmniAvatar: Geometry-Guided Controllable 3D Head Synthesis
CVPR 2023
Clover: Towards a Unified Video-Language Alignment and Fusion Model
CVPR 2023
XAGen: 3D Expressive Human Avatars Generation
NIPS 2023
Expanding Small-Scale Datasets with Guided Imagination
NIPS 2023
Divide to Adapt: Mitigating Confirmation Bias for Domain Adaptation of Black-Box Predictors
ICLR 2023
PV3D: A 3D Generative Model for Portrait Video Generation
ICLR 2023
Revisiting Intrinsic Reward for Exploration in Procedurally Generated Environments
ICLR 2023
Dataset Quantization
ICCV 2023
Global Knowledge Calibration for Fast Open-Vocabulary Segmentation
ICCV 2023
PPG Reloaded: An Empirical Study on What Matters in Phasic Policy Gradient
ICML 2023
Reachability-Aware Laplacian Representation in Reinforcement Learning
ICML 2023
Scaling & Shifting Your Features: A New Baseline for Efficient Model Tuning
NIPS 2022
PoseTriplet: Co-Evolving 3D Human Pose Estimation, Imitation, and Hallucination Under Self-Supervision
CVPR 2022
Shunted Self-Attention via Multi-Scale Token Aggregation
CVPR 2022
MetaFormer Is Actually What You Need for Vision
CVPR 2022
Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning
CVPR 2022
DINE: Domain Adaptation From Single and Multiple Black-Box Predictors
CVPR 2022
Self-Supervised Aggregation of Diverse Experts for Test-Agnostic Long-Tailed Recognition
NIPS 2022
Sharpness-Aware Training for Free
NIPS 2022
Slim Scissors: Segmenting Thin Object from Synthetic Background
ECCV 2022
Towards Adversarially Robust Deep Image Denoising
IJCAI 2022
Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural Human Rendering
ECCV 2022
Understanding The Robustness in Vision Transformers
ICML 2022
The Geometry of Robust Value Functions
ICML 2022
How Well Does Self-Supervised Pre-Training Perform with Streaming Data?
ICLR 2022
Efficient Sharpness-aware Minimization for Improved Training of Neural Networks
ICLR 2022
Generalizing Few-Shot NAS with Gradient Matching
ICLR 2022
Task similarity aware meta learning: theory-inspired improvement on MAML
UAI 2021
Continual Learning via Bit-Level Information Preserving
CVPR 2021
PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose Estimation
CVPR 2021
Body Meshes as Points
CVPR 2021
LV-BERT: Exploiting Layer Variety for BERT
ACL 2021
Coordinate Attention for Efficient Mobile Network Design
CVPR 2021
No Fear of Heterogeneity: Classifier Calibration for Federated Learning with Non-IID Data
NIPS 2021
Towards Better Laplacian Representation in Reinforcement Learning with Generalized Graph Drawing
ICML 2021
CIFS: Improving Adversarial Robustness of CNNs via Channel-wise Importance-based Feature Selection
ICML 2021
Domain Adaptation With Auxiliary Target Domain-Oriented Classifier
CVPR 2021
DANCE: A Deep Attentive Contour Model for Efficient Instance Segmentation
WACV 2021
Deep Interactive Thin Object Selection
WACV 2021
AutoSpace: Neural Architecture Search With Less Human Interference
ICCV 2021
Tokens-to-Token ViT: Training Vision Transformers From Scratch on ImageNet
ICCV 2021
Voxel Transformer for 3D Object Detection
ICCV 2021
PnP-DETR: Towards Efficient Visual Analysis With Transformers
ICCV 2021
Unleashing the Power of Contrastive Self-Supervised Visual Models via Contrast-Regularized Fine-Tuning
NIPS 2021
Towards Understanding Why Lookahead Generalizes Better Than SGD and Beyond
NIPS 2021
All Tokens Matter: Token Labeling for Training Better Vision Transformers
NIPS 2021
Direct Multi-view Multi-person 3D Pose Estimation
NIPS 2021
Exploring Balanced Feature Spaces for Representation Learning
ICLR 2021
LV-BERT: Exploiting Layer Variety for BERT
IJCNLP 2021
A Balanced and Uncertainty-aware Approach for Partial Domain Adaptation
ECCV 2020
Inference Stage Optimization for Cross-scenario 3D Human Pose Estimation
NIPS 2020
Improving Generalization in Reinforcement Learning with Mixture Regularization
NIPS 2020
Residual Distillation: Towards Portable Deep Neural Networks without Shortcuts
NIPS 2020
ConvBERT: Improving BERT with Span-based Dynamic Convolution
NIPS 2020
Towards Theoretically Understanding Why Sgd Generalizes Better Than Adam in Deep Learning
NIPS 2020
PPDM: Parallel Point Detection and Matching for Real-Time Human-Object Interaction Detection
CVPR 2020
Central Similarity Quantization for Efficient Image and Video Retrieval
CVPR 2020
Revisiting Knowledge Distillation via Label Smoothing Regularization
CVPR 2020
Strip Pooling: Rethinking Spatial Pooling for Scene Parsing
CVPR 2020
PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer
CVPR 2020
Overcoming Classifier Imbalance for Long-Tail Object Detection With Balanced Group Softmax
CVPR 2020
Boosting Few-Shot Learning With Adaptive Margin Loss
CVPR 2020
Improving Convolutional Networks With Self-Calibrated Convolutions
CVPR 2020
Rethinking Bottleneck Structure for Efficient Mobile Network Design
ECCV 2020
Adversarial Self-Supervised Learning for Semi-Supervised 3D Action Recognition
ECCV 2020
The Devil is in Classification: A Simple Framework for Long-tail Instance Segmentation
ECCV 2020
Query-efficient Meta Attack to Deep Neural Networks
ICLR 2020
Neural Epitome Search for Architecture-Agnostic Network Compression
ICLR 2020
Decoupling Representation and Classifier for Long-Tailed Recognition
ICLR 2020
ReClor: A Reading Comprehension Dataset Requiring Logical Reasoning
ICLR 2020
On Robustness of Neural Ordinary Differential Equations
ICLR 2020
Do We Really Need to Access the Source Data? Source Hypothesis Transfer for Unsupervised Domain Adaptation
ICML 2020
Partial Order Pruning: For Best Speed/Accuracy Trade-Off in Neural Architecture Search
CVPR 2019
Efficient Meta Learning via Minibatch Proximal Update
NIPS 2019
Cycle-SUM: Cycle-Consistent Adversarial LSTM Networks for Unsupervised Video Summarization
AAAI 2019
MultiSeg: Semantically Meaningful, Scale-Diverse Segmentations From Minimal User Input
ICCV 2019
Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks With Octave Convolution
ICCV 2019
Dynamic Kernel Distillation for Efficient Pose Estimation in Videos
ICCV 2019
Single-Stage Multi-Person Pose Machines
ICCV 2019
Few-Shot Object Detection via Feature Reweighting
ICCV 2019
Foreground-Aware Pyramid Reconstruction for Alignment-Free Occluded Person Re-Identification
ICCV 2019
PANet: Few-Shot Image Semantic Segmentation With Prototype Alignment
ICCV 2019
Dynamic Feature Fusion for Semantic Edge Detection
IJCAI 2019
Learning to Localize Objects with Noisy Labeled Instances
AAAI 2019
Look across Elapse: Disentangled Representation Learning and Photorealistic Cross-Age Face Synthesis for Age-Invariant Face Recognition
AAAI 2019
Faster First-Order Methods for Stochastic Non-Convex Optimization on Riemannian Manifolds
AISTATS 2019
Multi-Prototype Networks for Unconstrained Set-based Face Recognition
IJCAI 2019
Graph-Based Global Reasoning Networks
CVPR 2019
Generalized Majorization-Minimization for Non-Convex Optimization
IJCAI 2019
Frame-Consistent Recurrent Video Deraining With Dual-Level Flow
CVPR 2019
A Simple Pooling-Based Design for Real-Time Salient Object Detection
CVPR 2019
Distilling Object Detectors With Fine-Grained Feature Imitation
CVPR 2019
Few-Shot Adaptive Faster R-CNN
CVPR 2019
Zigzag Learning for Weakly Supervised Object Detection
CVPR 2018
Left-Right Comparative Recurrent Model for Stereo Matching
CVPR 2018
Towards Pose Invariant Face Recognition in the Wild
CVPR 2018
Pose Partition Networks for Multi-Person Pose Estimation
ECCV 2018
ML-LocNet: Improving Object Localization with Multi-view Learning Network
ECCV 2018
Attention-aware Deep Adversarial Hashing for Cross-Modal Retrieval
ECCV 2018
Mutual Learning to Adapt for Joint Human Parsing and Pose Estimation
ECCV 2018
Dynamic Conditional Networks for Few-Shot Learning
ECCV 2018
Multi-Fiber Networks for Video Recognition
ECCV 2018
TS2C: Tight Box Mining with Surrounding Segmentation Context for Weakly Supervised Object Detection
ECCV 2018
Exact Low Tubal Rank Tensor Recovery from Gaussian Measurements
IJCAI 2018
Human Pose Estimation With Parsing Induced Learner
CVPR 2018
Deep Adversarial Subspace Clustering
CVPR 2018
Adversarial Complementary Learning for Weakly Supervised Object Localization
CVPR 2018
MoNet: Deep Motion Exploitation for Video Object Segmentation
CVPR 2018
Empirical Risk Landscape Analysis for Understanding Deep Neural Networks
ICLR 2018
WSNet: Compact and Efficient Networks Through Weight Sampling
ICML 2018
Policy Optimization with Demonstrations
ICML 2018
Understanding Generalization and Optimization Performance of Deep CNNs
ICML 2018
Efficient Stochastic Gradient Hard Thresholding
NIPS 2018
A^2-Nets: Double Attention Networks
NIPS 2018
New Insight into Hybrid Stochastic Gradient Descent: Beyond With-Replacement Sampling and Convexity
NIPS 2018
Sharing Residual Units Through Collective Tensor Factorization To Improve Deep Neural Networks
IJCAI 2018
3D-Aided Deep Pose-Invariant Face Recognition
IJCAI 2018
Revisiting Dilated Convolution: A Simple Approach for Weakly- and Semi-Supervised Semantic Segmentation
CVPR 2018
Learning Markov Clustering Networks for Scene Text Detection
CVPR 2018
Weakly Supervised Phrase Localization With Multi-Scale Anchored Transformer Network
CVPR 2018
Dual Path Networks
NIPS 2017
Multimodal Learning and Reasoning for Visual Question Answering
NIPS 2017
Predicting Scene Parsing and Motion Dynamics in the Future
NIPS 2017
Dual-Agent GANs for Photorealistic and Identity Preserving Profile Face Synthesis
NIPS 2017
Training Group Orthogonal Neural Networks with Privileged Information
IJCAI 2017
Memory-Augmented Attribute Manipulation Networks for Interactive Fashion Search
CVPR 2017
Deep Self-Taught Learning for Weakly Supervised Object Localization
CVPR 2017
Deep Joint Rain Detection and Removal From a Single Image
CVPR 2017
Outlier-Robust Tensor PCA
CVPR 2017
Deep Future Gaze: Gaze Anticipation on Egocentric Videos Using Adversarial Networks
CVPR 2017
Object Region Mining With Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach
CVPR 2017
Online Robust Low-Rank Tensor Learning
IJCAI 2017
Recurrent 3D-2D Dual Learning for Large-Pose Facial Landmark Detection
ICCV 2017
Interpretable Structure-Evolving LSTM
CVPR 2017
Learning Detection With Diverse Proposals
CVPR 2017
Video Scene Parsing With Predictive Feature Learning
ICCV 2017
Regional Interactive Image Segmentation Networks
ICCV 2017
FoveaNet: Perspective-Aware Urban Scene Parsing
ICCV 2017
Neural Person Search Machines
ICCV 2017
Perceptual Generative Adversarial Networks for Small Object Detection
CVPR 2017
Tree-Structured Reinforcement Learning for Sequential Object Localization
NIPS 2016
Semantic Object Parsing With Local-Global Long Short-Term Memory
CVPR 2016
Highway Vehicle Counting in Compressed Domain
CVPR 2016
Recurrent Face Aging
CVPR 2016
Recurrently Target-Attending Tracking
CVPR 2016
Reversible Recursive Instance-Level Object Segmentation
CVPR 2016
DrMAD: Distilling Reverse-Mode Automatic Differentiation for Optimizing Hyperparameters of Deep Neural Networks
IJCAI 2016
Deep Subspace Clustering with Sparsity Prior
IJCAI 2016
Tensor Robust Principal Component Analysis: Exact Recovery of Corrupted Low-Rank Tensors via Convex Optimization
CVPR 2016
Natural Language Object Retrieval
CVPR 2016
Learning The Structure of Deep Convolutional Networks
ICCV 2015
Learning Scalable Discriminative Dictionary with Sample Relatedness
CVPR 2014
Robust Logistic Regression and Classification
NIPS 2014
Robust Subspace Segmentation with Block-diagonal Prior
CVPR 2014
Correlation Adaptive Subspace Segmentation by Trace Lasso
ICCV 2013
Online Robust PCA via Stochastic Optimization
NIPS 2013
Online PCA for Contaminated Data
NIPS 2013