Jiaya Jia
166 papers · 2013–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+18 more ↓ Show less ↑
🌍 Conference Polyglot (10) 🏃 Academic Marathon (12) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird
🐣
Hot Topic Early Bird
🧭
Keyword Pioneer
🏃
Academic Marathon
(12)
🌟
Keyword Trendsetter Combo
(4)
🏠
Conference Loyalist
(79)
🏆
Grand Slam
🌱
Topic Pioneer
🔬
Deep Specialist
(29)
🧬
Topic Evolution
🏆
Keyword Champion
🤝
Dynamic Duo
(42)
⚡
Prolific Year
(13)
❓
The Questioner
💎
Century Club
(163)
🔥
Unstoppable
(13)
📈
Trend Setter
🚀
Conference Pioneer
🗃️
Keyword Collector
(596)
Conferences
CVPR (79)
ICCV (44)
ECCV (17)
NIPS (11)
AAAI (3)
ICLR (3)
ICML (3)
ACL (2)
EMNLP (2)
COLING (1)
IJCAI (1)
Top co-authors
Keywords
semantic segmentation
(29)
object detection
(18)
point cloud
(17)
convolutional neural network
(15)
image restoration
(10)
3d object detection
(10)
3d vision
(9)
instance segmentation
(9)
large language model
(7)
image generation
(6)
contrastive learning
(6)
image segmentation
(5)
image synthesis
(5)
representation learning
(5)
diffusion model
(5)
attention mechanism
(5)
semi-supervised learning
(5)
feature extraction
(5)
generative model
(5)
depth estimation
(4)
Papers
TraveLLaMA: A Multimodal Travel Assistant with Large-Scale Dataset and Structured Reasoning
AAAI 2026
TRAC: Teacher-Guided Token Reward with Adaptive Calibration for Robust Policy Optimization
ACL 2026
SearchGym: Bootstrapping Real-World Search Agents via Cost-Effective and High-Fidelity Environment Simulation
ACL 2026
Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition
ICCV 2025
Mixture-of-Scores: Robust Image-Text Data Valuation via Three Lines of Code
ICCV 2025
MagicMirror: ID-Preserved Video Generation in Video Diffusion Transformers
ICCV 2025
Generative Video Propagation
CVPR 2025
TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization
ICML 2025
Enhancing LLM Knowledge Learning through Generalization
EMNLP 2025
Logits-Based Finetuning
EMNLP 2025
QuickLLaMA: Query-aware Inference Acceleration for Large Language Models
COLING 2025
GRADEO: Towards Human-Like Evaluation for Text-to-Video Generation via Multi-Step Reasoning
ICML 2025
VisionZip: Longer is Better but Not Necessary in Vision Language Models
CVPR 2025
MR-GSM8K: A Meta-Reasoning Benchmark for Large Language Model Evaluation
ICLR 2025
DreamOmni: Unified Image Generation and Editing
CVPR 2025
Does Your Vision-Language Model Get Lost in the Long Video Sampling Dilemma?
ICCV 2025
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models
ECCV 2024
Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models
ECCV 2024
Video-P2P: Video Editing with Cross-attention Control
CVPR 2024
OA-CNNs: Omni-Adaptive Sparse CNNs for 3D Semantic Segmentation
CVPR 2024
LISA: Reasoning Segmentation via Large Language Model
CVPR 2024
SaCo Loss: Sample-wise Affinity Consistency for Vision-Language Pre-training
CVPR 2024
GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding
CVPR 2024
Unified Language-driven Zero-shot Domain Adaptation
CVPR 2024
Prompt Highlighter: Interactive Control for Multi-Modal LLMs
CVPR 2024
Scalable Language Model with Generalized Continual Learning
ICLR 2024
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
ICLR 2024
MR-Ben: A Meta-Reasoning Benchmark for Evaluating System-2 Thinking in LLMs
NIPS 2024
RL-GPT: Integrating Reinforcement Learning and Code-as-policy
NIPS 2024
LogoSticker: Inserting Logos into Diffusion Models for Customized Generation
ECCV 2024
LLMGA: Multimodal Large Language Model based Generation Assistant
ECCV 2024
TriVol: Point Cloud Rendering via Triple Volumes
CVPR 2023
Rethinking Out-of-Distribution (OOD) Detection: Masked Image Modeling Is All You Need
CVPR 2023
Point2Pix: Photo-Realistic Point Cloud Rendering via Neural Radiance Fields
CVPR 2023
Spherical Transformer for LiDAR-Based 3D Recognition
CVPR 2023
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking
CVPR 2023
Understanding Imbalanced Semantic Segmentation Through Neural Collapse
CVPR 2023
Deep Parametric 3D Filters for Joint Video Denoising and Illumination Enhancement in Video Super Resolution
AAAI 2023
High Quality Entity Segmentation
ICCV 2023
FocalFormer3D: Focusing on Hard Instance for 3D Object Detection
ICCV 2023
End-to-end 3D Tracking with Decoupled Queries
ICCV 2023
Mask-Attention-Free Transformer for 3D Instance Segmentation
ICCV 2023
Removing Anomalies as Noises for Industrial Defect Localization
ICCV 2023
Command-Driven Articulated Object Understanding and Manipulation
CVPR 2023
Hierarchical Dense Correlation Distillation for Few-Shot Segmentation
CVPR 2023
Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields for Controllable Scene Stylization
CVPR 2023
LargeKernel3D: Scaling Up Kernels in 3D Sparse CNNs
CVPR 2023
Real-World Image Variation by Aligning Diffusion Inversion Chain
NIPS 2023
DiffComplete: Diffusion-based Generative 3D Shape Completion
NIPS 2023
Learning Context-Aware Classifier for Semantic Segmentation
AAAI 2023
EfficientNeRF Efficient Neural Radiance Fields
CVPR 2022
Stratified Transformer for 3D Point Cloud Segmentation
CVPR 2022
SNR-Aware Low-Light Image Enhancement
CVPR 2022
High Quality Segmentation for Ultra High-Resolution Images
CVPR 2022
Multi-View Transformer for 3D Visual Grounding
CVPR 2022
Unifying Voxel-based Representation with Transformer for 3D Object Detection
NIPS 2022
Voxel Field Fusion for 3D Object Detection
CVPR 2022
MAT: Mask-Aware Transformer for Large Hole Image Inpainting
CVPR 2022
Focal Sparse Convolutional Networks for 3D Object Detection
CVPR 2022
Tracking Objects As Pixel-Wise Distributions
ECCV 2022
CA-SSL: Class-Agnostic Semi-Supervised Learning for Detection and Segmentation
ECCV 2022
DecoupleNet: Decoupled Network for Domain Adaptive Semantic Segmentation
ECCV 2022
Video Frame Interpolation With Transformer
CVPR 2022
A Unified Query-Based Paradigm for Point Cloud Understanding
CVPR 2022
TWIST: Two-Way Inter-Label Self-Training for Semi-Supervised 3D Instance Segmentation
CVPR 2022
Generalized Few-Shot Semantic Segmentation
CVPR 2022
Seeing Dynamic Scene in the Dark: A High-Quality Video Dataset With Mechatronic Alignment
ICCV 2021
Blending Anti-Aliasing into Vision Transformer
NIPS 2021
Distilling Knowledge via Knowledge Review
CVPR 2021
Semi-Supervised Semantic Segmentation With Directional Context-Aware Consistency
CVPR 2021
Jigsaw Clustering for Unsupervised Visual Representation Learning
CVPR 2021
Self-Supervised 3D Mesh Reconstruction From Single Images
CVPR 2021
Multi-Scale Aligned Distillation for Low-Resolution Detection
CVPR 2021
Scale-Aware Automatic Augmentation for Object Detection
CVPR 2021
Fully Convolutional Networks for Panoptic Segmentation
CVPR 2021
MASA-SR: Matching Acceleration and Spatial Adaptation for Reference-Based Image Super-Resolution
CVPR 2021
Bidirectional Projection Network for Cross Dimension Scene Understanding
CVPR 2021
Improving Calibration for Long-Tailed Recognition
CVPR 2021
Image Synthesis via Semantic Composition
ICCV 2021
Guided Point Contrastive Learning for Semi-Supervised Point Cloud Semantic Segmentation
ICCV 2021
Dynamic Divide-and-Conquer Adversarial Training for Robust Semantic Segmentation
ICCV 2021
Deep Structured Instance Graph for Distilling Object Detectors
ICCV 2021
Video Instance Segmentation With a Propose-Reduce Paradigm
ICCV 2021
Learnable Boundary Guided Adversarial Training
ICCV 2021
Point Transformer
ICCV 2021
Parametric Contrastive Learning
ICCV 2021
Particularity beyond Commonality: Unpaired Identity Transfer with Multiple References
ECCV 2020
MuCAN: Multi-Correspondence Aggregation Network for Video Super-Resolution
ECCV 2020
CN: Channel Normalization For Point Cloud Recognition
ECCV 2020
Memory Selection Network for Video Propagation
ECCV 2020
VCNet: A Robust Approach to Blind Image Inpainting
ECCV 2020
LAPAR: Linearly-Assembled Pixel-Adaptive Regression Network for Single Image Super-resolution and Beyond
NIPS 2020
Exploring Self-Attention for Image Recognition
CVPR 2020
PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation
CVPR 2020
Attentive Normalization for Conditional Image Generation
CVPR 2020
Domain Adaptive Image-to-Image Translation
CVPR 2020
3DSSD: Point-Based 3D Single Stage Object Detector
CVPR 2020
DSGN: Deep Stereo Geometry Network for 3D Object Detection
CVPR 2020
Associatively Segmenting Instances and Semantics in Point Clouds
CVPR 2019
STD: Sparse-to-Dense 3D Object Detector for Point Cloud
ICCV 2019
Aggregation via Separation: Boosting Facial Landmark Detector With Semi-Supervised Style Translation
ICCV 2019
AGSS-VOS: Attention Guided Single-Shot Video Object Segmentation
ICCV 2019
Attribute-Driven Spontaneous Motion in Unpaired Image Translation
ICCV 2019
Fast and Practical Neural Architecture Search
ICCV 2019
View Independent Generative Adversarial Network for Novel View Synthesis
ICCV 2019
Fast Point R-CNN
ICCV 2019
Hierarchical Point-Edge Interaction Network for Point Cloud Semantic Segmentation
ICCV 2019
Wide-Context Semantic Image Extrapolation
CVPR 2019
PointWeb: Enhancing Local Neighborhood Features for Point Cloud Processing
CVPR 2019
3D Motion Decomposition for RGBD Future Dynamic Scene Synthesis
CVPR 2019
Semantic Component Decomposition for Face Attribute Manipulation
CVPR 2019
Learning Shape-Aware Embedding for Scene Text Detection
CVPR 2019
Dynamic Scene Deblurring With Parameter Selective Sharing and Nested Skip Connections
CVPR 2019
Amodal Instance Segmentation With KINS Dataset
CVPR 2019
Underexposed Photo Enhancement Using Deep Illumination Estimation
CVPR 2019
Homomorphic Latent Space Interpolation for Unpaired Image-To-Image Translation
CVPR 2019
Semi-Parametric Image Synthesis
CVPR 2018
ICNet for Real-Time Semantic Segmentation on High-Resolution Images
ECCV 2018
SegStereo: Exploiting Semantic Information for Disparity Estimation
ECCV 2018
Image Inpainting via Generative Multi-column Convolutional Neural Networks
NIPS 2018
Sequential Context Encoding for Duplicate Removal
NIPS 2018
Facelet-Bank for Fast Portrait Manipulation
CVPR 2018
GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation
CVPR 2018
PSANet: Point-wise Spatial Attention Network for Scene Parsing
ECCV 2018
Compositing-aware Image Search
ECCV 2018
GAL: Geometric Adversarial Loss for Single-View 3D-Object Reconstruction
ECCV 2018
Referring Image Segmentation via Recurrent Refinement Networks
CVPR 2018
Scale-Recurrent Network for Deep Image Deblurring
CVPR 2018
Path Aggregation Network for Instance Segmentation
CVPR 2018
Situation Recognition With Graph Neural Networks
ICCV 2017
Pyramid Scene Parsing Network
CVPR 2017
Zero-Order Reverse Filtering
ICCV 2017
Unsupervised Learning of Stereo Matching
ICCV 2017
High-Quality Correspondence and Segmentation Estimation for Dual-Lens Smart-Phone Portraits
ICCV 2017
SGN: Sequential Grouping Networks for Instance Segmentation
ICCV 2017
Detail-Revealing Deep Video Super-Resolution
ICCV 2017
Makeup-Go: Blind Reversion of Portrait Edit
ICCV 2017
3D Graph Neural Networks for RGBD Semantic Segmentation
ICCV 2017
Visual Question Answering with Question Representation Update (QRU)
NIPS 2016
Multi-Scale Patch Aggregation (MPA) for Simultaneous Detection and Segmentation
CVPR 2016
ScribbleSup: Scribble-Supervised Convolutional Networks for Semantic Segmentation
CVPR 2016
Video Super-Resolution via Deep Draft-Ensemble Learning
ICCV 2015
Understanding and Diagnosing Visual Tracking Systems
ICCV 2015
Semantic Segmentation With Object Clique Potential
ICCV 2015
Box Aggregation for Proposal Decimation: Last Mile of Object Detection
ICCV 2015
Deep Edge-Aware Filters
ICML 2015
Handling Motion Blur in Multi-Frame Super-Resolution
CVPR 2015
Deep LAC: Deep Localization, Alignment and Classification for Fine-Grained Recognition
CVPR 2015
Just Noticeable Defocus Blur Detection and Estimation
CVPR 2015
Contour Box: Rejecting Object Proposals Without Explicit Closed Contours
ICCV 2015
Mutual-Structure for Joint Filtering
ICCV 2015
Two-Class Weather Classification
CVPR 2014
Deep Convolutional Neural Network for Image Deconvolution
NIPS 2014
Learning Important Spatial Pooling Regions for Scene Classification
CVPR 2014
Discriminative Blur Detection Features
CVPR 2014
100+ Times Faster Weighted Median Filter (WMF)
CVPR 2014
L0 Regularized Stationary Time Estimation for Crowd Group Analysis
CVPR 2014
Range-Sample Depth Feature for Action Recognition
CVPR 2014
Hierarchical Saliency Detection
CVPR 2013
Online Robust Dictionary Learning
CVPR 2013
SCMF: Sparse Covariance Matrix Factorization for Collaborative Filtering
IJCAI 2013
Abnormal Event Detection at 150 FPS in MATLAB
ICCV 2013
Cross-Field Joint Image Restoration via Scale Map
ICCV 2013
Forward Motion Deblurring
ICCV 2013
CoDeL: A Human Co-detection and Labeling Framework
ICCV 2013
Unnatural L0 Sparse Representation for Natural Image Deblurring
CVPR 2013