Rongrong Ji
209 papers · 2013–2025 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
π§ Keyword Pioneer π£ Hot Topic Early Bird πΊοΈ Taxonomy Completionist (14) π Interdisciplinary Bridge π Conference Polyglot (12)
π
Academic Marathon
(12)
πΊοΈ
Taxonomy Completionist
(14)
π§
Keyword Pioneer
π
Conference Loyalist
(24)
π€
Dynamic Duo
(45)
π
Triple Crown
π
Grand Slam
π±
Topic Pioneer
π¬
Deep Specialist
(21)
π§¬
Topic Evolution
π
Keyword Champion
(2)
ποΈ
Keyword Collector
(641)
β‘
Prolific Year
(25)
π
Century Club
(209)
π
Trend Setter
π₯
Unstoppable
(11)
π
Conference Pioneer
Conferences
CVPR (59)
ICCV (30)
ECCV (25)
AAAI (24)
NIPS (21)
ICML (19)
IJCAI (13)
ICLR (10)
EMNLP (3)
ACL (2)
COLING (2)
NAACL (1)
Top co-authors
Research topics
Keywords
model compression
(23)
attention mechanism
(14)
convolutional neural network
(13)
semantic segmentation
(12)
object detection
(12)
neural network
(10)
knowledge distillation
(9)
person re-identification
(9)
weakly supervised learning
(9)
multimodal learning
(8)
vision transformer
(7)
neural architecture search
(7)
image generation
(7)
domain adaptation
(6)
multimodal large language model
(6)
image captioning
(6)
feature learning
(6)
contrastive learning
(6)
diffusion model
(6)
model quantization
(5)
Papers
VTON-HandFit: Virtual Try-on for Arbitrary Hand Pose Guided by Hand Priors Embedding
CVPR 2025
SVFR: A Unified Framework for Generalized Video Face Restoration
CVPR 2025
FlashSloth : Lightning Multimodal Large Language Models via Embedded Visual Compression
CVPR 2025
Towards General Visual-Linguistic Face Forgery Detection
CVPR 2025
Monte Carlo Tree Search Based Prompt Autogeneration for Jailbreak Attacks against LLMs
COLING 2025
Automated Fine-Grained Mixture-of-Experts Quantization
ACL 2025
Training Long-Context LLMs Efficiently via Chunk-wise Optimization
ACL 2025
Boosting Multimodal Large Language Models with Visual Tokens Withdrawal for Rapid Inference
AAAI 2025
Determining Layer-wise Sparsity for Large Language Models Through a Theoretical Perspective
ICML 2025
GS-Bias: Global-Spatial Bias Learner for Single-Image Test-Time Adaptation of Vision-Language Models
ICML 2025
Dynamic Low-Rank Sparse Adaptation for Large Language Models
ICLR 2025
$\gamma-$MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large Language Models
ICLR 2025
Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models
ICLR 2025
Routing Experts: Learning to Route Dynamic Experts in Existing Multi-modal Large Language Models
ICLR 2025
Enhancing Language Model Hypernetworks with Restart: A Study on Optimization
NAACL 2025
EasyInv: Toward Fast and Better DDIM Inversion
ICML 2025
BAME: Block-Aware Mask Evolution for Efficient N:M Sparse Training
ICML 2025
polybasic Speculative Decoding Through a Theoretical Perspective
ICML 2025
FlexiReID: Adaptive Mixture of Expert for Multi-Modal Person Re-Identification
ICML 2025
DS-VLM: Diffusion Supervision Vision Language Model
ICML 2025
Benchmarking Abstract and Reasoning Abilities Through A Theoretical Perspective
ICML 2025
Learning Interleaved Image-Text Comprehension in Vision-Language Large Models
ICLR 2025
Few-Shot Image Quality Assessment via Adaptation of Vision-Language Models
ICCV 2025
From Objects to Events: Unlocking Complex Visual Understanding in Object Detectors via LLM-guided Symbolic Reasoning
ICCV 2025
OracleFusion: Assisting the Decipherment of Oracle Bone Script with Structurally Constrained Semantic Typography
ICCV 2025
Inter2Former: Dynamic Hybrid Attention for Efficient High-Precision Interactive Segmentation
ICCV 2025
Semantic Alignment and Reinforcement for Data-Free Quantization of Vision Transformers
ICCV 2025
AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models
ICCV 2025
Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models
ICML 2024
Cross-Modality Perturbation Synergy Attack for Person Re-identification
NIPS 2024
I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing
NIPS 2024
ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models
NIPS 2024
Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text
NIPS 2024
DiffusionFake: Enhancing Generalization in Deepfake Detection via Guided Stable Diffusion
NIPS 2024
RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation
NIPS 2024
RLE: A Unified Perspective of Data Augmentation for Cross-Spectral Re-Identification
NIPS 2024
Toward Open-Set Human Object Interaction Detection
AAAI 2024
Learning Image DemoirΓ©ing from Unpaired Real Data
AAAI 2024
MMAPS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization
COLING 2024
FocSAM: Delving Deeply into Focused Objects in Segmenting Anything
CVPR 2024
PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization
CVPR 2024
Aligning and Prompting Everything All at Once for Universal Visual Perception
CVPR 2024
DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model
CVPR 2024
Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation
CVPR 2024
UniPTS: A Unified Framework for Proficient Post-Training Sparsity
CVPR 2024
GraCo: Granularity-Controllable Interactive Segmentation
CVPR 2024
Autoregressive Queries for Adaptive Tracking with Spatio-Temporal Transformers
CVPR 2024
AccDiffusion: An Accurate Method for Higher-Resolution Image Generation
ECCV 2024
TF-FAS: Twofold-Element Fine-Grained Semantic Guidance for Generalizable Face Anti-Spoofing
ECCV 2024
Enhancing Tampered Text Detection through Frequency Feature Fusion and Decomposition
ECCV 2024
CamoTeacher: Dual-Rotation Consistency Learning for Semi-Supervised Camouflaged Object Detection
ECCV 2024
Textual Grounding for Open-vocabulary Visual Information Extraction in Layout-diversified Documents
ECCV 2024
Multi-branch Collaborative Learning Network for 3D Visual Grounding
ECCV 2024
Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model
ECCV 2024
DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation
ECCV 2024
AnyTrans: Translate AnyText in the Image with Large Scale Models
EMNLP 2024
Code Membership Inference for Detecting Unauthorized Data Use in Code Pre-trained Language Models
EMNLP 2024
AffineQuant: Affine Transformation Quantization for Large Language Models
ICLR 2024
Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLMs
ICLR 2024
Exploring Target Representations for Masked Autoencoders
ICLR 2024
Adaptive Feature Selection for No-Reference Image Quality Assessment by Mitigating Semantic Noise Sensitivity
ICML 2024
Integrating Global Context Contrast and Local Sensitivity for Blind Image Quality Assessment
ICML 2024
Outlier-aware Slicing for Post-Training Quantization in Vision Transformer
ICML 2024
X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation
ICML 2024
SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation
ICML 2024
CaM: Cache Merging for Memory-efficient LLMs Inference
ICML 2024
Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
ICML 2024
ERQ: Error Reduction for Post-Training Quantization of Vision Transformers
ICML 2024
Discover and Align Taxonomic Context Priors for Open-world Semi-Supervised Learning
NIPS 2023
Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models
NIPS 2023
CAPro: Webly Supervised Learning with Cross-modality Aligned Prototypes
NIPS 2023
Real-Time Image Demoir$\acute{e}$ing on Mobile Devices
ICLR 2023
Pseudo-label Alignment for Semi-supervised Instance Segmentation
ICCV 2023
AutoDiffusion: Training-Free Optimization of Time Steps and Architectures for Automated Diffusion Model Acceleration
ICCV 2023
DiffRate : Differentiable Compression Rate for Efficient Vision Transformers
ICCV 2023
Category-aware Allocation Transformer for Weakly Supervised Object Localization
ICCV 2023
InterFormer: Real-time Interactive Image Segmentation
ICCV 2023
X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance
ICCV 2023
SMMix: Self-Motivated Image Mixing for Vision Transformers
ICCV 2023
Automatic Network Pruning via Hilbert-Schmidt Independence Criterion Lasso under Information Bottleneck Principle
ICCV 2023
Solving Oscillation Problem in Post-Training Quantization Through a Theoretical Perspective
CVPR 2023
Meta Architecture for Point Cloud Analysis
CVPR 2023
STAR Loss: Reducing Semantic Ambiguity in Facial Landmark Detection
CVPR 2023
Clover: Towards a Unified Video-Language Alignment and Fusion Model
CVPR 2023
Discriminator-Cooperated Feature Map Distillation for GAN Compression
CVPR 2023
DistilPose: Tokenized Pose Regression With Heatmap Distillation
CVPR 2023
RefTeacher: A Strong Baseline for Semi-Supervised Referring Expression Comprehension
CVPR 2023
Interactive Object Placement with Reinforcement Learning
ICML 2023
Improving Adversarial Robustness via Information Bottleneck Distillation
NIPS 2023
RefCLIP: A Universal Teacher for Weakly Supervised Referring Expression Comprehension
CVPR 2023
You Only Segment Once: Towards Real-Time Panoptic Segmentation
CVPR 2023
OMPQ: Orthogonal Mixed Precision Quantization
AAAI 2023
CF-ViT: A General Coarse-to-Fine Method for Vision Transformer
AAAI 2023
Bi-directional Masks for Efficient N:M Sparse Training
ICML 2023
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models
NIPS 2023
Neural Architecture Search With Representation Mutual Information
CVPR 2022
Training-Free Transformer Architecture Search
CVPR 2022
IntraQ: Learning Synthetic Images With Intra-Class Heterogeneity for Zero-Shot Network Quantization
CVPR 2022
Learning to Learn Transferable Attack
AAAI 2022
DIFNet: Boosting Visual Information Flow for Image Captioning
CVPR 2022
Active Teacher for Semi-Supervised Object Detection
CVPR 2022
Black-Box Dissector: Towards Erasing-Based Hard-Label Model Stealing Attack
ECCV 2022
ECO-TR: Efficient Correspondences Finding via Coarse-to-Fine Refinement
ECCV 2022
Fine-Grained Data Distribution Alignment for Post-Training Quantization
ECCV 2022
Privacy-Preserving Face Recognition with Learnable Privacy Budgets in Frequency Domain
ECCV 2022
An Information Theoretic Approach for Attention-Driven Face Forgery Detection
ECCV 2022
PixelFolder: An Efficient Progressive Pixel Synthesis Network for Image Generation
ECCV 2022
Dynamic Dual Trainable Bounds for Ultra-Low Precision Super-Resolution Networks
ECCV 2022
ARM: Any-Time Super-Resolution Method
ECCV 2022
SeqTR: A Simple Yet Universal Network for Visual Grounding
ECCV 2022
PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining
NIPS 2022
Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach
NIPS 2022
Learning Best Combination for Efficient N:M Sparsity
NIPS 2022
Boosting Crowd Counting via Multifaceted Attention
CVPR 2022
Dual Contrastive Learning for General Face Forgery Detection
AAAI 2022
Aha! Adaptive History-Driven Attack for Decision-Based Black-Box Models
ICCV 2021
Parallel Detection-and-Segmentation Learning for Weakly Supervised Instance Segmentation
ICCV 2021
Occlude Them All: Occlusion-Aware Attention Network for Occluded Person Re-ID
ICCV 2021
Revisiting Discriminator in GAN Compression: A Generator-discriminator Cooperative Compression Scheme
NIPS 2021
Seminar Learning for Click-Level Weakly Supervised Semantic Segmentation
ICCV 2021
Dual Distribution Alignment Network for Generalizable Person Re-Identification
AAAI 2021
Local Relation Learning for Face Forgery Detection
AAAI 2021
Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network
AAAI 2021
Dual-level Collaborative Transformer for Image Captioning
AAAI 2021
Toward Joint Thing-and-Stuff Mining for Weakly Supervised Panoptic Segmentation
CVPR 2021
HifiFace: 3D Shape and Semantic Prior Guided High Fidelity Face Swapping
IJCAI 2021
Enhancing Unsupervised Video Representation Learning by Decoupling the Scene and the Motion
AAAI 2021
Domain General Face Forgery Detection by Learning to Weight
AAAI 2021
Towards Compact CNNs via Collaborative Compression
CVPR 2021
Discover Cross-Modality Nuances for Visible-Infrared Person Re-Identification
CVPR 2021
Image-to-Image Translation via Hierarchical Style Disentanglement
CVPR 2021
Beyond Max-Margin: Class Margin Equilibrium for Few-Shot Object Detection
CVPR 2021
Removing the Background by Adding the Background: Towards Background Robust Self-Supervised Video Representation Learning
CVPR 2021
RSTNet: Captioning With Adaptive Attention on Visual and Non-Visual Words
CVPR 2021
Towards Robustness Against Natural Language Word Substitutions
ICLR 2021
Architecture Disentanglement for Deep Neural Networks
ICCV 2021
TRAR: Routing the Attention Spans in Transformer for Visual Question Answering
ICCV 2021
ReCU: Reviving the Dead Weights in Binary Neural Networks
ICCV 2021
EC-DARTS: Inducing Equalized and Consistent Optimization Into DARTS
ICCV 2021
AD-Cluster: Augmented Discriminative Clustering for Domain Adaptive Person Re-Identification
CVPR 2020
Filter Grafting for Deep Neural Networks
CVPR 2020
Rethinking Performance Estimation in Neural Architecture Search
CVPR 2020
Multi-Task Collaborative Network for Joint Referring Expression Comprehension and Segmentation
CVPR 2020
One-Shot Adversarial Attacks on Visual Tracking With Dual Attention
CVPR 2020
Noise-Aware Fully Webly Supervised Object Detection
CVPR 2020
Channel Pruning via Automatic Structure Search
IJCAI 2020
Asymmetric Co-Teaching for Unsupervised Cross-Domain Person Re-Identification
AAAI 2020
Fast Learning of Temporal Action Proposal via Dense Boundary Generator
AAAI 2020
Binarized Neural Architecture Search
AAAI 2020
Revisiting Image Aesthetic Assessment via Self-Supervised Feature Learning
AAAI 2020
Multiple Expert Brainstorming for Domain Adaptive Person Re-identification
ECCV 2020
Enabling Deep Residual Networks for Weakly Supervised Object Detection
ECCV 2020
Anti-Bandit Neural Architecture Search for Model Defense
ECCV 2020
API-Net: Robust Generative Classifier via a Single Discriminator
ECCV 2020
SSCGAN: Facial Attribute Editing via Style Skip Connections
ECCV 2020
Interpretable Neural Network Decoupling
ECCV 2020
PAMS: Quantized Super-Resolution via Parameterized Max Scale
ECCV 2020
Improving Face Recognition from Hard Samples via Distribution Distillation Loss
ECCV 2020
Rotated Binary Neural Network
NIPS 2020
UWSOD: Toward Fully-Supervised-Level Capacity Weakly Supervised Object Detection
NIPS 2020
HRank: Filter Pruning Using High-Rank Feature Map
CVPR 2020
Salience-Guided Cascaded Suppression Network for Person Re-Identification
CVPR 2020
Projection & Probability-Driven Black-Box Attack
CVPR 2020
Cogradient Descent for Bilinear Optimization
CVPR 2020
Siamese Box Adaptive Network for Visual Tracking
CVPR 2020
Learning Neural Bag-of-Matrix-Summarization with Riemannian Network
AAAI 2019
Scoot: A Perceptual Metric for Facial Sketches
ICCV 2019
Bayesian Optimized 1-Bit CNNs
ICCV 2019
Universal Adversarial Perturbation via Prior Driven Uncertainty Approximation
ICCV 2019
Multinomial Distribution Learning for Effective Neural Architecture Search
ICCV 2019
Variational Structured Semantic Inference for Diverse Image Captioning
NIPS 2019
Information Competing Process for Learning Diversified Representations
NIPS 2019
A Part Power Set Model for Scale-Free Person Retrieval
IJCAI 2019
FreeAnchor: Learning to Match Anchors for Visual Object Detection
NIPS 2019
Hypergraph Induced Convolutional Manifold Networks
IJCAI 2019
Generalized Zero-Shot Vehicle Detection in Remote Sensing Imagery via Coarse-to-Fine Framework
IJCAI 2019
Cyclic Guidance for Weakly Supervised Joint Detection and Segmentation
CVPR 2019
Circulant Binary Convolutional Networks: Enhancing the Performance of 1-Bit DCNNs With Circulant Back Propagation
CVPR 2019
Towards Optimal Structured CNN Pruning via Generative Adversarial Learning
CVPR 2019
Exploiting Kernel Sparsity and Entropy for Interpretable CNN Compression
CVPR 2019
Towards Visual Feature Translation
CVPR 2019
Pyramidal Person Re-IDentification via Multi-Loss Dynamic Training
CVPR 2019
Dynamic Capsule Attention for Visual Question Answering
AAAI 2019
Free VQA Models from Knowledge Inertia by Pairwise Inconformity Learning
AAAI 2019
Towards Optimal Fine Grained Retrieval via Decorrelated Centralized Loss with Normalize-Scale Layer
AAAI 2019
PVRNet: Point-View Relation Neural Network for 3D Shape Recognition
AAAI 2019
Towards Optimal Discrete Online Hashing with Balanced Similarity
AAAI 2019
Hypergraph Neural Networks
AAAI 2019
Universal Perturbation Attack Against Image Retrieval
ICCV 2019
Structured Modeling of Joint Deep Feature and Prediction Refinement for Salient Object Detection
ICCV 2019
Centralized Ranking Loss with Weakly Supervised Localization for Fine-Grained Object Retrieval
IJCAI 2018
Cross-Modality Person Re-Identification with Generative Adversarial Training
IJCAI 2018
Robust Face Sketch Synthesis via Generative Adversarial Fusion of Priors and Parametric Sigmoid
IJCAI 2018
GroupCap: Group-Based Image Captioning With Structured Relevance and Diversity Constraints
CVPR 2018
Accelerating Convolutional Networks via Global & Dynamic Filter Pruning
IJCAI 2018
Generative Adversarial Learning Towards Fast Weakly Supervised Detection
CVPR 2018
Modulated Convolutional Networks
CVPR 2018
GVCNN: Group-View Convolutional Neural Networks for 3D Shape Recognition
CVPR 2018
Cross-Modality Binary Code Learning via Fusion Similarity Hashing
CVPR 2017
Supervised Matrix Factorization for Cross-Modality Hashing
IJCAI 2016
Towards Convolutional Neural Networks Compression via Global Error Reconstruction
IJCAI 2016
Variational Neural Discourse Relation Recognizer
EMNLP 2016
Top Rank Supervised Binary Coding for Visual Search
ICCV 2015
Towards 3D Object Detection With Bimodal Deep Boltzmann Machines Over RGBD Imagery
CVPR 2015
Modeling Inter- and Intra-Part Deformations for Object Structure Parsing
IJCAI 2015
Understanding Image Structure via Hierarchical Shape Parsing
CVPR 2015
Visual Reranking through Weakly Supervised Multi-graph Learning
ICCV 2013
Semi-Supervised Learning with Manifold Fitted Graphs
IJCAI 2013
Label Propagation from ImageNet to 3D Point Clouds
CVPR 2013