conftrace_

Rongrong Ji

209 papers · 2013–2025 · 12 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+17 more ↓

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🗺️ Taxonomy Completionist (14) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (12)

🏃 Academic Marathon (12) 🗺️ Taxonomy Completionist (14) 🧭 Keyword Pioneer 🏠 Conference Loyalist (24) 🤝 Dynamic Duo (45) 👑 Triple Crown 🏆 Grand Slam 🌱 Topic Pioneer 🔬 Deep Specialist (21) 🧬 Topic Evolution 🏆 Keyword Champion (2) 🗃️ Keyword Collector (641) ⚡ Prolific Year (25) 💎 Century Club (209) 📈 Trend Setter 🔥 Unstoppable (11) 🚀 Conference Pioneer

Conferences

CVPR (59) ICCV (30) ECCV (25) AAAI (24) NIPS (21) ICML (19) IJCAI (13) ICLR (10) EMNLP (3) ACL (2) COLING (2) NAACL (1)

Top co-authors

Xiaoshuai Sun (45) Yongjian Wu (34) Xiawu Zheng (32) Feiyue Huang (30) Liujuan Cao (30) Baochang Zhang (25) Yunhang Shen (25) Fei Chao (24) Mingbao Lin (22) Yiyi Zhou (21)

Research topics

Computer Vision (1)

Keywords

model compression (23) attention mechanism (14) convolutional neural network (13) semantic segmentation (12) object detection (12) neural network (10) knowledge distillation (9) person re-identification (9) weakly supervised learning (9) multimodal learning (8) vision transformer (7) neural architecture search (7) image generation (7) domain adaptation (6) multimodal large language model (6) image captioning (6) feature learning (6) contrastive learning (6) diffusion model (6) model quantization (5)

Papers

VTON-HandFit: Virtual Try-on for Arbitrary Hand Pose Guided by Hand Priors Embedding CVPR 2025 SVFR: A Unified Framework for Generalized Video Face Restoration CVPR 2025 FlashSloth : Lightning Multimodal Large Language Models via Embedded Visual Compression CVPR 2025 Towards General Visual-Linguistic Face Forgery Detection CVPR 2025 Monte Carlo Tree Search Based Prompt Autogeneration for Jailbreak Attacks against LLMs COLING 2025 Automated Fine-Grained Mixture-of-Experts Quantization ACL 2025 Training Long-Context LLMs Efficiently via Chunk-wise Optimization ACL 2025 Boosting Multimodal Large Language Models with Visual Tokens Withdrawal for Rapid Inference AAAI 2025 Determining Layer-wise Sparsity for Large Language Models Through a Theoretical Perspective ICML 2025 GS-Bias: Global-Spatial Bias Learner for Single-Image Test-Time Adaptation of Vision-Language Models ICML 2025 Dynamic Low-Rank Sparse Adaptation for Large Language Models ICLR 2025 $\gamma-$MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large Language Models ICLR 2025 Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models ICLR 2025 Routing Experts: Learning to Route Dynamic Experts in Existing Multi-modal Large Language Models ICLR 2025 Enhancing Language Model Hypernetworks with Restart: A Study on Optimization NAACL 2025 EasyInv: Toward Fast and Better DDIM Inversion ICML 2025 BAME: Block-Aware Mask Evolution for Efficient N:M Sparse Training ICML 2025 polybasic Speculative Decoding Through a Theoretical Perspective ICML 2025 FlexiReID: Adaptive Mixture of Expert for Multi-Modal Person Re-Identification ICML 2025 DS-VLM: Diffusion Supervision Vision Language Model ICML 2025 Benchmarking Abstract and Reasoning Abilities Through A Theoretical Perspective ICML 2025 Learning Interleaved Image-Text Comprehension in Vision-Language Large Models ICLR 2025 Few-Shot Image Quality Assessment via Adaptation of Vision-Language Models ICCV 2025 From Objects to Events: Unlocking Complex Visual Understanding in Object Detectors via LLM-guided Symbolic Reasoning ICCV 2025 OracleFusion: Assisting the Decipherment of Oracle Bone Script with Structurally Constrained Semantic Typography ICCV 2025 Inter2Former: Dynamic Hybrid Attention for Efficient High-Precision Interactive Segmentation ICCV 2025 Semantic Alignment and Reinforcement for Data-Free Quantization of Vision Transformers ICCV 2025 AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models ICCV 2025 Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models ICML 2024 Cross-Modality Perturbation Synergy Attack for Person Re-identification NIPS 2024 I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing NIPS 2024 ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models NIPS 2024 Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text NIPS 2024 DiffusionFake: Enhancing Generalization in Deepfake Detection via Guided Stable Diffusion NIPS 2024 RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation NIPS 2024 RLE: A Unified Perspective of Data Augmentation for Cross-Spectral Re-Identification NIPS 2024 Toward Open-Set Human Object Interaction Detection AAAI 2024 Learning Image Demoiréing from Unpaired Real Data AAAI 2024 MMAPS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization COLING 2024 FocSAM: Delving Deeply into Focused Objects in Segmenting Anything CVPR 2024 PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization CVPR 2024 Aligning and Prompting Everything All at Once for Universal Visual Perception CVPR 2024 DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model CVPR 2024 Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation CVPR 2024 UniPTS: A Unified Framework for Proficient Post-Training Sparsity CVPR 2024 GraCo: Granularity-Controllable Interactive Segmentation CVPR 2024 Autoregressive Queries for Adaptive Tracking with Spatio-Temporal Transformers CVPR 2024 AccDiffusion: An Accurate Method for Higher-Resolution Image Generation ECCV 2024 TF-FAS: Twofold-Element Fine-Grained Semantic Guidance for Generalizable Face Anti-Spoofing ECCV 2024 Enhancing Tampered Text Detection through Frequency Feature Fusion and Decomposition ECCV 2024 CamoTeacher: Dual-Rotation Consistency Learning for Semi-Supervised Camouflaged Object Detection ECCV 2024 Textual Grounding for Open-vocabulary Visual Information Extraction in Layout-diversified Documents ECCV 2024 Multi-branch Collaborative Learning Network for 3D Visual Grounding ECCV 2024 Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model ECCV 2024 DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation ECCV 2024 AnyTrans: Translate AnyText in the Image with Large Scale Models EMNLP 2024 Code Membership Inference for Detecting Unauthorized Data Use in Code Pre-trained Language Models EMNLP 2024 AffineQuant: Affine Transformation Quantization for Large Language Models ICLR 2024 Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLMs ICLR 2024 Exploring Target Representations for Masked Autoencoders ICLR 2024 Adaptive Feature Selection for No-Reference Image Quality Assessment by Mitigating Semantic Noise Sensitivity ICML 2024 Integrating Global Context Contrast and Local Sensitivity for Blind Image Quality Assessment ICML 2024 Outlier-aware Slicing for Post-Training Quantization in Vision Transformer ICML 2024 X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation ICML 2024 SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation ICML 2024 CaM: Cache Merging for Memory-efficient LLMs Inference ICML 2024 Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization ICML 2024 ERQ: Error Reduction for Post-Training Quantization of Vision Transformers ICML 2024 Discover and Align Taxonomic Context Priors for Open-world Semi-Supervised Learning NIPS 2023 Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models NIPS 2023 CAPro: Webly Supervised Learning with Cross-modality Aligned Prototypes NIPS 2023 Real-Time Image Demoir$\acute{e}$ing on Mobile Devices ICLR 2023 Pseudo-label Alignment for Semi-supervised Instance Segmentation ICCV 2023 AutoDiffusion: Training-Free Optimization of Time Steps and Architectures for Automated Diffusion Model Acceleration ICCV 2023 DiffRate : Differentiable Compression Rate for Efficient Vision Transformers ICCV 2023 Category-aware Allocation Transformer for Weakly Supervised Object Localization ICCV 2023 InterFormer: Real-time Interactive Image Segmentation ICCV 2023 X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance ICCV 2023 SMMix: Self-Motivated Image Mixing for Vision Transformers ICCV 2023 Automatic Network Pruning via Hilbert-Schmidt Independence Criterion Lasso under Information Bottleneck Principle ICCV 2023 Solving Oscillation Problem in Post-Training Quantization Through a Theoretical Perspective CVPR 2023 Meta Architecture for Point Cloud Analysis CVPR 2023 STAR Loss: Reducing Semantic Ambiguity in Facial Landmark Detection CVPR 2023 Clover: Towards a Unified Video-Language Alignment and Fusion Model CVPR 2023 Discriminator-Cooperated Feature Map Distillation for GAN Compression CVPR 2023 DistilPose: Tokenized Pose Regression With Heatmap Distillation CVPR 2023 RefTeacher: A Strong Baseline for Semi-Supervised Referring Expression Comprehension CVPR 2023 Interactive Object Placement with Reinforcement Learning ICML 2023 Improving Adversarial Robustness via Information Bottleneck Distillation NIPS 2023 RefCLIP: A Universal Teacher for Weakly Supervised Referring Expression Comprehension CVPR 2023 You Only Segment Once: Towards Real-Time Panoptic Segmentation CVPR 2023 OMPQ: Orthogonal Mixed Precision Quantization AAAI 2023 CF-ViT: A General Coarse-to-Fine Method for Vision Transformer AAAI 2023 Bi-directional Masks for Efficient N:M Sparse Training ICML 2023 Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models NIPS 2023 Neural Architecture Search With Representation Mutual Information CVPR 2022 Training-Free Transformer Architecture Search CVPR 2022 IntraQ: Learning Synthetic Images With Intra-Class Heterogeneity for Zero-Shot Network Quantization CVPR 2022 Learning to Learn Transferable Attack AAAI 2022 DIFNet: Boosting Visual Information Flow for Image Captioning CVPR 2022 Active Teacher for Semi-Supervised Object Detection CVPR 2022 Black-Box Dissector: Towards Erasing-Based Hard-Label Model Stealing Attack ECCV 2022 ECO-TR: Efficient Correspondences Finding via Coarse-to-Fine Refinement ECCV 2022 Fine-Grained Data Distribution Alignment for Post-Training Quantization ECCV 2022 Privacy-Preserving Face Recognition with Learnable Privacy Budgets in Frequency Domain ECCV 2022 An Information Theoretic Approach for Attention-Driven Face Forgery Detection ECCV 2022 PixelFolder: An Efficient Progressive Pixel Synthesis Network for Image Generation ECCV 2022 Dynamic Dual Trainable Bounds for Ultra-Low Precision Super-Resolution Networks ECCV 2022 ARM: Any-Time Super-Resolution Method ECCV 2022 SeqTR: A Simple Yet Universal Network for Visual Grounding ECCV 2022 PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining NIPS 2022 Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach NIPS 2022 Learning Best Combination for Efficient N:M Sparsity NIPS 2022 Boosting Crowd Counting via Multifaceted Attention CVPR 2022 Dual Contrastive Learning for General Face Forgery Detection AAAI 2022 Aha! Adaptive History-Driven Attack for Decision-Based Black-Box Models ICCV 2021 Parallel Detection-and-Segmentation Learning for Weakly Supervised Instance Segmentation ICCV 2021 Occlude Them All: Occlusion-Aware Attention Network for Occluded Person Re-ID ICCV 2021 Revisiting Discriminator in GAN Compression: A Generator-discriminator Cooperative Compression Scheme NIPS 2021 Seminar Learning for Click-Level Weakly Supervised Semantic Segmentation ICCV 2021 Dual Distribution Alignment Network for Generalizable Person Re-Identification AAAI 2021 Local Relation Learning for Face Forgery Detection AAAI 2021 Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network AAAI 2021 Dual-level Collaborative Transformer for Image Captioning AAAI 2021 Toward Joint Thing-and-Stuff Mining for Weakly Supervised Panoptic Segmentation CVPR 2021 HifiFace: 3D Shape and Semantic Prior Guided High Fidelity Face Swapping IJCAI 2021 Enhancing Unsupervised Video Representation Learning by Decoupling the Scene and the Motion AAAI 2021 Domain General Face Forgery Detection by Learning to Weight AAAI 2021 Towards Compact CNNs via Collaborative Compression CVPR 2021 Discover Cross-Modality Nuances for Visible-Infrared Person Re-Identification CVPR 2021 Image-to-Image Translation via Hierarchical Style Disentanglement CVPR 2021 Beyond Max-Margin: Class Margin Equilibrium for Few-Shot Object Detection CVPR 2021 Removing the Background by Adding the Background: Towards Background Robust Self-Supervised Video Representation Learning CVPR 2021 RSTNet: Captioning With Adaptive Attention on Visual and Non-Visual Words CVPR 2021 Towards Robustness Against Natural Language Word Substitutions ICLR 2021 Architecture Disentanglement for Deep Neural Networks ICCV 2021 TRAR: Routing the Attention Spans in Transformer for Visual Question Answering ICCV 2021 ReCU: Reviving the Dead Weights in Binary Neural Networks ICCV 2021 EC-DARTS: Inducing Equalized and Consistent Optimization Into DARTS ICCV 2021 AD-Cluster: Augmented Discriminative Clustering for Domain Adaptive Person Re-Identification CVPR 2020 Filter Grafting for Deep Neural Networks CVPR 2020 Rethinking Performance Estimation in Neural Architecture Search CVPR 2020 Multi-Task Collaborative Network for Joint Referring Expression Comprehension and Segmentation CVPR 2020 One-Shot Adversarial Attacks on Visual Tracking With Dual Attention CVPR 2020 Noise-Aware Fully Webly Supervised Object Detection CVPR 2020 Channel Pruning via Automatic Structure Search IJCAI 2020 Asymmetric Co-Teaching for Unsupervised Cross-Domain Person Re-Identification AAAI 2020 Fast Learning of Temporal Action Proposal via Dense Boundary Generator AAAI 2020 Binarized Neural Architecture Search AAAI 2020 Revisiting Image Aesthetic Assessment via Self-Supervised Feature Learning AAAI 2020 Multiple Expert Brainstorming for Domain Adaptive Person Re-identification ECCV 2020 Enabling Deep Residual Networks for Weakly Supervised Object Detection ECCV 2020 Anti-Bandit Neural Architecture Search for Model Defense ECCV 2020 API-Net: Robust Generative Classifier via a Single Discriminator ECCV 2020 SSCGAN: Facial Attribute Editing via Style Skip Connections ECCV 2020 Interpretable Neural Network Decoupling ECCV 2020 PAMS: Quantized Super-Resolution via Parameterized Max Scale ECCV 2020 Improving Face Recognition from Hard Samples via Distribution Distillation Loss ECCV 2020 Rotated Binary Neural Network NIPS 2020 UWSOD: Toward Fully-Supervised-Level Capacity Weakly Supervised Object Detection NIPS 2020 HRank: Filter Pruning Using High-Rank Feature Map CVPR 2020 Salience-Guided Cascaded Suppression Network for Person Re-Identification CVPR 2020 Projection & Probability-Driven Black-Box Attack CVPR 2020 Cogradient Descent for Bilinear Optimization CVPR 2020 Siamese Box Adaptive Network for Visual Tracking CVPR 2020 Learning Neural Bag-of-Matrix-Summarization with Riemannian Network AAAI 2019 Scoot: A Perceptual Metric for Facial Sketches ICCV 2019 Bayesian Optimized 1-Bit CNNs ICCV 2019 Universal Adversarial Perturbation via Prior Driven Uncertainty Approximation ICCV 2019 Multinomial Distribution Learning for Effective Neural Architecture Search ICCV 2019 Variational Structured Semantic Inference for Diverse Image Captioning NIPS 2019 Information Competing Process for Learning Diversified Representations NIPS 2019 A Part Power Set Model for Scale-Free Person Retrieval IJCAI 2019 FreeAnchor: Learning to Match Anchors for Visual Object Detection NIPS 2019 Hypergraph Induced Convolutional Manifold Networks IJCAI 2019 Generalized Zero-Shot Vehicle Detection in Remote Sensing Imagery via Coarse-to-Fine Framework IJCAI 2019 Cyclic Guidance for Weakly Supervised Joint Detection and Segmentation CVPR 2019 Circulant Binary Convolutional Networks: Enhancing the Performance of 1-Bit DCNNs With Circulant Back Propagation CVPR 2019 Towards Optimal Structured CNN Pruning via Generative Adversarial Learning CVPR 2019 Exploiting Kernel Sparsity and Entropy for Interpretable CNN Compression CVPR 2019 Towards Visual Feature Translation CVPR 2019 Pyramidal Person Re-IDentification via Multi-Loss Dynamic Training CVPR 2019 Dynamic Capsule Attention for Visual Question Answering AAAI 2019 Free VQA Models from Knowledge Inertia by Pairwise Inconformity Learning AAAI 2019 Towards Optimal Fine Grained Retrieval via Decorrelated Centralized Loss with Normalize-Scale Layer AAAI 2019 PVRNet: Point-View Relation Neural Network for 3D Shape Recognition AAAI 2019 Towards Optimal Discrete Online Hashing with Balanced Similarity AAAI 2019 Hypergraph Neural Networks AAAI 2019 Universal Perturbation Attack Against Image Retrieval ICCV 2019 Structured Modeling of Joint Deep Feature and Prediction Refinement for Salient Object Detection ICCV 2019 Centralized Ranking Loss with Weakly Supervised Localization for Fine-Grained Object Retrieval IJCAI 2018 Cross-Modality Person Re-Identification with Generative Adversarial Training IJCAI 2018 Robust Face Sketch Synthesis via Generative Adversarial Fusion of Priors and Parametric Sigmoid IJCAI 2018 GroupCap: Group-Based Image Captioning With Structured Relevance and Diversity Constraints CVPR 2018 Accelerating Convolutional Networks via Global & Dynamic Filter Pruning IJCAI 2018 Generative Adversarial Learning Towards Fast Weakly Supervised Detection CVPR 2018 Modulated Convolutional Networks CVPR 2018 GVCNN: Group-View Convolutional Neural Networks for 3D Shape Recognition CVPR 2018 Cross-Modality Binary Code Learning via Fusion Similarity Hashing CVPR 2017 Supervised Matrix Factorization for Cross-Modality Hashing IJCAI 2016 Towards Convolutional Neural Networks Compression via Global Error Reconstruction IJCAI 2016 Variational Neural Discourse Relation Recognizer EMNLP 2016 Top Rank Supervised Binary Coding for Visual Search ICCV 2015 Towards 3D Object Detection With Bimodal Deep Boltzmann Machines Over RGBD Imagery CVPR 2015 Modeling Inter- and Intra-Part Deformations for Object Structure Parsing IJCAI 2015 Understanding Image Structure via Hierarchical Shape Parsing CVPR 2015 Visual Reranking through Weakly Supervised Multi-graph Learning ICCV 2013 Semi-Supervised Learning with Manifold Fitted Graphs IJCAI 2013 Label Propagation from ImageNet to 3D Point Clouds CVPR 2013