Jingren Zhou
74 papers · 2012–2026 · 15 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (14) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (5) π Conference Polyglot (15)
π
Interdisciplinary Bridge
π
Cross-Pollinator
(5)
πΊοΈ
Taxonomy Completionist
(14)
π€
Dynamic Duo
(16)
π
Triple Crown
π§¬
Topic Evolution
π
Grand Slam
π₯
Mega-Team
(25)
π¬
Deep Specialist
(13)
π
Keyword Champion
(4)
π
Conference Pioneer
β‘
Prolific Year
(11)
ποΈ
Keyword Collector
(273)
π
Century Club
(71)
π₯
Unstoppable
(6)
π
Trend Setter
Conferences
ACL (14)
ICLR (11)
ICML (10)
NIPS (8)
CVPR (7)
EMNLP (6)
NSDI (4)
AAAI (3)
ICCV (2)
IJCNLP (2)
NAACL (2)
OSDI (2)
COLING (1)
IJCAI (1)
INTERSPEECH (1)
Top co-authors
Research topics
Keywords
large language model
(9)
model compression
(6)
image generation
(6)
diffusion model
(5)
image synthesis
(4)
image-text retrieval
(4)
contrastive learning
(3)
multimodal learning
(3)
foundation model
(3)
transfer learning
(3)
generative adversarial network
(3)
cross-modal retrieval
(2)
benchmark evaluation
(2)
multi-task learning
(2)
multi-modal learning
(2)
neural architecture search
(2)
instruction following
(2)
semantic segmentation
(2)
representation learning
(2)
image captioning
(2)
Papers
Evidence-Augmented Policy Optimization with Reward Co-Evolution for Long-Context Reasoning
ACL 2026
Nested Browser-Use Learning for Agentic Information Seeking
ACL 2026
Retrieval Heads are Dynamic
ACL 2026
AsymKV: Enabling 1-Bit Quantization of KV Cache with Layer-Wise Asymmetric Quantization Configurations
COLING 2025
GenSim: A General Social Simulation Platform with Large Language Model based Agents
NAACL 2025
Data-Juicer Sandbox: A Feedback-Driven Suite for Multimodal Data-Model Co-development
ICML 2025
mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models
ICLR 2025
Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models
ICLR 2025
Rotated Runtime Smooth: Training-Free Activation Smoother for accurate INT4 inference
ICLR 2025
ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer
ICLR 2025
Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent
ICLR 2025
RMTBench: Benchmarking LLMs Through Multi-Turn User-Centric Role-Playing
EMNLP 2025
P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs
EMNLP 2025
ProcessBench: Identifying Process Errors in Mathematical Reasoning
ACL 2025
Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models
ACL 2025
Self-Steering Optimization: Autonomous Preference Optimization for Large Language Models
ACL 2025
mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding
ACL 2025
The Lessons of Developing Process Reward Models in Mathematical Reasoning
ACL 2025
FPE2M2: Approaching Lossless and Efficient Quantization with Native Floating Point
ACL 2025
DreamVideo: Composing Your Dream Videos with Customized Subject and Motion
CVPR 2024
How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition
ACL 2024
AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension
ACL 2024
Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment
ACL 2024
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
CVPR 2024
Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model
EMNLP 2024
mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding
EMNLP 2024
#InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models
ICLR 2024
Lipschitz Singularities in Diffusion Models
ICLR 2024
EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language Models with 3D Parallelism
ICML 2024
Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models
NAACL 2024
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video
ICML 2023
Cones: Concept Neurons in Diffusion Models for Customized Generation
ICML 2023
Dimensionality-Varying Diffusion Process
CVPR 2023
Devil Is in the Queries: Advancing Mask Transformers for Real-World Medical Image Segmentation and Out-of-Distribution Localization
CVPR 2023
Composer: Creative and Controllable Image Synthesis with Composable Conditions
ICML 2023
Neural Dependencies Emerging From Learning Massive Categories
CVPR 2023
ViM: Vision Middleware for Unified Downstream Transferring
ICCV 2023
CancerUniT: Towards a Single Unified Model for Effective Detection, Segmentation, and Diagnosis of Eight Major Cancers Using a Large Collection of CT Scans
ICCV 2023
RA-CLIP: Retrieval Augmented Contrastive Language-Image Pre-Training
CVPR 2023
ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models
EMNLP 2023
LipFormer: High-Fidelity and Generalizable Talking Face Generation With a Pre-Learned Facial Codebook
CVPR 2023
Learned Index with Dynamic $\epsilon$
ICLR 2023
VideoComposer: Compositional Video Synthesis with Motion Controllability
NIPS 2023
MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for speech recognition
INTERSPEECH 2023
PASS: Patch Automatic Skip Scheme for Efficient Real-Time Video Perception on Edge Devices
AAAI 2023
FaceComposer: A Unified Model for Versatile Facial Content Creation
NIPS 2023
RLEG: Vision-Language Representation Learning with Diffusion-based Embedding Generation
ICML 2023
Customizable Image Synthesis with Multiple Subjects
NIPS 2023
Res-Tuning: A Flexible and Efficient Tuning Paradigm via Unbinding Tuner from Backbone
NIPS 2023
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections
EMNLP 2022
Reliable Adversarial Distillation with Unreliable Teachers
ICLR 2022
iFlood: A Stable and Effective Regularizer
ICLR 2022
Principled Knowledge Extrapolation with GANs
ICML 2022
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
ICML 2022
Learning Relation Alignment for Calibrated Cross-modal Retrieval
ACL 2021
Uncertainty Principles of Encoding GANs
ICML 2021
Dynamic Memory based Attention Network for Sequential Recommendation
AAAI 2021
Low-Rank Subspaces in GANs
NIPS 2021
Enhancing E-commerce Recommender System Adaptability with Online Deep Controllable Learning-To-Rank
AAAI 2021
Learning to Rehearse in Long Sequence Memorization
ICML 2021
GAIA: A System for Interactive Analysis on Distributed Graphs Using a High-Level Language
NSDI 2021
Sketch and Refine: Towards Faithful and Informative Table-to-Text Generation
ACL 2021
Learning Relation Alignment for Calibrated Cross-modal Retrieval
IJCNLP 2021
Sketch and Refine: Towards Faithful and Informative Table-to-Text Generation
IJCNLP 2021
UFC-BERT: Unifying Multi-Modal Controls for Conditional Image Synthesis
NIPS 2021
AdaBERT: Task-Adaptive BERT Compression with Differentiable Neural Architecture Search
IJCAI 2020
Learning Efficient Parameter Server Synchronization Policies for Distributed SGD
ICLR 2020
Learning to Mutate with Hypergradient Guided Population
NIPS 2020
StreamScope: Continuous Reliable Distributed Processing of Big Data Streams
NSDI 2016
Large-scale L-BFGS using MapReduce
NIPS 2014
Apollo: Scalable and Coordinated Scheduling for Cloud-Scale Computing
OSDI 2014
Optimizing Data Shuffling in Data-Parallel Computation by Understanding User-Defined Functions
NSDI 2012
Spotting Code Optimizations in Data-Parallel Pipelines through PeriSCOPE
OSDI 2012
Reoptimizing Data Parallel Computing
NSDI 2012