Zicheng Zhang
37 papers · 2022–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π Conference Polyglot (9) π Cross-Pollinator (5) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (5)
π
Renaissance Researcher
(5)
πΊοΈ
Taxonomy Completionist
(60)
π
Grand Slam
π€
Dynamic Duo
(17)
π
Triple Crown
π¬
Deep Specialist
(12)
β
The Questioner
π
Century Club
(32)
β‘
Prolific Year
(16)
π
Trend Setter
ποΈ
Keyword Collector
(140)
π₯
Unstoppable
(5)
Conferences
CVPR (10)
AAAI (6)
NIPS (5)
ACL (4)
ICML (4)
ICCV (3)
ECCV (2)
ICLR (2)
IJCAI (1)
Top co-authors
Research topics
Keywords
video generation
(4)
multimodal large language model
(4)
large multimodal model
(4)
benchmark evaluation
(4)
image quality assessment
(3)
video quality assessment
(3)
visual perception
(2)
multi-modal large language model
(2)
neural radiance field
(2)
state space model
(2)
semantic segmentation
(2)
one-shot learning
(2)
diffusion model
(2)
style transfer
(2)
image generation
(2)
text-to-image generation
(2)
multimodal learning
(2)
multi-modal learning
(2)
3d vision
(2)
instruction tuning
(2)
Papers
GeoX-Bench: Benchmarking Cross-View Geo-Localization and Pose Estimation Capabilities of Large Multimodal Models
AAAI 2026
Market-Bench: Benchmarking Large Language Models on Economic and Trade Competition
ACL 2026
MedOmni-45Β°: A SafetyβPerformance Benchmark for Reasoning-Oriented LLMs in Medicine
AAAI 2026
Refine-IQA: Multi-Stage Reinforcement Finetuning for Perceptual Image Quality Assessment
AAAI 2026
Scaling-up Perceptual Video Quality Assessment
AAAI 2026
CamPoint: Boosting Point Cloud Segmentation with Virtual Camera
CVPR 2025
StyO: Stylize Your Face in Only One-Shot
AAAI 2025
Redundancy Principles for MLLMs Benchmarks
ACL 2025
OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference
ACL 2025
Beyond Logits: Aligning Feature Dynamics for Effective Knowledge Distillation
ACL 2025
Q-Bench-Video: Benchmark the Video Quality Understanding of LMMs
CVPR 2025
Learning Hazing to Dehazing: Towards Realistic Haze Generation for Real-World Image Dehazing
CVPR 2025
CoSER: Towards Consistent Dense Multiview Text-to-Image Generator for 3D Creation
CVPR 2025
Image Quality Assessment: From Human to Machine Preference
CVPR 2025
Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content
CVPR 2025
Who is a Better Talker: Subjective and Objective Quality Assessment for AI-Generated Talking Heads
ICCV 2025
Information Density Principle for MLLM Benchmarks
ICCV 2025
Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLMs
ICCV 2025
A-Bench: Are LMMs Masters at Evaluating AI-generated Images?
ICLR 2025
Feature out! Let Raw Image as Your Condition for Blind Face Restoration
ICML 2025
MIRROR: Make Your Object-Level Multi-View Generation More Consistent with Training-Free Rectification
ICML 2025
Towards Open-ended Visual Quality Comparison
ECCV 2024
Q-Bench: A Benchmark for General-Purpose Foundation Models on Low-level Vision
ICLR 2024
Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis
CVPR 2024
Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models
CVPR 2024
Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare
NIPS 2024
Towards Optimal Adversarial Robust Q-learning with Bellman Infinity-error
ICML 2024
Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels
ICML 2024
BlazeBVD: Make Scale-Time Equalization Great Again for Blind Video Deflickering
ECCV 2024
GAIA: Rethinking Action Quality Assessment for AI-Generated Videos
NIPS 2024
Towards Consistent Video Editing with Text-to-Image Diffusion Models
NIPS 2023
MM-PCQA: Multi-Modal Learning for No-reference Point Cloud Quality Assessment
IJCAI 2023
Transforming Radiance Field With Lipschitz Network for Photorealistic 3D Scene Stylization
CVPR 2023
MD-VQA: Multi-Dimensional Quality Assessment for UGC Live Videos
CVPR 2023
PetsGAN: Rethinking Priors for Single Image Generation
AAAI 2022
CoupAlign: Coupling Word-Pixel with Sentence-Mask Alignments for Referring Image Segmentation
NIPS 2022
Generalized One-shot Domain Adaptation of Generative Adversarial Networks
NIPS 2022