Zhi-Qi Cheng
30 papers · 2017–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
π Academic Marathon (8) π Interdisciplinary Bridge π Conference Polyglot (10) π§ Keyword Pioneer π Cross-Pollinator (12)
π
Renaissance Researcher
(5)
πΊοΈ
Taxonomy Completionist
(56)
π
Interdisciplinary Bridge
π§¬
Topic Evolution
π€
Dynamic Duo
(10)
β‘
Prolific Year
(5)
π
Trend Setter
π
Century Club
(29)
ποΈ
Keyword Collector
(145)
Conferences
CVPR (8)
ICCV (4)
EMNLP (3)
IJCAI (3)
NIPS (3)
AAAI (2)
NAACL (2)
WACV (2)
ACL (1)
ICLR (1)
SEMEVAL (1)
Top co-authors
Keywords
multimodal learning
(4)
large language model
(3)
diffusion model
(3)
object detection
(3)
emotion cause extraction
(2)
video generation
(2)
video motion editing
(2)
cross-domain retrieval
(2)
graph convolutional network
(2)
prompt tuning
(2)
streaming perception
(2)
multimodal emotion recognition
(2)
convolutional neural network
(2)
question answering
(2)
autonomous driving
(2)
vision-language model
(2)
video understanding
(2)
density map
(2)
video editing
(2)
benchmark evaluation
(1)
Papers
Sign-Language Datasets at Scale: A Comprehensive Survey on Resources, Benchmarks, and Annotation Standards
ACL 2026
Large Language Model Agents in Finance: A Survey Bridging Research, Practice, and Real-World Deployment
EMNLP 2025
POPoS: Improving Efficient and Robust Facial Landmark Detection with Parallel Optimal Position Search
AAAI 2025
A Video-grounded Dialogue Dataset and Metric for Event-driven Activities
AAAI 2025
Emphasizing Discriminative Features for Dataset Distillation in Complex Scenarios
CVPR 2025
StableAnimator: High-Quality Identity-Preserving Human Image Animation
CVPR 2025
MotionFollower: Editing Video Motion via Score-Guided Diffusion
ICCV 2025
MetaDesigner: Advancing Artistic Typography through AI-Driven, User-Centric, and Multilingual WordArt Synthesis
ICLR 2025
ProMQA: Question Answering Dataset for Multimodal Procedural Activity Understanding
NAACL 2025
DyRoNet: Dynamic Routing and Low-Rank Adapters for Autonomous Driving Streaming Perception
WACV 2025
UCDR-Adapter: Exploring Adaptation of Pre-Trained Vision-Language Models for Universal Cross-Domain Retrieval
WACV 2025
ProS: Prompting-to-simulate Generalized knowledge for Universal Cross-Domain Retrieval
CVPR 2024
MIPS at SemEval-2024 Task 3: Multimodal Emotion-Cause Pair Extraction in Conversations with Multimodal Language Models
NAACL 2024
SHIELD: LLM-Driven Schema Induction for Predictive Analytics in EV Battery Supply Chain Disruptions
EMNLP 2024
Towards Calibrated Robust Fine-Tuning of Vision-Language Models
NIPS 2024
Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning
NIPS 2024
MIPS at SemEval-2024 Task 3: Multimodal Emotion-Cause Pair Extraction in Conversations with Multimodal Language Models
SEMEVAL 2024
Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions
NIPS 2024
MotionEditor: Editing Video Motion via Content-Aware Diffusion
CVPR 2024
BlockGCN: Redefine Topology Awareness for Skeleton-Based Action Recognition
CVPR 2024
FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio
CVPR 2024
ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic Rules
ICCV 2023
WordArt Designer: User-Driven Artistic Typography Synthesis using Large Language Models
EMNLP 2023
DAMO-StreamNet: Optimizing Streaming Perception in Autonomous Driving
IJCAI 2023
HDFormer: High-order Directed Transformer for 3D Human Pose Estimation
IJCAI 2023
Implicit Temporal Modeling with Learnable Alignment for Video Recognition
ICCV 2023
Rethinking Spatial Invariance of Convolutional Networks for Object Counting
CVPR 2022
Generating Person Images with Appearance-aware Pose Stylizer
IJCAI 2020
Learning Spatial Awareness to Improve Crowd Counting
ICCV 2019
Video2Shop: Exact Matching Clothes in Videos to Online Shopping Images
CVPR 2017