conftrace_

Shuhuai Ren

20 papers · 2019–2026 · 9 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+11 more ↓ 🌈 Renaissance Researcher (7) πŸŒ‰ Interdisciplinary Bridge πŸƒ Academic Marathon (6) 🌍 Conference Polyglot (8) πŸ—ΊοΈ Taxonomy Completionist (44)
πŸ—ΊοΈ Taxonomy Completionist (44) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird πŸ† Keyword Champion (2) 🧬 Topic Evolution πŸ‘₯ Mega-Team (21) 🀝 Dynamic Duo (14) πŸ—ƒοΈ Keyword Collector (90) ❓ The Questioner (2) ⚑ Prolific Year (5) πŸ’Ž Century Club (19)

Conferences

ACL (5) EMNLP (5) CVPR (3) NIPS (2) AAAI (1) ECCV (1) ICCV (1) IJCNLP (1) NAACL (1)

Papers

TEMPLE: Incentivizing Temporal Understanding of Video Large Language Models via Progressive Pre-SFT Alignment AAAI 2026 Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation ICCV 2025 RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction EMNLP 2025 Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis CVPR 2025 Parallelized Autoregressive Visual Generation CVPR 2025 VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models ECCV 2024 PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain ACL 2024 TempCompass: Do Video LLMs Really Understand Videos? ACL 2024 TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding CVPR 2024 LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation? NAACL 2024 Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition NIPS 2023 TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding EMNLP 2023 FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation NIPS 2023 Delving into the Openness of CLIP ACL 2023 Learning Relation Alignment for Calibrated Cross-modal Retrieval IJCNLP 2021 Learning Relation Alignment for Calibrated Cross-modal Retrieval ACL 2021 Dynamic Knowledge Distillation for Pre-trained Language Models EMNLP 2021 Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification EMNLP 2021 CascadeBERT: Accelerating Inference of Pre-trained Language Models via Calibrated Complete Models Cascade EMNLP 2021 Generating Natural Language Adversarial Examples through Probability Weighted Word Saliency ACL 2019