SHIZHE DIAO

34 papers · 2020–2026 · 10 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🏃 Academic Marathon (5) 🌍 Conference Polyglot (10) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird

🐝 Cross-Pollinator (11) 🌍 Conference Polyglot (10) 🏃 Academic Marathon (5) 🤝 Dynamic Duo (21) 🧬 Topic Evolution 💎 Century Club (33) ⚡ Prolific Year (6) 🗃️ Keyword Collector (135) ❓ The Questioner 🔥 Unstoppable (6)

Conferences

EMNLP (10) ACL (9) ICLR (3) ICML (3) IJCNLP (2) NAACL (2) NIPS (2) EACL (1) ICCV (1) IJCAI (1)

Top co-authors

Tong Zhang (21) Rui Pan (9) Jipeng Zhang (9) Renjie Pi (7) Kashun Shum (6) Yan Song (5) Yong Lin (5) Hanze Dong (4) Wei Xiong (3) Xiang Liu (3)

Keywords

large language model (11) multimodal learning (4) domain adaptation (4) n-gram representation (3) pre-trained language model (3) reasoning chain (2) reinforcement learning (2) chain-of-thought prompting (2) transfer learning (2) parameter-efficient fine-tuning (2) vision-language pre-training (2) continued pretraining (2) text encoder (2) uncertainty quantification (2) prompt engineering (2) benchmark evaluation (2) model compression (2) chain-of-thought reasoning (2) reinforcement learning from human feedback (2) instruction tuning (2)

Papers

Speech-Hands: A Self-Reflection Voice Agentic Approach to Speech Recognition and Audio Reasoning with Omni Perception ACL 2026 UGPhysics: A Comprehensive Benchmark for Undergraduate Physics Reasoning with Large Language Models ICML 2025 MA-LoT: Model-Collaboration Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem Proving ICML 2025 Can We Verify Step by Step for Incorrect Answer Detection? IJCAI 2025 Hymba: A Hybrid-head Architecture for Small Language Models ICLR 2025 LongMamba: Enhancing Mamba's Long-Context Capabilities via Training-Free Receptive Field Enlargement ICLR 2025 Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models EMNLP 2025 TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts EMNLP 2024 LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning NIPS 2024 Active Prompting with Chain-of-Thought for Large Language Models ACL 2024 Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards ACL 2024 VeraCT Scan: Retrieval-Augmented Fake News Detection with Justifiable Reasoning ACL 2024 Plum: Prompt Learning using Metaheuristics ACL 2024 ConstraintChecker: A Plugin for Large Language Models to Reason on Commonsense Knowledge Bases EACL 2024 Mitigating the Alignment Tax of RLHF EMNLP 2024 SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales EMNLP 2024 FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation EMNLP 2024 The Instinctive Bias: Spurious Images lead to Illusion in MLLMs EMNLP 2024 R-Tuning: Instructing Large Language Models to Say ‘I Don’t Know’ NAACL 2024 LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models NAACL 2024 DetGPT: Detect What You Need via Reasoning EMNLP 2023 Towards Unifying Medical Vision-and-Language Pre-Training via Soft Prompts ICCV 2023 Write and Paint: Generative Vision-Language Models are Unified Modal Learners ICLR 2023 Doolittle: Benchmarks and Corpora for Academic Writing Formalization EMNLP 2023 On the Difference of BERT-style and CLIP-style Text Encoders ACL 2023 Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Models’ Memories ACL 2023 Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data EMNLP 2023 VLUE: A Multi-Task Multi-Dimension Benchmark for Evaluating Vision-Language Pre-training ICML 2022 Taming Pre-trained Language Models with N-gram Representations for Low-Resource Domain Adaptation ACL 2021 TILGAN: Transformer-based Implicit Latent GAN for Diverse and Coherent Text Generation ACL 2021 TILGAN: Transformer-based Implicit Latent GAN for Diverse and Coherent Text Generation IJCNLP 2021 Taming Pre-trained Language Models with N-gram Representations for Low-Resource Domain Adaptation IJCNLP 2021 Efficient Neural Network Training via Forward and Backward Propagation Sparsification NIPS 2021 ZEN: Pre-training Chinese Text Encoder Enhanced by N-gram Representations EMNLP 2020