SHIZHE DIAO
34 papers · 2020–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π Academic Marathon (5) π Conference Polyglot (10) π Interdisciplinary Bridge π§ Keyword Pioneer π£ Hot Topic Early Bird
π
Cross-Pollinator
(11)
π
Conference Polyglot
(10)
π
Academic Marathon
(5)
π€
Dynamic Duo
(21)
π§¬
Topic Evolution
π
Century Club
(33)
β‘
Prolific Year
(6)
ποΈ
Keyword Collector
(135)
β
The Questioner
π₯
Unstoppable
(6)
Conferences
EMNLP (10)
ACL (9)
ICLR (3)
ICML (3)
IJCNLP (2)
NAACL (2)
NIPS (2)
EACL (1)
ICCV (1)
IJCAI (1)
Top co-authors
Keywords
large language model
(11)
multimodal learning
(4)
domain adaptation
(4)
n-gram representation
(3)
pre-trained language model
(3)
reasoning chain
(2)
reinforcement learning
(2)
chain-of-thought prompting
(2)
transfer learning
(2)
parameter-efficient fine-tuning
(2)
vision-language pre-training
(2)
continued pretraining
(2)
text encoder
(2)
uncertainty quantification
(2)
prompt engineering
(2)
benchmark evaluation
(2)
model compression
(2)
chain-of-thought reasoning
(2)
reinforcement learning from human feedback
(2)
instruction tuning
(2)
Papers
Speech-Hands: A Self-Reflection Voice Agentic Approach to Speech Recognition and Audio Reasoning with Omni Perception
ACL 2026
UGPhysics: A Comprehensive Benchmark for Undergraduate Physics Reasoning with Large Language Models
ICML 2025
MA-LoT: Model-Collaboration Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem Proving
ICML 2025
Can We Verify Step by Step for Incorrect Answer Detection?
IJCAI 2025
Hymba: A Hybrid-head Architecture for Small Language Models
ICLR 2025
LongMamba: Enhancing Mamba's Long-Context Capabilities via Training-Free Receptive Field Enlargement
ICLR 2025
Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
EMNLP 2025
TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts
EMNLP 2024
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning
NIPS 2024
Active Prompting with Chain-of-Thought for Large Language Models
ACL 2024
Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards
ACL 2024
VeraCT Scan: Retrieval-Augmented Fake News Detection with Justifiable Reasoning
ACL 2024
Plum: Prompt Learning using Metaheuristics
ACL 2024
ConstraintChecker: A Plugin for Large Language Models to Reason on Commonsense Knowledge Bases
EACL 2024
Mitigating the Alignment Tax of RLHF
EMNLP 2024
SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales
EMNLP 2024
FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation
EMNLP 2024
The Instinctive Bias: Spurious Images lead to Illusion in MLLMs
EMNLP 2024
R-Tuning: Instructing Large Language Models to Say βI Donβt Knowβ
NAACL 2024
LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models
NAACL 2024
DetGPT: Detect What You Need via Reasoning
EMNLP 2023
Towards Unifying Medical Vision-and-Language Pre-Training via Soft Prompts
ICCV 2023
Write and Paint: Generative Vision-Language Models are Unified Modal Learners
ICLR 2023
Doolittle: Benchmarks and Corpora for Academic Writing Formalization
EMNLP 2023
On the Difference of BERT-style and CLIP-style Text Encoders
ACL 2023
Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Modelsβ Memories
ACL 2023
Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data
EMNLP 2023
VLUE: A Multi-Task Multi-Dimension Benchmark for Evaluating Vision-Language Pre-training
ICML 2022
Taming Pre-trained Language Models with N-gram Representations for Low-Resource Domain Adaptation
ACL 2021
TILGAN: Transformer-based Implicit Latent GAN for Diverse and Coherent Text Generation
ACL 2021
TILGAN: Transformer-based Implicit Latent GAN for Diverse and Coherent Text Generation
IJCNLP 2021
Taming Pre-trained Language Models with N-gram Representations for Low-Resource Domain Adaptation
IJCNLP 2021
Efficient Neural Network Training via Forward and Backward Propagation Sparsification
NIPS 2021
ZEN: Pre-training Chinese Text Encoder Enhanced by N-gram Representations
EMNLP 2020