Paul Pu Liang

58 papers · 2018–2026 · 16 conferences · across top CS/AI conferences

Achievements

+16 more ↓

🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (12) 🌍 Conference Polyglot (14)

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏃 Academic Marathon (7) 🤝 Dynamic Duo (37) 🏆 Grand Slam 👥 Mega-Team (77) 🌱 Topic Pioneer 🔬 Deep Specialist (19) 🧬 Topic Evolution 🏆 Keyword Champion (2) ⚡ Prolific Year (11) 💎 Century Club (56) 🔥 Unstoppable (8) 🗃️ Keyword Collector (212) 🚀 Conference Pioneer 📈 Trend Setter

Conferences

ACL (14) EMNLP (8) ICLR (8) NIPS (6) CVPR (4) AAAI (3) ICML (3) NAACL (3) ECCV (2) ACML (1) EACL (1) ICCV (1) IJCNLP (1) JMLR (1) MIDL (1) WACV (1)

Top co-authors

Louis-Philippe Morency (37) Ruslan Salakhutdinov (19) Amir Zadeh (5) Yiwei Lyu (4) AmirAli Bagher Zadeh (4) Yao-Hung Hubert Tsai (4) Alex Wilf (3) Barnabás Póczos (3) Leena Mathur (3) Xiang Fan (3)

Keywords

multimodal learning (17) sentiment analysis (8) representation learning (7) emotion recognition (5) vision-language model (5) language model (4) multimodal language (3) text generation (3) multimodal fusion (3) benchmark evaluation (3) self-supervised learning (3) video understanding (3) few-shot learning (2) contrastive learning (2) affective computing (2) federated learning (2) privacy-preserving learning (2) human-computer interaction (2) uncertainty quantification (2) visual reasoning (2)

Papers

RADAR: A Reasoning-Guided Attribution Framework for Explainable Visual Data Analysis EACL 2026 Unpaired Multimodal Learning for Biological Datasets MIDL 2026 Progressive Compositionality in Text-to-Image Generative Models ICLR 2025 VLM2-Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues ACL 2025 TAMP: Token-Adaptive Layerwise Pruning in Multimodal Large Language Models ACL 2025 Deriving Strategic Market Insights with Large Language Models: A Benchmark for Forward Counterfactual Generation EMNLP 2025 Social Genome: Grounded Social Reasoning Abilities of Multimodal Models EMNLP 2025 VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks ICLR 2025 OS-ATLAS: Foundation Action Model for Generalist GUI Agents ICLR 2025 TeaserGen: Generating Teasers for Long Documentaries ICLR 2025 CLIMB: Data Foundations for Large Scale Multimodal Clinical Foundation Models ICML 2025 Understanding the Emergence of Multimodal Representation Alignment ICML 2025 Comparative Knowledge Distillation WACV 2025 Think Twice: Perspective-Taking Improves Large Language Models’ Theory-of-Mind Capabilities ACL 2024 Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications ICLR 2024 Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions EMNLP 2024 MMoE: Enhancing Multimodal Models with Mixtures of Multimodal Interaction Experts EMNLP 2024 HEMM: Holistic Evaluation of Multimodal Foundation Models NIPS 2024 Modeling Dense Multimodal Interactions Between Biological Pathways and Histology for Survival Prediction CVPR 2024 FLHetBench: Benchmarking Device and State Heterogeneity in Federated Learning CVPR 2024 Language Models Get a Gender Makeover: Mitigating Gender Bias with Few-Shot Data Interventions ACL 2023 Quantifying & Modeling Multimodal Interactions: An Information Decomposition Framework NIPS 2023 Factorized Contrastive Learning: Going Beyond Multi-view Redundancy NIPS 2023 Nano: Nested Human-in-the-Loop Reward Learning for Few-shot Language Model Control ACL 2023 Read and Reap the Rewards: Learning to Play Atari with the Help of Instruction Manuals NIPS 2023 Localized Symbolic Knowledge Distillation for Visual Commonsense Models NIPS 2023 Demystify the Gravity Well in the Optimization Landscape (Student Abstract) AAAI 2023 MultiZoo and MultiBench: A Standardized Toolkit for Multimodal Deep Learning JMLR 2023 Lecture Presentations Multimodal Dataset: Towards Understanding Multimodality in Educational Videos ICCV 2023 MultiViz: Towards Visualizing and Understanding Multimodal Models ICLR 2023 Cross-modal Attention Congruence Regularization for Vision-Language Relation Alignment ACL 2023 GEMv2: Multilingual NLG Benchmarking in a Single Line of Code EMNLP 2022 PACS: A Dataset for Physical Audiovisual Commonsense Reasoning ECCV 2022 Uncertainty Quantification with Pre-trained Language Models: A Large-Scale Empirical Analysis EMNLP 2022 Tutorial on Multimodal Machine Learning NAACL 2022 Rethinking Architecture Design for Tackling Data Heterogeneity in Federated Learning CVPR 2022 Learning Language and Multimodal Privacy-Preserving Markers of Mood from Mobile Data IJCNLP 2021 Anchor & Transform: Learning Sparse Embeddings for Large Vocabularies ICLR 2021 StylePTB: A Compositional Benchmark for Fine-grained Controllable Text Style Transfer NAACL 2021 Towards Understanding and Mitigating Social Biases in Language Models ICML 2021 Learning Language and Multimodal Privacy-Preserving Markers of Mood from Mobile Data ACL 2021 Towards Debiasing Sentence Representations ACL 2020 Diverse and Admissible Trajectory Prediction through Multimodal Context Understanding ECCV 2020 CMU-MOSEAS: A Multimodal Language Dataset for Spanish, Portuguese, German and French EMNLP 2020 Words Can Shift: Dynamically Adjusting Word Representations Using Nonverbal Behaviors AAAI 2019 Multimodal Transformer for Unaligned Multimodal Language Sequences ACL 2019 Social-IQ: A Question Answering Benchmark for Artificial Social Intelligence CVPR 2019 Learning Factorized Multimodal Representations ICLR 2019 Deep Gamblers: Learning to Abstain with Portfolio Theory NIPS 2019 Strong and Simple Baselines for Multimodal Utterance Embeddings NAACL 2019 Found in Translation: Learning Robust Joint Representations by Cyclic Translations between Modalities AAAI 2019 Learning Representations from Imperfect Time Series Data via Tensor Rank Regularization ACL 2019 Efficient Low-rank Multimodal Fusion With Modality-Specific Factors ACL 2018 An Empirical Evaluation of Sketched SVD and its Application to Leverage Score Ordering ACML 2018 Proceedings of Grand Challenge and Workshop on Human Multimodal Language (Challenge-HML) ACL 2018 Seq2Seq2Sentiment: Multimodal Sequence to Sequence Models for Sentiment Analysis ACL 2018 Multimodal Language Analysis with Recurrent Multistage Fusion EMNLP 2018 Multimodal Language Analysis in the Wild: CMU-MOSEI Dataset and Interpretable Dynamic Fusion Graph ACL 2018