Paul Pu Liang
58 papers · 2018–2026 · 16 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
π£ Hot Topic Early Bird π§ Keyword Pioneer π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (12) π Conference Polyglot (14)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Academic Marathon
(7)
π€
Dynamic Duo
(37)
π
Grand Slam
π₯
Mega-Team
(77)
π±
Topic Pioneer
π¬
Deep Specialist
(19)
π§¬
Topic Evolution
π
Keyword Champion
(2)
β‘
Prolific Year
(11)
π
Century Club
(56)
π₯
Unstoppable
(8)
ποΈ
Keyword Collector
(212)
π
Conference Pioneer
π
Trend Setter
Conferences
ACL (14)
EMNLP (8)
ICLR (8)
NIPS (6)
CVPR (4)
AAAI (3)
ICML (3)
NAACL (3)
ECCV (2)
ACML (1)
EACL (1)
ICCV (1)
IJCNLP (1)
JMLR (1)
MIDL (1)
WACV (1)
Top co-authors
Keywords
multimodal learning
(17)
sentiment analysis
(8)
representation learning
(7)
emotion recognition
(5)
vision-language model
(5)
language model
(4)
multimodal language
(3)
text generation
(3)
multimodal fusion
(3)
benchmark evaluation
(3)
self-supervised learning
(3)
video understanding
(3)
few-shot learning
(2)
contrastive learning
(2)
affective computing
(2)
federated learning
(2)
privacy-preserving learning
(2)
human-computer interaction
(2)
uncertainty quantification
(2)
visual reasoning
(2)
Papers
RADAR: A Reasoning-Guided Attribution Framework for Explainable Visual Data Analysis
EACL 2026
Unpaired Multimodal Learning for Biological Datasets
MIDL 2026
Progressive Compositionality in Text-to-Image Generative Models
ICLR 2025
VLM2-Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues
ACL 2025
TAMP: Token-Adaptive Layerwise Pruning in Multimodal Large Language Models
ACL 2025
Deriving Strategic Market Insights with Large Language Models: A Benchmark for Forward Counterfactual Generation
EMNLP 2025
Social Genome: Grounded Social Reasoning Abilities of Multimodal Models
EMNLP 2025
VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks
ICLR 2025
OS-ATLAS: Foundation Action Model for Generalist GUI Agents
ICLR 2025
TeaserGen: Generating Teasers for Long Documentaries
ICLR 2025
CLIMB: Data Foundations for Large Scale Multimodal Clinical Foundation Models
ICML 2025
Understanding the Emergence of Multimodal Representation Alignment
ICML 2025
Comparative Knowledge Distillation
WACV 2025
Think Twice: Perspective-Taking Improves Large Language Modelsβ Theory-of-Mind Capabilities
ACL 2024
Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications
ICLR 2024
Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions
EMNLP 2024
MMoE: Enhancing Multimodal Models with Mixtures of Multimodal Interaction Experts
EMNLP 2024
HEMM: Holistic Evaluation of Multimodal Foundation Models
NIPS 2024
Modeling Dense Multimodal Interactions Between Biological Pathways and Histology for Survival Prediction
CVPR 2024
FLHetBench: Benchmarking Device and State Heterogeneity in Federated Learning
CVPR 2024
Language Models Get a Gender Makeover: Mitigating Gender Bias with Few-Shot Data Interventions
ACL 2023
Quantifying & Modeling Multimodal Interactions: An Information Decomposition Framework
NIPS 2023
Factorized Contrastive Learning: Going Beyond Multi-view Redundancy
NIPS 2023
Nano: Nested Human-in-the-Loop Reward Learning for Few-shot Language Model Control
ACL 2023
Read and Reap the Rewards: Learning to Play Atari with the Help of Instruction Manuals
NIPS 2023
Localized Symbolic Knowledge Distillation for Visual Commonsense Models
NIPS 2023
Demystify the Gravity Well in the Optimization Landscape (Student Abstract)
AAAI 2023
MultiZoo and MultiBench: A Standardized Toolkit for Multimodal Deep Learning
JMLR 2023
Lecture Presentations Multimodal Dataset: Towards Understanding Multimodality in Educational Videos
ICCV 2023
MultiViz: Towards Visualizing and Understanding Multimodal Models
ICLR 2023
Cross-modal Attention Congruence Regularization for Vision-Language Relation Alignment
ACL 2023
GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
EMNLP 2022
PACS: A Dataset for Physical Audiovisual Commonsense Reasoning
ECCV 2022
Uncertainty Quantification with Pre-trained Language Models: A Large-Scale Empirical Analysis
EMNLP 2022
Tutorial on Multimodal Machine Learning
NAACL 2022
Rethinking Architecture Design for Tackling Data Heterogeneity in Federated Learning
CVPR 2022
Learning Language and Multimodal Privacy-Preserving Markers of Mood from Mobile Data
IJCNLP 2021
Anchor & Transform: Learning Sparse Embeddings for Large Vocabularies
ICLR 2021
StylePTB: A Compositional Benchmark for Fine-grained Controllable Text Style Transfer
NAACL 2021
Towards Understanding and Mitigating Social Biases in Language Models
ICML 2021
Learning Language and Multimodal Privacy-Preserving Markers of Mood from Mobile Data
ACL 2021
Towards Debiasing Sentence Representations
ACL 2020
Diverse and Admissible Trajectory Prediction through Multimodal Context Understanding
ECCV 2020
CMU-MOSEAS: A Multimodal Language Dataset for Spanish, Portuguese, German and French
EMNLP 2020
Words Can Shift: Dynamically Adjusting Word Representations Using Nonverbal Behaviors
AAAI 2019
Multimodal Transformer for Unaligned Multimodal Language Sequences
ACL 2019
Social-IQ: A Question Answering Benchmark for Artificial Social Intelligence
CVPR 2019
Learning Factorized Multimodal Representations
ICLR 2019
Deep Gamblers: Learning to Abstain with Portfolio Theory
NIPS 2019
Strong and Simple Baselines for Multimodal Utterance Embeddings
NAACL 2019
Found in Translation: Learning Robust Joint Representations by Cyclic Translations between Modalities
AAAI 2019
Learning Representations from Imperfect Time Series Data via Tensor Rank Regularization
ACL 2019
Efficient Low-rank Multimodal Fusion With Modality-Specific Factors
ACL 2018
An Empirical Evaluation of Sketched SVD and its Application to Leverage Score Ordering
ACML 2018
Proceedings of Grand Challenge and Workshop on Human Multimodal Language (Challenge-HML)
ACL 2018
Seq2Seq2Sentiment: Multimodal Sequence to Sequence Models for Sentiment Analysis
ACL 2018
Multimodal Language Analysis with Recurrent Multistage Fusion
EMNLP 2018
Multimodal Language Analysis in the Wild: CMU-MOSEI Dataset and Interpretable Dynamic Fusion Graph
ACL 2018