Louis-Philippe Morency

92 papers · 2008–2025 · 16 conferences · across top CS/AI conferences

Achievements

+18 more ↓

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🗺️ Taxonomy Completionist (13) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (16)

🌍 Conference Polyglot (16) 🗺️ Taxonomy Completionist (13) 🐣 Hot Topic Early Bird 🌟 Keyword Trendsetter Combo (8) 🏠 Conference Loyalist (22) 🌱 Topic Pioneer 🤝 Dynamic Duo (37) 🔬 Deep Specialist (32) 🧬 Topic Evolution 🏆 Grand Slam 🏆 Keyword Champion (4) 🗃️ Keyword Collector (351) ⚡ Prolific Year (8) 📈 Trend Setter 🚀 Conference Pioneer 🔥 Unstoppable (10) 💎 Century Club (92) ❓ The Questioner

Conferences

ACL (22) EMNLP (18) ICLR (9) NIPS (7) CVPR (6) NAACL (6) ICCV (4) INTERSPEECH (4) AAAI (3) ECCV (3) IJCNLP (3) COLING (2) ICML (2) JMLR (1) UAI (1) WACV (1)

Top co-authors

Paul Pu Liang (37) Ruslan Salakhutdinov (29) Yao-Hung Hubert Tsai (14) Amir Zadeh (11) AmirAli Bagher Zadeh (7) Soujanya Poria (7) Dong Won Lee (6) Martin Q. Ma (6) Chaitanya Ahuja (5) Leena Mathur (5)

Keywords

multimodal learning (30) sentiment analysis (13) representation learning (10) emotion recognition (8) attention mechanism (8) multimodal fusion (6) language model (6) multimodal sentiment analysis (5) video understanding (5) multimodal language (4) humor detection (4) affective computing (4) text generation (3) gesture generation (3) recurrent neural network (3) feature fusion (3) cross-modal learning (3) self-supervised learning (3) vision-language model (3) referring expression recognition (3)

Papers

Social Genome: Grounded Social Reasoning Abilities of Multimodal Models EMNLP 2025 Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models ICLR 2025 Isolated Causal Effects of Natural Language ICML 2025 Aligning Dialogue Agents with Global Feedback via Large Language Model Multimodal Reward Decomposition EMNLP 2025 AV-Flow: Transforming Text to Audio-Visual Human-like Interactions ICCV 2025 ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models ICCV 2025 Comparative Knowledge Distillation WACV 2025 MMoE: Enhancing Multimodal Models with Mixtures of Multimodal Interaction Experts EMNLP 2024 SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents ICLR 2024 HEMM: Holistic Evaluation of Multimodal Foundation Models NIPS 2024 Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications ICLR 2024 Think Twice: Perspective-Taking Improves Large Language Models’ Theory-of-Mind Capabilities ACL 2024 Optimizing Language Models for Human Preferences is a Causal Inference Problem UAI 2024 Global Reward to Local Rewards: Multimodal-Guided Decomposition for Improving Dialogue Agents EMNLP 2024 Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions EMNLP 2024 MultiViz: Towards Visualizing and Understanding Multimodal Models ICLR 2023 Continual Learning for Personalized Co-speech Gesture Generation ICCV 2023 Lecture Presentations Multimodal Dataset: Towards Understanding Multimodality in Educational Videos ICCV 2023 MultiZoo and MultiBench: A Standardized Toolkit for Multimodal Deep Learning JMLR 2023 Quantifying & Modeling Multimodal Interactions: An Information Decomposition Framework NIPS 2023 Factorized Contrastive Learning: Going Beyond Multi-view Redundancy NIPS 2023 Cross-modal Attention Congruence Regularization for Vision-Language Relation Alignment ACL 2023 Language Models Get a Gender Makeover: Mitigating Gender Bias with Few-Shot Data Interventions ACL 2023 SenteCon: Leveraging Lexicons to Learn Human-Interpretable Language Representations ACL 2023 Nano: Nested Human-in-the-Loop Reward Learning for Few-shot Language Model Control ACL 2023 Understanding Masked Autoencoders via Hierarchical Latent Variable Models CVPR 2023 Text-Transport: Toward Learning Causal Effects of Natural Language EMNLP 2023 Counterfactual Augmentation for Multimodal Learning Under Presentation Bias EMNLP 2023 Difference-Masking: Choosing What to Mask in Continued Pretraining EMNLP 2023 Low-Resource Adaptation for Personalized Co-Speech Gesture Generation CVPR 2022 Uncertainty Quantification with Pre-trained Language Models: A Large-Scale Empirical Analysis EMNLP 2022 Paraphrasing Is All You Need for Novel Object Captioning NIPS 2022 Tutorial on Multimodal Machine Learning NAACL 2022 PACS: A Dataset for Physical Audiovisual Commonsense Reasoning ECCV 2022 Conditional Contrastive Learning with Kernel ICLR 2022 Learning Weakly-supervised Contrastive Representations ICLR 2022 HOLM: Hallucinating Objects with Language Models for Referring Expression Recognition in Partially-Observed Scenes ACL 2022 Beyond Additive Fusion: Learning Non-Additive Multimodal Interactions EMNLP 2022 Self-supervised Representation Learning with Relative Predictive Coding ICLR 2021 MTAG: Modal-Temporal Attention Graph for Unaligned Human Multimodal Language Sequences NAACL 2021 StylePTB: A Compositional Benchmark for Fine-grained Controllable Text Style Transfer NAACL 2021 Humor Knowledge Enriched Transformer for Understanding Multimodal Humor AAAI 2021 Learning Language and Multimodal Privacy-Preserving Markers of Mood from Mobile Data ACL 2021 Towards Understanding and Mitigating Social Biases in Language Models ICML 2021 Learning Language and Multimodal Privacy-Preserving Markers of Mood from Mobile Data IJCNLP 2021 Self-supervised Learning from a Multi-view Perspective ICLR 2021 Multimodal Routing: Improving Local and Global Interpretability of Multimodal Language Analysis EMNLP 2020 Neural Methods for Point-wise Dependency Estimation NIPS 2020 Diverse and Admissible Trajectory Prediction through Multimodal Context Understanding ECCV 2020 Style Transfer for Co-Speech Gesture Animation: A Multi-Speaker Conditional-Mixture Approach ECCV 2020 Integrating Multimodal Information in Large Pretrained Transformers ACL 2020 Towards Debiasing Sentence Representations ACL 2020 Language to Network: Conditional Parameter Adaptation with Natural Language Descriptions ACL 2020 Refer360∘: A Referring Expression Recognition Dataset in 360∘ Images ACL 2020 CMU-MOSEAS: A Multimodal Language Dataset for Spanish, Portuguese, German and French EMNLP 2020 No Gestures Left Behind: Learning Relationships between Spoken Language and Freeform Gestures EMNLP 2020 UR-FUNNY: A Multimodal Language Dataset for Understanding Humor EMNLP 2019 Deep Gamblers: Learning to Abstain with Portfolio Theory NIPS 2019 Found in Translation: Learning Robust Joint Representations by Cyclic Translations between Modalities AAAI 2019 Words Can Shift: Dynamically Adjusting Word Representations Using Nonverbal Behaviors AAAI 2019 Learning Representations from Imperfect Time Series Data via Tensor Rank Regularization ACL 2019 Multimodal Transformer for Unaligned Multimodal Language Sequences ACL 2019 Social-IQ: A Question Answering Benchmark for Artificial Social Intelligence CVPR 2019 Video Relationship Reasoning Using Gated Spatio-Temporal Energy Graph CVPR 2019 Transformer Dissection: An Unified Understanding for Transformer’s Attention via the Lens of Kernel EMNLP 2019 Learning Factorized Multimodal Representations ICLR 2019 UR-FUNNY: A Multimodal Language Dataset for Understanding Humor IJCNLP 2019 Transformer Dissection: An Unified Understanding for Transformer’s Attention via the Lens of Kernel IJCNLP 2019 Bag-of-Acoustic-Words for Mental Health Assessment: A Deep Autoencoding Approach INTERSPEECH 2019 Strong and Simple Baselines for Multimodal Utterance Embeddings NAACL 2019 Proceedings of Grand Challenge and Workshop on Human Multimodal Language (Challenge-HML) ACL 2018 Conversational Memory Network for Emotion Recognition in Dyadic Dialogue Videos NAACL 2018 Visual Referring Expression Recognition: What Do Systems Actually Learn? NAACL 2018 Multimodal Polynomial Fusion for Detecting Driver Distraction INTERSPEECH 2018 Efficient Low-rank Multimodal Fusion With Modality-Specific Factors ACL 2018 Multimodal Language Analysis in the Wild: CMU-MOSEI Dataset and Interpretable Dynamic Fusion Graph ACL 2018 Multimodal Language Analysis with Recurrent Multistage Fusion EMNLP 2018 Speaker-Follower Models for Vision-and-Language Navigation NIPS 2018 Temporal Attention-Gated Model for Robust Sequence Classification CVPR 2017 Context-Dependent Sentiment Analysis in User-Generated Videos ACL 2017 Affect-LM: A Neural Language Model for Customizable Affective Text Generation ACL 2017 Computational Analysis of Acoustic Descriptors in Psychotic Patients INTERSPEECH 2017 Multimodal Machine Learning: Integrating Language, Vision and Speech ACL 2017 Combating Human Trafficking with Multimodal Deep Models ACL 2017 Tensor Fusion Network for Multimodal Sentiment Analysis EMNLP 2017 Unsupervised Text Recap Extraction for TV Series EMNLP 2016 Representation Learning for Speech Emotion Recognition INTERSPEECH 2016 Action Recognition by Hierarchical Sequence Summarization CVPR 2013 Utterance-Level Multimodal Sentiment Analysis ACL 2013 Modeling Wisdom of Crowds Using Latent Mixture of Discriminative Experts ACL 2011 Latent Mixture of Discriminative Experts for Multimodal Prediction Modeling COLING 2010 Modeling Latent-Dynamic in Shallow Parsing: A Latent Conditional Model with Improved Inference COLING 2008