Louis-Philippe Morency
92 papers · 2008–2025 · 16 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+18 more ↓ Show less ↑
๐งญ Keyword Pioneer ๐ฃ Hot Topic Early Bird ๐บ๏ธ Taxonomy Completionist (13) ๐ Interdisciplinary Bridge ๐ Conference Polyglot (16)
๐
Conference Polyglot
(16)
๐บ๏ธ
Taxonomy Completionist
(13)
๐ฃ
Hot Topic Early Bird
๐
Keyword Trendsetter Combo
(8)
๐
Conference Loyalist
(22)
๐ฑ
Topic Pioneer
๐ค
Dynamic Duo
(37)
๐ฌ
Deep Specialist
(32)
๐งฌ
Topic Evolution
๐
Grand Slam
๐
Keyword Champion
(4)
๐๏ธ
Keyword Collector
(351)
โก
Prolific Year
(8)
๐
Trend Setter
๐
Conference Pioneer
๐ฅ
Unstoppable
(10)
๐
Century Club
(92)
โ
The Questioner
Conferences
ACL (22)
EMNLP (18)
ICLR (9)
NIPS (7)
CVPR (6)
NAACL (6)
ICCV (4)
INTERSPEECH (4)
AAAI (3)
ECCV (3)
IJCNLP (3)
COLING (2)
ICML (2)
JMLR (1)
UAI (1)
WACV (1)
Top co-authors
Keywords
multimodal learning
(30)
sentiment analysis
(13)
representation learning
(10)
emotion recognition
(8)
attention mechanism
(8)
multimodal fusion
(6)
language model
(6)
multimodal sentiment analysis
(5)
video understanding
(5)
multimodal language
(4)
humor detection
(4)
affective computing
(4)
text generation
(3)
gesture generation
(3)
recurrent neural network
(3)
feature fusion
(3)
cross-modal learning
(3)
self-supervised learning
(3)
vision-language model
(3)
referring expression recognition
(3)
Papers
Social Genome: Grounded Social Reasoning Abilities of Multimodal Models
EMNLP 2025
Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models
ICLR 2025
Isolated Causal Effects of Natural Language
ICML 2025
Aligning Dialogue Agents with Global Feedback via Large Language Model Multimodal Reward Decomposition
EMNLP 2025
AV-Flow: Transforming Text to Audio-Visual Human-like Interactions
ICCV 2025
ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models
ICCV 2025
Comparative Knowledge Distillation
WACV 2025
MMoE: Enhancing Multimodal Models with Mixtures of Multimodal Interaction Experts
EMNLP 2024
SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents
ICLR 2024
HEMM: Holistic Evaluation of Multimodal Foundation Models
NIPS 2024
Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications
ICLR 2024
Think Twice: Perspective-Taking Improves Large Language Modelsโ Theory-of-Mind Capabilities
ACL 2024
Optimizing Language Models for Human Preferences is a Causal Inference Problem
UAI 2024
Global Reward to Local Rewards: Multimodal-Guided Decomposition for Improving Dialogue Agents
EMNLP 2024
Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions
EMNLP 2024
MultiViz: Towards Visualizing and Understanding Multimodal Models
ICLR 2023
Continual Learning for Personalized Co-speech Gesture Generation
ICCV 2023
Lecture Presentations Multimodal Dataset: Towards Understanding Multimodality in Educational Videos
ICCV 2023
MultiZoo and MultiBench: A Standardized Toolkit for Multimodal Deep Learning
JMLR 2023
Quantifying & Modeling Multimodal Interactions: An Information Decomposition Framework
NIPS 2023
Factorized Contrastive Learning: Going Beyond Multi-view Redundancy
NIPS 2023
Cross-modal Attention Congruence Regularization for Vision-Language Relation Alignment
ACL 2023
Language Models Get a Gender Makeover: Mitigating Gender Bias with Few-Shot Data Interventions
ACL 2023
SenteCon: Leveraging Lexicons to Learn Human-Interpretable Language Representations
ACL 2023
Nano: Nested Human-in-the-Loop Reward Learning for Few-shot Language Model Control
ACL 2023
Understanding Masked Autoencoders via Hierarchical Latent Variable Models
CVPR 2023
Text-Transport: Toward Learning Causal Effects of Natural Language
EMNLP 2023
Counterfactual Augmentation for Multimodal Learning Under Presentation Bias
EMNLP 2023
Difference-Masking: Choosing What to Mask in Continued Pretraining
EMNLP 2023
Low-Resource Adaptation for Personalized Co-Speech Gesture Generation
CVPR 2022
Uncertainty Quantification with Pre-trained Language Models: A Large-Scale Empirical Analysis
EMNLP 2022
Paraphrasing Is All You Need for Novel Object Captioning
NIPS 2022
Tutorial on Multimodal Machine Learning
NAACL 2022
PACS: A Dataset for Physical Audiovisual Commonsense Reasoning
ECCV 2022
Conditional Contrastive Learning with Kernel
ICLR 2022
Learning Weakly-supervised Contrastive Representations
ICLR 2022
HOLM: Hallucinating Objects with Language Models for Referring Expression Recognition in Partially-Observed Scenes
ACL 2022
Beyond Additive Fusion: Learning Non-Additive Multimodal Interactions
EMNLP 2022
Self-supervised Representation Learning with Relative Predictive Coding
ICLR 2021
MTAG: Modal-Temporal Attention Graph for Unaligned Human Multimodal Language Sequences
NAACL 2021
StylePTB: A Compositional Benchmark for Fine-grained Controllable Text Style Transfer
NAACL 2021
Humor Knowledge Enriched Transformer for Understanding Multimodal Humor
AAAI 2021
Learning Language and Multimodal Privacy-Preserving Markers of Mood from Mobile Data
ACL 2021
Towards Understanding and Mitigating Social Biases in Language Models
ICML 2021
Learning Language and Multimodal Privacy-Preserving Markers of Mood from Mobile Data
IJCNLP 2021
Self-supervised Learning from a Multi-view Perspective
ICLR 2021
Multimodal Routing: Improving Local and Global Interpretability of Multimodal Language Analysis
EMNLP 2020
Neural Methods for Point-wise Dependency Estimation
NIPS 2020
Diverse and Admissible Trajectory Prediction through Multimodal Context Understanding
ECCV 2020
Style Transfer for Co-Speech Gesture Animation: A Multi-Speaker Conditional-Mixture Approach
ECCV 2020
Integrating Multimodal Information in Large Pretrained Transformers
ACL 2020
Towards Debiasing Sentence Representations
ACL 2020
Language to Network: Conditional Parameter Adaptation with Natural Language Descriptions
ACL 2020
Refer360โ: A Referring Expression Recognition Dataset in 360โ Images
ACL 2020
CMU-MOSEAS: A Multimodal Language Dataset for Spanish, Portuguese, German and French
EMNLP 2020
No Gestures Left Behind: Learning Relationships between Spoken Language and Freeform Gestures
EMNLP 2020
UR-FUNNY: A Multimodal Language Dataset for Understanding Humor
EMNLP 2019
Deep Gamblers: Learning to Abstain with Portfolio Theory
NIPS 2019
Found in Translation: Learning Robust Joint Representations by Cyclic Translations between Modalities
AAAI 2019
Words Can Shift: Dynamically Adjusting Word Representations Using Nonverbal Behaviors
AAAI 2019
Learning Representations from Imperfect Time Series Data via Tensor Rank Regularization
ACL 2019
Multimodal Transformer for Unaligned Multimodal Language Sequences
ACL 2019
Social-IQ: A Question Answering Benchmark for Artificial Social Intelligence
CVPR 2019
Video Relationship Reasoning Using Gated Spatio-Temporal Energy Graph
CVPR 2019
Transformer Dissection: An Unified Understanding for Transformerโs Attention via the Lens of Kernel
EMNLP 2019
Learning Factorized Multimodal Representations
ICLR 2019
UR-FUNNY: A Multimodal Language Dataset for Understanding Humor
IJCNLP 2019
Transformer Dissection: An Unified Understanding for Transformerโs Attention via the Lens of Kernel
IJCNLP 2019
Bag-of-Acoustic-Words for Mental Health Assessment: A Deep Autoencoding Approach
INTERSPEECH 2019
Strong and Simple Baselines for Multimodal Utterance Embeddings
NAACL 2019
Proceedings of Grand Challenge and Workshop on Human Multimodal Language (Challenge-HML)
ACL 2018
Conversational Memory Network for Emotion Recognition in Dyadic Dialogue Videos
NAACL 2018
Visual Referring Expression Recognition: What Do Systems Actually Learn?
NAACL 2018
Multimodal Polynomial Fusion for Detecting Driver Distraction
INTERSPEECH 2018
Efficient Low-rank Multimodal Fusion With Modality-Specific Factors
ACL 2018
Multimodal Language Analysis in the Wild: CMU-MOSEI Dataset and Interpretable Dynamic Fusion Graph
ACL 2018
Multimodal Language Analysis with Recurrent Multistage Fusion
EMNLP 2018
Speaker-Follower Models for Vision-and-Language Navigation
NIPS 2018
Temporal Attention-Gated Model for Robust Sequence Classification
CVPR 2017
Context-Dependent Sentiment Analysis in User-Generated Videos
ACL 2017
Affect-LM: A Neural Language Model for Customizable Affective Text Generation
ACL 2017
Computational Analysis of Acoustic Descriptors in Psychotic Patients
INTERSPEECH 2017
Multimodal Machine Learning: Integrating Language, Vision and Speech
ACL 2017
Combating Human Trafficking with Multimodal Deep Models
ACL 2017
Tensor Fusion Network for Multimodal Sentiment Analysis
EMNLP 2017
Unsupervised Text Recap Extraction for TV Series
EMNLP 2016
Representation Learning for Speech Emotion Recognition
INTERSPEECH 2016
Action Recognition by Hierarchical Sequence Summarization
CVPR 2013
Utterance-Level Multimodal Sentiment Analysis
ACL 2013
Modeling Wisdom of Crowds Using Latent Mixture of Discriminative Experts
ACL 2011
Latent Mixture of Discriminative Experts for Multimodal Prediction Modeling
COLING 2010
Modeling Latent-Dynamic in Shallow Parsing: A Latent Conditional Model with Improved Inference
COLING 2008