Maja Pantic

47 papers · 2013–2025 · 9 conferences · across top CS/AI conferences

Achievements

+15 more ↓

🌍 Conference Polyglot (9) 🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🏃 Academic Marathon (12)

🐣 Hot Topic Early Bird 🐝 Cross-Pollinator (12) 🌍 Conference Polyglot (9) 🏠 Conference Loyalist (21) 🤝 Dynamic Duo (19) 🏆 Grand Slam 🔬 Deep Specialist (12) 🧬 Topic Evolution 🏆 Keyword Champion (2) ⚡ Prolific Year (5) 📈 Trend Setter 🗃️ Keyword Collector (222) 🔥 Unstoppable (13) 💎 Century Club (47) 🚀 Conference Pioneer

Conferences

CVPR (21) INTERSPEECH (10) ICCV (5) WACV (4) ICLR (2) JMLR (2) AAAI (1) ICML (1) NIPS (1)

Top co-authors

Stavros Petridis (19) Pingchuan Ma (10) Stefanos Zafeiriou (10) Yannis Panagakis (7) Konstantinos Vougioukas (7) Rodrigo Mira (6) Alexandros Haliassos (6) Shiyang Cheng (5) Honglie Chen (5) Jean Kossaifi (5)

Keywords

self-supervised learning (4) generative adversarial network (4) lip reading (4) facial action unit (4) visual speech recognition (3) model compression (3) tensor decomposition (3) automatic speech recognition (3) face alignment (3) facial expression (3) variational autoencoder (3) generative model (2) latent variable model (2) ordinal regression (2) nuclear norm minimization (2) representation learning (2) diffusion model (2) semi-supervised learning (2) feature extraction (2) dynamic time warping (2)

Papers

KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation CVPR 2025 Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation WACV 2024 Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual Inputs NIPS 2024 Dynamic Data Pruning for Automatic Speech Recognition INTERSPEECH 2024 MSRS: Training Multimodal Speech Recognition Models from Scratch with Sparse Mask Optimization INTERSPEECH 2024 RT-LA-VocE: Real-Time Low-SNR Audio-Visual Speech Enhancement INTERSPEECH 2024 EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars CVPR 2024 SparseVSR: Lightweight and Noise Robust Visual Speech Recognition INTERSPEECH 2023 FAN-Trans: Online Knowledge Distillation for Facial Action Unit Detection WACV 2023 Streaming Audio-Visual Speech Recognition with Alignment Regularization INTERSPEECH 2023 Jointly Learning Visual and Auditory Speech Representations from Raw Data ICLR 2023 SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision CVPR 2023 Cauchy–Schwarz Regularized Autoencoder JMLR 2022 Leveraging Real Talking Faces via Self-Supervision for Robust Forgery Detection CVPR 2022 SVTS: Scalable Video-to-Speech Synthesis INTERSPEECH 2022 Lips Don't Lie: A Generalisable and Robust Approach To Face Forgery Detection CVPR 2021 DINO: A Conditional Energy-Based GAN for Domain Translation ICLR 2021 LiRA: Learning Visual Speech Representations from Audio Through Self-Supervision INTERSPEECH 2021 Lip-Reading With Densely Connected Temporal Convolutional Networks WACV 2021 Incremental Multi-Domain Learning with Network Latent Tensor Factorization AAAI 2020 Dynamic Face Video Segmentation via Reinforcement Learning CVPR 2020 Factorized Higher-Order CNNs With an Application to Spatio-Temporal Emotion Estimation CVPR 2020 Shape Constrained Network for Eye Segmentation in the Wild WACV 2020 Multilinear Latent Conditioning for Generating Unseen Attribute Combinations ICML 2020 T-Net: Parametrizing Fully Convolutional Nets With a Single High-Order Tensor CVPR 2019 Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition INTERSPEECH 2019 Video-Driven Speech Reconstruction Using Generative Adversarial Networks INTERSPEECH 2019 TensorLy: Tensor Learning in Python JMLR 2019 4DFAB: A Large Scale 4D Database for Facial Expression Analysis and Biometric Applications CVPR 2018 GAGAN: Geometry-Aware Generative Adversarial Networks CVPR 2018 Deep Structured Learning for Facial Action Unit Intensity Estimation CVPR 2017 DeepCoder: Semi-Parametric Variational Autoencoders for Automatic Facial Action Coding ICCV 2017 Spotting Social Signals in Conversational Speech over IP: A Deep Learning Perspective INTERSPEECH 2017 Copula Ordinal Regression for Joint Estimation of Facial Action Unit Intensity CVPR 2016 Joint Unsupervised Deformable Spatio-Temporal Alignment of Sequences CVPR 2016 Robust Statistical Face Frontalization ICCV 2015 Latent Trees for Estimating Intensity of Facial Action Units CVPR 2015 Multi-Conditional Latent Variable Model for Joint Facial Action Unit Detection ICCV 2015 Gauss-Newton Deformable Part Models for Face Alignment in-the-Wild CVPR 2014 Incremental Face Alignment in the Wild CVPR 2014 RAPS: Robust and Efficient Automatic Construction of Person-Specific Deformable Models CVPR 2014 Merging SVMs with Linear Discriminant Analysis: A Combined Model CVPR 2014 Full-Angle Quaternions for Robustly Matching Vectors of 3D Rotations CVPR 2014 Robust Canonical Time Warping for the Alignment of Grossly Corrupted Sequences CVPR 2013 Learning Slow Features for Behaviour Analysis ICCV 2013 Robust Discriminative Response Map Fitting with Constrained Local Models CVPR 2013 Optimization Problems for Fast AAM Fitting in-the-Wild ICCV 2013