Maja Pantic
47 papers · 2013–2025 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
π Conference Polyglot (9) π£ Hot Topic Early Bird π Interdisciplinary Bridge π§ Keyword Pioneer π Academic Marathon (12)
π£
Hot Topic Early Bird
π
Cross-Pollinator
(12)
π
Conference Polyglot
(9)
π
Conference Loyalist
(21)
π€
Dynamic Duo
(19)
π
Grand Slam
π¬
Deep Specialist
(12)
π§¬
Topic Evolution
π
Keyword Champion
(2)
β‘
Prolific Year
(5)
π
Trend Setter
ποΈ
Keyword Collector
(222)
π₯
Unstoppable
(13)
π
Century Club
(47)
π
Conference Pioneer
Conferences
CVPR (21)
INTERSPEECH (10)
ICCV (5)
WACV (4)
ICLR (2)
JMLR (2)
AAAI (1)
ICML (1)
NIPS (1)
Top co-authors
Keywords
self-supervised learning
(4)
generative adversarial network
(4)
lip reading
(4)
facial action unit
(4)
visual speech recognition
(3)
model compression
(3)
tensor decomposition
(3)
automatic speech recognition
(3)
face alignment
(3)
facial expression
(3)
variational autoencoder
(3)
generative model
(2)
latent variable model
(2)
ordinal regression
(2)
nuclear norm minimization
(2)
representation learning
(2)
diffusion model
(2)
semi-supervised learning
(2)
feature extraction
(2)
dynamic time warping
(2)
Papers
KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation
CVPR 2025
Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
WACV 2024
Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual Inputs
NIPS 2024
Dynamic Data Pruning for Automatic Speech Recognition
INTERSPEECH 2024
MSRS: Training Multimodal Speech Recognition Models from Scratch with Sparse Mask Optimization
INTERSPEECH 2024
RT-LA-VocE: Real-Time Low-SNR Audio-Visual Speech Enhancement
INTERSPEECH 2024
EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars
CVPR 2024
SparseVSR: Lightweight and Noise Robust Visual Speech Recognition
INTERSPEECH 2023
FAN-Trans: Online Knowledge Distillation for Facial Action Unit Detection
WACV 2023
Streaming Audio-Visual Speech Recognition with Alignment Regularization
INTERSPEECH 2023
Jointly Learning Visual and Auditory Speech Representations from Raw Data
ICLR 2023
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
CVPR 2023
CauchyβSchwarz Regularized Autoencoder
JMLR 2022
Leveraging Real Talking Faces via Self-Supervision for Robust Forgery Detection
CVPR 2022
SVTS: Scalable Video-to-Speech Synthesis
INTERSPEECH 2022
Lips Don't Lie: A Generalisable and Robust Approach To Face Forgery Detection
CVPR 2021
DINO: A Conditional Energy-Based GAN for Domain Translation
ICLR 2021
LiRA: Learning Visual Speech Representations from Audio Through Self-Supervision
INTERSPEECH 2021
Lip-Reading With Densely Connected Temporal Convolutional Networks
WACV 2021
Incremental Multi-Domain Learning with Network Latent Tensor Factorization
AAAI 2020
Dynamic Face Video Segmentation via Reinforcement Learning
CVPR 2020
Factorized Higher-Order CNNs With an Application to Spatio-Temporal Emotion Estimation
CVPR 2020
Shape Constrained Network for Eye Segmentation in the Wild
WACV 2020
Multilinear Latent Conditioning for Generating Unseen Attribute Combinations
ICML 2020
T-Net: Parametrizing Fully Convolutional Nets With a Single High-Order Tensor
CVPR 2019
Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition
INTERSPEECH 2019
Video-Driven Speech Reconstruction Using Generative Adversarial Networks
INTERSPEECH 2019
TensorLy: Tensor Learning in Python
JMLR 2019
4DFAB: A Large Scale 4D Database for Facial Expression Analysis and Biometric Applications
CVPR 2018
GAGAN: Geometry-Aware Generative Adversarial Networks
CVPR 2018
Deep Structured Learning for Facial Action Unit Intensity Estimation
CVPR 2017
DeepCoder: Semi-Parametric Variational Autoencoders for Automatic Facial Action Coding
ICCV 2017
Spotting Social Signals in Conversational Speech over IP: A Deep Learning Perspective
INTERSPEECH 2017
Copula Ordinal Regression for Joint Estimation of Facial Action Unit Intensity
CVPR 2016
Joint Unsupervised Deformable Spatio-Temporal Alignment of Sequences
CVPR 2016
Robust Statistical Face Frontalization
ICCV 2015
Latent Trees for Estimating Intensity of Facial Action Units
CVPR 2015
Multi-Conditional Latent Variable Model for Joint Facial Action Unit Detection
ICCV 2015
Gauss-Newton Deformable Part Models for Face Alignment in-the-Wild
CVPR 2014
Incremental Face Alignment in the Wild
CVPR 2014
RAPS: Robust and Efficient Automatic Construction of Person-Specific Deformable Models
CVPR 2014
Merging SVMs with Linear Discriminant Analysis: A Combined Model
CVPR 2014
Full-Angle Quaternions for Robustly Matching Vectors of 3D Rotations
CVPR 2014
Robust Canonical Time Warping for the Alignment of Grossly Corrupted Sequences
CVPR 2013
Learning Slow Features for Behaviour Analysis
ICCV 2013
Robust Discriminative Response Map Fitting with Constrained Local Models
CVPR 2013
Optimization Problems for Fast AAM Fitting in-the-Wild
ICCV 2013