Jesus Villalba
37 papers · 2017–2025 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (18) π§ Keyword Pioneer π Renaissance Researcher (6) π Interdisciplinary Bridge π£ Hot Topic Early Bird
π
Cross-Pollinator
(11)
πΊοΈ
Taxonomy Completionist
(18)
π
Academic Marathon
(8)
π
Conference Loyalist
(35)
π€
Dynamic Duo
(36)
π§¬
Topic Evolution
π
Keyword Champion
(2)
π¬
Deep Specialist
(11)
ποΈ
Keyword Collector
(54)
β‘
Prolific Year
(6)
π₯
Unstoppable
(9)
β
The Questioner
π
Century Club
(37)
Conferences
INTERSPEECH (35)
EMNLP (1)
NIPS (1)
Top co-authors
Keywords
speaker verification
(9)
speaker recognition
(9)
speaker embedding
(8)
domain adaptation
(6)
self-supervised learning
(6)
adversarial attack
(5)
automatic speech recognition
(4)
deep neural network
(3)
x-vector embedding
(3)
representation learning
(3)
probabilistic linear discriminant analysis
(3)
bandwidth extension
(3)
end-to-end learning
(2)
residual network
(2)
adversarial robustness
(2)
black-box attack
(2)
speaker identity
(2)
multimodal learning
(2)
speaker diarization
(2)
speech processing
(2)
Papers
Paired by the Teacher: Turning Unpaired Data into High-Fidelity Pairs for Low-Resource Text Generation
EMNLP 2025
CA-SSLR: Condition-Aware Self-Supervised Learning Representation for Generalized Speech Processing
NIPS 2024
Exploring the Complementary Nature of Speech and Eye Movements for Profiling Neurological Disorders
INTERSPEECH 2024
Noise-robust Speech Separation with Fast Generative Correction
INTERSPEECH 2024
Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition
INTERSPEECH 2023
Segmental SpeechCLIP: Utilizing Pretrained Image-text Models for Audio-Visual Learning
INTERSPEECH 2023
Advances in Language Recognition in Low Resource African Languages: The JHU-MIT Submission for NIST LRE22
INTERSPEECH 2023
DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model
INTERSPEECH 2023
Do Phonatory Features Display Robustness to Characterize Parkinsonian Speech Across Corpora?
INTERSPEECH 2023
End-to-End Neural Speaker Diarization with an Iterative Refinement of Non-Autoregressive Attention-based Attractors
INTERSPEECH 2022
Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification
INTERSPEECH 2022
Non-contrastive self-supervised learning of utterance-level speech representations
INTERSPEECH 2022
Defense against Adversarial Attacks on Hybrid Speech Recognition System using Adversarial Fine-tuning with Denoiser
INTERSPEECH 2022
Chunking Defense for Adversarial Attacks on ASR
INTERSPEECH 2022
AdvEst: Adversarial Perturbation Estimation to Classify and Detect Adversarial Attacks against Speaker Identification
INTERSPEECH 2022
Align-Denoise: Single-Pass Non-Autoregressive Speech Recognition
INTERSPEECH 2021
Automatic Detection and Assessment of Alzheimer Disease Using Speech and Language Technologies in Low-Resource Scenarios
INTERSPEECH 2021
Representation Learning to Classify and Detect Adversarial Attacks Against Speaker and Speech Recognition Systems
INTERSPEECH 2021
Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation
INTERSPEECH 2021
Spine2Net: SpineNet with Res2Net and Time-Squeeze-and-Excitation Blocks for Speaker Recognition
INTERSPEECH 2021
Deep Feature CycleGANs: Speaker Identity Preserving Non-Parallel Microphone-Telephone Domain Adaptation for Speaker Verification
INTERSPEECH 2021
Learning Speaker Embedding from Text-to-Speech
INTERSPEECH 2020
Black-Box Attacks on Spoofing Countermeasures Using Transferability of Adversarial Examples
INTERSPEECH 2020
Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery
INTERSPEECH 2020
x-Vectors Meet Adversarial Attacks: Benchmarking Adversarial Robustness in Speaker Verification
INTERSPEECH 2020
State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18
INTERSPEECH 2019
The JHU Speaker Recognition System for the VOiCES 2019 Challenge
INTERSPEECH 2019
Tied Mixture of Factor Analyzers Layer to Combine Frame Level Representations in Neural Speaker Embeddings
INTERSPEECH 2019
ASSERT: Anti-Spoofing with Squeeze-Excitation and Residual Networks
INTERSPEECH 2019
An Investigation of Non-linear i-vectors for Speaker Verification
INTERSPEECH 2018
Deep Neural Networks for Emotion Recognition Combining Audio and Transcripts
INTERSPEECH 2018
End-to-end Deep Neural Network Age Estimation
INTERSPEECH 2018
Investigation on Bandwidth Extension for Speaker Recognition
INTERSPEECH 2018
Effectiveness of Single-Channel BLSTM Enhancement for Language Identification
INTERSPEECH 2018
Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge
INTERSPEECH 2018
Tied Variational Autoencoder Backends for i-Vector Speaker Recognition
INTERSPEECH 2017
Domain Adaptation of PLDA Models in Broadcast Diarization by Means of Unsupervised Speaker Clustering
INTERSPEECH 2017