conftrace_

Jesus Villalba

37 papers · 2017–2025 · 3 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+13 more ↓

🗺️ Taxonomy Completionist (18) 🧭 Keyword Pioneer 🌈 Renaissance Researcher (6) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird

🐝 Cross-Pollinator (11) 🗺️ Taxonomy Completionist (18) 🏃 Academic Marathon (8) 🏠 Conference Loyalist (35) 🤝 Dynamic Duo (36) 🧬 Topic Evolution 🏆 Keyword Champion (2) 🔬 Deep Specialist (11) 🗃️ Keyword Collector (54) ⚡ Prolific Year (6) 🔥 Unstoppable (9) ❓ The Questioner 💎 Century Club (37)

Conferences

INTERSPEECH (35) EMNLP (1) NIPS (1)

Top co-authors

Najim Dehak (36) Laureano Moro-Velazquez (13) Piotr Żelasko (10) Thomas Thebaud (8) Nanxin Chen (7) Saurabh Kataria (7) Sanjeev Khudanpur (6) Sonal Joshi (5) Daniel Povey (4) Jaejin Cho (4)

Keywords

speaker verification (9) speaker recognition (9) speaker embedding (8) domain adaptation (6) self-supervised learning (6) adversarial attack (5) automatic speech recognition (4) deep neural network (3) x-vector embedding (3) representation learning (3) probabilistic linear discriminant analysis (3) bandwidth extension (3) end-to-end learning (2) residual network (2) adversarial robustness (2) black-box attack (2) speaker identity (2) multimodal learning (2) speaker diarization (2) speech processing (2)

Papers

Paired by the Teacher: Turning Unpaired Data into High-Fidelity Pairs for Low-Resource Text Generation EMNLP 2025 CA-SSLR: Condition-Aware Self-Supervised Learning Representation for Generalized Speech Processing NIPS 2024 Exploring the Complementary Nature of Speech and Eye Movements for Profiling Neurological Disorders INTERSPEECH 2024 Noise-robust Speech Separation with Fast Generative Correction INTERSPEECH 2024 Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition INTERSPEECH 2023 Segmental SpeechCLIP: Utilizing Pretrained Image-text Models for Audio-Visual Learning INTERSPEECH 2023 Advances in Language Recognition in Low Resource African Languages: The JHU-MIT Submission for NIST LRE22 INTERSPEECH 2023 DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model INTERSPEECH 2023 Do Phonatory Features Display Robustness to Characterize Parkinsonian Speech Across Corpora? INTERSPEECH 2023 End-to-End Neural Speaker Diarization with an Iterative Refinement of Non-Autoregressive Attention-based Attractors INTERSPEECH 2022 Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification INTERSPEECH 2022 Non-contrastive self-supervised learning of utterance-level speech representations INTERSPEECH 2022 Defense against Adversarial Attacks on Hybrid Speech Recognition System using Adversarial Fine-tuning with Denoiser INTERSPEECH 2022 Chunking Defense for Adversarial Attacks on ASR INTERSPEECH 2022 AdvEst: Adversarial Perturbation Estimation to Classify and Detect Adversarial Attacks against Speaker Identification INTERSPEECH 2022 Align-Denoise: Single-Pass Non-Autoregressive Speech Recognition INTERSPEECH 2021 Automatic Detection and Assessment of Alzheimer Disease Using Speech and Language Technologies in Low-Resource Scenarios INTERSPEECH 2021 Representation Learning to Classify and Detect Adversarial Attacks Against Speaker and Speech Recognition Systems INTERSPEECH 2021 Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation INTERSPEECH 2021 Spine2Net: SpineNet with Res2Net and Time-Squeeze-and-Excitation Blocks for Speaker Recognition INTERSPEECH 2021 Deep Feature CycleGANs: Speaker Identity Preserving Non-Parallel Microphone-Telephone Domain Adaptation for Speaker Verification INTERSPEECH 2021 Learning Speaker Embedding from Text-to-Speech INTERSPEECH 2020 Black-Box Attacks on Spoofing Countermeasures Using Transferability of Adversarial Examples INTERSPEECH 2020 Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery INTERSPEECH 2020 x-Vectors Meet Adversarial Attacks: Benchmarking Adversarial Robustness in Speaker Verification INTERSPEECH 2020 State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18 INTERSPEECH 2019 The JHU Speaker Recognition System for the VOiCES 2019 Challenge INTERSPEECH 2019 Tied Mixture of Factor Analyzers Layer to Combine Frame Level Representations in Neural Speaker Embeddings INTERSPEECH 2019 ASSERT: Anti-Spoofing with Squeeze-Excitation and Residual Networks INTERSPEECH 2019 An Investigation of Non-linear i-vectors for Speaker Verification INTERSPEECH 2018 Deep Neural Networks for Emotion Recognition Combining Audio and Transcripts INTERSPEECH 2018 End-to-end Deep Neural Network Age Estimation INTERSPEECH 2018 Investigation on Bandwidth Extension for Speaker Recognition INTERSPEECH 2018 Effectiveness of Single-Channel BLSTM Enhancement for Language Identification INTERSPEECH 2018 Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge INTERSPEECH 2018 Tied Variational Autoencoder Backends for i-Vector Speaker Recognition INTERSPEECH 2017 Domain Adaptation of PLDA Models in Broadcast Diarization by Means of Unsupervised Speaker Clustering INTERSPEECH 2017