Junichi Yamagishi

75 papers · 2016–2026 · 9 conferences · across top CS/AI conferences

Achievements

+15 more ↓

🗺️ Taxonomy Completionist (20) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (7) 🌍 Conference Polyglot (8)

🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (8) 🐝 Cross-Pollinator (5) 🏠 Conference Loyalist (60) 🤝 Dynamic Duo (28) 🧬 Topic Evolution 🔬 Deep Specialist (18) 🏆 Keyword Champion (4) 📈 Trend Setter 🚀 Conference Pioneer 🔥 Unstoppable (10) ⚡ Prolific Year (5) ❓ The Questioner (3) 💎 Century Club (72) 🗃️ Keyword Collector (62)

Conferences

INTERSPEECH (60) ACL (4) COLING (3) EACL (2) IJCNLP (2) AACL (1) EMNLP (1) ICCV (1) WACV (1)

Top co-authors

Xin Wang (28) Erica Cooper (12) Nicholas Evans (11) Shinji Takaki (8) Canasai Kruengkrai (7) Massimiliano Todisco (7) Kong Aik Lee (6) Lauri Juvela (5) Tomi Kinnunen (5) Paavo Alku (5)

Research topics

Privacy (4) Speech & Audio (1)

Keywords

speech synthesis (16) speaker verification (13) text-to-speech synthesis (9) voice conversion (9) neural network (7) recurrent neural network (5) deep neural network (5) mean opinion score (5) listening test (4) neural vocoder (4) equal error rate (4) generative adversarial network (4) speech enhancement (4) speech intelligibility (3) glottal excitation (3) spoofing countermeasure (3) self-supervised learning (3) spoofing detection (3) fundamental frequency (3) speaker identity (3)

Papers

Pushing the Frontiers of Scientific Fact-Checking: The SCINLP Dataset EACL 2026 Prompt-driven Detection of Offensive Urdu Language using Large Language Models EACL 2026 When Bigger Isn’t Better: A Comprehensive Fairness Evaluation of Political Bias in Multi-News Summarisation ACL 2026 QualiSpeech: A Speech Quality Assessment Dataset with Natural Language Reasoning and Descriptions ACL 2025 Revisiting and Improving Scoring Fusion for Spoofing-aware Speaker Verification Using Compositional Data Analysis INTERSPEECH 2024 An Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios INTERSPEECH 2024 Generating Speakers by Prompting Listener Impressions for Pre-trained Multi-Speaker Text-to-Speech Systems INTERSPEECH 2024 Target Speaker Extraction with Curriculum Learning INTERSPEECH 2024 Speaker Detection by the Individual Listener and the Crowd: Parametric Models Applicable to Bonafide and Deepfake Speech INTERSPEECH 2024 To what extent can ASV systems naturally defend against spoofing attacks? INTERSPEECH 2024 Experimental evaluation of MOS, AB and BWS listening test designs INTERSPEECH 2024 Spoof Diarization: "What Spoofed When" in Partially Spoofed Audio INTERSPEECH 2024 Bridging Textual and Tabular Worlds for Fact Verification: A Lightweight, Attention-Based Model COLING 2024 Investigating Range-Equalizing Bias in Mean Opinion Score Ratings of Synthesized Speech INTERSPEECH 2023 Analysis of Master Vein Attacks on Finger Vein Recognition Systems WACV 2023 Improving Generalization Ability of Countermeasures for New Mismatch Scenario by Combining Multiple Advanced Regularization Terms INTERSPEECH 2023 Revisiting Pathologies of Neural Models under Input Reduction ACL 2023 Range-Based Equal Error Rate for Spoof Localization INTERSPEECH 2023 Towards Single Integrated Spoofing-aware Speaker Verification Embeddings INTERSPEECH 2023 Controlling Multi-Class Human Vocalization Generation via a Simple Segment-based Labeling Scheme INTERSPEECH 2023 The VoiceMOS Challenge 2022 INTERSPEECH 2022 DDS: A new device-degraded speech dataset for speech enhancement INTERSPEECH 2022 Spoofing-Aware Attention based ASV Back-end with Multiple Enrollment Utterances and a Sampling Strategy for the SASV Challenge 2022 INTERSPEECH 2022 Mitigating the Diminishing Effect of Elastic Weight Consolidation COLING 2022 Outlier-Aware Training for Improving Group Accuracy Disparities AACL 2022 Outlier-Aware Training for Improving Group Accuracy Disparities IJCNLP 2022 Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions INTERSPEECH 2022 A Multi-Level Attention Model for Evidence-Based Fact Checking IJCNLP 2021 A Multi-Level Attention Model for Evidence-Based Fact Checking ACL 2021 OpenForensics: Large-Scale Challenging Dataset for Multi-Face Forgery Detection and Segmentation In-the-Wild ICCV 2021 A Comparative Study on Recent Neural Spoofing Countermeasures for Synthetic Speech Detection INTERSPEECH 2021 An Initial Investigation for Detecting Partially Spoofed Audio INTERSPEECH 2021 Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing INTERSPEECH 2021 Can Speaker Augmentation Improve Multi-Speaker End-to-End TTS? INTERSPEECH 2020 Improved Prosody from Learned F0 Codebook Representations for VQ-VAE Speech Waveform Reconstruction INTERSPEECH 2020 Introducing the VoicePrivacy Initiative INTERSPEECH 2020 The Privacy ZEBRA: Zero Evidence Biometric Recognition Assessment INTERSPEECH 2020 Design Choices for X-Vector Based Speaker Anonymization INTERSPEECH 2020 Using Cyclic Noise as the Source Signal for Neural Source-Filter-Based Speech Waveform Model INTERSPEECH 2020 Noise Tokens: Learning Neural Noise Templates for Environment-Aware Speech Enhancement INTERSPEECH 2020 Reverberation Modeling for Source-Filter-Based Neural Vocoder INTERSPEECH 2020 Viable Threat on News Reading: Generating Biased News Using Natural Language Models EMNLP 2020 iMetricGAN: Intelligibility Enhancement for Speech-in-Noise Using Generative Adversarial Network-Based Metric Learning INTERSPEECH 2020 ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection INTERSPEECH 2019 Does the Lombard Effect Improve Emotional Communication in Noise? — Analysis of Emotional Speech Acted in Noise INTERSPEECH 2019 Joint Training Framework for Text-to-Speech and Voice Conversion Using Multi-Source Tacotron and WaveNet INTERSPEECH 2019 GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-Spectrogram INTERSPEECH 2019 Training Multi-Speaker Neural Text-to-Speech Systems Using Speaker-Imbalanced Speech Corpora INTERSPEECH 2019 MOSNet: Deep Learning-Based Objective Assessment for Voice Conversion INTERSPEECH 2019 Investigating Accuracy of Pitch-accent Annotations in Neural Network-based Speech Synthesis and Denoising Effects INTERSPEECH 2018 Expressive Speech Synthesis Using Sentiment Embeddings INTERSPEECH 2018 Speaker-independent Raw Waveform Model for Glottal Excitation INTERSPEECH 2018 Integrated Presentation Attack Detection and Automatic Speaker Verification: Common Features and Gaussian Back-end Fusion INTERSPEECH 2018 Multimodal Speech Synthesis Architecture for Unsupervised Speaker Adaptation INTERSPEECH 2018 An RNN-Based Quantized F0 Model with Multi-Tier Feedback Links for Text-to-Speech Synthesis INTERSPEECH 2017 Principles for Learning Controllable TTS from Annotated and Latent Variation INTERSPEECH 2017 Generative Adversarial Network-Based Postfilter for STFT Spectrograms INTERSPEECH 2017 Speech Intelligibility in Cars: The Effect of Speaking Style, Noise and Listener Age INTERSPEECH 2017 Reducing Mismatch in Training of DNN-Based Glottal Excitation Models in a Statistical Parametric Text-to-Speech System INTERSPEECH 2017 Direct Modeling of Frequency Spectra and Waveform Generation Based on Phase Recovery for DNN-Based Speech Synthesis INTERSPEECH 2017 Learning Word Vector Representations Based on Acoustic Counts INTERSPEECH 2017 Misperceptions of the Emotional Content of Natural and Vocoded Speech in a Car INTERSPEECH 2017 The ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection INTERSPEECH 2017 Complex-Valued Restricted Boltzmann Machine for Direct Learning of Frequency Spectra INTERSPEECH 2017 Enhance the Word Vector with Prosodic Information for the Recurrent Neural Network Based TTS System INTERSPEECH 2016 Using Text and Acoustic Features in Predicting Glottal Excitation Waveforms for Parametric Speech Synthesis with Recurrent Neural Networks INTERSPEECH 2016 Applying Spectral Normalisation and Efficient Envelope Estimation and Statistical Transformation for the Voice Conversion Challenge 2016 INTERSPEECH 2016 Analysis of the Voice Conversion Challenge 2016 Evaluation Results INTERSPEECH 2016 The Voice Conversion Challenge 2016 INTERSPEECH 2016 The SIWIS Database: A Multilingual Speech Database with Acted Emphasis INTERSPEECH 2016 Majorisation-Minimisation Based Optimisation of the Composite Autoregressive System with Application to Glottal Inverse Filtering INTERSPEECH 2016 Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System Using Deep Recurrent Neural Networks INTERSPEECH 2016 A Hierarchical Predictor of Synthetic Speech Naturalness Using Neural Networks INTERSPEECH 2016 Continuous Expressive Speaking Styles Synthesis based on CVSM and MR-HMM COLING 2016 Syllable-Level Representations of Suprasegmental Features for DNN-Based Text-to-Speech Synthesis INTERSPEECH 2016