Junichi Yamagishi
75 papers · 2016–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (20) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (7) π Conference Polyglot (8)
π
Interdisciplinary Bridge
π
Conference Polyglot
(8)
π
Cross-Pollinator
(5)
π
Conference Loyalist
(60)
π€
Dynamic Duo
(28)
π§¬
Topic Evolution
π¬
Deep Specialist
(18)
π
Keyword Champion
(4)
π
Trend Setter
π
Conference Pioneer
π₯
Unstoppable
(10)
β‘
Prolific Year
(5)
β
The Questioner
(3)
π
Century Club
(72)
ποΈ
Keyword Collector
(62)
Conferences
INTERSPEECH (60)
ACL (4)
COLING (3)
EACL (2)
IJCNLP (2)
AACL (1)
EMNLP (1)
ICCV (1)
WACV (1)
Top co-authors
Research topics
Keywords
speech synthesis
(16)
speaker verification
(13)
text-to-speech synthesis
(9)
voice conversion
(9)
neural network
(7)
recurrent neural network
(5)
deep neural network
(5)
mean opinion score
(5)
listening test
(4)
neural vocoder
(4)
equal error rate
(4)
generative adversarial network
(4)
speech enhancement
(4)
speech intelligibility
(3)
glottal excitation
(3)
spoofing countermeasure
(3)
self-supervised learning
(3)
spoofing detection
(3)
fundamental frequency
(3)
speaker identity
(3)
Papers
Pushing the Frontiers of Scientific Fact-Checking: The SCINLP Dataset
EACL 2026
Prompt-driven Detection of Offensive Urdu Language using Large Language Models
EACL 2026
When Bigger Isnβt Better: A Comprehensive Fairness Evaluation of Political Bias in Multi-News Summarisation
ACL 2026
QualiSpeech: A Speech Quality Assessment Dataset with Natural Language Reasoning and Descriptions
ACL 2025
Revisiting and Improving Scoring Fusion for Spoofing-aware Speaker Verification Using Compositional Data Analysis
INTERSPEECH 2024
An Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios
INTERSPEECH 2024
Generating Speakers by Prompting Listener Impressions for Pre-trained Multi-Speaker Text-to-Speech Systems
INTERSPEECH 2024
Target Speaker Extraction with Curriculum Learning
INTERSPEECH 2024
Speaker Detection by the Individual Listener and the Crowd: Parametric Models Applicable to Bonafide and Deepfake Speech
INTERSPEECH 2024
To what extent can ASV systems naturally defend against spoofing attacks?
INTERSPEECH 2024
Experimental evaluation of MOS, AB and BWS listening test designs
INTERSPEECH 2024
Spoof Diarization: "What Spoofed When" in Partially Spoofed Audio
INTERSPEECH 2024
Bridging Textual and Tabular Worlds for Fact Verification: A Lightweight, Attention-Based Model
COLING 2024
Investigating Range-Equalizing Bias in Mean Opinion Score Ratings of Synthesized Speech
INTERSPEECH 2023
Analysis of Master Vein Attacks on Finger Vein Recognition Systems
WACV 2023
Improving Generalization Ability of Countermeasures for New Mismatch Scenario by Combining Multiple Advanced Regularization Terms
INTERSPEECH 2023
Revisiting Pathologies of Neural Models under Input Reduction
ACL 2023
Range-Based Equal Error Rate for Spoof Localization
INTERSPEECH 2023
Towards Single Integrated Spoofing-aware Speaker Verification Embeddings
INTERSPEECH 2023
Controlling Multi-Class Human Vocalization Generation via a Simple Segment-based Labeling Scheme
INTERSPEECH 2023
The VoiceMOS Challenge 2022
INTERSPEECH 2022
DDS: A new device-degraded speech dataset for speech enhancement
INTERSPEECH 2022
Spoofing-Aware Attention based ASV Back-end with Multiple Enrollment Utterances and a Sampling Strategy for the SASV Challenge 2022
INTERSPEECH 2022
Mitigating the Diminishing Effect of Elastic Weight Consolidation
COLING 2022
Outlier-Aware Training for Improving Group Accuracy Disparities
AACL 2022
Outlier-Aware Training for Improving Group Accuracy Disparities
IJCNLP 2022
Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions
INTERSPEECH 2022
A Multi-Level Attention Model for Evidence-Based Fact Checking
IJCNLP 2021
A Multi-Level Attention Model for Evidence-Based Fact Checking
ACL 2021
OpenForensics: Large-Scale Challenging Dataset for Multi-Face Forgery Detection and Segmentation In-the-Wild
ICCV 2021
A Comparative Study on Recent Neural Spoofing Countermeasures for Synthetic Speech Detection
INTERSPEECH 2021
An Initial Investigation for Detecting Partially Spoofed Audio
INTERSPEECH 2021
Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing
INTERSPEECH 2021
Can Speaker Augmentation Improve Multi-Speaker End-to-End TTS?
INTERSPEECH 2020
Improved Prosody from Learned F0 Codebook Representations for VQ-VAE Speech Waveform Reconstruction
INTERSPEECH 2020
Introducing the VoicePrivacy Initiative
INTERSPEECH 2020
The Privacy ZEBRA: Zero Evidence Biometric Recognition Assessment
INTERSPEECH 2020
Design Choices for X-Vector Based Speaker Anonymization
INTERSPEECH 2020
Using Cyclic Noise as the Source Signal for Neural Source-Filter-Based Speech Waveform Model
INTERSPEECH 2020
Noise Tokens: Learning Neural Noise Templates for Environment-Aware Speech Enhancement
INTERSPEECH 2020
Reverberation Modeling for Source-Filter-Based Neural Vocoder
INTERSPEECH 2020
Viable Threat on News Reading: Generating Biased News Using Natural Language Models
EMNLP 2020
iMetricGAN: Intelligibility Enhancement for Speech-in-Noise Using Generative Adversarial Network-Based Metric Learning
INTERSPEECH 2020
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection
INTERSPEECH 2019
Does the Lombard Effect Improve Emotional Communication in Noise? β Analysis of Emotional Speech Acted in Noise
INTERSPEECH 2019
Joint Training Framework for Text-to-Speech and Voice Conversion Using Multi-Source Tacotron and WaveNet
INTERSPEECH 2019
GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-Spectrogram
INTERSPEECH 2019
Training Multi-Speaker Neural Text-to-Speech Systems Using Speaker-Imbalanced Speech Corpora
INTERSPEECH 2019
MOSNet: Deep Learning-Based Objective Assessment for Voice Conversion
INTERSPEECH 2019
Investigating Accuracy of Pitch-accent Annotations in Neural Network-based Speech Synthesis and Denoising Effects
INTERSPEECH 2018
Expressive Speech Synthesis Using Sentiment Embeddings
INTERSPEECH 2018
Speaker-independent Raw Waveform Model for Glottal Excitation
INTERSPEECH 2018
Integrated Presentation Attack Detection and Automatic Speaker Verification: Common Features and Gaussian Back-end Fusion
INTERSPEECH 2018
Multimodal Speech Synthesis Architecture for Unsupervised Speaker Adaptation
INTERSPEECH 2018
An RNN-Based Quantized F0 Model with Multi-Tier Feedback Links for Text-to-Speech Synthesis
INTERSPEECH 2017
Principles for Learning Controllable TTS from Annotated and Latent Variation
INTERSPEECH 2017
Generative Adversarial Network-Based Postfilter for STFT Spectrograms
INTERSPEECH 2017
Speech Intelligibility in Cars: The Effect of Speaking Style, Noise and Listener Age
INTERSPEECH 2017
Reducing Mismatch in Training of DNN-Based Glottal Excitation Models in a Statistical Parametric Text-to-Speech System
INTERSPEECH 2017
Direct Modeling of Frequency Spectra and Waveform Generation Based on Phase Recovery for DNN-Based Speech Synthesis
INTERSPEECH 2017
Learning Word Vector Representations Based on Acoustic Counts
INTERSPEECH 2017
Misperceptions of the Emotional Content of Natural and Vocoded Speech in a Car
INTERSPEECH 2017
The ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection
INTERSPEECH 2017
Complex-Valued Restricted Boltzmann Machine for Direct Learning of Frequency Spectra
INTERSPEECH 2017
Enhance the Word Vector with Prosodic Information for the Recurrent Neural Network Based TTS System
INTERSPEECH 2016
Using Text and Acoustic Features in Predicting Glottal Excitation Waveforms for Parametric Speech Synthesis with Recurrent Neural Networks
INTERSPEECH 2016
Applying Spectral Normalisation and Efficient Envelope Estimation and Statistical Transformation for the Voice Conversion Challenge 2016
INTERSPEECH 2016
Analysis of the Voice Conversion Challenge 2016 Evaluation Results
INTERSPEECH 2016
The Voice Conversion Challenge 2016
INTERSPEECH 2016
The SIWIS Database: A Multilingual Speech Database with Acted Emphasis
INTERSPEECH 2016
Majorisation-Minimisation Based Optimisation of the Composite Autoregressive System with Application to Glottal Inverse Filtering
INTERSPEECH 2016
Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System Using Deep Recurrent Neural Networks
INTERSPEECH 2016
A Hierarchical Predictor of Synthetic Speech Naturalness Using Neural Networks
INTERSPEECH 2016
Continuous Expressive Speaking Styles Synthesis based on CVSM and MR-HMM
COLING 2016
Syllable-Level Representations of Suprasegmental Features for DNN-Based Text-to-Speech Synthesis
INTERSPEECH 2016