Zhen-Hua Ling
69 papers · 2015–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (16) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (5) π Conference Polyglot (12)
π
Interdisciplinary Bridge
π
Conference Polyglot
(12)
πΊοΈ
Taxonomy Completionist
(16)
π
Keyword Trendsetter Combo
(3)
π
Conference Loyalist
(28)
π€
Dynamic Duo
(16)
π§¬
Topic Evolution
π±
Topic Pioneer
π¬
Deep Specialist
(14)
π
Keyword Champion
(3)
β
The Questioner
π
Century Club
(65)
ποΈ
Keyword Collector
(260)
π
Trend Setter
π
Conference Pioneer
π₯
Unstoppable
(11)
β‘
Prolific Year
(6)
Conferences
INTERSPEECH (28)
ACL (13)
EMNLP (9)
AAAI (4)
NAACL (4)
IJCNLP (3)
ICLR (2)
IJCAI (2)
AACL (1)
COLING (1)
ICML (1)
SEMEVAL (1)
Top co-authors
Keywords
speech synthesis
(10)
large language model
(6)
neural network
(6)
dialogue system
(5)
speaker identification
(4)
neural vocoder
(4)
voice conversion
(4)
acoustic model
(4)
deep neural network
(4)
multi-party conversation
(4)
speech bandwidth extension
(3)
connectionist temporal classification
(3)
graph neural network
(3)
named entity recognition
(3)
retrieval-augmented generation
(3)
knowledge integration
(3)
natural language inference
(3)
long short-term memory
(3)
attention mechanism
(3)
matching network
(3)
Papers
GenesisFunc: Multi-Agent Data Generation for Accurate and Generalizable Function-Calling
ACL 2026
Say More with Less: Variable-Frame-Rate Speech Tokenization via Adaptive Clustering and Implicit Duration Coding
AAAI 2026
Multiplicative Orthogonal Sequential Editing for Language Models
AAAI 2026
UniVocal: Unified Speech-Singing Code-Switching Synthesis
ACL 2026
UniSpeaker: A Unified Approach for Multimodality-driven Speaker Generation
EMNLP 2025
RPO: Retrieval Preference Optimization for Robust Retrieval-Augmented Generation
ACL 2025
RISE: Reasoning Enhancement via Iterative Self-Exploration in Multi-hop Question Answering
ACL 2025
Constraining Sequential Model Editing with Editing Anchor Compression
NAACL 2025
Perturbation-Restrained Sequential Model Editing
ICLR 2025
Select to Know: An Internal-External Knowledge Self-Selection Framework for Domain-Specific Question Answering
EMNLP 2025
Neighboring Perturbations of Knowledge Editing on Large Language Models
ICML 2024
X-ACE: Explainable and Multi-factor Audio Captioning Evaluation
ACL 2024
Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue
EMNLP 2024
Retrieving, Rethinking and Revising: The Chain-of-Verification Can Improve Retrieval Augmented Generation
EMNLP 2024
Asynchronous Voice Anonymization Using Adversarial Perturbation On Speaker Embedding
INTERSPEECH 2024
A Low-Bitrate Neural Audio Codec Framework with Bandwidth Reduction and Recovery for High-Sampling-Rate Waveforms
INTERSPEECH 2024
MultiStage Speech Bandwidth Extension with Flexible Sampling Rate Control
INTERSPEECH 2024
Clever Hans Effect Found in Automatic Detection of Alzheimer's Disease through Speech
INTERSPEECH 2024
BiVocoder: A Bidirectional Neural Vocoder Integrating Feature Extraction and Waveform Generation
INTERSPEECH 2024
Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge Distillation
INTERSPEECH 2023
Learning WHO Saying WHAT to WHOM in Multi-Party Conversations
AACL 2023
MADNet: Maximizing Addressee Deduction Expectation for Multi-Party Conversation Generation
EMNLP 2023
Symbolization, Prompt, and Classification: A Framework for Implicit Speaker Identification in Novels
EMNLP 2023
Is ChatGPT a Good Multi-Party Conversation Solver?
EMNLP 2023
Learning WHO Saying WHAT to WHOM in Multi-Party Conversations
IJCNLP 2023
MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra
INTERSPEECH 2023
BASEN: Time-Domain Brain-Assisted Speech Enhancement Network with Convolutional Cross Attention in Multi-talker Conditions
INTERSPEECH 2023
Speech Synthesis with Self-Supervisedly Learnt Prosodic Representations
INTERSPEECH 2023
Who Says What to Whom: A Survey of Multi-Party Conversations
IJCAI 2022
HeterMPC: A Heterogeneous Graph Neural Network for Response Generation in Multi-Party Conversations
ACL 2022
USTC-NELSLIP at SemEval-2022 Task 11: Gazetteer-Adapted Integration Network for Multilingual Complex Named Entity Recognition
SEMEVAL 2022
Pronunciation Dictionary-Free Multilingual Speech Synthesis by Combining Unsupervised and Supervised Phonetic Representations
INTERSPEECH 2022
PoNet: Pooling Network for Efficient Token Mixing in Long Sequences
ICLR 2022
USTC-NELSLIP at SemEval-2022 Task 11: Gazetteer-Adapted Integration Network for Multilingual Complex Named Entity Recognition
NAACL 2022
Conversation- and Tree-Structure Losses for Dialogue Disentanglement
ACL 2022
TegTok: Augmenting Text Generation via Task-specific and Open-world Knowledge
ACL 2022
Tracking Interaction States for Multi-Turn Text-to-SQL Semantic Parsing
AAAI 2021
TaLNet: Voice Reconstruction from Tongue and Lip Articulation with Transfer Learning from Text-to-Speech Synthesis
AAAI 2021
Adversarial Voice Conversion Against Neural Spoofing Detectors
INTERSPEECH 2021
A Neural-Network-Based Approach to Identifying Speakers in Novels
INTERSPEECH 2021
UnitNet-Based Hybrid Speech Synthesis
INTERSPEECH 2021
Knowledge-and-Data-Driven Amplitude Spectrum Prediction for Hierarchical Neural Vocoders
INTERSPEECH 2020
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning
INTERSPEECH 2020
Unsupervised Regularization-Based Adaptive Training for Speech Recognition
INTERSPEECH 2020
Adaptive Speaker Normalization for CTC-Based Speech Recognition
INTERSPEECH 2020
An Adaptive X-Vector Model for Text-Independent Speaker Verification
INTERSPEECH 2020
Reverberation Modeling for Source-Filter-Based Neural Vocoder
INTERSPEECH 2020
Neural Text Clustering with Document-Level Attention Based on Dynamic Soft Labels
INTERSPEECH 2019
Dually Interactive Matching Network for Personalized Response Selection in Retrieval-Based Chatbots
IJCNLP 2019
Dually Interactive Matching Network for Personalized Response Selection in Retrieval-Based Chatbots
EMNLP 2019
Distant Supervision Relation Extraction with Intra-Bag and Inter-Bag Attentions
NAACL 2019
Multi-Level Matching and Aggregation Network for Few-Shot Relation Classification
ACL 2019
A Chinese Dataset for Identifying Speakers in Novels
INTERSPEECH 2019
Singing Voice Synthesis Using Deep Autoregressive Neural Networks for Acoustic Modeling
INTERSPEECH 2019
Neural Natural Language Inference Models Enhanced with External Knowledge
ACL 2018
Enhancing Sentence Embedding with Generalized Pooling
COLING 2018
Hybrid semi-Markov CRF for Neural Sequence Labeling
ACL 2018
Learning and Modeling Unit Embeddings for Improving HMM-based Unit Selection Speech Synthesis
INTERSPEECH 2018
WaveNet Vocoder with Limited Training Data for Voice Conversion
INTERSPEECH 2018
Cause-Effect Knowledge Acquisition and Neural Association Model for Solving A Set of Winograd Schema Problems
IJCAI 2017
Enhanced LSTM for Natural Language Inference
ACL 2017
Waveform Modeling Using Stacked Dilated Convolutional Neural Networks for Speech Bandwidth Extension
INTERSPEECH 2017
The USTC System for Voice Conversion Challenge 2016: Neural Network Based Approaches for Spectrum, Aperiodicity and F0Conversion
INTERSPEECH 2016
Speech Bandwidth Extension Using Bottleneck Features and Deep Recurrent Neural Networks
INTERSPEECH 2016
Intra-Topic Variability Normalization based on Linear Projection for Topic Classification
NAACL 2016
Exploring Semantic Representation in Brain Activity Using Word Embeddings
EMNLP 2016
Articulatory-to-Acoustic Conversion with Cascaded Prediction of Spectral and Excitation Features Using Neural Networks
INTERSPEECH 2016
Learning Semantic Word Embeddings based on Ordinal Knowledge Constraints
IJCNLP 2015
Learning Semantic Word Embeddings based on Ordinal Knowledge Constraints
ACL 2015