Co-occurring keywords
Papers
Can ChatGPT Detect Intent? Evaluating Large Language Models for Spoken Language Understanding
INTERSPEECH 2023
ZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMs
INTERSPEECH 2023
ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Speed
INTERSPEECH 2023
Flow-VAE VC: End-to-End Flow Framework with Contrastive Loss for Zero-shot Voice Conversion
INTERSPEECH 2023
Pruning Self-Attention for Zero-Shot Multi-Speaker Text-to-Speech
INTERSPEECH 2023
Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation
INTERSPEECH 2023
Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis
INTERSPEECH 2023
ZET-Speech: Zero-shot adaptive Emotion-controllable Text-to-Speech Synthesis with Diffusion and Style-based Models
INTERSPEECH 2023
Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations
INTERSPEECH 2023
Query Your Model with Definitions in FrameNet: An Effective Method for Frame Semantic Role Labeling
AAAI 2023