Prasanta Kumar Ghosh
56 papers · 2016–2024 · 1 conference · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (14) π Interdisciplinary Bridge π Renaissance Researcher (6) π£ Hot Topic Early Bird
π
Interdisciplinary Bridge
π£
Hot Topic Early Bird
π
Conference Loyalist
(56)
π€
Dynamic Duo
(10)
π
Keyword Champion
(2)
π₯
Mega-Team
(20)
π¬
Deep Specialist
(22)
π
Conference Pioneer
β‘
Prolific Year
(8)
π₯
Unstoppable
(9)
ποΈ
Keyword Collector
(246)
β
The Questioner
π
Trend Setter
π
Century Club
(56)
Conferences
INTERSPEECH (56)
Top co-authors
Keywords
deep neural network
(6)
articulatory movement
(6)
speech classification
(5)
acoustic-to-articulatory inversion
(5)
speech synthesis
(5)
acoustic feature
(5)
mel frequency cepstral coefficient
(4)
bidirectional lstm
(4)
gaussian mixture model
(4)
phoneme recognition
(4)
mel-frequency cepstral coefficient
(3)
whispered speech
(3)
acoustic analysis
(3)
image segmentation
(3)
second language learning
(3)
speech processing
(3)
convolutional neural network
(3)
medical imaging
(3)
self-supervised learning
(3)
bidirectional long short-term memory
(3)
Papers
A comparative study of the impact of voiceless alveolar and palato-alveolar sibilants in English on lip aperture and protrusion during VCV production
INTERSPEECH 2024
IndicMOS: Multilingual MOS Prediction for 7 Indian languages
INTERSPEECH 2024
Adapter pre-training for improved speech recognition in unseen domains using low resource adapter tuning of self-supervised models
INTERSPEECH 2024
Articulatory synthesis using representations learnt through phonetic label-aware contrastive loss
INTERSPEECH 2024
Exploring Syllable Discriminability during Diadochokinetic Task with Increasing Dysarthria Severity for Patients with Amyotrophic Lateral Sclerosis
INTERSPEECH 2024
Weakly supervised glottis segmentation in high-speed videoendoscopy using bounding box labels
INTERSPEECH 2023
Transfer Learning to Aid Dysarthria Severity Classification for Patients with Amyotrophic Lateral Sclerosis
INTERSPEECH 2023
Classification of Multi-class Vowels and Fricatives From Patients Having Amyotrophic Lateral Sclerosis with Varied Levels of Dysarthria Severity
INTERSPEECH 2023
Do Vocal Breath Sounds Encode Gender Cues for Automatic Gender Classification?
INTERSPEECH 2023
Exploring a classification approach using quantised articulatory movements for acoustic to articulatory inversion
INTERSPEECH 2023
A Study on the Importance of Formant Transitions for Stop-Consonant Classification in VCV Sequence
INTERSPEECH 2023
An Investigation of Indian Native Language Phonemic Influences on L2 English Pronunciations
INTERSPEECH 2023
Watch Me Speak: 2D Visualization of Human Mouth during Speech
INTERSPEECH 2022
DiCOVA Challenge: Dataset, Task, and Baseline System for COVID-19 Diagnosis Using Acoustics
INTERSPEECH 2021
Web Interface for Estimating Articulatory Movements in Speech Production from Acoustics and Text
INTERSPEECH 2021
Source and Vocal Tract Cues for Speech-Based Classification of Patients with Parkinsonβs Disease and Healthy Subjects
INTERSPEECH 2021
MUCS 2021: Multilingual and Code-Switching ASR Challenges for Low Resource Indian Languages
INTERSPEECH 2021
Noise Robust Pitch Stylization Using Minimum Mean Absolute Error Criterion
INTERSPEECH 2021
Estimating Articulatory Movements in Speech Production with Transformer Networks
INTERSPEECH 2021
A Comparative Study of Different EMG Features for Acoustics-to-EMG Mapping
INTERSPEECH 2021
Coswara β A Database of Breathing, Cough, and Voice Sounds for COVID-19 Diagnosis
INTERSPEECH 2020
Speaker Conditioned Acoustic-to-Articulatory Inversion Using x-Vectors
INTERSPEECH 2020
Air-Tissue Boundary Segmentation in Real Time Magnetic Resonance Imaging Video Using 3-D Convolutional Neural Network
INTERSPEECH 2020
An Investigation of the Virtual Lip Trajectories During the Production of Bilabial Stops and Nasal at Different Speaking Rates
INTERSPEECH 2020
Speech Rate Task-Specific Representation Learning from Acoustic-Articulatory Data
INTERSPEECH 2020
Attention and Encoder-Decoder Based Models for Transforming Articulatory Movements at Different Speaking Rates
INTERSPEECH 2020
Whisper Activity Detection Using CNN-LSTM Based Attention Pooling Network Trained for a Speaker Identification Task
INTERSPEECH 2020
Raw Speech Waveform Based Classification of Patients with ALS, Parkinsonβs Disease and Healthy Controls Using CNN-BLSTM
INTERSPEECH 2020
Automatic Glottis Detection and Segmentation in Stroboscopic Videos Using Convolutional Networks
INTERSPEECH 2020
Whisper to Neutral Mapping Using Cosine Similarity Maximization in i-Vector Space for Speaker Verification
INTERSPEECH 2019
An Improved Goodness of Pronunciation (GoP) Measure for Pronunciation Evaluation with DNN-HMM System Considering HMM Transition Probabilities
INTERSPEECH 2019
Comparison of Speech Tasks and Recording Devices for Voice Based Automatic Classification of Healthy Subjects and Patients with Amyotrophic Lateral Sclerosis
INTERSPEECH 2019
ASR Inspired Syllable Stress Detection for Pronunciation Evaluation Without Using a Supervised Classifier and Syllable Level Features
INTERSPEECH 2019
An Investigation on Speaker Specific Articulatory Synthesis with Speaker Independent Articulatory Inversion
INTERSPEECH 2019
Low Resource Automatic Intonation Classification Using Gated Recurrent Unit (GRU) Networks Pre-Trained with Synthesized Pitch Patterns
INTERSPEECH 2019
SPIRE-fluent: A Self-Learning App for Tutoring Oral Fluency to Second Language English Learners
INTERSPEECH 2019
Acoustic and Articulatory Feature Based Speech Rate Estimation Using a Convolutional Dense Neural Network
INTERSPEECH 2019
SPIRE-SST: An Automatic Web-based Self-learning Tool for Syllable Stress Tutoring (SST) to the Second Language Learners
INTERSPEECH 2018
Reconstructing Neutral Speech from Tracheoesophageal Speech
INTERSPEECH 2018
Subband Weighting for Binaural Speech Source Localization
INTERSPEECH 2018
Intonation tutor by SPIRE (In-SPIRE): An Online Tool for an Automatic Feedback to the Second Language Learners in Learning Intonation
INTERSPEECH 2018
Whispered Speech to Neutral Speech Conversion Using Bidirectional LSTMs
INTERSPEECH 2018
Speech Enhancement Using Deep Mixture of Experts Based on Hard Expectation Maximization
INTERSPEECH 2018
Air-Tissue Boundary Segmentation in Real-Time Magnetic Resonance Imaging Video Using Semantic Segmentation with Fully Convolutional Networks
INTERSPEECH 2018
Automatic Visual Augmentation for Concatenation Based Synthesized Articulatory Videos from Real-time MRI Data for Spoken Language Training
INTERSPEECH 2018
Low Resource Acoustic-to-articulatory Inversion Using Bi-directional Long Short Term Memory
INTERSPEECH 2018
Automatic Glottis Localization and Segmentation in Stroboscopic Videos Using Deep Neural Network
INTERSPEECH 2018
Relating Articulatory Motions in Different Speaking Rates
INTERSPEECH 2018
PRAV: A Phonetically Rich Audio Visual Corpus
INTERSPEECH 2017
Phoneme State Posteriorgram Features for Speech Based Automatic Classification of Speakers in Cold and Healthy Condition
INTERSPEECH 2017
Subband Selection for Binaural Speech Source Localization
INTERSPEECH 2017
A Robust Voiced/Unvoiced Phoneme Classification from Whispered Speech Using the βColorβ of Whispered Phonemes and Deep Neural Network
INTERSPEECH 2017
An Information Theoretic Analysis of the Temporal Synchrony Between Head Gestures and Prosodic Patterns in Spontaneous Speech
INTERSPEECH 2017
A Dual Source-Filter Model of Snore Audio for Snorer Group Classification
INTERSPEECH 2017
A Class-Specific Speech Enhancement for Phoneme Recognition: A Dictionary Learning Approach
INTERSPEECH 2016
Automatic Recognition of Social Roles Using Long Term Role Transitions in Small Group Interactions
INTERSPEECH 2016