Prasanta Kumar Ghosh

56 papers · 2016–2024 · 1 conference · across top CS/AI conferences

Achievements

+14 more ↓

🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (14) 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (6) 🐣 Hot Topic Early Bird

🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird 🏠 Conference Loyalist (56) 🤝 Dynamic Duo (10) 🏆 Keyword Champion (2) 👥 Mega-Team (20) 🔬 Deep Specialist (22) 🚀 Conference Pioneer ⚡ Prolific Year (8) 🔥 Unstoppable (9) 🗃️ Keyword Collector (246) ❓ The Questioner 📈 Trend Setter 💎 Century Club (56)

Conferences

INTERSPEECH (56)

Top co-authors

Chiranjeevi Yarra (10) Aravind Illa (9) Sathvik Udupa (7) Yamini Belur (6) Ravi Yadav (5) Tanuka Bhattacharjee (4) G. Nisha Meenakshi (4) Renuka Mannem (4) Achuth Rao M.V. (3) Abhayjeet Singh (3)

Keywords

deep neural network (6) articulatory movement (6) speech classification (5) acoustic-to-articulatory inversion (5) speech synthesis (5) acoustic feature (5) mel frequency cepstral coefficient (4) bidirectional lstm (4) gaussian mixture model (4) phoneme recognition (4) mel-frequency cepstral coefficient (3) whispered speech (3) acoustic analysis (3) image segmentation (3) second language learning (3) speech processing (3) convolutional neural network (3) medical imaging (3) self-supervised learning (3) bidirectional long short-term memory (3)

Papers

A comparative study of the impact of voiceless alveolar and palato-alveolar sibilants in English on lip aperture and protrusion during VCV production INTERSPEECH 2024 IndicMOS: Multilingual MOS Prediction for 7 Indian languages INTERSPEECH 2024 Adapter pre-training for improved speech recognition in unseen domains using low resource adapter tuning of self-supervised models INTERSPEECH 2024 Articulatory synthesis using representations learnt through phonetic label-aware contrastive loss INTERSPEECH 2024 Exploring Syllable Discriminability during Diadochokinetic Task with Increasing Dysarthria Severity for Patients with Amyotrophic Lateral Sclerosis INTERSPEECH 2024 Weakly supervised glottis segmentation in high-speed videoendoscopy using bounding box labels INTERSPEECH 2023 Transfer Learning to Aid Dysarthria Severity Classification for Patients with Amyotrophic Lateral Sclerosis INTERSPEECH 2023 Classification of Multi-class Vowels and Fricatives From Patients Having Amyotrophic Lateral Sclerosis with Varied Levels of Dysarthria Severity INTERSPEECH 2023 Do Vocal Breath Sounds Encode Gender Cues for Automatic Gender Classification? INTERSPEECH 2023 Exploring a classification approach using quantised articulatory movements for acoustic to articulatory inversion INTERSPEECH 2023 A Study on the Importance of Formant Transitions for Stop-Consonant Classification in VCV Sequence INTERSPEECH 2023 An Investigation of Indian Native Language Phonemic Influences on L2 English Pronunciations INTERSPEECH 2023 Watch Me Speak: 2D Visualization of Human Mouth during Speech INTERSPEECH 2022 DiCOVA Challenge: Dataset, Task, and Baseline System for COVID-19 Diagnosis Using Acoustics INTERSPEECH 2021 Web Interface for Estimating Articulatory Movements in Speech Production from Acoustics and Text INTERSPEECH 2021 Source and Vocal Tract Cues for Speech-Based Classification of Patients with Parkinson’s Disease and Healthy Subjects INTERSPEECH 2021 MUCS 2021: Multilingual and Code-Switching ASR Challenges for Low Resource Indian Languages INTERSPEECH 2021 Noise Robust Pitch Stylization Using Minimum Mean Absolute Error Criterion INTERSPEECH 2021 Estimating Articulatory Movements in Speech Production with Transformer Networks INTERSPEECH 2021 A Comparative Study of Different EMG Features for Acoustics-to-EMG Mapping INTERSPEECH 2021 Coswara — A Database of Breathing, Cough, and Voice Sounds for COVID-19 Diagnosis INTERSPEECH 2020 Speaker Conditioned Acoustic-to-Articulatory Inversion Using x-Vectors INTERSPEECH 2020 Air-Tissue Boundary Segmentation in Real Time Magnetic Resonance Imaging Video Using 3-D Convolutional Neural Network INTERSPEECH 2020 An Investigation of the Virtual Lip Trajectories During the Production of Bilabial Stops and Nasal at Different Speaking Rates INTERSPEECH 2020 Speech Rate Task-Specific Representation Learning from Acoustic-Articulatory Data INTERSPEECH 2020 Attention and Encoder-Decoder Based Models for Transforming Articulatory Movements at Different Speaking Rates INTERSPEECH 2020 Whisper Activity Detection Using CNN-LSTM Based Attention Pooling Network Trained for a Speaker Identification Task INTERSPEECH 2020 Raw Speech Waveform Based Classification of Patients with ALS, Parkinson’s Disease and Healthy Controls Using CNN-BLSTM INTERSPEECH 2020 Automatic Glottis Detection and Segmentation in Stroboscopic Videos Using Convolutional Networks INTERSPEECH 2020 Whisper to Neutral Mapping Using Cosine Similarity Maximization in i-Vector Space for Speaker Verification INTERSPEECH 2019 An Improved Goodness of Pronunciation (GoP) Measure for Pronunciation Evaluation with DNN-HMM System Considering HMM Transition Probabilities INTERSPEECH 2019 Comparison of Speech Tasks and Recording Devices for Voice Based Automatic Classification of Healthy Subjects and Patients with Amyotrophic Lateral Sclerosis INTERSPEECH 2019 ASR Inspired Syllable Stress Detection for Pronunciation Evaluation Without Using a Supervised Classifier and Syllable Level Features INTERSPEECH 2019 An Investigation on Speaker Specific Articulatory Synthesis with Speaker Independent Articulatory Inversion INTERSPEECH 2019 Low Resource Automatic Intonation Classification Using Gated Recurrent Unit (GRU) Networks Pre-Trained with Synthesized Pitch Patterns INTERSPEECH 2019 SPIRE-fluent: A Self-Learning App for Tutoring Oral Fluency to Second Language English Learners INTERSPEECH 2019 Acoustic and Articulatory Feature Based Speech Rate Estimation Using a Convolutional Dense Neural Network INTERSPEECH 2019 SPIRE-SST: An Automatic Web-based Self-learning Tool for Syllable Stress Tutoring (SST) to the Second Language Learners INTERSPEECH 2018 Reconstructing Neutral Speech from Tracheoesophageal Speech INTERSPEECH 2018 Subband Weighting for Binaural Speech Source Localization INTERSPEECH 2018 Intonation tutor by SPIRE (In-SPIRE): An Online Tool for an Automatic Feedback to the Second Language Learners in Learning Intonation INTERSPEECH 2018 Whispered Speech to Neutral Speech Conversion Using Bidirectional LSTMs INTERSPEECH 2018 Speech Enhancement Using Deep Mixture of Experts Based on Hard Expectation Maximization INTERSPEECH 2018 Air-Tissue Boundary Segmentation in Real-Time Magnetic Resonance Imaging Video Using Semantic Segmentation with Fully Convolutional Networks INTERSPEECH 2018 Automatic Visual Augmentation for Concatenation Based Synthesized Articulatory Videos from Real-time MRI Data for Spoken Language Training INTERSPEECH 2018 Low Resource Acoustic-to-articulatory Inversion Using Bi-directional Long Short Term Memory INTERSPEECH 2018 Automatic Glottis Localization and Segmentation in Stroboscopic Videos Using Deep Neural Network INTERSPEECH 2018 Relating Articulatory Motions in Different Speaking Rates INTERSPEECH 2018 PRAV: A Phonetically Rich Audio Visual Corpus INTERSPEECH 2017 Phoneme State Posteriorgram Features for Speech Based Automatic Classification of Speakers in Cold and Healthy Condition INTERSPEECH 2017 Subband Selection for Binaural Speech Source Localization INTERSPEECH 2017 A Robust Voiced/Unvoiced Phoneme Classification from Whispered Speech Using the ‘Color’ of Whispered Phonemes and Deep Neural Network INTERSPEECH 2017 An Information Theoretic Analysis of the Temporal Synchrony Between Head Gestures and Prosodic Patterns in Spontaneous Speech INTERSPEECH 2017 A Dual Source-Filter Model of Snore Audio for Snorer Group Classification INTERSPEECH 2017 A Class-Specific Speech Enhancement for Phoneme Recognition: A Dictionary Learning Approach INTERSPEECH 2016 Automatic Recognition of Social Roles Using Long Term Role Transitions in Small Group Interactions INTERSPEECH 2016