Jing Xiao
103 papers · 2004–2026 · 17 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (23) π Renaissance Researcher (6) π Interdisciplinary Bridge π£ Hot Topic Early Bird
πΊοΈ
Taxonomy Completionist
(23)
π§
Keyword Pioneer
π
Conference Polyglot
(17)
π
Conference Loyalist
(36)
π€
Dynamic Duo
(31)
π¬
Deep Specialist
(13)
π§¬
Topic Evolution
π
Conference Pioneer
π₯
Unstoppable
(7)
β‘
Prolific Year
(9)
π
Trend Setter
π
Century Club
(100)
ποΈ
Keyword Collector
(63)
Conferences
INTERSPEECH (36)
CVPR (12)
AAAI (11)
EMNLP (7)
ACL (6)
ECCV (6)
NAACL (5)
COLING (3)
NIPS (3)
IJCNLP (3)
SEMEVAL (2)
IJCAI (2)
ICML (2)
AISTATS (2)
MIDL (1)
EACL (1)
RSS (1)
Top co-authors
Keywords
attention mechanism
(14)
self-supervised learning
(6)
automatic speech recognition
(6)
large language model
(5)
medical imaging
(5)
semantic segmentation
(5)
data augmentation
(5)
speaker verification
(5)
neural network
(5)
knowledge distillation
(4)
representation learning
(4)
generative adversarial network
(4)
transfer learning
(4)
semi-supervised learning
(4)
multi-task learning
(4)
semantic representation
(3)
object detection
(3)
uncertainty quantification
(3)
speech recognition
(3)
federated learning
(3)
Papers
Learning to Generate Structured Meshes with In-Context: Toward Generalization in Mesh Generation
AAAI 2026
CHARM: Collaborative Harmonization Across Arbitrary Modalities for Modality-Agnostic Semantic Segmentation
AAAI 2026
Flow-Based Page Unique Semantic Mapping Architecture for Document Visual Question Answering
ACL 2026
ChatSOP: An SOP-Guided MCTS Planning Framework for Controllable LLM Dialogue Agents
ACL 2025
Prefix-Enhanced Large Language Models with Reused Training Data in Multi-Turn Medical Dialogue
NAACL 2025
Co-Speech Gesture Video Generation with Implicit Motion-Audio Entanglement
CVPR 2025
GRASP: Replace Redundant Layers with Adaptive Singular Parameters for Efficient Model Compression
EMNLP 2025
RUNA: Object-Level Out-of-Distribution Detection via Regional Uncertainty Alignment of Multimodal Representations
AAAI 2025
Open-world Radio Frequency Fingerprint Identification via Augmented Semi-supervised Learning
AAAI 2025
ACCon: Angle-Compensated Contrastive Regularizer for Deep Regression
AAAI 2025
Dynamic Attention-Guided Context Decoding for Mitigating Context Faithfulness Hallucinations in Large Language Models
ACL 2025
IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding
EMNLP 2024
From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning
NAACL 2024
Bidirectional Autoregessive Diffusion Model for Dance Generation
CVPR 2024
Boosting Image Quality Assessment through Efficient Transformer Adaptation with Local Feature Enhancement
CVPR 2024
DFlow: A Generative Model Combining Denoising AutoEncoder and Normalizing Flow for High Fidelity Waveform Generation
ICML 2024
Prior Relational Schema Assists Effective Contrastive Learning for Inductive Knowledge Graph Completion
COLING 2024
Co-speech Gesture Video Generation with 3D Human Meshes
ECCV 2024
E-Paraformer: A Faster and Better Parallel Transformer for Non-autoregressive End-to-End Mandarin Speech Recognition
INTERSPEECH 2024
Improving Multilingual Text-to-Speech with Mixture-of-Language-Experts and Accent Disentanglement
INTERSPEECH 2024
Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment
NIPS 2023
Prompt Guided Copy Mechanism for Conversational Question Answering
INTERSPEECH 2023
FedET: A Communication-Efficient Federated Class-Incremental Learning Framework Based on Enhanced Transformer
IJCAI 2023
P-vectors: A Parallel-coupled TDNN/Transformer Network for Speaker Verification
INTERSPEECH 2023
Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism
INTERSPEECH 2023
EmoMix: Emotion Mixing via Diffusion Models for Emotional Speech Synthesis
INTERSPEECH 2023
Only a Few Classes Confusing: Pixel-Wise Candidate Labels Disambiguation for Foggy Scene Understanding
AAAI 2023
On the Calibration and Uncertainty with PΓ³lya-Gamma Augmentation for Dialog Retrieval Models
AAAI 2023
Improving Visual-Semantic Embedding with Adaptive Pooling and Optimization Objective
EACL 2023
Investigation of Music Emotion Recognition Based on Segmented Semi-Supervised Learning
INTERSPEECH 2023
SVVAD: Personal Voice Activity Detection for Speaker Verification
INTERSPEECH 2023
Exploring multi-task learning and data augmentation in dementia detection with self-supervised pretrained models
INTERSPEECH 2023
Improving End-to-End Modeling For Mandarin-English Code-Switching Using Lightweight Switch-Routing Mixture-of-Experts
INTERSPEECH 2023
PRCA: Fitting Black-Box Large Language Models for Retrieval Question Answering via Pluggable Reward-Driven Contextual Adapter
EMNLP 2023
GAIA: Delving into Gradient-based Attribution Abnormality for Out-of-distribution Detection
NIPS 2023
PINGAN Omini-Sinitic at SemEval-2022 Task 4: Multi-prompt Training for Patronizing and Condescending Language Detection
SEMEVAL 2022
Uncertainty Calibration for Deep Audio Classifiers
INTERSPEECH 2022
FFM: A Frame Filtering Mechanism To Accelerate Inference Speed For Conformer In Speech Recognition
INTERSPEECH 2022
Self-supervised Cross-modal Pretraining for Speech Emotion Recognition and Sentiment Analysis
EMNLP 2022
Tiny-Sepformer: A Tiny Time-Domain Transformer Network For Speech Separation
INTERSPEECH 2022
A compact transformer-based GAN vocoder
INTERSPEECH 2022
SpeechEQ: Speech Emotion Recognition based on Multi-scale Unified Datasets and Multitask Learning
INTERSPEECH 2022
Towards Efficiently Learning Monotonic Alignments for Attention-based End-to-End Speech Recognition
INTERSPEECH 2022
Spatial-Temporal Space Hand-in-Hand: Spatial-Temporal Video Super-Resolution via Cycle-Projected Mutual Learning
CVPR 2022
Localized Adversarial Domain Generalization
CVPR 2022
ElasticMVS: Learning elastic part representation for self-supervised multi-view stereopsis
NIPS 2022
PINGAN Omini-Sinitic at SemEval-2022 Task 4: Multi-prompt Training for Patronizing and Condescending Language Detection
NAACL 2022
An Augmented Benchmark Dataset for Geometric Question Answering through Dual Parallel Text Encoding
COLING 2022
Adversarial Knowledge Distillation For Robust Spoken Language Understanding
INTERSPEECH 2022
3D Graph Anatomy Geometry-Integrated Network for Pancreatic Mass Segmentation, Diagnosis, and Quantitative Patient Management
CVPR 2021
Window Loss for Bone Fracture Detection and Localization in X-ray Images with Point-based Annotation
AAAI 2021
A Neural Transition-based Joint Model for Disease Named Entity Recognition and Normalization
ACL 2021
PHMOSpell: Phonological and Morphological Knowledge Guided Chinese Spelling Check
ACL 2021
PINGAN Omini-Sinitic at SemEval-2021 Task 4:Reading Comprehension of Abstract Meaning
ACL 2021
Understanding Gradient Clipping In Incremental Gradient Methods
AISTATS 2021
Deep Lesion Tracker: Monitoring Lesions in 4D Longitudinal Imaging Studies
CVPR 2021
Image Inpainting Guided by Coherence Priors of Semantics and Textures
CVPR 2021
Automatic Vertebra Localization and Identification in CT by Spine Rectification and Anatomically-Constrained Optimization
CVPR 2021
Leveraging Large-Scale Weakly Labeled Data for Semi-Supervised Mass Detection in Mammograms
CVPR 2021
An Alignment-Agnostic Model for Chinese Text Error Correction
EMNLP 2021
Enhancing Dual-Encoders with Question and Answer Cross-Embeddings for Answer Retrieval
EMNLP 2021
EfficientTTS: An Efficient and High-Quality Text-to-Speech Architecture
ICML 2021
A Neural Transition-based Joint Model for Disease Named Entity Recognition and Normalization
IJCNLP 2021
PHMOSpell: Phonological and Morphological Knowledge Guided Chinese Spelling Check
IJCNLP 2021
PINGAN Omini-Sinitic at SemEval-2021 Task 4:Reading Comprehension of Abstract Meaning
IJCNLP 2021
ICSpk: Interpretable Complex Speaker Embedding Extractor from Raw Waveform
INTERSPEECH 2021
Variational Information Bottleneck for Effective Low-Resource Audio Classification
INTERSPEECH 2021
Dropout Regularization for Self-Supervised Learning of Transformer Encoder Speech Representation
INTERSPEECH 2021
Extending Pronunciation Dictionary with Automatically Detected Word Mispronunciations to Improve PAIIβs System for Interspeech 2021 Non-Native Child English Close Track ASR Challenge
INTERSPEECH 2021
EfficientSing: A Chinese Singing Voice Synthesis System Using Duration-Free Acoustic Model and HiFi-GAN Vocoder
INTERSPEECH 2021
Speech2Video: Cross-Modal Distillation for Speech to Video Generation
INTERSPEECH 2021
Effective Phase Encoding for End-To-End Speaker Verification
INTERSPEECH 2021
Federated Learning with Dynamic Transformer for Text to Speech
INTERSPEECH 2021
An Improved Single Step Non-Autoregressive Transformer for Automatic Speech Recognition
INTERSPEECH 2021
Improving Polyphone Disambiguation for Mandarin Chinese by Combining Mix-Pooling Strategy and Window-Based Attention
INTERSPEECH 2021
Multi-Grained Knowledge Distillation for Named Entity Recognition
NAACL 2021
System Description on Automatic Simultaneous Translation Workshop
NAACL 2021
PINGAN Omini-Sinitic at SemEval-2021 Task 4:Reading Comprehension of Abstract Meaning
SEMEVAL 2021
Prosody Learning Mechanism for Speech Synthesis System Without Text Length Limit
INTERSPEECH 2020
Nonparallel Emotional Speech Conversion Using VAE-GAN
INTERSPEECH 2020
Large-Scale Transfer Learning for Low-Resource Spoken Language Understanding
INTERSPEECH 2020
Evolutionary Algorithm Enhanced Neural Architecture Search for Text-Independent Speaker Verification
INTERSPEECH 2020
A Real-Time Robot-Based Auxiliary System for Risk Evaluation of COVID-19 Infection
INTERSPEECH 2020
Structured Landmark Detection via Topology-Adapting Deep Graph Learning
ECCV 2020
Mining on Heterogeneous Manifolds for Zero-Shot Cross-Modal Image Retrieval
AAAI 2020
An Iterative Polishing Framework Based on Quality Aware Masked Language Model for Chinese Poetry Generation
AAAI 2020
Organ at Risk Segmentation for Head and Neck Cancer Using Stratified Learning and Neural Architecture Search
CVPR 2020
Non-Parallel Voice Conversion with Fewer Labeled Data by Conditional Generative Adversarial Networks
INTERSPEECH 2020
Improving Replay Detection System with Channel Consistency DenseNeXt for the ASVspoof 2019 Challenge
INTERSPEECH 2020
MLNET: An Adaptive Multiple Receptive-Field Attention Neural Network for Voice Activity Detection
INTERSPEECH 2020
Generating Reasonable Legal Text through the Combination of Language Modeling and Question Answering
IJCAI 2020
Empirical Studies of Institutional Federated Learning For Natural Language Processing
EMNLP 2020
Guidance and Evaluation: Semantic-Aware Image Inpainting for Mixed Scenes
ECCV 2020
Co-Heterogeneous and Adaptive Segmentation from Multi-Source and Multi-Phase CT Imaging Data: A Study on Pathological Liver and Lesion Segmentation
ECCV 2020
Anatomy-Aware Siamese Network: Exploiting Semantic Asymmetry for Accurate Pelvic Fracture Detection in X-ray Images
ECCV 2020
JSSR: A Joint Synthesis, Segmentation, and Registration System for 3D Multi-Modal Image Alignment of Large-scale Pathological CT Scans
ECCV 2020
Adversarial Discrete Sequence Generation without Explicit NeuralNetworks as Discriminators
AISTATS 2019
XLSor: A Robust and Accurate Lung Segmentor on Chest X-Rays Using Criss-Cross Attention and Customized Radiorealistic Abnormalities Generation
MIDL 2019
Cross-Lingual, Multi-Speaker Text-To-Speech Synthesis Using Neural Speaker Embedding
INTERSPEECH 2019
CISI-net: Explicit Latent Content Inference and Imitated Style Rendering for Image Inpainting
AAAI 2019
Detection Evolution with Multi-order Contextual Co-occurrence
CVPR 2013
Modeling Complex Contacts Involving Deformable Objects for Haptic and Graphic Rendering
RSS 2005
Cascading Use of Soft and Hard Matching Pattern Rules for Weakly Supervised Information Extraction
COLING 2004