conftrace_

Jing Xiao

103 papers · 2004–2026 · 17 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+13 more ↓

🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (23) 🌈 Renaissance Researcher (6) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird

🗺️ Taxonomy Completionist (23) 🧭 Keyword Pioneer 🌍 Conference Polyglot (17) 🏠 Conference Loyalist (36) 🤝 Dynamic Duo (31) 🔬 Deep Specialist (13) 🧬 Topic Evolution 🚀 Conference Pioneer 🔥 Unstoppable (7) ⚡ Prolific Year (9) 📈 Trend Setter 💎 Century Club (100) 🗃️ Keyword Collector (63)

Conferences

INTERSPEECH (36) CVPR (12) AAAI (11) EMNLP (7) ACL (6) ECCV (6) NAACL (5) COLING (3) NIPS (3) IJCNLP (3) SEMEVAL (2) IJCAI (2) ICML (2) AISTATS (2) MIDL (1) EACL (1) RSS (1)

Top co-authors

Shaojun Wang (31) Jianzong Wang (29) Ning Cheng (18) Jun Ma (11) Minchuan Chen (11) Xiaoyang Qu (10) Le Lu (10) Ye Wang (8) Yanmeng Wang (8) Mei Han (7)

Keywords

attention mechanism (14) self-supervised learning (6) automatic speech recognition (6) large language model (5) medical imaging (5) semantic segmentation (5) data augmentation (5) speaker verification (5) neural network (5) knowledge distillation (4) representation learning (4) generative adversarial network (4) transfer learning (4) semi-supervised learning (4) multi-task learning (4) semantic representation (3) object detection (3) uncertainty quantification (3) speech recognition (3) federated learning (3)

Papers

Learning to Generate Structured Meshes with In-Context: Toward Generalization in Mesh Generation AAAI 2026 CHARM: Collaborative Harmonization Across Arbitrary Modalities for Modality-Agnostic Semantic Segmentation AAAI 2026 Flow-Based Page Unique Semantic Mapping Architecture for Document Visual Question Answering ACL 2026 ChatSOP: An SOP-Guided MCTS Planning Framework for Controllable LLM Dialogue Agents ACL 2025 Prefix-Enhanced Large Language Models with Reused Training Data in Multi-Turn Medical Dialogue NAACL 2025 Co-Speech Gesture Video Generation with Implicit Motion-Audio Entanglement CVPR 2025 GRASP: Replace Redundant Layers with Adaptive Singular Parameters for Efficient Model Compression EMNLP 2025 RUNA: Object-Level Out-of-Distribution Detection via Regional Uncertainty Alignment of Multimodal Representations AAAI 2025 Open-world Radio Frequency Fingerprint Identification via Augmented Semi-supervised Learning AAAI 2025 ACCon: Angle-Compensated Contrastive Regularizer for Deep Regression AAAI 2025 Dynamic Attention-Guided Context Decoding for Mitigating Context Faithfulness Hallucinations in Large Language Models ACL 2025 IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding EMNLP 2024 From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning NAACL 2024 Bidirectional Autoregessive Diffusion Model for Dance Generation CVPR 2024 Boosting Image Quality Assessment through Efficient Transformer Adaptation with Local Feature Enhancement CVPR 2024 DFlow: A Generative Model Combining Denoising AutoEncoder and Normalizing Flow for High Fidelity Waveform Generation ICML 2024 Prior Relational Schema Assists Effective Contrastive Learning for Inductive Knowledge Graph Completion COLING 2024 Co-speech Gesture Video Generation with 3D Human Meshes ECCV 2024 E-Paraformer: A Faster and Better Parallel Transformer for Non-autoregressive End-to-End Mandarin Speech Recognition INTERSPEECH 2024 Improving Multilingual Text-to-Speech with Mixture-of-Language-Experts and Accent Disentanglement INTERSPEECH 2024 Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment NIPS 2023 Prompt Guided Copy Mechanism for Conversational Question Answering INTERSPEECH 2023 FedET: A Communication-Efficient Federated Class-Incremental Learning Framework Based on Enhanced Transformer IJCAI 2023 P-vectors: A Parallel-coupled TDNN/Transformer Network for Speaker Verification INTERSPEECH 2023 Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism INTERSPEECH 2023 EmoMix: Emotion Mixing via Diffusion Models for Emotional Speech Synthesis INTERSPEECH 2023 Only a Few Classes Confusing: Pixel-Wise Candidate Labels Disambiguation for Foggy Scene Understanding AAAI 2023 On the Calibration and Uncertainty with Pólya-Gamma Augmentation for Dialog Retrieval Models AAAI 2023 Improving Visual-Semantic Embedding with Adaptive Pooling and Optimization Objective EACL 2023 Investigation of Music Emotion Recognition Based on Segmented Semi-Supervised Learning INTERSPEECH 2023 SVVAD: Personal Voice Activity Detection for Speaker Verification INTERSPEECH 2023 Exploring multi-task learning and data augmentation in dementia detection with self-supervised pretrained models INTERSPEECH 2023 Improving End-to-End Modeling For Mandarin-English Code-Switching Using Lightweight Switch-Routing Mixture-of-Experts INTERSPEECH 2023 PRCA: Fitting Black-Box Large Language Models for Retrieval Question Answering via Pluggable Reward-Driven Contextual Adapter EMNLP 2023 GAIA: Delving into Gradient-based Attribution Abnormality for Out-of-distribution Detection NIPS 2023 PINGAN Omini-Sinitic at SemEval-2022 Task 4: Multi-prompt Training for Patronizing and Condescending Language Detection SEMEVAL 2022 Uncertainty Calibration for Deep Audio Classifiers INTERSPEECH 2022 FFM: A Frame Filtering Mechanism To Accelerate Inference Speed For Conformer In Speech Recognition INTERSPEECH 2022 Self-supervised Cross-modal Pretraining for Speech Emotion Recognition and Sentiment Analysis EMNLP 2022 Tiny-Sepformer: A Tiny Time-Domain Transformer Network For Speech Separation INTERSPEECH 2022 A compact transformer-based GAN vocoder INTERSPEECH 2022 SpeechEQ: Speech Emotion Recognition based on Multi-scale Unified Datasets and Multitask Learning INTERSPEECH 2022 Towards Efficiently Learning Monotonic Alignments for Attention-based End-to-End Speech Recognition INTERSPEECH 2022 Spatial-Temporal Space Hand-in-Hand: Spatial-Temporal Video Super-Resolution via Cycle-Projected Mutual Learning CVPR 2022 Localized Adversarial Domain Generalization CVPR 2022 ElasticMVS: Learning elastic part representation for self-supervised multi-view stereopsis NIPS 2022 PINGAN Omini-Sinitic at SemEval-2022 Task 4: Multi-prompt Training for Patronizing and Condescending Language Detection NAACL 2022 An Augmented Benchmark Dataset for Geometric Question Answering through Dual Parallel Text Encoding COLING 2022 Adversarial Knowledge Distillation For Robust Spoken Language Understanding INTERSPEECH 2022 3D Graph Anatomy Geometry-Integrated Network for Pancreatic Mass Segmentation, Diagnosis, and Quantitative Patient Management CVPR 2021 Window Loss for Bone Fracture Detection and Localization in X-ray Images with Point-based Annotation AAAI 2021 A Neural Transition-based Joint Model for Disease Named Entity Recognition and Normalization ACL 2021 PHMOSpell: Phonological and Morphological Knowledge Guided Chinese Spelling Check ACL 2021 PINGAN Omini-Sinitic at SemEval-2021 Task 4:Reading Comprehension of Abstract Meaning ACL 2021 Understanding Gradient Clipping In Incremental Gradient Methods AISTATS 2021 Deep Lesion Tracker: Monitoring Lesions in 4D Longitudinal Imaging Studies CVPR 2021 Image Inpainting Guided by Coherence Priors of Semantics and Textures CVPR 2021 Automatic Vertebra Localization and Identification in CT by Spine Rectification and Anatomically-Constrained Optimization CVPR 2021 Leveraging Large-Scale Weakly Labeled Data for Semi-Supervised Mass Detection in Mammograms CVPR 2021 An Alignment-Agnostic Model for Chinese Text Error Correction EMNLP 2021 Enhancing Dual-Encoders with Question and Answer Cross-Embeddings for Answer Retrieval EMNLP 2021 EfficientTTS: An Efficient and High-Quality Text-to-Speech Architecture ICML 2021 A Neural Transition-based Joint Model for Disease Named Entity Recognition and Normalization IJCNLP 2021 PHMOSpell: Phonological and Morphological Knowledge Guided Chinese Spelling Check IJCNLP 2021 PINGAN Omini-Sinitic at SemEval-2021 Task 4:Reading Comprehension of Abstract Meaning IJCNLP 2021 ICSpk: Interpretable Complex Speaker Embedding Extractor from Raw Waveform INTERSPEECH 2021 Variational Information Bottleneck for Effective Low-Resource Audio Classification INTERSPEECH 2021 Dropout Regularization for Self-Supervised Learning of Transformer Encoder Speech Representation INTERSPEECH 2021 Extending Pronunciation Dictionary with Automatically Detected Word Mispronunciations to Improve PAII’s System for Interspeech 2021 Non-Native Child English Close Track ASR Challenge INTERSPEECH 2021 EfficientSing: A Chinese Singing Voice Synthesis System Using Duration-Free Acoustic Model and HiFi-GAN Vocoder INTERSPEECH 2021 Speech2Video: Cross-Modal Distillation for Speech to Video Generation INTERSPEECH 2021 Effective Phase Encoding for End-To-End Speaker Verification INTERSPEECH 2021 Federated Learning with Dynamic Transformer for Text to Speech INTERSPEECH 2021 An Improved Single Step Non-Autoregressive Transformer for Automatic Speech Recognition INTERSPEECH 2021 Improving Polyphone Disambiguation for Mandarin Chinese by Combining Mix-Pooling Strategy and Window-Based Attention INTERSPEECH 2021 Multi-Grained Knowledge Distillation for Named Entity Recognition NAACL 2021 System Description on Automatic Simultaneous Translation Workshop NAACL 2021 PINGAN Omini-Sinitic at SemEval-2021 Task 4:Reading Comprehension of Abstract Meaning SEMEVAL 2021 Prosody Learning Mechanism for Speech Synthesis System Without Text Length Limit INTERSPEECH 2020 Nonparallel Emotional Speech Conversion Using VAE-GAN INTERSPEECH 2020 Large-Scale Transfer Learning for Low-Resource Spoken Language Understanding INTERSPEECH 2020 Evolutionary Algorithm Enhanced Neural Architecture Search for Text-Independent Speaker Verification INTERSPEECH 2020 A Real-Time Robot-Based Auxiliary System for Risk Evaluation of COVID-19 Infection INTERSPEECH 2020 Structured Landmark Detection via Topology-Adapting Deep Graph Learning ECCV 2020 Mining on Heterogeneous Manifolds for Zero-Shot Cross-Modal Image Retrieval AAAI 2020 An Iterative Polishing Framework Based on Quality Aware Masked Language Model for Chinese Poetry Generation AAAI 2020 Organ at Risk Segmentation for Head and Neck Cancer Using Stratified Learning and Neural Architecture Search CVPR 2020 Non-Parallel Voice Conversion with Fewer Labeled Data by Conditional Generative Adversarial Networks INTERSPEECH 2020 Improving Replay Detection System with Channel Consistency DenseNeXt for the ASVspoof 2019 Challenge INTERSPEECH 2020 MLNET: An Adaptive Multiple Receptive-Field Attention Neural Network for Voice Activity Detection INTERSPEECH 2020 Generating Reasonable Legal Text through the Combination of Language Modeling and Question Answering IJCAI 2020 Empirical Studies of Institutional Federated Learning For Natural Language Processing EMNLP 2020 Guidance and Evaluation: Semantic-Aware Image Inpainting for Mixed Scenes ECCV 2020 Co-Heterogeneous and Adaptive Segmentation from Multi-Source and Multi-Phase CT Imaging Data: A Study on Pathological Liver and Lesion Segmentation ECCV 2020 Anatomy-Aware Siamese Network: Exploiting Semantic Asymmetry for Accurate Pelvic Fracture Detection in X-ray Images ECCV 2020 JSSR: A Joint Synthesis, Segmentation, and Registration System for 3D Multi-Modal Image Alignment of Large-scale Pathological CT Scans ECCV 2020 Adversarial Discrete Sequence Generation without Explicit NeuralNetworks as Discriminators AISTATS 2019 XLSor: A Robust and Accurate Lung Segmentor on Chest X-Rays Using Criss-Cross Attention and Customized Radiorealistic Abnormalities Generation MIDL 2019 Cross-Lingual, Multi-Speaker Text-To-Speech Synthesis Using Neural Speaker Embedding INTERSPEECH 2019 CISI-net: Explicit Latent Content Inference and Imitated Style Rendering for Image Inpainting AAAI 2019 Detection Evolution with Multi-order Contextual Co-occurrence CVPR 2013 Modeling Complex Contacts Involving Deformable Objects for Haptic and Graphic Rendering RSS 2005 Cascading Use of Soft and Hard Matching Pattern Rules for Weakly Supervised Information Extraction COLING 2004