Jianzong Wang

39 papers · 2020–2026 · 9 conferences · across top CS/AI conferences

Achievements

+9 more ↓

🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (12) 🧭 Keyword Pioneer 🌍 Conference Polyglot (9)

🐣 Hot Topic Early Bird 🐝 Cross-Pollinator (14) 🏃 Academic Marathon (5) 🏠 Conference Loyalist (20) 🤝 Dynamic Duo (29) 🔥 Unstoppable (6) 💎 Century Club (37) ⚡ Prolific Year (7) 🗃️ Keyword Collector (181)

Conferences

INTERSPEECH (20) ACL (5) AAAI (4) EMNLP (4) NAACL (2) ACML (1) ICCV (1) IJCAI (1) NIPS (1)

Top co-authors

Jing Xiao (29) Ning Cheng (18) Xiaoyang Qu (18) Zhitao Li (6) Yong Zhang (5) Jiguang Wan (5) Shijing Si (5) Xulong Zhang (5) Zuheng Kang (4) Junqing Peng (4)

Keywords

attention mechanism (5) federated learning (4) speaker verification (4) large language model (3) knowledge distillation (3) speech emotion recognition (3) out-of-distribution detection (2) audio classification (2) generative adversarial network (2) low-resource learning (2) model compression (2) gaussian process (2) reinforcement learning (2) communication efficiency (2) representation learning (2) automatic speech recognition (2) transfer learning (2) catastrophic forgetting (2) uncertainty quantification (2) self-supervised learning (2)

Papers

From Inheritance to Saturation: Disentangling the Evolution of Visual Redundancy for Architecture-Aware MLLM Inference Acceleration ACL 2026 Vista: Scene-Aware Optimization for Streaming Video Question Answering Under Post-Hoc Queries AAAI 2026 RUNA: Object-Level Out-of-Distribution Detection via Regional Uncertainty Alignment of Multimodal Representations AAAI 2025 MoQAE: Mixed-Precision Quantization for Long-Context LLM Inference via Mixture of Quantization-Aware Experts ACL 2025 Hierarchical-Task-Aware Multi-modal Mixture of Incremental LoRA Experts for Embodied Continual Learning ACL 2025 RATE-Nav: Region-Aware Termination Enhancement for Zero-shot Object Navigation with Vision-Language Models ACL 2025 EMO-RL: Emotion-Rule-Based Reinforcement Learning Enhanced Audio-Language Model for Generalized Speech Emotion Recognition EMNLP 2025 Federated Domain Generalization with Domain-specific Soft Prompts Generation ICCV 2025 ACCon: Angle-Compensated Contrastive Regularizer for Deep Regression AAAI 2025 From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning NAACL 2024 IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding EMNLP 2024 Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning ACL 2024 GAIA: Delving into Gradient-based Attribution Abnormality for Out-of-distribution Detection NIPS 2023 On the Calibration and Uncertainty with Pólya-Gamma Augmentation for Dialog Retrieval Models AAAI 2023 PRCA: Fitting Black-Box Large Language Models for Retrieval Question Answering via Pluggable Reward-Driven Contextual Adapter EMNLP 2023 FedET: A Communication-Efficient Federated Class-Incremental Learning Framework Based on Enhanced Transformer IJCAI 2023 EmoMix: Emotion Mixing via Diffusion Models for Emotional Speech Synthesis INTERSPEECH 2023 Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism INTERSPEECH 2023 Prompt Guided Copy Mechanism for Conversational Question Answering INTERSPEECH 2023 SVVAD: Personal Voice Activity Detection for Speaker Verification INTERSPEECH 2023 Investigation of Music Emotion Recognition Based on Segmented Semi-Supervised Learning INTERSPEECH 2023 Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion INTERSPEECH 2022 SpeechEQ: Speech Emotion Recognition based on Multi-scale Unified Datasets and Multitask Learning INTERSPEECH 2022 Tiny-Sepformer: A Tiny Time-Domain Transformer Network For Speech Separation INTERSPEECH 2022 Pose Guided Human Image Synthesis with Partially Decoupled GAN ACML 2022 Uncertainty Calibration for Deep Audio Classifiers INTERSPEECH 2022 System Description on Automatic Simultaneous Translation Workshop NAACL 2021 ICSpk: Interpretable Complex Speaker Embedding Extractor from Raw Waveform INTERSPEECH 2021 Dropout Regularization for Self-Supervised Learning of Transformer Encoder Speech Representation INTERSPEECH 2021 Speech2Video: Cross-Modal Distillation for Speech to Video Generation INTERSPEECH 2021 Effective Phase Encoding for End-To-End Speaker Verification INTERSPEECH 2021 Federated Learning with Dynamic Transformer for Text to Speech INTERSPEECH 2021 Variational Information Bottleneck for Effective Low-Resource Audio Classification INTERSPEECH 2021 Empirical Studies of Institutional Federated Learning For Natural Language Processing EMNLP 2020 Evolutionary Algorithm Enhanced Neural Architecture Search for Text-Independent Speaker Verification INTERSPEECH 2020 Large-Scale Transfer Learning for Low-Resource Spoken Language Understanding INTERSPEECH 2020 MLNET: An Adaptive Multiple Receptive-Field Attention Neural Network for Voice Activity Detection INTERSPEECH 2020 Prosody Learning Mechanism for Speech Synthesis System Without Text Length Limit INTERSPEECH 2020 A Real-Time Robot-Based Auxiliary System for Risk Evaluation of COVID-19 Infection INTERSPEECH 2020