Jianzong Wang
39 papers · 2020–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
π£ Hot Topic Early Bird π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (12) π§ Keyword Pioneer π Conference Polyglot (9)
π£
Hot Topic Early Bird
π
Cross-Pollinator
(14)
π
Academic Marathon
(5)
π
Conference Loyalist
(20)
π€
Dynamic Duo
(29)
π₯
Unstoppable
(6)
π
Century Club
(37)
β‘
Prolific Year
(7)
ποΈ
Keyword Collector
(181)
Conferences
INTERSPEECH (20)
ACL (5)
AAAI (4)
EMNLP (4)
NAACL (2)
ACML (1)
ICCV (1)
IJCAI (1)
NIPS (1)
Top co-authors
Keywords
attention mechanism
(5)
federated learning
(4)
speaker verification
(4)
large language model
(3)
knowledge distillation
(3)
speech emotion recognition
(3)
out-of-distribution detection
(2)
audio classification
(2)
generative adversarial network
(2)
low-resource learning
(2)
model compression
(2)
gaussian process
(2)
reinforcement learning
(2)
communication efficiency
(2)
representation learning
(2)
automatic speech recognition
(2)
transfer learning
(2)
catastrophic forgetting
(2)
uncertainty quantification
(2)
self-supervised learning
(2)
Papers
From Inheritance to Saturation: Disentangling the Evolution of Visual Redundancy for Architecture-Aware MLLM Inference Acceleration
ACL 2026
Vista: Scene-Aware Optimization for Streaming Video Question Answering Under Post-Hoc Queries
AAAI 2026
RUNA: Object-Level Out-of-Distribution Detection via Regional Uncertainty Alignment of Multimodal Representations
AAAI 2025
MoQAE: Mixed-Precision Quantization for Long-Context LLM Inference via Mixture of Quantization-Aware Experts
ACL 2025
Hierarchical-Task-Aware Multi-modal Mixture of Incremental LoRA Experts for Embodied Continual Learning
ACL 2025
RATE-Nav: Region-Aware Termination Enhancement for Zero-shot Object Navigation with Vision-Language Models
ACL 2025
EMO-RL: Emotion-Rule-Based Reinforcement Learning Enhanced Audio-Language Model for Generalized Speech Emotion Recognition
EMNLP 2025
Federated Domain Generalization with Domain-specific Soft Prompts Generation
ICCV 2025
ACCon: Angle-Compensated Contrastive Regularizer for Deep Regression
AAAI 2025
From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning
NAACL 2024
IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding
EMNLP 2024
Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
ACL 2024
GAIA: Delving into Gradient-based Attribution Abnormality for Out-of-distribution Detection
NIPS 2023
On the Calibration and Uncertainty with PΓ³lya-Gamma Augmentation for Dialog Retrieval Models
AAAI 2023
PRCA: Fitting Black-Box Large Language Models for Retrieval Question Answering via Pluggable Reward-Driven Contextual Adapter
EMNLP 2023
FedET: A Communication-Efficient Federated Class-Incremental Learning Framework Based on Enhanced Transformer
IJCAI 2023
EmoMix: Emotion Mixing via Diffusion Models for Emotional Speech Synthesis
INTERSPEECH 2023
Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism
INTERSPEECH 2023
Prompt Guided Copy Mechanism for Conversational Question Answering
INTERSPEECH 2023
SVVAD: Personal Voice Activity Detection for Speaker Verification
INTERSPEECH 2023
Investigation of Music Emotion Recognition Based on Segmented Semi-Supervised Learning
INTERSPEECH 2023
Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion
INTERSPEECH 2022
SpeechEQ: Speech Emotion Recognition based on Multi-scale Unified Datasets and Multitask Learning
INTERSPEECH 2022
Tiny-Sepformer: A Tiny Time-Domain Transformer Network For Speech Separation
INTERSPEECH 2022
Pose Guided Human Image Synthesis with Partially
Decoupled GAN
ACML 2022
Uncertainty Calibration for Deep Audio Classifiers
INTERSPEECH 2022
System Description on Automatic Simultaneous Translation Workshop
NAACL 2021
ICSpk: Interpretable Complex Speaker Embedding Extractor from Raw Waveform
INTERSPEECH 2021
Dropout Regularization for Self-Supervised Learning of Transformer Encoder Speech Representation
INTERSPEECH 2021
Speech2Video: Cross-Modal Distillation for Speech to Video Generation
INTERSPEECH 2021
Effective Phase Encoding for End-To-End Speaker Verification
INTERSPEECH 2021
Federated Learning with Dynamic Transformer for Text to Speech
INTERSPEECH 2021
Variational Information Bottleneck for Effective Low-Resource Audio Classification
INTERSPEECH 2021
Empirical Studies of Institutional Federated Learning For Natural Language Processing
EMNLP 2020
Evolutionary Algorithm Enhanced Neural Architecture Search for Text-Independent Speaker Verification
INTERSPEECH 2020
Large-Scale Transfer Learning for Low-Resource Spoken Language Understanding
INTERSPEECH 2020
MLNET: An Adaptive Multiple Receptive-Field Attention Neural Network for Voice Activity Detection
INTERSPEECH 2020
Prosody Learning Mechanism for Speech Synthesis System Without Text Length Limit
INTERSPEECH 2020
A Real-Time Robot-Based Auxiliary System for Risk Evaluation of COVID-19 Infection
INTERSPEECH 2020