Jian Wu

108 papers · 2010–2026 · 19 conferences · across top CS/AI conferences

Achievements

+15 more ↓

🗺️ Taxonomy Completionist (25) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (7) 🐣 Hot Topic Early Bird

🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird 🐝 Cross-Pollinator (8) 🔬 Deep Specialist (15) 🧬 Topic Evolution 🏆 Keyword Champion 🏆 Grand Slam 🤝 Dynamic Duo (20) 💎 Century Club (103) ❓ The Questioner (4) 🚀 Conference Pioneer ⚡ Prolific Year (11) 🔥 Unstoppable (12) 🗃️ Keyword Collector (74) 📈 Trend Setter

Conferences

INTERSPEECH (18) ACL (12) AAAI (10) IJCAI (10) NIPS (9) MICCAI (9) EMNLP (8) ICLR (6) COLING (5) NAACL (5) CVPR (4) ICML (3) IJCNLP (2) ICCV (2) MIDL (1) ECCV (1) AISTATS (1) UAI (1) WACV (1)

Top co-authors

Jintai Chen (21) Zuozhu Liu (20) Yang Feng (12) Hongxia Xu (11) Haochao Ying (10) Zhuo Chen (10) Danny Chen (8) Danny Z. Chen (8) Yan Zhang (7) Jiahuan Yan (7)

Research topics

Science (1)

Keywords

large language model (13) representation learning (7) automatic speech recognition (5) deep learning (5) speech separation (5) multimodal learning (5) speech recognition (5) ordinal regression (5) neural network (4) word error rate (4) bayesian optimization (4) medical imaging (4) semi-supervised learning (3) graph neural network (3) knowledge distillation (3) streaming asr (3) knowledge gradient (3) data augmentation (3) convolutional neural network (3) image classification (2)

Papers

Learning What Matters: Dynamic Dimension Selection and Aggregation for Interpretable Vision-Language Reward Modeling ACL 2026 MT3: A Synergistic Multi-Task RL Framework for Specializing MLLMs in Text Image Machine Translation ACL 2026 Act as you think: Reinforcing Consistent Reasoning in Medical Visual Question Answering ACL 2026 Debate-of-Thoughts: Resolving Knowledge Conflicts in LLMs Through Internal Deliberation ACL 2026 LAMDAS: LLM as an Implicit Classifier for Domain-specific Data Selection AAAI 2026 MedThink: A Rationale-Guided Framework for Explaining Medical Visual Question Answering NAACL 2025 DiTAR: Diffusion Transformer Autoregressive Modeling for Speech Generation ICML 2025 Dual-level Fuzzy Learning with Patch Guidance for Image Ordinal Regression IJCAI 2025 HSCR: Hierarchical Self-Contrastive Rewarding for Aligning Medical Vision Language Models ACL 2025 LLMs Can Simulate Standardized Patients via Agent Coevolution ACL 2025 Towards Reliable Large Audio Language Model ACL 2025 From Misleading Queries to Accurate Answers: A Three-Stage Fine-Tuning Method for LLMs ACL 2025 Reason from Future: Reverse Thought Chain Enhances LLM Reasoning ACL 2025 Rethinking Neural-based Matrix Inversion: Why can’t, and Where can AISTATS 2025 V2T-CoT: From Vision to Text Chain-of-Thought for Medical Reasoning and Diagnosis MICCAI 2025 Uncertainty-Aware Multi-Expert Knowledge Distillation for Imbalanced Disease Grading MICCAI 2025 RefineNet: Elevating Medical Foundation Models through Quality-Centric Data Curation by MLLM-Annotated Proxy Distillation MICCAI 2025 Knowing or Guessing? Robust Medical Visual Question Answering via Joint Consistency and Contrastive Learning MICCAI 2025 Scalable Autoregressive Monocular Depth Estimation CVPR 2025 Fair-MoE: Medical Fairness-Oriented Mixture of Experts in Vision-Language Models MICCAI 2025 Icon2: Aligning Large Language Models Using Self-Synthetic Preference Data via Inherent Regulation EMNLP 2025 LongWeave: A Long-Form Generation Benchmark Bridging Real-World Relevance and Verifiability EMNLP 2025 MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning EMNLP 2025 Guiding Large Language Models for Biomedical Entity Linking via Restrictive and Contrastive Decoding EMNLP 2025 OrderChain: Towards General Instruct-Tuning for Stimulating the Ordinal Understanding Ability of MLLM ICCV 2025 Small Models are LLM Knowledge Triggers for Medical Tabular Prediction ICLR 2025 MMQA: Evaluating LLMs with Multi-Table Multi-Hop Complex Questions ICLR 2025 CofCA: A STEP-WISE Counterfactual Multi-hop QA benchmark ICLR 2025 Modality-Fair Preference Optimization for Trustworthy MLLM Alignment IJCAI 2025 TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement NAACL 2025 Synergy of GFlowNet and Protein Language Model Makes a Diverse Antibody Designer AAAI 2025 ProtCLIP: Function-Informed Protein Multi-Modal Learning AAAI 2025 Identifying and Mitigating Social Bias Knowledge in Language Models NAACL 2025 M-MAD: Multidimensional Multi-Agent Debate for Advanced Machine Translation Evaluation ACL 2025 Personalized Heart Disease Detection via ECG Digital Twin Generation IJCAI 2024 AI-Enhanced Virtual Reality in Medicine: A Comprehensive Survey IJCAI 2024 MFIF-Net: A Multi-Focal Image Fusion Network for Implantation Outcome Prediction of Blastocyst MIDL 2024 COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning INTERSPEECH 2024 PX2Tooth: Reconstructing the 3D Point Cloud Teeth from a Single Panoramic X-ray MICCAI 2024 VPL: Visual Proxy Learning Framework for Zero-Shot Medical Image Diagnosis EMNLP 2024 Arithmetic Feature Interaction Is Necessary for Deep Tabular Learning AAAI 2024 ETDPC: A Multimodality Framework for Classifying Pages in Electronic Theses and Dissertations AAAI 2024 TeleOR: Real-time Telemedicine System for Full-Scene Operating Room MICCAI 2024 Can Large Language Models Discern Evidence for Scientific Hypotheses? Case Studies in the Social Sciences COLING 2024 Enhancing Semi-Supervised Learning via Representative and Diverse Sample Selection NIPS 2024 Coarse-to-Fine Latent Diffusion Model for Glaucoma Forecast on Sequential Fundus Images MICCAI 2024 FedLoGe: Joint Local and Generic Federated Learning under Long-tailed Data ICLR 2024 MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant CVPR 2024 Making Pre-trained Language Models Great on Tabular Prediction ICLR 2024 DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM ECCV 2024 Bridge-IF: Learning Inverse Protein Folding with Markov Bridges NIPS 2024 LKM-UNet: Large Kernel Vision Mamba UNet for Medical Image Segmentation MICCAI 2024 Unraveling Babel: Exploring Multilingual Activation Patterns of LLMs and Their Applications EMNLP 2024 Mind’s Mirror: Distilling Self-Evaluation Capability and Comprehensive Thinking from Large Language Models NAACL 2024 T2G-FORMER: Organizing Tabular Features into Relation Graphs Promotes Heterogeneous Feature Interaction AAAI 2023 Fast Model DeBias with Machine Unlearning NIPS 2023 Towards Distribution-Agnostic Generalized Category Discovery NIPS 2023 Fed-GraB: Federated Long-tailed Learning with Self-Adjusting Gradient Balancer NIPS 2023 Sample-efficient Multi-objective Molecular Optimization with GFlowNets NIPS 2023 TACR: A Table Alignment-based Cell Selection Method for HybridQA ACL 2023 Text2Tree: Aligning Text Representation to the Label Tree Hierarchy for Imbalanced Medical Classification EMNLP 2023 Ord2Seq: Regarding Ordinal Regression as Label Sequence Prediction ICCV 2023 TabCaps: A Capsule Neural Network for Tabular Data Classification with BoW Routing ICLR 2023 Robust Image Ordinal Regression with Controllable Image Generation IJCAI 2023 MolHF: A Hierarchical Normalizing Flow for Molecular Graph Generation IJCAI 2023 Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings INTERSPEECH 2022 Streaming Multi-Talker ASR with Token-Level Serialized Output Training INTERSPEECH 2022 ME-GAN: Learning Panoptic Electrocardio Representations for Multi-view ECG Synthesis Conditioned on Heart Diseases ICML 2022 A Synthetic Prediction Market for Estimating Confidence in Published Work AAAI 2022 DialMed: A Dataset for Dialogue-based Medication Recommendation COLING 2022 Sound2Synth: Interpreting Sound via FM Synthesizer Parameters Estimation IJCAI 2022 DANets: Deep Abstract Networks for Tabular Data Classification and Regression AAAI 2022 DeepPatent: Large Scale Patent Drawing Recognition and Retrieval WACV 2022 Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition? INTERSPEECH 2022 Ultra Fast Speech Separation Model with Teacher Student Learning INTERSPEECH 2021 Investigation of Practical Aspects of Single Channel Speech Separation for ASR INTERSPEECH 2021 Electrocardio Panorama: Synthesizing New ECG views with Self-supervision IJCAI 2021 Dig into Multi-modal Cues for Video Retrieval with Hierarchical Alignment IJCAI 2021 Extractive Research Slide Generation Using Windowed Labeling Ranking NAACL 2021 AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario INTERSPEECH 2021 A Receptor Skeleton for Capsule Neural Networks ICML 2021 Sequence-Level Confidence Classifier for ASR Utterance Accuracy and Application to Acoustic Models INTERSPEECH 2021 To Choose or to Fuse? Scale Selection for Crowd Counting AAAI 2021 1-D Row-Convolution LSTM: Fast Streaming ASR at Accuracy Parity with LC-BLSTM INTERSPEECH 2020 An End-to-End Architecture of Online Multi-Channel Speech Separation INTERSPEECH 2020 Speaker Attribution with Voice Profiles by Graph-Based Semi-Supervised Learning INTERSPEECH 2020 Fast and Slow Acoustic Model INTERSPEECH 2020 Channel-Wise Subband Input for Better Voice and Accompaniment Separation on High Resolution Music INTERSPEECH 2020 Bandpass Noise Generation and Augmentation for Unified ASR INTERSPEECH 2020 DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement INTERSPEECH 2020 NPU Speaker Verification System for INTERSPEECH 2020 Far-Field Speaker Verification Challenge INTERSPEECH 2020 Acknowledgement Entity Recognition in CORD-19 Papers EMNLP 2020 A Hierarchical Graph Network for 3D Object Detection on Point Clouds CVPR 2020 X2CT-GAN: Reconstructing CT From Biplanar X-Rays With Generative Adversarial Networks CVPR 2019 A Comprehensive Study of Speech Separation: Spectrogram vs Waveform Separation INTERSPEECH 2019 Practical Multi-fidelity Bayesian Optimization for Hyperparameter Tuning UAI 2019 Improved Speaker-Dependent Separation for CHiME-5 Challenge INTERSPEECH 2019 Cleaning Noisy and Heterogeneous Metadata for Record Linking across Scholarly Big Datasets AAAI 2019 Practical Two-Step Lookahead Bayesian Optimization NIPS 2019 Sequential Recommender System based on Hierarchical Attention Networks IJCAI 2018 Bayesian Optimization with Gradients NIPS 2017 The Parallel Knowledge Gradient Method for Batch Bayesian Optimization NIPS 2016 Tibetan Unknown Word Identification from News Corpora for Supporting Lexicon-based Tibetan Word Segmentation ACL 2015 Tibetan Unknown Word Identification from News Corpora for Supporting Lexicon-based Tibetan Word Segmentation IJCNLP 2015 Zipf’s Law and Statistical Data on Modern Tibetan COLING 2014 Tibetan Base Noun Phrase Identification Framework Based on Chinese-Tibetan Sentence Aligned Corpus COLING 2012 Compression Methods by Code Mapping and Code Dividing for Chinese Dictionary Stored in a Double-Array Trie IJCNLP 2011 Tibetan Number Identification Based on Classification of Number Components in Tibetan Word Segmentation COLING 2010