Jian Wu
108 papers · 2010–2026 · 19 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (25) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (7) π£ Hot Topic Early Bird
π
Interdisciplinary Bridge
π£
Hot Topic Early Bird
π
Cross-Pollinator
(8)
π¬
Deep Specialist
(15)
π§¬
Topic Evolution
π
Keyword Champion
π
Grand Slam
π€
Dynamic Duo
(20)
π
Century Club
(103)
β
The Questioner
(4)
π
Conference Pioneer
β‘
Prolific Year
(11)
π₯
Unstoppable
(12)
ποΈ
Keyword Collector
(74)
π
Trend Setter
Conferences
INTERSPEECH (18)
ACL (12)
AAAI (10)
IJCAI (10)
NIPS (9)
MICCAI (9)
EMNLP (8)
ICLR (6)
COLING (5)
NAACL (5)
CVPR (4)
ICML (3)
IJCNLP (2)
ICCV (2)
MIDL (1)
ECCV (1)
AISTATS (1)
UAI (1)
WACV (1)
Top co-authors
Research topics
Keywords
large language model
(13)
representation learning
(7)
automatic speech recognition
(5)
deep learning
(5)
speech separation
(5)
multimodal learning
(5)
speech recognition
(5)
ordinal regression
(5)
neural network
(4)
word error rate
(4)
bayesian optimization
(4)
medical imaging
(4)
semi-supervised learning
(3)
graph neural network
(3)
knowledge distillation
(3)
streaming asr
(3)
knowledge gradient
(3)
data augmentation
(3)
convolutional neural network
(3)
image classification
(2)
Papers
Learning What Matters: Dynamic Dimension Selection and Aggregation for Interpretable Vision-Language Reward Modeling
ACL 2026
MT3: A Synergistic Multi-Task RL Framework for Specializing MLLMs in Text Image Machine Translation
ACL 2026
Act as you think: Reinforcing Consistent Reasoning in Medical Visual Question Answering
ACL 2026
Debate-of-Thoughts: Resolving Knowledge Conflicts in LLMs Through Internal Deliberation
ACL 2026
LAMDAS: LLM as an Implicit Classifier for Domain-specific Data Selection
AAAI 2026
MedThink: A Rationale-Guided Framework for Explaining Medical Visual Question Answering
NAACL 2025
DiTAR: Diffusion Transformer Autoregressive Modeling for Speech Generation
ICML 2025
Dual-level Fuzzy Learning with Patch Guidance for Image Ordinal Regression
IJCAI 2025
HSCR: Hierarchical Self-Contrastive Rewarding for Aligning Medical Vision Language Models
ACL 2025
LLMs Can Simulate Standardized Patients via Agent Coevolution
ACL 2025
Towards Reliable Large Audio Language Model
ACL 2025
From Misleading Queries to Accurate Answers: A Three-Stage Fine-Tuning Method for LLMs
ACL 2025
Reason from Future: Reverse Thought Chain Enhances LLM Reasoning
ACL 2025
Rethinking Neural-based Matrix Inversion: Why canβt, and Where can
AISTATS 2025
V2T-CoT: From Vision to Text Chain-of-Thought for Medical Reasoning and Diagnosis
MICCAI 2025
Uncertainty-Aware Multi-Expert Knowledge Distillation for Imbalanced Disease Grading
MICCAI 2025
RefineNet: Elevating Medical Foundation Models through Quality-Centric Data Curation by MLLM-Annotated Proxy Distillation
MICCAI 2025
Knowing or Guessing? Robust Medical Visual Question Answering via Joint Consistency and Contrastive Learning
MICCAI 2025
Scalable Autoregressive Monocular Depth Estimation
CVPR 2025
Fair-MoE: Medical Fairness-Oriented Mixture of Experts in Vision-Language Models
MICCAI 2025
Icon2: Aligning Large Language Models Using Self-Synthetic Preference Data via Inherent Regulation
EMNLP 2025
LongWeave: A Long-Form Generation Benchmark Bridging Real-World Relevance and Verifiability
EMNLP 2025
MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning
EMNLP 2025
Guiding Large Language Models for Biomedical Entity Linking via Restrictive and Contrastive Decoding
EMNLP 2025
OrderChain: Towards General Instruct-Tuning for Stimulating the Ordinal Understanding Ability of MLLM
ICCV 2025
Small Models are LLM Knowledge Triggers for Medical Tabular Prediction
ICLR 2025
MMQA: Evaluating LLMs with Multi-Table Multi-Hop Complex Questions
ICLR 2025
CofCA: A STEP-WISE Counterfactual Multi-hop QA benchmark
ICLR 2025
Modality-Fair Preference Optimization for Trustworthy MLLM Alignment
IJCAI 2025
TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement
NAACL 2025
Synergy of GFlowNet and Protein Language Model Makes a Diverse Antibody Designer
AAAI 2025
ProtCLIP: Function-Informed Protein Multi-Modal Learning
AAAI 2025
Identifying and Mitigating Social Bias Knowledge in Language Models
NAACL 2025
M-MAD: Multidimensional Multi-Agent Debate for Advanced Machine Translation Evaluation
ACL 2025
Personalized Heart Disease Detection via ECG Digital Twin Generation
IJCAI 2024
AI-Enhanced Virtual Reality in Medicine: A Comprehensive Survey
IJCAI 2024
MFIF-Net: A Multi-Focal Image Fusion Network for Implantation Outcome Prediction of Blastocyst
MIDL 2024
COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning
INTERSPEECH 2024
PX2Tooth: Reconstructing the 3D Point Cloud Teeth from a Single Panoramic X-ray
MICCAI 2024
VPL: Visual Proxy Learning Framework for Zero-Shot Medical Image Diagnosis
EMNLP 2024
Arithmetic Feature Interaction Is Necessary for Deep Tabular Learning
AAAI 2024
ETDPC: A Multimodality Framework for Classifying Pages in Electronic Theses and Dissertations
AAAI 2024
TeleOR: Real-time Telemedicine System for Full-Scene Operating Room
MICCAI 2024
Can Large Language Models Discern Evidence for Scientific Hypotheses? Case Studies in the Social Sciences
COLING 2024
Enhancing Semi-Supervised Learning via Representative and Diverse Sample Selection
NIPS 2024
Coarse-to-Fine Latent Diffusion Model for Glaucoma Forecast on Sequential Fundus Images
MICCAI 2024
FedLoGe: Joint Local and Generic Federated Learning under Long-tailed Data
ICLR 2024
MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant
CVPR 2024
Making Pre-trained Language Models Great on Tabular Prediction
ICLR 2024
DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM
ECCV 2024
Bridge-IF: Learning Inverse Protein Folding with Markov Bridges
NIPS 2024
LKM-UNet: Large Kernel Vision Mamba UNet for Medical Image Segmentation
MICCAI 2024
Unraveling Babel: Exploring Multilingual Activation Patterns of LLMs and Their Applications
EMNLP 2024
Mindβs Mirror: Distilling Self-Evaluation Capability and Comprehensive Thinking from Large Language Models
NAACL 2024
T2G-FORMER: Organizing Tabular Features into Relation Graphs Promotes Heterogeneous Feature Interaction
AAAI 2023
Fast Model DeBias with Machine Unlearning
NIPS 2023
Towards Distribution-Agnostic Generalized Category Discovery
NIPS 2023
Fed-GraB: Federated Long-tailed Learning with Self-Adjusting Gradient Balancer
NIPS 2023
Sample-efficient Multi-objective Molecular Optimization with GFlowNets
NIPS 2023
TACR: A Table Alignment-based Cell Selection Method for HybridQA
ACL 2023
Text2Tree: Aligning Text Representation to the Label Tree Hierarchy for Imbalanced Medical Classification
EMNLP 2023
Ord2Seq: Regarding Ordinal Regression as Label Sequence Prediction
ICCV 2023
TabCaps: A Capsule Neural Network for Tabular Data Classification with BoW Routing
ICLR 2023
Robust Image Ordinal Regression with Controllable Image Generation
IJCAI 2023
MolHF: A Hierarchical Normalizing Flow for Molecular Graph Generation
IJCAI 2023
Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings
INTERSPEECH 2022
Streaming Multi-Talker ASR with Token-Level Serialized Output Training
INTERSPEECH 2022
ME-GAN: Learning Panoptic Electrocardio Representations for Multi-view ECG Synthesis Conditioned on Heart Diseases
ICML 2022
A Synthetic Prediction Market for Estimating Confidence in Published Work
AAAI 2022
DialMed: A Dataset for Dialogue-based Medication Recommendation
COLING 2022
Sound2Synth: Interpreting Sound via FM Synthesizer Parameters Estimation
IJCAI 2022
DANets: Deep Abstract Networks for Tabular Data Classification and Regression
AAAI 2022
DeepPatent: Large Scale Patent Drawing Recognition and Retrieval
WACV 2022
Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?
INTERSPEECH 2022
Ultra Fast Speech Separation Model with Teacher Student Learning
INTERSPEECH 2021
Investigation of Practical Aspects of Single Channel Speech Separation for ASR
INTERSPEECH 2021
Electrocardio Panorama: Synthesizing New ECG views with Self-supervision
IJCAI 2021
Dig into Multi-modal Cues for Video Retrieval with Hierarchical Alignment
IJCAI 2021
Extractive Research Slide Generation Using Windowed Labeling Ranking
NAACL 2021
AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario
INTERSPEECH 2021
A Receptor Skeleton for Capsule Neural Networks
ICML 2021
Sequence-Level Confidence Classifier for ASR Utterance Accuracy and Application to Acoustic Models
INTERSPEECH 2021
To Choose or to Fuse? Scale Selection for Crowd Counting
AAAI 2021
1-D Row-Convolution LSTM: Fast Streaming ASR at Accuracy Parity with LC-BLSTM
INTERSPEECH 2020
An End-to-End Architecture of Online Multi-Channel Speech Separation
INTERSPEECH 2020
Speaker Attribution with Voice Profiles by Graph-Based Semi-Supervised Learning
INTERSPEECH 2020
Fast and Slow Acoustic Model
INTERSPEECH 2020
Channel-Wise Subband Input for Better Voice and Accompaniment Separation on High Resolution Music
INTERSPEECH 2020
Bandpass Noise Generation and Augmentation for Unified ASR
INTERSPEECH 2020
DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement
INTERSPEECH 2020
NPU Speaker Verification System for INTERSPEECH 2020 Far-Field Speaker Verification Challenge
INTERSPEECH 2020
Acknowledgement Entity Recognition in CORD-19 Papers
EMNLP 2020
A Hierarchical Graph Network for 3D Object Detection on Point Clouds
CVPR 2020
X2CT-GAN: Reconstructing CT From Biplanar X-Rays With Generative Adversarial Networks
CVPR 2019
A Comprehensive Study of Speech Separation: Spectrogram vs Waveform Separation
INTERSPEECH 2019
Practical Multi-fidelity Bayesian Optimization for Hyperparameter Tuning
UAI 2019
Improved Speaker-Dependent Separation for CHiME-5 Challenge
INTERSPEECH 2019
Cleaning Noisy and Heterogeneous Metadata for Record Linking across Scholarly Big Datasets
AAAI 2019
Practical Two-Step Lookahead Bayesian Optimization
NIPS 2019
Sequential Recommender System based on Hierarchical Attention Networks
IJCAI 2018
Bayesian Optimization with Gradients
NIPS 2017
The Parallel Knowledge Gradient Method for Batch Bayesian Optimization
NIPS 2016
Tibetan Unknown Word Identification from News Corpora for Supporting Lexicon-based Tibetan Word Segmentation
ACL 2015
Tibetan Unknown Word Identification from News Corpora for Supporting Lexicon-based Tibetan Word Segmentation
IJCNLP 2015
Zipfβs Law and Statistical Data on Modern Tibetan
COLING 2014
Tibetan Base Noun Phrase Identification Framework Based on Chinese-Tibetan Sentence Aligned Corpus
COLING 2012
Compression Methods by Code Mapping and Code Dividing for Chinese Dictionary Stored in a Double-Array Trie
IJCNLP 2011
Tibetan Number Identification Based on Classification of Number Components in Tibetan Word Segmentation
COLING 2010