Jun Zhang
115 papers · 2009–2026 · 22 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (17) π Interdisciplinary Bridge π Renaissance Researcher (7) π Conference Polyglot (21)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(17)
π§
Keyword Pioneer
π
Grand Slam
π
Triple Crown
π₯
Mega-Team
(21)
π¬
Deep Specialist
(11)
π
Keyword Champion
(2)
β‘
Prolific Year
(9)
π
Conference Pioneer
ποΈ
Keyword Collector
(414)
β
The Questioner
(3)
π
Century Club
(99)
π₯
Unstoppable
(6)
Conferences
AAAI (17)
ACL (17)
ICLR (13)
NIPS (11)
INTERSPEECH (8)
CVPR (8)
EMNLP (8)
ICML (7)
ICCV (6)
ECCV (4)
IJCAI (3)
EACL (2)
WACV (2)
AISTATS (1)
IJCNLP (1)
ACML (1)
JMLR (1)
MICCAI (1)
NAACL (1)
AACL (1)
RSS (1)
UAI (1)
Top co-authors
Keywords
large language model
(11)
question answering
(5)
domain adaptation
(5)
representation learning
(4)
contrastive learning
(4)
multiple instance learning
(4)
multimodal large language model
(4)
whole slide image
(3)
weakly supervised learning
(3)
multi-agent reinforcement learning
(3)
mathematical reasoning
(3)
self-supervised learning
(3)
speech recognition
(3)
efficient inference
(3)
model compression
(3)
reinforcement learning
(3)
multimodal learning
(3)
text classification
(3)
speculative decoding
(3)
whole-slide image
(3)
Papers
Enhancing Auto-regressive Chain-of-Thought through Loop-Aligned Reasoning
EACL 2026
SHARP: Self-adaptive Harmful Category-aware Prompt Generation for Black-box Jailbreaking
ACL 2026
See the Forest for the Trees: Loosely Speculative Decoding via Visual-Semantic Guidance for Efficient Inference of Video LLMs
ACL 2026
ReFL: Reflective Feedback Learning for Hallucination Detection of Large Language Models
ACL 2026
DisCal: Distribution-Aware Calibration for Mathematical Reasoning Under Character-Level Noisy Inputs
ACL 2026
HybridKV: Hybrid KV Cache Compression for Efficient Multimodal Large Language Model Inference
ACL 2026
Interleaved Tool-Call Reasoning for Protein Function Understanding
ACL 2026
Focus-dLLM: Accelerating Long-Context Diffusion LLM Inference via Confidence-Guided Context Focusing
ACL 2026
On the Feasibility of Using MultiModal LLMs to Execute AR Social Engineering Attacks
AAAI 2026
Global-Local Confidence Fusion for Hallucination Detection in Mathematical Reasoning Task
AAAI 2026
VIL2C: Value-of-Information Aware Low-Latency Communication for Multi-Agent Reinforcement Learning
AAAI 2026
PepCCD: A Contrastive Conditioned Diffusion Framework for Target-Specific Peptide Generation
AAAI 2026
DisCo DETR: Distance-aware Multi-view Contrastive Learning for DETR Pre-training
AAAI 2026
Cross-Scale Collaboration between LLMs and Lightweight Sequential Recommenders with Domain-Specific Latent Reasoning
AAAI 2026
GaussianImage++: Boosted Image Representation and Compression with 2D Gaussian Splatting
AAAI 2026
KNN-SSD: Enabling Dynamic Self-Speculative Decoding via Nearest Neighbor Layer Set Optimization
EACL 2026
GI-GS: Global Illumination Decomposition on Gaussian Splatting for Inverse Rendering
ICLR 2025
Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models
AAAI 2025
Semi-Supervised Clustering Framework for Fine-grained Scene Graph Generation
AAAI 2025
CAMSIC: Content-aware Masked Image Modeling Transformer for Stereo Image Compression
AAAI 2025
Learn How to Query from Unlabeled Data Streams in Federated Learning
AAAI 2025
Chain of Functions: A Programmatic Pipeline for Fine-Grained Chart Reasoning Data Generation
AACL 2025
CodeDPO: Aligning Code Models with Self Generated and Verified Source Code
ACL 2025
QualiSpeech: A Speech Quality Assessment Dataset with Natural Language Reasoning and Descriptions
ACL 2025
A Parallelized Framework for Simulating Large-Scale LLM Agents with Realistic Environments and Interactions
ACL 2025
Dynamic Evil Score-Guided Decoding: An Efficient Decoding Framework For Red-Team Model
ACL 2025
Reinforcement Learning with Intrinsically Motivated Feedback Graph for Lost-sales Inventory Control
AISTATS 2025
SpecVLM: Enhancing Speculative Decoding of Video LLMs via Verifier-Guided Token Pruning
EMNLP 2025
Complex Numerical Reasoning with Numerical Semantic Pre-training Framework
EMNLP 2025
Long Chain-of-Thought Fine-tuning via Understanding-to-Reasoning Transition
EMNLP 2025
SafeConf: A Confidence-Calibrated Safety Self-Evaluation Method for Large Language Models
EMNLP 2025
MEGA: Memory-Efficient 4D Gaussian Splatting for Dynamic Scenes
ICCV 2025
FinMMR: Make Financial Numerical Reasoning More Multimodal, Comprehensive, and Challenging
ICCV 2025
p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay
ICCV 2025
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Decoupled Video Diffusion
ICCV 2025
Ensembling Diffusion Models via Adaptive Feature Aggregation
ICLR 2025
Exponential Topology-enabled Scalable Communication in Multi-agent Reinforcement Learning
ICLR 2025
Why Does the Effective Context Length of LLMs Fall Short?
ICLR 2025
SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration
ICLR 2025
Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models
ICLR 2025
Let the Code LLM Edit Itself When You Edit the Code
ICLR 2025
AdaWorld: Learning Adaptable World Models with Latent Actions
ICML 2025
HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer Acceleration
ICML 2025
C2IQL: Constraint-Conditioned Implicit Q-learning for Safe Offline Reinforcement Learning
ICML 2025
FloE: On-the-Fly MoE Inference on Memory-constrained GPU
ICML 2025
Chain of Functions: A Programmatic Pipeline for Fine-Grained Chart Reasoning Data Generation
IJCNLP 2025
A Novel ED Triage Framework Using Conditional Imputation, Multi-Scale Semantic Learning, and Cross-Modal Fusion
MICCAI 2025
Graph Neural Network Enhanced Retrieval for Question Answering of Large Language Models
NAACL 2025
Neural Graph Map: Dense Mapping with Efficient Loop Closure Integration
WACV 2025
$\texttt{ConflictBank}$: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLMs
NIPS 2024
Differentially Private Deep Learning with Importance-based Adaptive Gradient Processing
ACML 2024
Training-Free Long-Context Scaling of Large Language Models
ICML 2024
Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforcement Learning
ICML 2024
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
NIPS 2024
Semi-Open 3D Object Retrieval via Hierarchical Equilibrium on Hypergraph
NIPS 2024
SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words
NIPS 2024
Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement Learning
NIPS 2024
Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding
ACL 2024
On the Convergence of an Adaptive Momentum Method for Adversarial Attacks
AAAI 2024
Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model
ECCV 2024
TransLoc4D: Transformer-based 4D Radar Place Recognition
CVPR 2024
Boosting Neural Representations for Videos with a Conditional Decoder
CVPR 2024
Generalized Predictive Model for Autonomous Driving
CVPR 2024
Task-Aware Encoder Control for Deep Video Compression
CVPR 2024
Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models
ICLR 2024
VersVideo: Leveraging Enhanced Temporal Diffusion Models for Versatile Video Generation
ICLR 2024
GaussianImage: 1000 FPS Image Representation and Compression by 2D Gaussian Splatting
ECCV 2024
Can Large Language Models Understand Spatial Audio?
INTERSPEECH 2024
Dual-Pipeline with Low-Rank Adaptation for New Language Integration in Multilingual ASR
INTERSPEECH 2024
L-Eval: Instituting Standardized Evaluation for Long Context Language Models
ACL 2024
Assembly Fuzzy Representation on Hypergraph for Open-Set 3D Object Retrieval
NIPS 2024
Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?
ACL 2024
CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training
ACL 2023
Graph-Based Self-Learning for Robust Person Re-Identification
WACV 2023
Transferable Post-hoc Calibration on Pretrained Transformers in Noisy Text Classification
AAAI 2023
LDMIC: Learning-based Distributed Multi-view Image Coding
ICLR 2023
Exploring Low-Rank Property in Multiple Instance Learning for Whole Slide Image Classification
ICLR 2023
Sparse Mixture-of-Experts are Domain Generalizable Learners
ICLR 2023
MIMT: Masked Image Modeling Transformer for Video Compression
ICLR 2023
RLogist: Fast Observation Strategy on Whole-Slide Images with Deep Reinforcement Learning
AAAI 2023
CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling
ICML 2023
Locate, Refine and Restore: A Progressive Enhancement Network for Camouflaged Object Detection
IJCAI 2023
KBioXLM: A Knowledge-anchored Biomedical Multilingual Pretrained Language Model
EMNLP 2023
Text-only Domain Adaptation using Unified Speech-Text Representation in Transducer
INTERSPEECH 2023
Language-specific Boundary Learning for Improving Mandarin-English Code-switching Speech Recognition
INTERSPEECH 2023
Generalized Relation Modeling for Transformer Tracking
CVPR 2023
Bring dialogue-context into RNN-T for streaming ASR
INTERSPEECH 2022
BMInf: An Efficient Toolkit for Big Model Inference and Tuning
ACL 2022
PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining
NIPS 2022
Multi-dataset Training of Transformers for Robust Action Recognition
NIPS 2022
Node-Aligned Graph Convolutional Network for Whole-Slide Image Representation and Classification
CVPR 2022
Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval
NIPS 2022
DReS-FL: Dropout-Resilient Secure Federated Learning for Non-IID Clients via Secret Data Sharing
NIPS 2022
Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire
INTERSPEECH 2022
SCL-WC: Cross-Slide Contrastive Learning for Weakly-Supervised Whole-Slide Image Classification
NIPS 2022
HMM-Free Encoder Pre-Training for Streaming RNN Transducer
INTERSPEECH 2021
Diagnose Like A Pathologist: Weakly-Supervised Pathologist-Tree Network for Slide-Level Immunohistochemical Scoring
AAAI 2021
kFolden: k-Fold Ensemble for Out-Of-Distribution Detection
EMNLP 2021
Minimizing Labeling Cost for Nuclei Instance Segmentation and Classification with Cross-domain Images and Weak Labels
AAAI 2021
Exploiting Behavioral Consistence for Universal User Representation
AAAI 2021
Learning 3D Shape Feature for Texture-Insensitive Person Re-Identification
CVPR 2021
KERS: A Knowledge-Enhanced Framework for Recommendation Dialog Systems with Multiple Subgoals
EMNLP 2021
Attentional Pyramid Pooling of Salient Visual Residuals for Place Recognition
ICCV 2021
Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval With Partial Query
ICCV 2021
A Comprehensive Survey on Image Dehazing Based on Deep Learning
IJCAI 2021
Do Not Disturb Me: Person Re-identification Under the Interference of Other Pedestrians
ECCV 2020
Complete Dictionary Learning via $\ell_p$-norm Maximization
UAI 2020
Predicting Lymph Node Metastasis Using Histopathological Images Based on Multiple Instance Learning With Deep Graph Convolution
CVPR 2020
Zero-shot Text Classification via Reinforced Self-training
ACL 2020
GATCluster: Self-Supervised Gaussian-Attention Network for Image Clustering
ECCV 2020
Session-level Language Modeling for Conversational Speech
EMNLP 2018
Three-Dimensional Hysteresis Modeling of Robotic Artificial Muscles with Application to Shape Memory Alloy Actuators
RSS 2017
Transfer Learning for Speaker Verification on Short Utterances
INTERSPEECH 2016
Saliency Detection with a Deeper Investigation of Light Field
IJCAI 2015
Reproducing Kernel Banach Spaces for Machine Learning
JMLR 2009