Jun Zhang

115 papers · 2009–2026 · 22 conferences · across top CS/AI conferences

Achievements

+14 more ↓

🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (17) 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (7) 🌍 Conference Polyglot (21)

🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (17) 🧭 Keyword Pioneer 🏆 Grand Slam 👑 Triple Crown 👥 Mega-Team (21) 🔬 Deep Specialist (11) 🏆 Keyword Champion (2) ⚡ Prolific Year (9) 🚀 Conference Pioneer 🗃️ Keyword Collector (414) ❓ The Questioner (3) 💎 Century Club (99) 🔥 Unstoppable (6)

Conferences

AAAI (17) ACL (17) ICLR (13) NIPS (11) INTERSPEECH (8) CVPR (8) EMNLP (8) ICML (7) ICCV (6) ECCV (4) IJCAI (3) EACL (2) WACV (2) AISTATS (1) IJCNLP (1) ACML (1) JMLR (1) MICCAI (1) NAACL (1) AACL (1) RSS (1) UAI (1)

Top co-authors

Xinjie Zhang (9) Xiao Han (8) Lu Lu (7) Yan Wang (6) Lidan Shou (6) Dailan He (6) Zejun Ma (6) Xinran Li (6) Xingtong Ge (6) Tongda Xu (6)

Keywords

large language model (11) question answering (5) domain adaptation (5) representation learning (4) contrastive learning (4) multiple instance learning (4) multimodal large language model (4) whole slide image (3) weakly supervised learning (3) multi-agent reinforcement learning (3) mathematical reasoning (3) self-supervised learning (3) speech recognition (3) efficient inference (3) model compression (3) reinforcement learning (3) multimodal learning (3) text classification (3) speculative decoding (3) whole-slide image (3)

Papers

Enhancing Auto-regressive Chain-of-Thought through Loop-Aligned Reasoning EACL 2026 SHARP: Self-adaptive Harmful Category-aware Prompt Generation for Black-box Jailbreaking ACL 2026 See the Forest for the Trees: Loosely Speculative Decoding via Visual-Semantic Guidance for Efficient Inference of Video LLMs ACL 2026 ReFL: Reflective Feedback Learning for Hallucination Detection of Large Language Models ACL 2026 DisCal: Distribution-Aware Calibration for Mathematical Reasoning Under Character-Level Noisy Inputs ACL 2026 HybridKV: Hybrid KV Cache Compression for Efficient Multimodal Large Language Model Inference ACL 2026 Interleaved Tool-Call Reasoning for Protein Function Understanding ACL 2026 Focus-dLLM: Accelerating Long-Context Diffusion LLM Inference via Confidence-Guided Context Focusing ACL 2026 On the Feasibility of Using MultiModal LLMs to Execute AR Social Engineering Attacks AAAI 2026 Global-Local Confidence Fusion for Hallucination Detection in Mathematical Reasoning Task AAAI 2026 VIL2C: Value-of-Information Aware Low-Latency Communication for Multi-Agent Reinforcement Learning AAAI 2026 PepCCD: A Contrastive Conditioned Diffusion Framework for Target-Specific Peptide Generation AAAI 2026 DisCo DETR: Distance-aware Multi-view Contrastive Learning for DETR Pre-training AAAI 2026 Cross-Scale Collaboration between LLMs and Lightweight Sequential Recommenders with Domain-Specific Latent Reasoning AAAI 2026 GaussianImage++: Boosted Image Representation and Compression with 2D Gaussian Splatting AAAI 2026 KNN-SSD: Enabling Dynamic Self-Speculative Decoding via Nearest Neighbor Layer Set Optimization EACL 2026 GI-GS: Global Illumination Decomposition on Gaussian Splatting for Inverse Rendering ICLR 2025 Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models AAAI 2025 Semi-Supervised Clustering Framework for Fine-grained Scene Graph Generation AAAI 2025 CAMSIC: Content-aware Masked Image Modeling Transformer for Stereo Image Compression AAAI 2025 Learn How to Query from Unlabeled Data Streams in Federated Learning AAAI 2025 Chain of Functions: A Programmatic Pipeline for Fine-Grained Chart Reasoning Data Generation AACL 2025 CodeDPO: Aligning Code Models with Self Generated and Verified Source Code ACL 2025 QualiSpeech: A Speech Quality Assessment Dataset with Natural Language Reasoning and Descriptions ACL 2025 A Parallelized Framework for Simulating Large-Scale LLM Agents with Realistic Environments and Interactions ACL 2025 Dynamic Evil Score-Guided Decoding: An Efficient Decoding Framework For Red-Team Model ACL 2025 Reinforcement Learning with Intrinsically Motivated Feedback Graph for Lost-sales Inventory Control AISTATS 2025 SpecVLM: Enhancing Speculative Decoding of Video LLMs via Verifier-Guided Token Pruning EMNLP 2025 Complex Numerical Reasoning with Numerical Semantic Pre-training Framework EMNLP 2025 Long Chain-of-Thought Fine-tuning via Understanding-to-Reasoning Transition EMNLP 2025 SafeConf: A Confidence-Calibrated Safety Self-Evaluation Method for Large Language Models EMNLP 2025 MEGA: Memory-Efficient 4D Gaussian Splatting for Dynamic Scenes ICCV 2025 FinMMR: Make Financial Numerical Reasoning More Multimodal, Comprehensive, and Challenging ICCV 2025 p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay ICCV 2025 DimensionX: Create Any 3D and 4D Scenes from a Single Image with Decoupled Video Diffusion ICCV 2025 Ensembling Diffusion Models via Adaptive Feature Aggregation ICLR 2025 Exponential Topology-enabled Scalable Communication in Multi-agent Reinforcement Learning ICLR 2025 Why Does the Effective Context Length of LLMs Fall Short? ICLR 2025 SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration ICLR 2025 Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models ICLR 2025 Let the Code LLM Edit Itself When You Edit the Code ICLR 2025 AdaWorld: Learning Adaptable World Models with Latent Actions ICML 2025 HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer Acceleration ICML 2025 C2IQL: Constraint-Conditioned Implicit Q-learning for Safe Offline Reinforcement Learning ICML 2025 FloE: On-the-Fly MoE Inference on Memory-constrained GPU ICML 2025 Chain of Functions: A Programmatic Pipeline for Fine-Grained Chart Reasoning Data Generation IJCNLP 2025 A Novel ED Triage Framework Using Conditional Imputation, Multi-Scale Semantic Learning, and Cross-Modal Fusion MICCAI 2025 Graph Neural Network Enhanced Retrieval for Question Answering of Large Language Models NAACL 2025 Neural Graph Map: Dense Mapping with Efficient Loop Closure Integration WACV 2025 $\texttt{ConflictBank}$: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLMs NIPS 2024 Differentially Private Deep Learning with Importance-based Adaptive Gradient Processing ACML 2024 Training-Free Long-Context Scaling of Large Language Models ICML 2024 Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforcement Learning ICML 2024 Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability NIPS 2024 Semi-Open 3D Object Retrieval via Hierarchical Equilibrium on Hypergraph NIPS 2024 SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words NIPS 2024 Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement Learning NIPS 2024 Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding ACL 2024 On the Convergence of an Adaptive Momentum Method for Adversarial Attacks AAAI 2024 Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model ECCV 2024 TransLoc4D: Transformer-based 4D Radar Place Recognition CVPR 2024 Boosting Neural Representations for Videos with a Conditional Decoder CVPR 2024 Generalized Predictive Model for Autonomous Driving CVPR 2024 Task-Aware Encoder Control for Deep Video Compression CVPR 2024 Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models ICLR 2024 VersVideo: Leveraging Enhanced Temporal Diffusion Models for Versatile Video Generation ICLR 2024 GaussianImage: 1000 FPS Image Representation and Compression by 2D Gaussian Splatting ECCV 2024 Can Large Language Models Understand Spatial Audio? INTERSPEECH 2024 Dual-Pipeline with Low-Rank Adaptation for New Language Integration in Multilingual ASR INTERSPEECH 2024 L-Eval: Instituting Standardized Evaluation for Long Context Language Models ACL 2024 Assembly Fuzzy Representation on Hypergraph for Open-Set 3D Object Retrieval NIPS 2024 Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning? ACL 2024 CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training ACL 2023 Graph-Based Self-Learning for Robust Person Re-Identification WACV 2023 Transferable Post-hoc Calibration on Pretrained Transformers in Noisy Text Classification AAAI 2023 LDMIC: Learning-based Distributed Multi-view Image Coding ICLR 2023 Exploring Low-Rank Property in Multiple Instance Learning for Whole Slide Image Classification ICLR 2023 Sparse Mixture-of-Experts are Domain Generalizable Learners ICLR 2023 MIMT: Masked Image Modeling Transformer for Video Compression ICLR 2023 RLogist: Fast Observation Strategy on Whole-Slide Images with Deep Reinforcement Learning AAAI 2023 CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling ICML 2023 Locate, Refine and Restore: A Progressive Enhancement Network for Camouflaged Object Detection IJCAI 2023 KBioXLM: A Knowledge-anchored Biomedical Multilingual Pretrained Language Model EMNLP 2023 Text-only Domain Adaptation using Unified Speech-Text Representation in Transducer INTERSPEECH 2023 Language-specific Boundary Learning for Improving Mandarin-English Code-switching Speech Recognition INTERSPEECH 2023 Generalized Relation Modeling for Transformer Tracking CVPR 2023 Bring dialogue-context into RNN-T for streaming ASR INTERSPEECH 2022 BMInf: An Efficient Toolkit for Big Model Inference and Tuning ACL 2022 PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining NIPS 2022 Multi-dataset Training of Transformers for Robust Action Recognition NIPS 2022 Node-Aligned Graph Convolutional Network for Whole-Slide Image Representation and Classification CVPR 2022 Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval NIPS 2022 DReS-FL: Dropout-Resilient Secure Federated Learning for Non-IID Clients via Secret Data Sharing NIPS 2022 Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire INTERSPEECH 2022 SCL-WC: Cross-Slide Contrastive Learning for Weakly-Supervised Whole-Slide Image Classification NIPS 2022 HMM-Free Encoder Pre-Training for Streaming RNN Transducer INTERSPEECH 2021 Diagnose Like A Pathologist: Weakly-Supervised Pathologist-Tree Network for Slide-Level Immunohistochemical Scoring AAAI 2021 kFolden: k-Fold Ensemble for Out-Of-Distribution Detection EMNLP 2021 Minimizing Labeling Cost for Nuclei Instance Segmentation and Classification with Cross-domain Images and Weak Labels AAAI 2021 Exploiting Behavioral Consistence for Universal User Representation AAAI 2021 Learning 3D Shape Feature for Texture-Insensitive Person Re-Identification CVPR 2021 KERS: A Knowledge-Enhanced Framework for Recommendation Dialog Systems with Multiple Subgoals EMNLP 2021 Attentional Pyramid Pooling of Salient Visual Residuals for Place Recognition ICCV 2021 Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval With Partial Query ICCV 2021 A Comprehensive Survey on Image Dehazing Based on Deep Learning IJCAI 2021 Do Not Disturb Me: Person Re-identification Under the Interference of Other Pedestrians ECCV 2020 Complete Dictionary Learning via $\ell_p$-norm Maximization UAI 2020 Predicting Lymph Node Metastasis Using Histopathological Images Based on Multiple Instance Learning With Deep Graph Convolution CVPR 2020 Zero-shot Text Classification via Reinforced Self-training ACL 2020 GATCluster: Self-Supervised Gaussian-Attention Network for Image Clustering ECCV 2020 Session-level Language Modeling for Conversational Speech EMNLP 2018 Three-Dimensional Hysteresis Modeling of Robotic Artificial Muscles with Application to Shape Memory Alloy Actuators RSS 2017 Transfer Learning for Speaker Verification on Short Utterances INTERSPEECH 2016 Saliency Detection with a Deeper Investigation of Light Field IJCAI 2015 Reproducing Kernel Banach Spaces for Machine Learning JMLR 2009