Jindong Gu

55 papers · 2020–2026 · 10 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🏃 Academic Marathon (5) 🐝 Cross-Pollinator (12) 🌍 Conference Polyglot (10) 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (9)

🌈 Renaissance Researcher (9) 🧭 Keyword Pioneer 🤝 Dynamic Duo (21) 👑 Triple Crown 🏆 Grand Slam 🧬 Topic Evolution 🏆 Keyword Champion (2) ❓ The Questioner (8) ⚡ Prolific Year (20) 🗃️ Keyword Collector (193) 🔥 Unstoppable (6) 💎 Century Club (52)

Conferences

CVPR (10) ECCV (10) AAAI (7) ACL (6) EMNLP (6) ICCV (5) ICLR (5) ICML (2) NIPS (2) WACV (2)

Top co-authors

Volker Tresp (22) Philip Torr (18) Haokun Chen (6) Gengyuan Zhang (5) Shuo Chen (5) Yao Zhang (5) Xiaojun Jia (5) Denis Krompass (4) Yao Qin (4) Xiaochun Cao (4)

Keywords

large language model (6) text-to-image generation (4) federated learning (4) vision-language model (4) adversarial attack (3) diffusion model (3) safety alignment (3) multimodal learning (3) capsule network (3) multimodal large language model (3) adversarial robustness (3) data heterogeneity (2) adversarial perturbation (2) convolutional neural network (2) game theory (2) dynamic routing (2) direct preference optimization (2) image generation (2) knowledge editing (2) visual question answering (2)

Papers

Can Editing LLMs Inject Harm? AAAI 2026 Knowledge Control for Responsible Generative AI: Bridging Academia, Industry, and Society ACL 2026 AUVIC: Adversarial Unlearning of Visual Concepts for Multi-modal Large Language Models AAAI 2026 AlignGuard: Scalable Safety Alignment for Text-to-Image Generation ICCV 2025 FedPop: Federated Population-based Hyperparameter Tuning AAAI 2025 Multimodal Pragmatic Jailbreak on Text-to-image Models ACL 2025 Benchmarking Open-ended Audio Dialogue Understanding for Large Audio-Language Models ACL 2025 Magnet: Multi-turn Tool-use Data Synthesis and Distillation via Graph Translation ACL 2025 FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings ACL 2025 Localizing Events in Videos with Multimodal Queries CVPR 2025 FedBiP: Heterogeneous One-Shot Federated Learning with Personalized Latent Diffusion Models CVPR 2025 Not Just Text: Uncovering Vision Modality Typographic Threats in Image Generation Models CVPR 2025 ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos CVPR 2025 Reimagining Safety Alignment with An Image EMNLP 2025 Can an Individual Manipulate the Collective Decisions of Multi-Agents? EMNLP 2025 Flaw or Artifact? Rethinking Prompt Sensitivity in Evaluating LLMs EMNLP 2025 PlanGEN: A Multi-Agent Framework for Generating Planning and Reasoning Trajectories for Complex Problem Solving EMNLP 2025 LLM Jailbreak Detection for (Almost) Free! EMNLP 2025 Fair Generation without Unfair Distortions: Debiasing Text-to-Image Generation with Entanglement-Free Attention ICCV 2025 Improved Techniques for Optimization-Based Jailbreaking on Large Language Models ICLR 2025 Primitive Vision: Improving Diagram Understanding in MLLMs ICML 2025 Can Multimodal Large Language Models Truly Perform Multimodal In-Context Learning? WACV 2025 CL-Cross VQA: A Continual Learning Benchmark for Cross-Domain Visual Question Answering WACV 2025 Self-Discovering Interpretable Diffusion Latent Directions for Responsible Text-to-Image Generation CVPR 2024 Initialization Matters for Adversarial Transfer Learning CVPR 2024 Hide in Thicket: Generating Imperceptible and Rational Adversarial Perturbations on 3D Point Clouds CVPR 2024 Latent Guard: a Safety Framework for Text-to-image Generation ECCV 2024 MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models ECCV 2024 Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Models ECCV 2024 Improving Adversarial Transferability via Model Alignment ECCV 2024 Which Model Generated This Image? A Model-Agnostic Approach for Origin Attribution ECCV 2024 Dataset Distillation by Automatic Training Trajectories ECCV 2024 Visual Question Decomposition on Multimodal Large Language Models EMNLP 2024 Influencer Backdoor Attack on Semantic Segmentation ICLR 2024 Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images ICLR 2024 Can Large Language Model Agents Simulate Human Trust Behavior? NIPS 2024 FedDAT: An Approach for Foundation Model Finetuning in Multi-Modal Heterogeneous Federated Learning AAAI 2024 Does Few-Shot Learning Suffer from Backdoor Attacks? AAAI 2024 Provably Better Explanations with Optimized Aggregation of Feature Attributions ICML 2024 Discretization-Induced Dirichlet Posterior for Robust Uncertainty Quantification on Regression AAAI 2024 An Image Is Worth 1000 Lies: Transferability of Adversarial Images across Prompts on Vision-Language Models ICLR 2024 Do DALL-E and Flamingo Understand Each Other? ICCV 2023 FRAug: Tackling Federated Learning with Non-IID Features via Representation Augmentation ICCV 2023 Multi-Event Video-Text Retrieval ICCV 2023 Backdoor Defense via Adaptively Splitting Poisoned Dataset CVPR 2023 ECOLA: Enhancing Temporal Knowledge Embeddings with Contextualized Language Representations ACL 2023 Benchmarking Robustness of Adaptation Methods on Pre-trained Vision-Language Models NIPS 2023 Are Vision Transformers Robust to Patch Perturbations? ECCV 2022 Watermark Vaccine: Adversarial Attacks to Prevent Watermark Removal ECCV 2022 SegPGD: An Effective and Efficient Adversarial Attack for Evaluating and Boosting Segmentation Robustness ECCV 2022 Towards Efficient Adversarial Training on Vision Transformers ECCV 2022 Capsule Network Is Not More Robust Than Convolutional Network CVPR 2021 Effective and Efficient Vote Attack on Capsule Networks ICLR 2021 Interpretable Graph Capsule Networks for Object Recognition AAAI 2021 Improving the Robustness of Capsule Networks to Image Affine Transformations CVPR 2020