model unlearning

29 papers

Explore in graph

Co-occurring keywords

large language model (12755) machine unlearning (270) knowledge editing (283) privacy preservation (376) backdoor attack (377) adversarial attack (1599) diffusion model (3720) selective forgetting (30) backdoor defense (54) adversarial robustness (1335)

Papers

Editing as Unlearning: Are Knowledge Editing Methods Strong Baselines for Large Language Model Unlearning? AAAI 2026

Sculpting Memory: Multi-Concept Forgetting in Diffusion Models via Dynamic Mask and Concept-Aware Optimization ICCV 2025

SUA: Stealthy Multimodal Large Language Model Unlearning Attack EMNLP 2025

Model Unlearning via Sparse Autoencoder Subspace Guided Projections EMNLP 2025

REVIVING YOUR MNEME: Predicting The Side Effects of LLM Unlearning and Fine-Tuning via Sparse Model Diffing EMNLP 2025

Knowledge Decoupling via Orthogonal Projection for Lifelong Editing of Large Language Models ACL 2025

Atyaephyra at SemEval-2025 Task 4: Low-Rank Negative Preference Optimization ACL 2025

ZJUKLAB at SemEval-2025 Task 4: Unlearning via Model Merging. ACL 2025

Human-Inspired Obfuscation for Model Unlearning: Local and Global Strategies with Hyperbolic Representations EMNLP 2025

Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense NAACL 2025

Adaptive Graph Unlearning IJCAI 2025

AUTE: Peer-Alignment and Self-Unlearning Boost Adversarial Robustness for Training Ensemble Models AAAI 2025

CURE: Controlled Unlearning for Robust Embeddings — Mitigating Conceptual Shortcuts in Pre-Trained Language Models EMNLP 2025

Targeted Forgetting of Image Subgroups in CLIP Models CVPR 2025

NoT: Federated Unlearning via Weight Negation CVPR 2025

ESC: Erasing Space Concept for Knowledge Deletion CVPR 2025

On Effects of Steering Latent Representation for Large Language Model Unlearning AAAI 2025

Separate the Wheat from the Chaff: Model Deficiency Unlearning via Parameter-Efficient Module Operation AAAI 2024

Continual Forgetting for Pre-trained Vision Models CVPR 2024

Demystifying Verbatim Memorization in Large Language Models EMNLP 2024

Leveraging Catastrophic Forgetting to Develop Safe Diffusion Models against Malicious Finetuning NIPS 2024

MACE: Mass Concept Erasure in Diffusion Models CVPR 2024

SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning EMNLP 2024

Unveiling and Mitigating Backdoor Vulnerabilities based on Unlearning Weight Changes and Backdoor Activeness NIPS 2024

Mitigating Backdoor Attack by Injecting Proactive Defensive Backdoor NIPS 2024