conftrace_

Sherif Saad

3 papers · 2023–2025 · 3 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+3 more ↓

🐝 Cross-Pollinator (15) 🐣 Hot Topic Early Bird 🌍 Conference Polyglot (3) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (10)

🧭 Keyword Pioneer 🌱 Topic Pioneer 📈 Trend Setter

Conferences

EACL (1) EMNLP (1) NAACL (1)

Top co-authors

Omar Mahmoud (2) Aly Kassem (2) Hyunwoo Kim (1) Yulia Tsvetkov (1) Santu Rana (1) Yejin Choi (1) Niloofar Mireshghallah (1) Aly M. Kassem (1)

Keywords

proximal policy optimization (2) language model (2) machine unlearning (1) instruction tuning (1) adversarial attack (1) reinforcement learning feedback (1) prompt optimization (1) privacy leakage (1) memorization mitigation (1) large language model (1) unlearning technique (1) language model memorization (1) paraphrasing policy (1) memorization risk (1) targeted paraphrasing (1) reinforcement learning (1) mutual implication score (1) privacy preservation (1)

Papers

ALPACA AGAINST VICUNA: Using LLMs to Uncover Memorization of LLMs NAACL 2025 Finding a Needle in the Adversarial Haystack: A Targeted Paraphrasing Approach For Uncovering Edge Cases with Minimal Distribution Distortion EACL 2024 Preserving Privacy Through Dememorization: An Unlearning Technique For Mitigating Memorization Risks In Language Models EMNLP 2023