conftrace_

Reshmi Ghosh

4 papers · 2023–2026 · 3 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+3 more ↓

🐝 Cross-Pollinator (14) 🐣 Hot Topic Early Bird 🌍 Conference Polyglot (2) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (12)

🧭 Keyword Pioneer 👥 Mega-Team (21) ❓ The Questioner

Conferences

EMNLP (2) EACL (1) NIPS (1)

Top co-authors

Yuval Lemberg (1) Mario Fritz (1) Lea Schönherr (1) Soundararajan Srinivasan (1) Dmitrii Petrov (1) Yun Huang (1) Tiffany Knearem (1) Samyadeep Basu (1) Fineas Silaghi (1) Abhilasha Lodha (1)

Keywords

large language model (2) fisher information (1) adversarial attack (1) value alignment (1) prompt optimization (1) security evaluation (1) prompt injection (1) language encoder (1) defense mechanism (1) security vulnerability (1) attack success rate (1) layer selection (1) selective fine-tuning (1) reward poisoning (1) ethical alignment (1) parameter-efficient method (1) model defense (1) human-ai alignment (1) contextual evaluation (1) societal value (1)

Papers

Are My Optimized Prompts Compromised? Exploring Vulnerabilities of LLM-based Optimizers EACL 2026 ValueCompass: A Framework for Measuring Contextual Value Alignment Between Human and LLMs EMNLP 2025 Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition NIPS 2024 On Surgical Fine-tuning for Language Encoders EMNLP 2023