conftrace_

Michael Shieh

8 papers · 2024–2025 · 4 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+2 more ↓

🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (4) 🐝 Cross-Pollinator (12) 🌈 Renaissance Researcher (5) 🗺️ Taxonomy Completionist (17)

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird

Conferences

ICLR (3) ACL (2) EMNLP (2) AAAI (1)

Top co-authors

Yuxi Xie (4) Kenji Kawaguchi (4) Yiran Zhao (3) Junxian He (2) Hannah Brown (2) Anirudh Goyal (2) James Xu Zhao (2) Xin Li (1) Qisheng Hu (1) Leon Lin (1)

Keywords

large language model (4) model alignment (2) transfer learning (2) prompt engineering (1) code generation (1) instruction tuning (1) adversarial attack (1) jailbreak attack (1) prompt optimization (1) red teaming (1) prompt perturbation (1) code optimization (1) llm alignment (1) code editing (1) adversarial suffix (1) harmful output (1) code refactoring (1) adversarial learning (1) greedy coordinate gradient (1) game theory (1)

Papers

Single Character Perturbations Break LLM Alignment AAAI 2025 Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron ICLR 2025 MixEval-X: Any-to-any Evaluations from Real-world Data Mixture ICLR 2025 LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization ICLR 2025 Reasoning Robustness of LLMs to Adversarial Typographical Errors EMNLP 2024 Prompt Optimization via Adversarial In-Context Learning ACL 2024 InstructCoder: Instruction Tuning Large Language Models for Code Editing ACL 2024 Advancing Adversarial Suffix Transfer Learning on Aligned Large Language Models EMNLP 2024