Michael Shieh
8 papers · 2024–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Interdisciplinary Bridge π Conference Polyglot (4) π Cross-Pollinator (12) π Renaissance Researcher (5) πΊοΈ Taxonomy Completionist (17)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
Conferences
ICLR (3)
ACL (2)
EMNLP (2)
AAAI (1)
Top co-authors
Keywords
large language model
(4)
model alignment
(2)
transfer learning
(2)
prompt engineering
(1)
code generation
(1)
instruction tuning
(1)
adversarial attack
(1)
jailbreak attack
(1)
prompt optimization
(1)
red teaming
(1)
prompt perturbation
(1)
code optimization
(1)
llm alignment
(1)
code editing
(1)
adversarial suffix
(1)
harmful output
(1)
code refactoring
(1)
adversarial learning
(1)
greedy coordinate gradient
(1)
game theory
(1)
Papers
Single Character Perturbations Break LLM Alignment
AAAI 2025
Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron
ICLR 2025
MixEval-X: Any-to-any Evaluations from Real-world Data Mixture
ICLR 2025
LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization
ICLR 2025
Reasoning Robustness of LLMs to Adversarial Typographical Errors
EMNLP 2024
Prompt Optimization via Adversarial In-Context Learning
ACL 2024
InstructCoder: Instruction Tuning Large Language Models for Code Editing
ACL 2024
Advancing Adversarial Suffix Transfer Learning on Aligned Large Language Models
EMNLP 2024