Sander Schulhoff
3 papers · 2023–2023 · 2 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
๐ Conference Polyglot (2) ๐ Renaissance Researcher (6) ๐ Interdisciplinary Bridge ๐บ๏ธ Taxonomy Completionist (15) ๐งญ Keyword Pioneer
๐ฃ
Hot Topic Early Bird
๐
Cross-Pollinator
(15)
Conferences
EMNLP (2)
NIPS (1)
Top co-authors
Keywords
large language model
(2)
imitation learning
(1)
sentiment analysis
(1)
text classification
(1)
text analysis
(1)
reward learning
(1)
human feedback
(1)
adversarial attack
(1)
pairwise comparison
(1)
benchmark dataset
(1)
adversarial prompt
(1)
prompt injection
(1)
security vulnerability
(1)
financial text
(1)
monetary policy
(1)
prompt hacking
(1)
reinforcement learning
(1)
dissent quantification
(1)
Papers
BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks
NIPS 2023
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs Through a Global Prompt Hacking Competition
EMNLP 2023
GPT Deciphering Fedspeak: Quantifying Dissent Among Hawks and Doves
EMNLP 2023