Eugene Jang
6 papers · 2021–2025 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π Cross-Pollinator (14) π Conference Polyglot (3) π Renaissance Researcher (6) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (22)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π₯
Unstoppable
(5)
Conferences
NAACL (3)
ACL (2)
EMNLP (1)
Top co-authors
Keywords
large language model
(2)
dark web
(2)
domain adaptation
(2)
response generation
(1)
text analysis
(1)
text representation
(1)
bias detection
(1)
responsible ai
(1)
token classification
(1)
document analysis
(1)
language model
(1)
domain-specific pretraining
(1)
adversarial input
(1)
masked language modeling
(1)
pretrained language model
(1)
fairness evaluation
(1)
negative sampling
(1)
model debiasing
(1)
web mining
(1)
linguistic analysis
(1)
Papers
Improbable Bigrams Expose Vulnerabilities of Incomplete Tokens in Byte-Level Tokenizers
EMNLP 2025
Ignore Me But Donβt Replace Me: Utilizing Non-Linguistic Elements for Pretraining on the Cybersecurity Domain
NAACL 2024
DarkBERT: A Language Model for the Dark Side of the Internet
ACL 2023
WinoQueer: A Community-in-the-Loop Benchmark for Anti-LGBTQ+ Bias in Large Language Models
ACL 2023
Shedding New Light on the Language of the Dark Web
NAACL 2022
Generating Negative Samples by Manipulating Golden Responses for Unsupervised Learning of a Response Evaluation Model
NAACL 2021