Papers

101 papers found
Toxicity Detection for Free
Zhanhao Hu, Julien Piet, Geng Zhao et al.
2024 NIPS
Soft-Label Integration for Robust Toxicity Classification
Zelei Cheng, Xian Wu, Jiahao Yu et al.
2024 NIPS
WLV-RIT at SemEval-2021 Task 5: A Neural Transformer Framework for Detecting Toxic Spans
Tharindu Ranasinghe, Diptanu Sarkar, Marcos Zampieri et al.
2021 ACL
WLV-RIT at SemEval-2021 Task 5: A Neural Transformer Framework for Detecting Toxic Spans
Tharindu Ranasinghe, Diptanu Sarkar, Marcos Zampieri et al.
2021 IJCNLP
2025 NAACL
WLV-RIT at SemEval-2021 Task 5: A Neural Transformer Framework for Detecting Toxic Spans
Tharindu Ranasinghe, Diptanu Sarkar, Marcos Zampieri et al.
2021 SEMEVAL
A Hybrid Confidence-Aware Framework for Arabic Toxicity Detection in Social Media
Fawzia Zaal Alanazi, Asma Mohammed Alamri, Arwa Bin Saleh et al.
2026 EACL
A Multi-Labeled Dataset for Indonesian Discourse: Examining Toxicity, Polarization, and Demographics Information
Lucky Susanto, Musa Izzanardi Wijanarko, Prasetia Anugrah Pratama et al.
2025 ACL
2024 EMNLP
Quantifying the Ethical Dilemma of Using Culturally Toxic Training Data in AI Tools for Indigenous Languages
Pedro Henrique Domingues, Claudio Santos Pinhanez, Paulo Cavalin et al.
2024 COLING
2021 ICML