Co-occurring keywords
Papers
Consolidating and Developing Benchmarking Datasets for the Nepali Natural Language Understanding Tasks
IJCNLP 2025
Beyond Single-Value Metrics: Evaluating and Enhancing LLM Unlearning with Cognitive Diagnosis
ACL 2025
Acquiescence Bias in Large Language Models
EMNLP 2025
Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek-R1, and Beyond
EMNLP 2025