Co-occurring keywords
Papers
RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation
EMNLP 2024
LLMArena: Assessing Capabilities of Large Language Models in Dynamic Multi-Agent Environments
ACL 2024
E-EVAL: A Comprehensive Chinese K-12 Education Evaluation Benchmark for Large Language Models
ACL 2024
NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User Queries
ACL 2024