Co-occurring keywords
Papers
nvAgent: Automated Data Visualization from Natural Language via Collaborative Agent Workflow
ACL 2025
Teaching an Old LLM Secure Coding: Localized Preference Optimization on Distilled Preferences
ACL 2025
mHumanEval - A Multilingual Benchmark to Evaluate Large Language Models for Code Generation
NAACL 2025
HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation Task
ACL 2025
Core Intelligence at SemEval-2025 Task 8: Multi-hop LLM Agent for Tabular Question Answering
SEMEVAL 2025