Papers
Evaluating Open-Domain Dialogues in Latent Space with Next Sentence Prediction and Mutual Information
Kun Zhao, Bohao Yang, Chenghua Lin et al.
Evaluating Open-Domain Question Answering in the Era of Large Language Models
Ehsan Kamalloo, Nouha Dziri, Charles Clarke et al.
Evaluating Paraphrastic Robustness in Textual Entailment Models
Dhruv Verma, Yash Kumar Lal, Shreyashee Sinha et al.
Evaluating pragmatic abilities of image captioners on A3DS
Polina Tsvilodub, Michael Franke
Evaluating Reading Comprehension Exercises Generated by LLMs: A Showcase of ChatGPT in Education Applications
Changrong Xiao, Sean Xin Xu, Kunpeng Zhang et al.
Evaluating the Effectiveness of Natural Language Inference for Hate Speech Detection in Languages with Limited Labeled Data
Janis Goldzycher, Moritz Preisig, Chantal Amrhein et al.
Evaluating the Factual Consistency of Large Language Models Through News Summarization
Derek Tam, Anisha Mascarenhas, Shiyue Zhang et al.
Evaluating Zero-Shot Event Structures: Recommendations for Automatic Content Extraction (ACE) Annotations
Erica Cai, Brendan O’Connor
Evaluation for Change
Rishi Bommasani
Evaluation Metrics for Depth and Flow of Knowledge in Non-fiction Narrative Texts
Sachin Pawar, Girish Palshikar, Ankita Jain et al.
Evaluation of ChatGPT on Biomedical Tasks: A Zero-Shot Comparison with Fine-Tuned Generative Transformers
Israt Jahan, Md Tahmid Rahman Laskar, Chun Peng et al.
Evaluation of Question Generation Needs More References
Shinhyeok Oh, Hyojun Go, Hyeongdon Moon et al.
Event-Centric Query Expansion in Web Search
Yanan Zhang, Weijie Cui, Yangfan Zhang et al.
Event Extraction as Question Generation and Answering
Di Lu, Shihao Ran, Joel Tetreault et al.
Event-independent temporal positioning: application to French clinical text
Nesrine Bannour, Bastien Rance, Xavier Tannier et al.
EventOA: An Event Ontology Alignment Benchmark Based on FrameNet and Wikidata
Shaoru Guo, Chenhao Wang, Yubo Chen et al.
Event Semantic Knowledge in Procedural Text Understanding
Ghazaleh Kazeminejad, Martha Palmer
Everything you need to know about Multilingual LLMs: Towards fair, performant and reliable models for languages of the world
Sunayana Sitaram, Monojit Choudhury, Barun Patra et al.
EvolveMT: an Ensemble MT Engine Improving Itself with Usage Only
Kamer Yüksel, Ahmet Gunduz, Mohamed Al-badrashiny et al.
Examining Bias in Opinion Summarisation through the Perspective of Opinion Diversity
Nannan Huang, Lin Tian, Haytham Fayek et al.
Examining the Causal Impact of First Names on Language Models: The Case of Social Commonsense Reasoning
Sullam Jeoung, Jana Diesner, Halil Kilicoglu
ExASAG: Explainable Framework for Automatic Short Answer Grading
Maximilian Tornqvist, Mosleh Mahamud, Erick Mendez Guzman et al.
Exclusive Supermask Subnetwork Training for Continual Learning
Prateek Yadav, Mohit Bansal
Expanding Scope: Adapting English Adversarial Attacks to Chinese
Hanyu Liu, Chengyuan Cai, Yanjun Qi
Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering
Yung-Sung Chuang, Wei Fang, Shang-Wen Li et al.