Papers
CLEAR: Can Language Models Really Understand Causal Graphs?
Sirui Chen, Mengying Xu, Kun Wang et al.
CliMedBench: A Large-Scale Chinese Benchmark for Evaluating Medical Large Language Models in Clinical Scenarios
Zetian Ouyang, Yishuai Qiu, Linlin Wang et al.
ClimRetrieve: A Benchmarking Dataset for Information Retrieval from Corporate Climate Disclosures
Tobias Schimanski, Jingwei Ni, Roberto Spacey Martín et al.
C-LLM: Learn to Check Chinese Spelling Errors Character by Character
Kunting Li, Yong Hu, Liang He et al.
CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models
Zexuan Qiu, Jingjing Li, Shijue Huang et al.
Closing the Loop: Learning to Generate Writing Feedback via Language Model Simulated Student Revisions
Inderjeet Jayakumar Nair, Jiaye Tan, Xiaotian Su et al.
CloudSheep System for WMT24 Discourse-Level Literary Translation
Lisa Liu, Ryan Liu, Angela Tsai et al.
Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation
Yuan Ge, Yilun Liu, Chi Hu et al.
Cluster-Norm for Unsupervised Probing of Knowledge
Walter Laurito, Sharan Maiya, Grégoire Dhimoïla et al.
CMD: a framework for Context-aware Model self-Detoxification
Zecheng Tang, Keyan Zhou, Juntao Li et al.
CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security Research
Sian-Yao Huang, Cheng-Lin Yang, Che-Yu Lin et al.
CMR Scaling Law: Predicting Critical Mixture Ratios for Continual Pre-training of Language Models
Jiawei Gu, Zacc Yang, Chuanghao Ding et al.
CNEQ: Incorporating numbers into Knowledge Graph Reasoning
Xianshu Peng, Wei Wei, Kaihe Xu et al.
CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models
Zi Gong, Hang Yu, Cong Liao et al.
Cochrane-auto: An Aligned Dataset for the Simplification of Biomedical Abstracts
Jan Bakker, Jaap Kamps
CoCoHD: Congress Committee Hearing Dataset
Arnav Hiray, Yunsong Liu, Mingxiao Song et al.
CoCoLoFa: A Dataset of News Comments with Common Logical Fallacies Written by LLM-Assisted Crowds
Min-Hsuan Yeh, Ruyuan Wan, Ting-Hao Kenneth Huang
CoCoST: Automatic Complex Code Generation with Online Searching and Correctness Testing
Xinyi He, Jiaru Zou, Yun Lin et al.
CodeAgent: Autonomous Communicative Agents for Code Review
Xunzhu Tang, Kisub Kim, Yewei Song et al.
CodeFort: Robust Training for Code Generation Models
Yuhao Zhang, Shiqi Wang, Haifeng Qian et al.
CodeIP: A Grammar-Guided Multi-Bit Watermark for Large Language Models of Code
Batu Guan, Yao Wan, Zhangqian Bi et al.
CodeJudge: Evaluating Code Generation with Large Language Models
Weixi Tong, Tianyi Zhang
Code Membership Inference for Detecting Unauthorized Data Use in Code Pre-trained Language Models
Sheng Zhang, Hui Li, Rongrong Ji
Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs
Haritz Puerto, Martin Tutek, Somak Aditya et al.
Code Representation Pre-training with Complements from Program Executions
Jiabo Huang, Jianyu Zhao, Yuyang Rong et al.