Papers
RoCode: A Dataset for Measuring Code Intelligence from Problem Definitions in Romanian
Adrian Cosma, Ioan-Bogdan Iordache, Paolo Rosso
RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions
Yuansen Zhang, Xiao Wang, Zhiheng Xi et al.
RT-VQ2A2: Real Time Vector Quantized Question Answering with ASR
Kyungho Kim, Seongmin Park, Jihwa Lee
RU22Fact: Optimizing Evidence for Multilingual Explainable Fact-Checking on Russia-Ukraine Conflict
Yirong Zeng, Xiao Ding, Yi Zhao et al.
RuBia: A Russian Language Bias Detection Dataset
Veronika Grigoreva, Anastasiia Ivanova, Ilseyar Alimova et al.
Russian Learner Corpus: Towards Error-Cause Annotation for L2 Russian
Daniil Kosakin, Sergei Obiedkov, Ivan Smirnov et al.
SaGE: Evaluating Moral Consistency in Large Language Models
Vamshi Krishna Bonagiri, Sreeram Vennam, Priyanshul Govil et al.
Saliency-Aware Interpolative Augmentation for Multimodal Financial Prediction
Samyak Jain, Parth Chhabra, Atula Tejaswi Neerkaje et al.
Samayik: A Benchmark and Dataset for English-Sanskrit Translation
Ayush Maheshwari, Ashim Gupta, Amrith Krishna et al.
SamróMur MilljóN: An ASR Corpus of One Million Verified Read Prompts in Icelandic
Carlos Daniel Hernandez Mena, Þorsteinn Daði Gunnarsson, Jon Gudnason
Sarcasm Detection in a Disaster Context
Tiberiu Sosea, Junyi Jessy Li, Cornelia Caragea
SarcNet: A Multilingual Multimodal Sarcasm Detection Dataset
Tan Yue, Xuzhao Shi, Rui Mao et al.
Scalable Patent Classification with Aggregated Multi-View Ranking
Dan Li, Vikrant Yadav, Zi Long Zhu et al.
ScalarLab@TRAC2024: Exploring Machine Learning Techniques for Identifying Potential Offline Harm in Multilingual Commentaries
Anagha H C, Saatvik M. Krishna, Soumya Sangam Jha et al.
Scale-VAE: Preventing Posterior Collapse in Variational Autoencoder
Tianbao Song, Jingbo Sun, Xin Liu et al.
Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment
Feifan Song, Bowen Yu, Hao Lang et al.
Scansion-based Lyrics Generation
Yiwen Chen, Simone Teufel
Schema-based Data Augmentation for Event Extraction
Xiaomeng Jin, Heng Ji
Schema Learning Corpus: Data and Annotation Focused on Complex Events
Song Chen, Jennifer Tracey, Ann Bies et al.
SciDMT: A Large-Scale Corpus for Detecting Scientific Mentions
Huitong Pan, Qi Zhang, Cornelia Caragea et al.
SciMRC: Multi-perspective Scientific Machine Reading Comprehension
Xiao Zhang, Heqi Zheng, Yuxiang Nie et al.
SciNews: From Scholarly Complexities to Public Narratives – a Dataset for Scientific News Report Generation
Dongqi Liu, Yifan Wang, Jia Loy et al.