Papers
1,072 papers found
Clinical Text Anonymization, its Influence on Downstream NLP Tasks and the Risk of Re-Identification
Iyadh Ben Cheikh Larbi, Aljoscha Burchardt, Roland Roller
What to Pre-Train on? Efficient Intermediate Task Selection
Clifton Poth, Jonas Pfeiffer, Andreas Rücklé et al.
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks
Yizhong Wang, Swaroop Mishra, Pegah Alipoormolabashi et al.
Knowledge Distillation Transfer Sets and their Impact on Downstream NLU Tasks
Charith Peris, Lizhen Tan, Thomas Gueudre et al.
Algorithmic Diversity and Tiny Models: Comparing Binary Networks and the Fruit Fly Algorithm on Document Representation Tasks
Tanise Ceron, Nhut Truong, Aurelie Herbelot
Evaluating Large Language Models on Controlled Generation Tasks
Jiao Sun, Yufei Tian, Wangchunshu Zhou et al.
Are ChatGPT and GPT-4 General-Purpose Solvers for Financial Text Analytics? A Study on Several Typical Tasks
Xianzhi Li, Samuel Chan, Xiaodan Zhu et al.
Tokenization Consistency Matters for Generative Models on Extractive NLP Tasks
Kaiser Sun, Peng Qi, Yuhao Zhang et al.
Dunamu-ml’s Submissions on AVERITEC Shared Task
Heesoo Park, Dongjun Lee, Jaehyuk Kim et al.
Findings of the WMT 2024 Biomedical Translation Shared Task: Test Sets on Abstract Level
Mariana Neves, Cristian Grozea, Philippe Thomas et al.
Tokenization and Representation Biases in Multilingual Models on Dialectal NLP Tasks
Vani Kanjirangat, Tanja Samardzic, Ljiljana Dolamic et al.
SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language Models on Software Engineering Tasks
Adamenko Pavel, Ivanov Mikhail, Aidar Valeev et al.
Unsupervised Multi-Task Feature Learning on Point Clouds
Kaveh Hassani, Mike Haley
Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task
Kenneth Li, Aspen K Hopkins, David Bau et al.
Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks
Samyak Jain, Robert Kirk, Ekdeep Singh Lubana et al.
BEND: Benchmarking DNA Language Models on Biologically Meaningful Tasks
Frederikke Isa Marin, Felix Teufel, Marc Horlacher et al.
Training on the Test Task Confounds Evaluation and Emergence
Ricardo Dominguez-Olmedo, Florian E. Dorner, Moritz Hardt
Online Multi-Task Learning for Policy Gradient Methods
Haitham Bou Ammar, Eric Eaton, Paul Ruvolo et al.
Abstract-to-Executable Trajectory Translation for One-Shot Task Generalization
Stone Tao, Xiaochen Li, Tongzhou Mu et al.
DeCo: Defect-Aware Modeling with Contrasting Matching for Optimizing Task Assignment in Online IC Testing
Lo Pang-Yun Ting, Yu-Hao Chiang, Yi-Tung Tsai et al.