Papers
AutoTriggER: Label-Efficient and Robust Named Entity Recognition with Auxiliary Trigger Extraction
Dong-Ho Lee, Ravi Kiran Selvam, Sheikh Muhammad Sarwar et al.
A weakly supervised textual entailment approach to zero-shot text classification
Marc Pàmies, Joan Llop, Francesco Multari et al.
Bag of Tricks for In-Distribution Calibration of Pretrained Transformers
Jaeyoung Kim, Dongbin Na, Sungchul Choi et al.
BanglaNLG and BanglaT5: Benchmarks and Resources for Evaluating Low-Resource Natural Language Generation in Bangla
Abhik Bhattacharjee, Tahmid Hasan, Wasi Uddin Ahmad et al.
Behavior Cloned Transformers are Neurosymbolic Reasoners
Ruoyao Wang, Peter Jansen, Marc-Alexandre Côté et al.
BENCHić-lang: A Benchmark for Discriminating between Bosnian, Croatian, Montenegrin and Serbian
Peter Rupnik, Taja Kuzman, Nikola Ljubešić
Benchmark Data and Evaluation Framework for Intent Discovery Around COVID-19 Vaccine Hesitancy
Shai Gretz, Assaf Toledo, Roni Friedman et al.
Benchmarking Long-tail Generalization with Likelihood Splits
Ameya Godbole, Robin Jia
BERT Is Not The Count: Learning to Match Mathematical Statements with Proofs
Weixian Waylon Li, Yftah Ziser, Maximin Coavoux et al.
BERT Shows Garden Path Effects
Tovah Irwin, Kyra Wilson, Alec Marantz
Better Pre-Training by Reducing Representation Confusion
Haojie Zhang, Mingfei Liang, Ruobing Xie et al.
BEVERS: A General, Simple, and Performant Framework for Automatic Fact Verification
Mitchell DeHaven, Stephen Scott
Bias assessment for experts in discrimination, not in computer science
Laura Alonso Alemany, Luciana Benotti, Hernán Maina et al.
BLM-AgrF: A New French Benchmark to Investigate Generalization of Agreement in Neural Networks
Aixiu An, Chunyang Jiang, Maria A. Rodriguez et al.
Bootstrapping Multilingual Semantic Parsers using Large Language Models
Abhijeet Awasthi, Nitish Gupta, Bidisha Samanta et al.
Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints
Albert Lu, Hongxin Zhang, Yanzhe Zhang et al.
Bridging Argument Quality and Deliberative Quality Annotations with Adapters
Neele Falk, Gabriella Lapesa
Bridging the Gap Between BabelNet and HowNet: Unsupervised Sense Alignment and Sememe Prediction
Xiang Zhang, Ning Shi, Bradley Hauer et al.
Bridging the Gap between Native Text and Translated Text through Adversarial Learning: A Case Study on Cross-Lingual Event Extraction
Pengfei Yu, Jonathan May, Heng Ji
Bridging the Gap between Pre-Training and Fine-Tuning for Commonsense Generation
Haoran Yang, Yan Wang, Piji Li et al.
Building Stereotype Repositories with Complementary Approaches for Scale and Depth
Sunipa Dev, Akshita Jha, Jaya Goyal et al.
CALM-Bench: A Multi-task Benchmark for Evaluating Causality-Aware Language Models
Dhairya Dalal, Paul Buitelaar, Mihael Arcan
Can BERT eat RuCoLA? Topological Data Analysis to Explain
Irina Proskurina, Ekaterina Artemova, Irina Piontkovskaya
Can Demographic Factors Improve Text Classification? Revisiting Demographic Adaptation in the Age of Transformers
Chia-Chien Hung, Anne Lauscher, Dirk Hovy et al.