Papers
Beyond Black Box AI generated Plagiarism Detection: From Sentence to Document Level
Ali Quidwai, Chunhui Li, Parijat Dube
Beyond Contrastive Learning: A Variational Generative Model for Multilingual Retrieval
John Wieting, Jonathan Clark, William Cohen et al.
Beyond English-Centric Bitexts for Better Multilingual Language Representation Learning
Barun Patra, Saksham Singhal, Shaohan Huang et al.
Beyond Positive Scaling: How Negation Impacts Scaling Trends of Language Models
Yuhui Zhang, Michihiro Yasunaga, Zhengping Zhou et al.
Beyond Triplet: Leveraging the Most Data for Multimodal Machine Translation
Yaoming Zhu, Zewei Sun, Shanbo Cheng et al.
bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark
Momchil Hardalov, Pepa Atanasova, Todor Mihaylov et al.
Bhasa-Abhijnaanam: Native-script and romanized Language Identification for 22 Indic languages
Yash Madhani, Mitesh M. Khapra, Anoop Kunchukuttan
Bhattacharya_Lab at SemEval-2023 Task 12: A Transformer-based Language Model for Sentiment Classification for Low Resource African Languages: Nigerian Pidgin and Yoruba
Nathaniel Hughes, Kevan Baker, Aditya Singh et al.
Bias Beyond English: Counterfactual Tests for Bias in Sentiment Analysis in Four Languages
Seraphina Goldfarb-Tarrant, Adam Lopez, Roi Blanco et al.
BIC: Twitter Bot Detection with Text-Graph Interaction and Semantic Consistency
Zhenyu Lei, Herun Wan, Wenqian Zhang et al.
Bidirectional Generative Framework for Cross-domain Aspect-based Sentiment Analysis
Yue Deng, Wenxuan Zhang, Sinno Jialin Pan et al.
Bidirectional Transformer Reranker for Grammatical Error Correction
Ying Zhang, Hidetaka Kamigaito, Manabu Okumura
BIG-C: a Multimodal Multi-Purpose Dataset for Bemba
Claytone Sikasote, Eunice Mukonde, Md Mahfuz Ibn Alam et al.
BigVideo: A Large-scale Video Subtitle Translation Dataset for Multimodal Machine Translation
Liyan Kang, Luyang Huang, Ningxin Peng et al.
Bi-level Finetuning with Task-dependent Similarity Structure for Low-resource Training
Sai Ashish Somayajula, Lifeng Jin, Linfeng Song et al.
Billy-Batson at SemEval-2023 Task 5: An Information Condensation based System for Clickbait Spoiling
Anubhav Sharma, Sagar Joshi, Tushar Abhishek et al.
Binary and Ternary Natural Language Generation
Zechun Liu, Barlas Oguz, Aasish Pappu et al.
Biomedical Document Classification with Literature Graph Representations of Bibliographies and Entities
Ryuki Ida, Makoto Miwa, Yutaka Sasaki
Biomedical Language Models are Robust to Sub-optimal Tokenization
Bernal Jimenez Gutierrez, Huan Sun, Yu Su
Biomedical Relation Extraction with Entity Type Markers and Relation-specific Question Answering
Koshi Yamada, Makoto Miwa, Yutaka Sasaki
BioNART: A Biomedical Non-AutoRegressive Transformer for Natural Language Generation
Masaki Asada, Makoto Miwa
BIOptimus: Pre-training an Optimal Biomedical Language Model with Curriculum Learning for Named Entity Recognition
Vera Pavlova, Mohammed Makhlouf
Bi-Phone: Modeling Inter Language Phonetic Influences in Text
Abhirut Gupta, Ananya B. Sai, Richard Sproat et al.