conftrace_

Thomas Wang

7 papers · 2022–2023 · 5 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+5 more ↓

🐝 Cross-Pollinator (15) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (16) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird

🌍 Conference Polyglot (5) 👑 Triple Crown 👥 Mega-Team (54) 🏆 Keyword Champion (3) ❓ The Questioner (2)

Conferences

EMNLP (2) NIPS (2) ACL (1) ICLR (1) ICML (1)

Top co-authors

Teven Le Scao (5) Stella Biderman (4) Colin Raffel (4) Lintang Sutawika (3) Niklas Muennighoff (3) Victor Sanh (3) M Saiful Bari (3) Zheng Xin Yong (3) Sheng Shen (3) Zaid Alyafeai (3)

Keywords

large language model (5) zero-shot generalization (3) multilingual model (2) multilingual language model (2) transformer architecture (2) language model training (1) multimodal learning (1) responsible ai (1) model training (1) vision language model (1) model scaling (1) pretraining corpus (1) data curation (1) data filtering (1) scaling behavior (1) text corpus (1) multilingual dataset (1) multilingual corpus (1) ablation study (1) crosslingual transfer (1)

Papers

FinGPT: Large Generative Models for a Small Language EMNLP 2023 Crosslingual Generalization through Multitask Finetuning ACL 2023 OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents NIPS 2023 What Language Model Architecture and Pretraining Objective Works Best for Zero-Shot Generalization? ICML 2022 What Language Model to Train if You Have One Million GPU Hours? EMNLP 2022 The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset NIPS 2022 Multitask Prompted Training Enables Zero-Shot Task Generalization ICLR 2022