Thomas Wang
7 papers · 2022–2023 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+5 more ↓ Show less ↑
π Cross-Pollinator (15) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (16) π§ Keyword Pioneer π£ Hot Topic Early Bird
π
Conference Polyglot
(5)
π
Triple Crown
π₯
Mega-Team
(54)
π
Keyword Champion
(3)
β
The Questioner
(2)
Conferences
EMNLP (2)
NIPS (2)
ACL (1)
ICLR (1)
ICML (1)
Top co-authors
Keywords
large language model
(5)
zero-shot generalization
(3)
multilingual model
(2)
multilingual language model
(2)
transformer architecture
(2)
language model training
(1)
multimodal learning
(1)
responsible ai
(1)
model training
(1)
vision language model
(1)
model scaling
(1)
pretraining corpus
(1)
data curation
(1)
data filtering
(1)
scaling behavior
(1)
text corpus
(1)
multilingual dataset
(1)
multilingual corpus
(1)
ablation study
(1)
crosslingual transfer
(1)
Papers
FinGPT: Large Generative Models for a Small Language
EMNLP 2023
Crosslingual Generalization through Multitask Finetuning
ACL 2023
OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents
NIPS 2023
What Language Model Architecture and Pretraining Objective Works Best for Zero-Shot Generalization?
ICML 2022
What Language Model to Train if You Have One Million GPU Hours?
EMNLP 2022
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
NIPS 2022
Multitask Prompted Training Enables Zero-Shot Task Generalization
ICLR 2022