Zihang Dai
23 papers · 2016–2022 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π Conference Polyglot (7) π Interdisciplinary Bridge π§ Keyword Pioneer π£ Hot Topic Early Bird π Academic Marathon (6)
π§
Keyword Pioneer
πΊοΈ
Taxonomy Completionist
(47)
π
Interdisciplinary Bridge
π§¬
Topic Evolution
π
Grand Slam
π±
Topic Pioneer
ποΈ
Keyword Collector
(86)
π
Conference Pioneer
π
Century Club
(23)
π₯
Unstoppable
(7)
π
Trend Setter
β‘
Prolific Year
(5)
Conferences
NIPS (10)
ACL (4)
ICLR (3)
CVPR (2)
EMNLP (2)
AAAI (1)
ICML (1)
Top co-authors
Keywords
transformer architecture
(5)
language model
(4)
language modeling
(4)
image classification
(3)
efficient computing
(3)
semi-supervised learning
(3)
sequence modeling
(2)
text classification
(2)
data augmentation
(2)
sparse attention
(2)
neural network
(2)
knowledge distillation
(2)
vision transformer
(2)
machine translation
(1)
domain adaptation
(1)
image captioning
(1)
transfer learning
(1)
text generation
(1)
self-attention mechanism
(1)
neural machine translation
(1)
Papers
Transformer Quality in Linear Time
ICML 2022
SimVLM: Simple Visual Language Model Pretraining with Weak Supervision
ICLR 2022
Combiner: Full Attention Transformer with Sparse Computation Cost
NIPS 2021
Meta Pseudo Labels
CVPR 2021
CoAtNet: Marrying Convolution and Attention for All Data Sizes
NIPS 2021
Searching for Efficient Transformers for Language Modeling
NIPS 2021
Pay Attention to MLPs
NIPS 2021
Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing
NIPS 2020
Unsupervised Data Augmentation for Consistency Training
NIPS 2020
A Mutual Information Maximization Perspective of Language Representation Learning
ICLR 2020
Transformer-XL: Attentive Language Models beyond a Fixed-Length Context
ACL 2019
XLNet: Generalized Autoregressive Pretraining for Language Understanding
NIPS 2019
Fast and Simple Mixture of Softmaxes with BPE and Hybrid-LightRNN for Language Generation
AAAI 2019
Re-examination of the Role of Latent Variables in Sequence Modeling
NIPS 2019
Characterizing and Avoiding Negative Transfer
CVPR 2019
Large-scale Cloze Test Dataset Created by Teachers
EMNLP 2018
From Credit Assignment to Entropy Regularization: Two New Algorithms for Neural Sequence Prediction
ACL 2018
SwitchOut: an Efficient Data Augmentation Algorithm for Neural Machine Translation
EMNLP 2018
Breaking the Softmax Bottleneck: A High-Rank RNN Language Model
ICLR 2018
Good Semi-supervised Learning That Requires a Bad GAN
NIPS 2017
Controllable Invariance through Adversarial Feature Learning
NIPS 2017
An Interpretable Knowledge Transfer Model for Knowledge Base Completion
ACL 2017
CFO: Conditional Focused Neural Question Answering with Large-scale Knowledge Bases
ACL 2016