Zihang Dai

23 papers · 2016–2022 · 7 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🌍 Conference Polyglot (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏃 Academic Marathon (6)

🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (47) 🌉 Interdisciplinary Bridge 🧬 Topic Evolution 🏆 Grand Slam 🌱 Topic Pioneer 🗃️ Keyword Collector (86) 🚀 Conference Pioneer 💎 Century Club (23) 🔥 Unstoppable (7) 📈 Trend Setter ⚡ Prolific Year (5)

Conferences

NIPS (10) ACL (4) ICLR (3) CVPR (2) EMNLP (2) AAAI (1) ICML (1)

Top co-authors

Qizhe Xie (7) Quoc V Le (6) Eduard Hovy (6) Yiming Yang (4) Ruslan Salakhutdinov (4) Hanxiao Liu (4) Zhilin Yang (4) Jaime Carbonell (3) Quoc V. Le (3) Guokun Lai (3)

Keywords

transformer architecture (5) language model (4) language modeling (4) image classification (3) efficient computing (3) semi-supervised learning (3) sequence modeling (2) text classification (2) data augmentation (2) sparse attention (2) neural network (2) knowledge distillation (2) vision transformer (2) machine translation (1) domain adaptation (1) image captioning (1) transfer learning (1) text generation (1) self-attention mechanism (1) neural machine translation (1)

Papers

Transformer Quality in Linear Time ICML 2022 SimVLM: Simple Visual Language Model Pretraining with Weak Supervision ICLR 2022 Combiner: Full Attention Transformer with Sparse Computation Cost NIPS 2021 Meta Pseudo Labels CVPR 2021 CoAtNet: Marrying Convolution and Attention for All Data Sizes NIPS 2021 Searching for Efficient Transformers for Language Modeling NIPS 2021 Pay Attention to MLPs NIPS 2021 Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing NIPS 2020 Unsupervised Data Augmentation for Consistency Training NIPS 2020 A Mutual Information Maximization Perspective of Language Representation Learning ICLR 2020 Transformer-XL: Attentive Language Models beyond a Fixed-Length Context ACL 2019 XLNet: Generalized Autoregressive Pretraining for Language Understanding NIPS 2019 Fast and Simple Mixture of Softmaxes with BPE and Hybrid-LightRNN for Language Generation AAAI 2019 Re-examination of the Role of Latent Variables in Sequence Modeling NIPS 2019 Characterizing and Avoiding Negative Transfer CVPR 2019 Large-scale Cloze Test Dataset Created by Teachers EMNLP 2018 From Credit Assignment to Entropy Regularization: Two New Algorithms for Neural Sequence Prediction ACL 2018 SwitchOut: an Efficient Data Augmentation Algorithm for Neural Machine Translation EMNLP 2018 Breaking the Softmax Bottleneck: A High-Rank RNN Language Model ICLR 2018 Good Semi-supervised Learning That Requires a Bad GAN NIPS 2017 Controllable Invariance through Adversarial Feature Learning NIPS 2017 An Interpretable Knowledge Transfer Model for Knowledge Base Completion ACL 2017 CFO: Conditional Focused Neural Question Answering with Large-scale Knowledge Bases ACL 2016