Yikang Shen

41 papers · 2017–2025 · 12 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🌈 Renaissance Researcher (8) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (12) 🏃 Academic Marathon (8) 🗺️ Taxonomy Completionist (51)

🐝 Cross-Pollinator (15) 🌍 Conference Polyglot (12) 🏃 Academic Marathon (8) 🤝 Dynamic Duo (16) 👑 Triple Crown 🏆 Grand Slam 🧬 Topic Evolution 🔥 Unstoppable (9) 💎 Century Club (41) 🗃️ Keyword Collector (113) 🚀 Conference Pioneer ⚡ Prolific Year (6)

Conferences

ICLR (12) NIPS (7) ACL (6) EMNLP (4) ICML (4) CVPR (2) AAAI (1) ECCV (1) ICCV (1) IJCAI (1) IJCNLP (1) NAACL (1)

Top co-authors

Chuang Gan (16) Aaron Courville (11) Zhenfang Chen (10) Shawn Tan (7) Alessandro Sordoni (6) Shun Zhang (5) Zhiqing Sun (5) Yiming Yang (4) Yi Tay (4) Zhouhan Lin (4)

Keywords

masked language modeling (4) unsupervised learning (4) constituency parsing (4) large language model (3) attention mechanism (3) dependency parsing (3) language modeling (3) neural network (3) ordered memory (2) reinforcement learning (2) multi-task learning (2) syntactic structure (2) language model (2) syntactic distance (2) mixture of expert (2) multimodal learning (2) tree structure (2) recurrent neural network (2) extractive summarization (1) question answering (1)

Papers

Scaling Stick-Breaking Attention: An Efficient Implementation and In-depth Study ICLR 2025 API Pack: A Massive Multi-Programming Language Dataset for API Call Generation ICLR 2025 LaMAGIC2: Advanced Circuit Formulations for Language Model-Based Analog Topology Generation ICML 2025 The Consensus Game: Language Model Generation via Equilibrium Search ICLR 2024 Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training NIPS 2024 Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision NIPS 2024 Parallelizing Linear Transformers with the Delta Rule over Sequence Length NIPS 2024 Visual Chain-of-Thought Prompting for Knowledge-Based Visual Reasoning AAAI 2024 Gated Linear Attention Transformers with Hardware-Efficient Training ICML 2024 LaMAGIC: Language-Model-based Topology Generation for Analog Integrated Circuits ICML 2024 SALMON: Self-Alignment with Instructable Reward Models ICLR 2024 Aligning Large Multimodal Models with Factually Augmented RLHF ACL 2024 CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding ICLR 2024 FlexAttention for Efficient High-Resolution Vision-Language Models ECCV 2024 Visual Dependency Transformers: Dependency Tree Emerges From Reversed Attention CVPR 2023 Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision NIPS 2023 Adaptive Online Replanning with Diffusion Models NIPS 2023 Mod-Squad: Designing Mixtures of Experts As Modular Multi-Task Learners CVPR 2023 Sparse Universal Transformer EMNLP 2023 TextPSG: Panoptic Scene Graph Generation from Textual Descriptions ICCV 2023 Hyper-Decision Transformer for Efficient Online Policy Adaptation ICLR 2023 Planning with Large Language Models for Code Generation ICLR 2023 Transformer-Patcher: One Mistake Worth One Neuron ICLR 2023 Prompting Decision Transformer for Few-Shot Policy Generalization ICML 2022 Mixture of Attention Heads: Selecting Attention Heads Per Token EMNLP 2022 Phrase-aware Unsupervised Constituency Parsing ACL 2022 Unsupervised Dependency Graph Network ACL 2022 Self-Instantiated Recurrent Units with Dynamic Soft Recursion NIPS 2021 Explicitly Modeling Syntax in Language Models with Incremental Parsing and a Dynamic Oracle NAACL 2021 Long Range Arena : A Benchmark for Efficient Transformers ICLR 2021 StructFormer: Joint Unsupervised Induction of Dependency and Constituency Structure from Masked Language Modeling IJCNLP 2021 Learning Task Decomposition with Ordered Memory Policy Network ICLR 2021 StructFormer: Joint Unsupervised Induction of Dependency and Constituency Structure from Masked Language Modeling ACL 2021 Exploiting Syntactic Structure for Better Language Modeling: A Syntactic Distance Approach ACL 2020 Recursive Top-Down Production for Sentence Generation with Latent Trees EMNLP 2020 Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks ICLR 2019 Ordered Memory NIPS 2019 BanditSum: Extractive Summarization as a Contextual Bandit EMNLP 2018 Straight to the Tree: Constituency Parsing with Neural Syntactic Distance ACL 2018 Neural Language Modeling by Jointly Learning Syntax and Lexicon ICLR 2018 Exploration of Tree-based Hierarchical Softmax for Recurrent Language Models IJCAI 2017