Yikang Shen
41 papers · 2017–2025 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π Renaissance Researcher (8) π Interdisciplinary Bridge π Conference Polyglot (12) π Academic Marathon (8) πΊοΈ Taxonomy Completionist (51)
π
Cross-Pollinator
(15)
π
Conference Polyglot
(12)
π
Academic Marathon
(8)
π€
Dynamic Duo
(16)
π
Triple Crown
π
Grand Slam
π§¬
Topic Evolution
π₯
Unstoppable
(9)
π
Century Club
(41)
ποΈ
Keyword Collector
(113)
π
Conference Pioneer
β‘
Prolific Year
(6)
Conferences
ICLR (12)
NIPS (7)
ACL (6)
EMNLP (4)
ICML (4)
CVPR (2)
AAAI (1)
ECCV (1)
ICCV (1)
IJCAI (1)
IJCNLP (1)
NAACL (1)
Top co-authors
Keywords
masked language modeling
(4)
unsupervised learning
(4)
constituency parsing
(4)
large language model
(3)
attention mechanism
(3)
dependency parsing
(3)
language modeling
(3)
neural network
(3)
ordered memory
(2)
reinforcement learning
(2)
multi-task learning
(2)
syntactic structure
(2)
language model
(2)
syntactic distance
(2)
mixture of expert
(2)
multimodal learning
(2)
tree structure
(2)
recurrent neural network
(2)
extractive summarization
(1)
question answering
(1)
Papers
Scaling Stick-Breaking Attention: An Efficient Implementation and In-depth Study
ICLR 2025
API Pack: A Massive Multi-Programming Language Dataset for API Call Generation
ICLR 2025
LaMAGIC2: Advanced Circuit Formulations for Language Model-Based Analog Topology Generation
ICML 2025
The Consensus Game: Language Model Generation via Equilibrium Search
ICLR 2024
Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training
NIPS 2024
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
NIPS 2024
Parallelizing Linear Transformers with the Delta Rule over Sequence Length
NIPS 2024
Visual Chain-of-Thought Prompting for Knowledge-Based Visual Reasoning
AAAI 2024
Gated Linear Attention Transformers with Hardware-Efficient Training
ICML 2024
LaMAGIC: Language-Model-based Topology Generation for Analog Integrated Circuits
ICML 2024
SALMON: Self-Alignment with Instructable Reward Models
ICLR 2024
Aligning Large Multimodal Models with Factually Augmented RLHF
ACL 2024
CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding
ICLR 2024
FlexAttention for Efficient High-Resolution Vision-Language Models
ECCV 2024
Visual Dependency Transformers: Dependency Tree Emerges From Reversed Attention
CVPR 2023
Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
NIPS 2023
Adaptive Online Replanning with Diffusion Models
NIPS 2023
Mod-Squad: Designing Mixtures of Experts As Modular Multi-Task Learners
CVPR 2023
Sparse Universal Transformer
EMNLP 2023
TextPSG: Panoptic Scene Graph Generation from Textual Descriptions
ICCV 2023
Hyper-Decision Transformer for Efficient Online Policy Adaptation
ICLR 2023
Planning with Large Language Models for Code Generation
ICLR 2023
Transformer-Patcher: One Mistake Worth One Neuron
ICLR 2023
Prompting Decision Transformer for Few-Shot Policy Generalization
ICML 2022
Mixture of Attention Heads: Selecting Attention Heads Per Token
EMNLP 2022
Phrase-aware Unsupervised Constituency Parsing
ACL 2022
Unsupervised Dependency Graph Network
ACL 2022
Self-Instantiated Recurrent Units with Dynamic Soft Recursion
NIPS 2021
Explicitly Modeling Syntax in Language Models with Incremental Parsing and a Dynamic Oracle
NAACL 2021
Long Range Arena : A Benchmark for Efficient Transformers
ICLR 2021
StructFormer: Joint Unsupervised Induction of Dependency and Constituency Structure from Masked Language Modeling
IJCNLP 2021
Learning Task Decomposition with Ordered Memory Policy Network
ICLR 2021
StructFormer: Joint Unsupervised Induction of Dependency and Constituency Structure from Masked Language Modeling
ACL 2021
Exploiting Syntactic Structure for Better Language Modeling: A Syntactic Distance Approach
ACL 2020
Recursive Top-Down Production for Sentence Generation with Latent Trees
EMNLP 2020
Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks
ICLR 2019
Ordered Memory
NIPS 2019
BanditSum: Extractive Summarization as a Contextual Bandit
EMNLP 2018
Straight to the Tree: Constituency Parsing with Neural Syntactic Distance
ACL 2018
Neural Language Modeling by Jointly Learning Syntax and Lexicon
ICLR 2018
Exploration of Tree-based Hierarchical Softmax for Recurrent Language Models
IJCAI 2017