Kaitao Song
31 papers · 2018–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π Conference Polyglot (12) π Academic Marathon (7) π§ Keyword Pioneer π Interdisciplinary Bridge π Cross-Pollinator (13)
π
Cross-Pollinator
(13)
π
Renaissance Researcher
(6)
πΊοΈ
Taxonomy Completionist
(55)
π€
Dynamic Duo
(21)
π
Triple Crown
π§¬
Topic Evolution
π
Grand Slam
ποΈ
Keyword Collector
(131)
π
Century Club
(27)
β‘
Prolific Year
(6)
π₯
Unstoppable
(8)
β
The Questioner
π
Trend Setter
Conferences
ACL (7)
NIPS (5)
ICML (4)
AAAI (3)
INTERSPEECH (3)
ICLR (2)
NAACL (2)
COLING (1)
EMNLP (1)
ICCV (1)
IJCAI (1)
IJCNLP (1)
Top co-authors
Keywords
large language model
(6)
autonomous agent
(4)
neural machine translation
(3)
language modeling
(3)
transfer learning
(3)
text generation
(3)
rhythm modeling
(2)
multi-agent system
(2)
rhyme modeling
(2)
error correction
(2)
task planning
(2)
agent system
(2)
convolutional neural network
(2)
rap generation
(2)
representation learning
(2)
automatic speech recognition
(2)
music generation
(2)
speech recognition
(1)
knowledge editing
(1)
model selection
(1)
Papers
Improving Long-Context Summarization with Multi-Granularity Retrieval Optimization
AAAI 2026
Foresight Optimization for Strategic Reasoning in Large Language Models
ACL 2026
UI-Copilot: Advancing Long-Horizon GUI Automation via Tool-Integrated Policy Optimization
ACL 2026
Controllable Memory Usage: Balancing Anchoring and Innovation in Long-Term HumanβAgent Interaction
ACL 2026
EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms
NAACL 2025
EASYTOOL: Enhancing LLM-based Agents with Concise Tool Instruction
NAACL 2025
PromptTTS 2: Describing and Generating Voices with Text Prompt
ICLR 2024
TaskBench: Benchmarking Large Language Models for Task Automation
NIPS 2024
Can Graph Learning Improve Planning in LLM-based Agents?
NIPS 2024
Improving Large Language Models in Event Relation Logical Prediction
ACL 2024
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
ICLR 2024
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
ICML 2024
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face
NIPS 2023
DiffusionNER: Boundary Diffusion for Named Entity Recognition
ACL 2023
MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models
EMNLP 2023
Towards Understanding Omission in Dialogue Summarization
ACL 2023
SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition
AAAI 2023
End-to-End Word-Level Pronunciation Assessment with MASK Pre-training
INTERSPEECH 2023
CircuitNet: A Generic Neural Network to Realize Universal Circuit Motif Modeling
ICML 2023
Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech
INTERSPEECH 2022
Analyzing and Mitigating Interference in Neural Architecture Search
ICML 2022
Transcormer: Transformer for Sentence Scoring with Sliding Language Modeling
NIPS 2022
Improving Hypernasality Estimation with Automatic Speech Recognition in Cleft Palate Speech
INTERSPEECH 2022
DeepRapper: Neural Rap Generation with Rhyme and Rhythm Modeling
IJCNLP 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction Without Convolutions
ICCV 2021
DeepRapper: Neural Rap Generation with Rhyme and Rhythm Modeling
ACL 2021
SongMASS: Automatic Song Writing with Pre-training and Alignment Constraint
AAAI 2021
MPNet: Masked and Permuted Pre-training for Language Understanding
NIPS 2020
Neural Machine Translation with Error Correction
IJCAI 2020
MASS: Masked Sequence to Sequence Pre-training for Language Generation
ICML 2019
Double Path Networks for Sequence to Sequence Learning
COLING 2018