Yu Sun

106 papers · 2008–2026 · 19 conferences · across top CS/AI conferences

Achievements

+16 more ↓

🗺️ Taxonomy Completionist (20) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🌍 Conference Polyglot (19)

🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (20) 🐣 Hot Topic Early Bird 🌟 Keyword Trendsetter Combo (3) 🤝 Dynamic Duo (37) 👑 Triple Crown 🏆 Keyword Champion 🏆 Grand Slam 🔬 Deep Specialist (16) 🧬 Topic Evolution 🗃️ Keyword Collector (64) 🚀 Conference Pioneer ⚡ Prolific Year (5) 📈 Trend Setter 💎 Century Club (102) 🔥 Unstoppable (11)

Conferences

ACL (22) EMNLP (11) SEMEVAL (8) NIPS (7) ICML (7) ICLR (7) CVPR (7) AAAI (7) IJCNLP (6) IJCAI (5) COLING (4) NAACL (4) ICCV (3) AISTATS (2) RSS (2) ECCV (1) CORL (1) JMLR (1) MICCAI (1)

Top co-authors

Shuohuan Wang (38) Hua Wu (28) Haifeng Wang (17) Shikun Feng (16) Jiaxiang Liu (15) Hao Tian (13) Zhenyu Zhang (11) Yekun Chai (10) Xuan Ouyang (10) Junyuan Shang (10)

Research topics

Differential Privacy (1)

Keywords

large language model (12) pre-trained language model (10) language model (9) transfer learning (7) multimodal learning (7) knowledge distillation (6) transformer model (5) text classification (5) model compression (5) test-time training (5) graph neural network (4) human pose estimation (4) pre-trained model (4) adversarial training (4) transformer architecture (4) document understanding (4) ensemble learning (4) question answering (4) image classification (3) neural network optimization (3)

Papers

Zo3T: Zero-Shot 3D-Aware Trajectory-Guided Image-to-Video Generation via Test-Time Training AAAI 2026 Uncertainty-Aware Routing for Principled Alignment with MoE Dynamics ACL 2026 AttnPO: Attention-Guided Process Supervision for Efficient Reasoning ACL 2026 IPS: In-Prompt Process Supervision for Short Video Content Moderation ACL 2026 BeamLoRA: Beam-Constraint Low-Rank Adaptation ACL 2025 Test-Time Training on Video Streams JMLR 2025 Learning to (Learn at Test Time): RNNs with Expressive Hidden States ICML 2025 Mixture of Hidden-Dimensions: Not All Hidden-States’ Dimensions are Needed in Transformer ICML 2025 InverseBench: Benchmarking Plug-and-Play Diffusion Priors for Inverse Problems in Physical Sciences ICLR 2025 Graph Structure Learning for Spatial-Temporal Imputation: Adapting to Node and Feature Scales AAAI 2025 MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions ICLR 2025 Reasoning-Enhanced Domain-Adaptive Pretraining of Multimodal Large Language Models for Short Video Content Governance EMNLP 2025 ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning EMNLP 2025 One-Minute Video Generation with Test-Time Training CVPR 2025 PromptHMR: Promptable Human Mesh Recovery CVPR 2025 Curiosity-Driven Reinforcement Learning from Human Feedback ACL 2025 Inner Thinking Transformer: Leveraging Dynamic Depth Scaling to Foster Adaptive Internal Thinking ACL 2025 CritiQ: Mining Data Quality Criteria from Human Preferences ACL 2025 Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging ACL 2025 HFT: Half Fine-Tuning for Large Language Models ACL 2025 F-Eval: Asssessing Fundamental Abilities with Refined Evaluation Methods ACL 2024 DHA: Learning Decoupled-Head Attention from Transformer Checkpoints via Adaptive Heads Fusion NIPS 2024 Frequency-aware Generative Models for Multivariate Time Series Imputation NIPS 2024 Principled Probabilistic Imaging using Diffusion Models as Plug-and-Play Priors NIPS 2024 Generalizing End-To-End Autonomous Driving In Real-World Environments Using Zero-Shot LLMs CORL 2024 NACL: A General and Effective KV Cache Eviction Framework for LLM at Inference Time ACL 2024 LEMON: Reviving Stronger and Smaller LMs from Larger LMs with Linear Parameter Fusion ACL 2024 TokenHMR: Advancing Human Mesh Recovery with a Tokenized Pose Representation CVPR 2024 ChatPose: Chatting about 3D Human Pose CVPR 2024 Autoregressive Pre-Training on Pixels and Texts EMNLP 2024 On Training Data Influence of GPT Models EMNLP 2024 LOCR: Location-Guided Transformer for Optical Character Recognition EMNLP 2024 Tool-Augmented Reward Modeling ICLR 2024 Test-Time Training on Nearest Neighbors for Large Language Models ICLR 2024 High-Order Contrastive Learning with Fine-grained Comparative Levels for Sparse Ordinal Tensor Completion ICML 2024 Cardiac Copilot: Automatic Probe Guidance for Echocardiography with World Model MICCAI 2024 ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages ACL 2023 Unleashing the Power of Gradient Signal-to-Noise Ratio for Zero-Shot NAS ICCV 2023 UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction ACL 2023 Instance-wise Batch Label Restoration via Gradients in Federated Learning ICLR 2023 ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models IJCNLP 2023 End-to-End Pipeline for Trigger Detection on Hit and Track Graphs AAAI 2023 Pose-Oriented Transformer with Uncertainty-Guided Refinement for 2D-to-3D Human Pose Estimation AAAI 2023 Retrieval-Augmented Domain Adaptation of Language Models ACL 2023 ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model With Knowledge-Enhanced Mixture-of-Denoising-Experts CVPR 2023 CoLLiE: Collaborative Training of Large Language Models in an Efficient Way EMNLP 2023 TRACE: 5D Temporal Regression of Avatars With Dynamic Cameras in 3D Environments CVPR 2023 An Embarrassingly Easy but Strong Baseline for Nested Named Entity Recognition ACL 2023 Uncertainty-aware Unsupervised Video Hashing AISTATS 2023 Learning Cross-Video Neural Representations for High-Quality Frame Interpolation ECCV 2022 X-PuDu at SemEval-2022 Task 6: Multilingual Learning for English and Arabic Sarcasm Detection SEMEVAL 2022 X-PuDu at SemEval-2022 Task 6: Multilingual Learning for English and Arabic Sarcasm Detection NAACL 2022 Test-Time Training with Masked Autoencoders NIPS 2022 Clip-Tuning: Towards Derivative-free Prompt Learning with a Mixture of Rewards EMNLP 2022 ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding EMNLP 2022 Simple and Effective Relation-based Embedding Propagation for Knowledge Representation Learning IJCAI 2022 X-PuDu at SemEval-2022 Task 7: A Replaced Token Detection Task Pre-trained Model with Pattern-aware Ensembling for Identifying Plausible Clarifications SEMEVAL 2022 X-PuDu at SemEval-2022 Task 7: A Replaced Token Detection Task Pre-trained Model with Pattern-aware Ensembling for Identifying Plausible Clarifications NAACL 2022 Putting People in Their Place: Monocular Regression of 3D People in Depth CVPR 2022 Alpha at SemEval-2021 Task 6: Transformer Based Propaganda Classification ACL 2021 Alpha at SemEval-2021 Task 6: Transformer Based Propaganda Classification SEMEVAL 2021 Latent Reasoning for Low-Resource Question Generation ACL 2021 ERNIE-M: Enhanced Multilingual Representation by Aligning Cross-lingual Semantics with Monolingual Corpora EMNLP 2021 CVAE-based Re-anchoring for Implicit Discourse Relation Classification EMNLP 2021 abcbpc at SemEval-2021 Task 7: ERNIE-based Multi-task Model for Detecting and Rating Humor and Offense SEMEVAL 2021 Correcting Chinese Spelling Errors with Phonetic Pre-training ACL 2021 ERNIE-Doc: A Retrospective Long-Document Modeling Transformer ACL 2021 Monocular, One-Stage, Regression of Multiple 3D People ICCV 2021 Self-Supervised Policy Adaptation during Deployment ICLR 2021 Async-RED: A Provably Convergent Asynchronous Block Parallel Stochastic Method using Deep Denoising Priors ICLR 2021 ERNIE-ViL: Knowledge Enhanced Vision-Language Representations through Scene Graphs AAAI 2021 Masked Label Prediction: Unified Message Passing Model for Semi-Supervised Classification IJCAI 2021 ERNIE-Doc: A Retrospective Long-Document Modeling Transformer IJCNLP 2021 Correcting Chinese Spelling Errors with Phonetic Pre-training IJCNLP 2021 Latent Reasoning for Low-Resource Question Generation IJCNLP 2021 Alpha at SemEval-2021 Task 6: Transformer Based Propaganda Classification IJCNLP 2021 abcbpc at SemEval-2021 Task 7: ERNIE-based Multi-task Model for Detecting and Rating Humor and Offense IJCNLP 2021 ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding NAACL 2021 Parallel sentences mining with transfer learning in an unsupervised setting NAACL 2021 abcbpc at SemEval-2021 Task 7: ERNIE-based Multi-task Model for Detecting and Rating Humor and Offense ACL 2021 Generalizable and Explainable Dialogue Generation via Explicit Action Learning EMNLP 2020 Galileo at SemEval-2020 Task 12: Multi-lingual Learning for Offensive Language Identification Using Pre-trained Language Models COLING 2020 ERNIE at SemEval-2020 Task 10: Learning Word Emphasis Selection by Pre-trained Language Model SEMEVAL 2020 PGL at TextGraphs 2020 Shared Task: Explanation Regeneration using Language and Graph Learning Methods COLING 2020 Kk2018 at SemEval-2020 Task 9: Adversarial Training for Code-Mixing Sentiment Classification COLING 2020 Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation ACL 2020 ERNIE 2.0: A Continual Pre-Training Framework for Language Understanding AAAI 2020 ERNIE at SemEval-2020 Task 10: Learning Word Emphasis Selection by Pre-trained Language Model COLING 2020 MALA: Cross-Domain Dialogue Generation with Action Learning AAAI 2020 A Motion Taxonomy for Manipulation Embedding RSS 2020 Kk2018 at SemEval-2020 Task 9: Adversarial Training for Code-Mixing Sentiment Classification SEMEVAL 2020 Test-Time Training with Self-Supervision for Generalization under Distribution Shifts ICML 2020 Galileo at SemEval-2020 Task 12: Multi-lingual Learning for Offensive Language Identification Using Pre-trained Language Models SEMEVAL 2020 ERNIE-GEN: An Enhanced Multi-Flow Pre-training and Fine-tuning Framework for Natural Language Generation IJCAI 2020 Human Mesh Recovery From Monocular Images via a Skeleton-Disentangled Representation ICCV 2019 RLTM: An Efficient Neural IR Framework for Long Documents IJCAI 2019 Doubly Robust Joint Learning for Recommendation on Data Missing Not at Random ICML 2019 OleNet at SemEval-2019 Task 9: BERT based Multi-Perspective Models for Suggestion Mining SEMEVAL 2019 Block Coordinate Regularization by Denoising NIPS 2019 KDGAN: Knowledge Distillation with Generative Adversarial Networks NIPS 2018 App Download Forecasting: An Evolutionary Hierarchical Competition Approach IJCAI 2017 On Calibration of Modern Neural Networks ICML 2017 Supervised Word Mover's Distance NIPS 2016 Private Causal Inference AISTATS 2016 From Word Embeddings To Document Distances ICML 2015 NanoNewton Force Sensing and Control in Microrobotic Cell Manipulation RSS 2008