Yu Sun
106 papers · 2008–2026 · 19 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (20) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (5) π Conference Polyglot (19)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(20)
π£
Hot Topic Early Bird
π
Keyword Trendsetter Combo
(3)
π€
Dynamic Duo
(37)
π
Triple Crown
π
Keyword Champion
π
Grand Slam
π¬
Deep Specialist
(16)
π§¬
Topic Evolution
ποΈ
Keyword Collector
(64)
π
Conference Pioneer
β‘
Prolific Year
(5)
π
Trend Setter
π
Century Club
(102)
π₯
Unstoppable
(11)
Conferences
ACL (22)
EMNLP (11)
SEMEVAL (8)
NIPS (7)
ICML (7)
ICLR (7)
CVPR (7)
AAAI (7)
IJCNLP (6)
IJCAI (5)
COLING (4)
NAACL (4)
ICCV (3)
AISTATS (2)
RSS (2)
ECCV (1)
CORL (1)
JMLR (1)
MICCAI (1)
Top co-authors
Research topics
Keywords
large language model
(12)
pre-trained language model
(10)
language model
(9)
transfer learning
(7)
multimodal learning
(7)
knowledge distillation
(6)
transformer model
(5)
text classification
(5)
model compression
(5)
test-time training
(5)
graph neural network
(4)
human pose estimation
(4)
pre-trained model
(4)
adversarial training
(4)
transformer architecture
(4)
document understanding
(4)
ensemble learning
(4)
question answering
(4)
image classification
(3)
neural network optimization
(3)
Papers
Zo3T: Zero-Shot 3D-Aware Trajectory-Guided Image-to-Video Generation via Test-Time Training
AAAI 2026
Uncertainty-Aware Routing for Principled Alignment with MoE Dynamics
ACL 2026
AttnPO: Attention-Guided Process Supervision for Efficient Reasoning
ACL 2026
IPS: In-Prompt Process Supervision for Short Video Content Moderation
ACL 2026
BeamLoRA: Beam-Constraint Low-Rank Adaptation
ACL 2025
Test-Time Training on Video Streams
JMLR 2025
Learning to (Learn at Test Time): RNNs with Expressive Hidden States
ICML 2025
Mixture of Hidden-Dimensions: Not All Hidden-Statesβ Dimensions are Needed in Transformer
ICML 2025
InverseBench: Benchmarking Plug-and-Play Diffusion Priors for Inverse Problems in Physical Sciences
ICLR 2025
Graph Structure Learning for Spatial-Temporal Imputation: Adapting to Node and Feature Scales
AAAI 2025
MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
ICLR 2025
Reasoning-Enhanced Domain-Adaptive Pretraining of Multimodal Large Language Models for Short Video Content Governance
EMNLP 2025
ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning
EMNLP 2025
One-Minute Video Generation with Test-Time Training
CVPR 2025
PromptHMR: Promptable Human Mesh Recovery
CVPR 2025
Curiosity-Driven Reinforcement Learning from Human Feedback
ACL 2025
Inner Thinking Transformer: Leveraging Dynamic Depth Scaling to Foster Adaptive Internal Thinking
ACL 2025
CritiQ: Mining Data Quality Criteria from Human Preferences
ACL 2025
Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging
ACL 2025
HFT: Half Fine-Tuning for Large Language Models
ACL 2025
F-Eval: Asssessing Fundamental Abilities with Refined Evaluation Methods
ACL 2024
DHA: Learning Decoupled-Head Attention from Transformer Checkpoints via Adaptive Heads Fusion
NIPS 2024
Frequency-aware Generative Models for Multivariate Time Series Imputation
NIPS 2024
Principled Probabilistic Imaging using Diffusion Models as Plug-and-Play Priors
NIPS 2024
Generalizing End-To-End Autonomous Driving In Real-World Environments Using Zero-Shot LLMs
CORL 2024
NACL: A General and Effective KV Cache Eviction Framework for LLM at Inference Time
ACL 2024
LEMON: Reviving Stronger and Smaller LMs from Larger LMs with Linear Parameter Fusion
ACL 2024
TokenHMR: Advancing Human Mesh Recovery with a Tokenized Pose Representation
CVPR 2024
ChatPose: Chatting about 3D Human Pose
CVPR 2024
Autoregressive Pre-Training on Pixels and Texts
EMNLP 2024
On Training Data Influence of GPT Models
EMNLP 2024
LOCR: Location-Guided Transformer for Optical Character Recognition
EMNLP 2024
Tool-Augmented Reward Modeling
ICLR 2024
Test-Time Training on Nearest Neighbors for Large Language Models
ICLR 2024
High-Order Contrastive Learning with Fine-grained Comparative Levels for Sparse Ordinal Tensor Completion
ICML 2024
Cardiac Copilot: Automatic Probe Guidance for Echocardiography with World Model
MICCAI 2024
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages
ACL 2023
Unleashing the Power of Gradient Signal-to-Noise Ratio for Zero-Shot NAS
ICCV 2023
UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction
ACL 2023
Instance-wise Batch Label Restoration via Gradients in Federated Learning
ICLR 2023
ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models
IJCNLP 2023
End-to-End Pipeline for Trigger Detection on Hit and Track Graphs
AAAI 2023
Pose-Oriented Transformer with Uncertainty-Guided Refinement for 2D-to-3D Human Pose Estimation
AAAI 2023
Retrieval-Augmented Domain Adaptation of Language Models
ACL 2023
ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model With Knowledge-Enhanced Mixture-of-Denoising-Experts
CVPR 2023
CoLLiE: Collaborative Training of Large Language Models in an Efficient Way
EMNLP 2023
TRACE: 5D Temporal Regression of Avatars With Dynamic Cameras in 3D Environments
CVPR 2023
An Embarrassingly Easy but Strong Baseline for Nested Named Entity Recognition
ACL 2023
Uncertainty-aware Unsupervised Video Hashing
AISTATS 2023
Learning Cross-Video Neural Representations for High-Quality Frame Interpolation
ECCV 2022
X-PuDu at SemEval-2022 Task 6: Multilingual Learning for English and Arabic Sarcasm Detection
SEMEVAL 2022
X-PuDu at SemEval-2022 Task 6: Multilingual Learning for English and Arabic Sarcasm Detection
NAACL 2022
Test-Time Training with Masked Autoencoders
NIPS 2022
Clip-Tuning: Towards Derivative-free Prompt Learning with a Mixture of Rewards
EMNLP 2022
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding
EMNLP 2022
Simple and Effective Relation-based Embedding Propagation for Knowledge Representation Learning
IJCAI 2022
X-PuDu at SemEval-2022 Task 7: A Replaced Token Detection Task Pre-trained Model with Pattern-aware Ensembling for Identifying Plausible Clarifications
SEMEVAL 2022
X-PuDu at SemEval-2022 Task 7: A Replaced Token Detection Task Pre-trained Model with Pattern-aware Ensembling for Identifying Plausible Clarifications
NAACL 2022
Putting People in Their Place: Monocular Regression of 3D People in Depth
CVPR 2022
Alpha at SemEval-2021 Task 6: Transformer Based Propaganda Classification
ACL 2021
Alpha at SemEval-2021 Task 6: Transformer Based Propaganda Classification
SEMEVAL 2021
Latent Reasoning for Low-Resource Question Generation
ACL 2021
ERNIE-M: Enhanced Multilingual Representation by Aligning Cross-lingual Semantics with Monolingual Corpora
EMNLP 2021
CVAE-based Re-anchoring for Implicit Discourse Relation Classification
EMNLP 2021
abcbpc at SemEval-2021 Task 7: ERNIE-based Multi-task Model for Detecting and Rating Humor and Offense
SEMEVAL 2021
Correcting Chinese Spelling Errors with Phonetic Pre-training
ACL 2021
ERNIE-Doc: A Retrospective Long-Document Modeling Transformer
ACL 2021
Monocular, One-Stage, Regression of Multiple 3D People
ICCV 2021
Self-Supervised Policy Adaptation during Deployment
ICLR 2021
Async-RED: A Provably Convergent Asynchronous Block Parallel Stochastic Method using Deep Denoising Priors
ICLR 2021
ERNIE-ViL: Knowledge Enhanced Vision-Language Representations through Scene Graphs
AAAI 2021
Masked Label Prediction: Unified Message Passing Model for Semi-Supervised Classification
IJCAI 2021
ERNIE-Doc: A Retrospective Long-Document Modeling Transformer
IJCNLP 2021
Correcting Chinese Spelling Errors with Phonetic Pre-training
IJCNLP 2021
Latent Reasoning for Low-Resource Question Generation
IJCNLP 2021
Alpha at SemEval-2021 Task 6: Transformer Based Propaganda Classification
IJCNLP 2021
abcbpc at SemEval-2021 Task 7: ERNIE-based Multi-task Model for Detecting and Rating Humor and Offense
IJCNLP 2021
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding
NAACL 2021
Parallel sentences mining with transfer learning in an unsupervised setting
NAACL 2021
abcbpc at SemEval-2021 Task 7: ERNIE-based Multi-task Model for Detecting and Rating Humor and Offense
ACL 2021
Generalizable and Explainable Dialogue Generation via Explicit Action Learning
EMNLP 2020
Galileo at SemEval-2020 Task 12: Multi-lingual Learning for Offensive Language Identification Using Pre-trained Language Models
COLING 2020
ERNIE at SemEval-2020 Task 10: Learning Word Emphasis Selection by Pre-trained Language Model
SEMEVAL 2020
PGL at TextGraphs 2020 Shared Task: Explanation Regeneration using Language and Graph Learning Methods
COLING 2020
Kk2018 at SemEval-2020 Task 9: Adversarial Training for Code-Mixing Sentiment Classification
COLING 2020
Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation
ACL 2020
ERNIE 2.0: A Continual Pre-Training Framework for Language Understanding
AAAI 2020
ERNIE at SemEval-2020 Task 10: Learning Word Emphasis Selection by Pre-trained Language Model
COLING 2020
MALA: Cross-Domain Dialogue Generation with Action Learning
AAAI 2020
A Motion Taxonomy for Manipulation Embedding
RSS 2020
Kk2018 at SemEval-2020 Task 9: Adversarial Training for Code-Mixing Sentiment Classification
SEMEVAL 2020
Test-Time Training with Self-Supervision for Generalization under Distribution Shifts
ICML 2020
Galileo at SemEval-2020 Task 12: Multi-lingual Learning for Offensive Language Identification Using Pre-trained Language Models
SEMEVAL 2020
ERNIE-GEN: An Enhanced Multi-Flow Pre-training and Fine-tuning Framework for Natural Language Generation
IJCAI 2020
Human Mesh Recovery From Monocular Images via a Skeleton-Disentangled Representation
ICCV 2019
RLTM: An Efficient Neural IR Framework for Long Documents
IJCAI 2019
Doubly Robust Joint Learning for Recommendation on Data Missing Not at Random
ICML 2019
OleNet at SemEval-2019 Task 9: BERT based Multi-Perspective Models for Suggestion Mining
SEMEVAL 2019
Block Coordinate Regularization by Denoising
NIPS 2019
KDGAN: Knowledge Distillation with Generative Adversarial Networks
NIPS 2018
App Download Forecasting: An Evolutionary Hierarchical Competition Approach
IJCAI 2017
On Calibration of Modern Neural Networks
ICML 2017
Supervised Word Mover's Distance
NIPS 2016
Private Causal Inference
AISTATS 2016
From Word Embeddings To Document Distances
ICML 2015
NanoNewton Force Sensing and Control in Microrobotic Cell Manipulation
RSS 2008