Tianyu Yang
33 papers · 2018–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
🏃 Academic Marathon (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (7) 🐝 Cross-Pollinator (14)
🐣
Hot Topic Early Bird
🌍
Conference Polyglot
(7)
🏃
Academic Marathon
(7)
🧬
Topic Evolution
🌱
Topic Pioneer
🔥
Unstoppable
(6)
🚀
Conference Pioneer
💎
Century Club
(30)
⚡
Prolific Year
(5)
🗃️
Keyword Collector
(128)
Conferences
ACL (8)
CVPR (8)
ECCV (5)
EMNLP (5)
ICLR (4)
ICCV (2)
AAAI (1)
Top co-authors
Research topics
Keywords
large language model
(5)
multimodal learning
(4)
contrastive learning
(4)
video object segmentation
(3)
video understanding
(3)
mathematical reasoning
(2)
reinforcement learning
(2)
object tracking
(2)
temporal action localization
(2)
multimodal large language model
(2)
data collection
(2)
machine unlearning
(2)
feature matching
(2)
self-supervised learning
(2)
variational autoencoder
(2)
video representation learning
(2)
vision transformer
(2)
semantic segmentation
(1)
video segmentation
(1)
feature extraction
(1)
Papers
ALDEN: Reinforcement Learning for Active Navigation and Evidence Gathering in Long Documents
ACL 2026
ReEx-SQL: Reasoning with Execution-Aware Reinforcement Learning for Text-to-SQL
ACL 2026
A Survey of Multimodal Mathematical Reasoning: From Perception, Alignment to Reasoning
ACL 2026
CLIPErase: Efficient Unlearning of Visual-Textual Associations in CLIP
ACL 2025
Physics: Benchmarking Foundation Models on University-Level Physics Problem Solving
ACL 2025
Beyond Single-Value Metrics: Evaluating and Enhancing LLM Unlearning with Cognitive Diagnosis
ACL 2025
Self-Improvement in Multimodal Large Language Models: A Survey
EMNLP 2025
StableDepth: Scene-Consistent and Scale-Invariant Monocular Depth
ICCV 2025
Quest2DataAgent: Automating End-to-End Scientific Data Collection
EMNLP 2025
Robust Utility-Preserving Text Anonymization Based on Large Language Models
ACL 2025
Compress3D: a Compressed Latent Space for 3D Generation from a Single Image
ECCV 2024
SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark
ACL 2024
A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video Editing
CVPR 2024
AddMe: Zero-shot Group-photo Synthesis by Inserting People into Scenes
ECCV 2024
OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models
ECCV 2024
SaSR-Net: Source-Aware Semantic Representation Network for Enhancing Audio-Visual Question Answering
EMNLP 2024
TOSS: High-quality Text-guided Novel View Synthesis from a Single Image
ICLR 2024
Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts
ICLR 2024
Symbol as Points: Panoptic Symbol Spotting via Point-based Representation
ICLR 2024
GPAvatar: Generalizable and Precise Head Avatar from Image(s)
ICLR 2024
Dior-CVAE: Pre-trained Language Models and Diffusion Priors for Variational Dialog Generation
EMNLP 2023
Scalable Video Object Segmentation with Simplified Framework
ICCV 2023
DropMAE: Masked Autoencoders With Spatial-Attention Dropout for Tracking Tasks
CVPR 2023
Learning Deep Hierarchical Features with Spatial Regularization for One-Class Facial Expression Recognition
AAAI 2023
UniMath: A Foundational and Multimodal Mathematical Reasoner
EMNLP 2023
Exploring Denoised Cross-Video Contrast for Weakly-Supervised Temporal Action Localization
CVPR 2022
SWEM: Towards Real-Time Video Object Segmentation With Sequential Weighted Expectation-Maximization
CVPR 2022
Motion-Aware Contrastive Video Representation Learning via Foreground-Background Merging
CVPR 2022
LocVTP: Video-Text Pre-training for Temporal Localization
ECCV 2022
Unsupervised Pre-Training for Temporal Action Localization Tasks
CVPR 2022
VideoMoCo: Contrastive Video Representation Learning With Temporally Adversarial Examples
CVPR 2021
ROAM: Recurrently Optimizing Tracking Model
CVPR 2020
Learning Dynamic Memory Networks for Object Tracking
ECCV 2018