Yuliang Liu
35 papers · 2017–2025 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π Conference Polyglot (10) π Academic Marathon (8) π§ Keyword Pioneer π Interdisciplinary Bridge π Cross-Pollinator (11)
π
Cross-Pollinator
(11)
π
Renaissance Researcher
(7)
πΊοΈ
Taxonomy Completionist
(74)
π
Grand Slam
π§¬
Topic Evolution
π€
Dynamic Duo
(17)
π
Conference Pioneer
π
Century Club
(35)
ποΈ
Keyword Collector
(163)
β
The Questioner
β‘
Prolific Year
(8)
Conferences
CVPR (12)
ICCV (6)
NIPS (4)
AAAI (3)
ACL (3)
EMNLP (2)
ICML (2)
ECCV (1)
ICLR (1)
IJCAI (1)
Top co-authors
Keywords
text detection
(6)
text recognition
(6)
multimodal learning
(5)
object detection
(5)
document understanding
(4)
semantic segmentation
(4)
scene text detection
(4)
visual question answering
(3)
image inpainting
(3)
diffusion model
(3)
image restoration
(3)
scene text
(3)
text spotting
(3)
domain generalization
(2)
large multi-modal model
(2)
multimodal large language model
(2)
large multimodal model
(2)
evaluation metric
(2)
benchmark evaluation
(2)
image segmentation
(2)
Papers
Theorem-Validated Reverse Chain-of-Thought Problem Generation for Geometric Reasoning
EMNLP 2025
WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?
EMNLP 2025
Multi-scenario Overlapping Text Segmentation with Depth Awareness
ICCV 2025
AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence
ICML 2025
LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models
ACL 2025
MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering
ACL 2025
Mini-Monkey: Alleviating the Semantic Sawtooth Effect for Lightweight MLLMs via Complementary Image Pyramid
ICLR 2025
LIRA: Inferring Segmentation in Large Multi-modal Models with Local Interleaved Region Assistance
ICCV 2025
Training-free Geometric Image Editing on Diffusion Models
ICCV 2025
DocThinker: Explainable Multimodal Large Language Models with Rule-based Reinforcement Learning for Document Understanding
ICCV 2025
Towards Comprehensive Lecture Slides Understanding: Large-scale Dataset and Effective Method
ICCV 2025
SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting
CVPR 2025
Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models
CVPR 2024
Deciphering Oracle Bone Language with Diffusion Models
ACL 2024
MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks
NIPS 2024
Bridging the Gap Between End-to-End and Two-Step Text Spotting
CVPR 2024
OmniParser: A Unified Framework for Text Spotting Key Information Extraction and Table Recognition
CVPR 2024
AP-Adapter: Improving Generalization of Automatic Prompts on Unseen Text-to-Image Diffusion Models
NIPS 2024
Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization
ICML 2024
ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining
AAAI 2024
ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
ICCV 2023
Towards Robust Tampered Text Detection in Document Image: New Dataset and New Solution
CVPR 2023
Turning a CLIP Model Into a Scene Text Detector
CVPR 2023
SAPA: Similarity-Aware Point Affiliation for Feature Upsampling
NIPS 2022
MSDS: A Large-Scale Chinese Signature and Token Digit String Dataset for Handwriting Verification
NIPS 2022
SwinTextSpotter: Scene Text Spotting via Better Synergy Between Text Detection and Text Recognition
CVPR 2022
Donβt Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Context
ECCV 2022
ABCNet: Real-Time Scene Text Spotting With Adaptive Bezier-Curve Network
CVPR 2020
On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering
CVPR 2020
Aggregation Cross-Entropy for Sequence Recognition
CVPR 2019
DeRPN: Taking a Further Step toward More General Object Detection
AAAI 2019
EnsNet: Ensconce Text in the Wild
AAAI 2019
Omnidirectional Scene Text Detection with Sequential-free Box Discretization
IJCAI 2019
Tightness-Aware Evaluation Protocol for Scene Text Detection
CVPR 2019
Deep Matching Prior Network: Toward Tighter Multi-Oriented Text Detection
CVPR 2017