Yuliang Liu

35 papers · 2017–2025 · 10 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🌍 Conference Polyglot (10) 🏃 Academic Marathon (8) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (11)

🐝 Cross-Pollinator (11) 🌈 Renaissance Researcher (7) 🗺️ Taxonomy Completionist (74) 🏆 Grand Slam 🧬 Topic Evolution 🤝 Dynamic Duo (17) 🚀 Conference Pioneer 💎 Century Club (35) 🗃️ Keyword Collector (163) ❓ The Questioner ⚡ Prolific Year (8)

Conferences

CVPR (12) ICCV (6) NIPS (4) AAAI (3) ACL (3) EMNLP (2) ICML (2) ECCV (1) ICLR (1) IJCAI (1)

Top co-authors

LianWen Jin (17) Xiang Bai (17) Mingxin Huang (4) Lele Xie (4) Chongyu Liu (4) Canjie Luo (3) Dezhi Peng (3) Can Huang (3) Wenwen Yu (3) dingkang liang (3)

Keywords

text detection (6) text recognition (6) multimodal learning (5) object detection (5) document understanding (4) semantic segmentation (4) scene text detection (4) visual question answering (3) image inpainting (3) diffusion model (3) image restoration (3) scene text (3) text spotting (3) domain generalization (2) large multi-modal model (2) multimodal large language model (2) large multimodal model (2) evaluation metric (2) benchmark evaluation (2) image segmentation (2)

Papers

Theorem-Validated Reverse Chain-of-Thought Problem Generation for Geometric Reasoning EMNLP 2025 WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild? EMNLP 2025 Multi-scenario Overlapping Text Segmentation with Depth Awareness ICCV 2025 AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence ICML 2025 LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models ACL 2025 MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering ACL 2025 Mini-Monkey: Alleviating the Semantic Sawtooth Effect for Lightweight MLLMs via Complementary Image Pyramid ICLR 2025 LIRA: Inferring Segmentation in Large Multi-modal Models with Local Interleaved Region Assistance ICCV 2025 Training-free Geometric Image Editing on Diffusion Models ICCV 2025 DocThinker: Explainable Multimodal Large Language Models with Rule-based Reinforcement Learning for Document Understanding ICCV 2025 Towards Comprehensive Lecture Slides Understanding: Large-scale Dataset and Effective Method ICCV 2025 SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting CVPR 2025 Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models CVPR 2024 Deciphering Oracle Bone Language with Diffusion Models ACL 2024 MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks NIPS 2024 Bridging the Gap Between End-to-End and Two-Step Text Spotting CVPR 2024 OmniParser: A Unified Framework for Text Spotting Key Information Extraction and Table Recognition CVPR 2024 AP-Adapter: Improving Generalization of Automatic Prompts on Unseen Text-to-Image Diffusion Models NIPS 2024 Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization ICML 2024 ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining AAAI 2024 ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer ICCV 2023 Towards Robust Tampered Text Detection in Document Image: New Dataset and New Solution CVPR 2023 Turning a CLIP Model Into a Scene Text Detector CVPR 2023 SAPA: Similarity-Aware Point Affiliation for Feature Upsampling NIPS 2022 MSDS: A Large-Scale Chinese Signature and Token Digit String Dataset for Handwriting Verification NIPS 2022 SwinTextSpotter: Scene Text Spotting via Better Synergy Between Text Detection and Text Recognition CVPR 2022 Don’t Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Context ECCV 2022 ABCNet: Real-Time Scene Text Spotting With Adaptive Bezier-Curve Network CVPR 2020 On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering CVPR 2020 Aggregation Cross-Entropy for Sequence Recognition CVPR 2019 DeRPN: Taking a Further Step toward More General Object Detection AAAI 2019 EnsNet: Ensconce Text in the Wild AAAI 2019 Omnidirectional Scene Text Detection with Sequential-free Box Discretization IJCAI 2019 Tightness-Aware Evaluation Protocol for Scene Text Detection CVPR 2019 Deep Matching Prior Network: Toward Tighter Multi-Oriented Text Detection CVPR 2017