LianWen Jin

65 papers · 2017–2026 · 11 conferences · across top CS/AI conferences

Achievements

+14 more ↓

🌍 Conference Polyglot (11) 🏃 Academic Marathon (8) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (13)

🌈 Renaissance Researcher (8) 🐣 Hot Topic Early Bird 🌍 Conference Polyglot (11) 🤝 Dynamic Duo (17) 🏆 Grand Slam 🔬 Deep Specialist (12) 🧬 Topic Evolution 🏆 Keyword Champion (2) 📈 Trend Setter 🗃️ Keyword Collector (291) 🚀 Conference Pioneer 🔥 Unstoppable (7) 💎 Century Club (60) ⚡ Prolific Year (6)

Conferences

AAAI (17) CVPR (17) ACL (10) IJCAI (6) ICCV (4) NIPS (4) ECCV (2) EMNLP (2) ICLR (1) ICML (1) NAACL (1)

Top co-authors

Yuliang Liu (17) Dezhi Peng (14) Jiapeng Wang (12) Chongyu Liu (11) Kai Ding (11) Yongxin Shi (8) Peirong Zhang (8) Jun Huang (8) Canjie Luo (8) Fengjun Guo (7)

Research topics

Digital Humanities (2) Computer Vision (1)

Keywords

text recognition (10) text detection (7) optical character recognition (7) object detection (6) multimodal large language model (6) image restoration (6) attention mechanism (6) document understanding (5) convolutional neural network (5) diffusion model (4) neural network (4) scene text detection (4) image generation (4) large language model (4) benchmark evaluation (3) representation learning (3) multimodal learning (3) semantic segmentation (3) end-to-end learning (3) multi-task learning (3)

Papers

TextShield-R1: Reinforced Reasoning for Tampered Text Detection AAAI 2026 Draft, Verify, Restore: Self-Refining Historical Inscription Restoration with a Unified MLLM ACL 2026 Frequency Mining Empowered by Text Aggregation: A New Perspective on Document Image Tampering Detection AAAI 2026 PosterVerse: A Full-Workflow Framework for Commercial-Grade Poster Generation with HTML-Based Scalable Typography AAAI 2026 URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding AAAI 2026 DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding CVPR 2025 Large-Scale Corpus Construction and Retrieval-Augmented Generation for Ancient Chinese Poetry: New Method and Data Insights NAACL 2025 Hallucination-Aware Prompt Optimization for Text-to-Video Synthesis IJCAI 2025 Revisiting Tampered Scene Text Detection in the Era of Generative AI AAAI 2025 Predicting the Original Appearance of Damaged Historical Documents AAAI 2025 DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming AAAI 2025 MCS-Bench: A Comprehensive Benchmark for Evaluating Multimodal Large Language Models in Chinese Classical Studies ACL 2025 Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document Restoration ACL 2025 RedundancyLens: Revealing and Exploiting Visual Token Processing Redundancy for Efficient Decoder-Only MLLMs ACL 2025 Mini-Monkey: Alleviating the Semantic Sawtooth Effect for Lightweight MLLMs via Complementary Image Pyramid ICLR 2025 CC-OCR: A Comprehensive and Challenging OCR Benchmark for Evaluating Large Multimodal Models in Literacy ICCV 2025 TongGu: Mastering Classical Chinese Understanding with Knowledge-Grounded Large Language Models EMNLP 2024 UPOCR: Towards Unified Pixel-Level OCR Interface ICML 2024 DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks CVPR 2024 WenMind: A Comprehensive Benchmark for Evaluating Large Language Models in Chinese Classical Literature and Language Arts NIPS 2024 Towards Modern Image Manipulation Localization: A Large-Scale Dataset and Novel Methods CVPR 2024 Bridging the Gap Between End-to-End and Two-Step Text Spotting CVPR 2024 Deciphering Oracle Bone Language with Diffusion Models ACL 2024 DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation ACL 2024 ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining AAAI 2024 DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degradations AAAI 2024 FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning AAAI 2024 M2Doc: A Multi-Modal Fusion Approach for Document Layout Analysis AAAI 2024 PPTSER: A Plug-and-Play Tag-guided Method for Few-shot Semantic Entity Recognition on Visually-rich Documents ACL 2024 VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models EMNLP 2024 Revisiting Scene Text Recognition: A Data Perspective ICCV 2023 M5HisDoc: A Large-scale Multi-style Chinese Historical Document Analysis Benchmark NIPS 2023 CocaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval ACL 2023 Rapid Diffusion: Building Domain-Specific Text-to-Image Synthesizers with Fast Inference Speed ACL 2023 Towards Robust Tampered Text Detection in Document Image: New Dataset and New Solution CVPR 2023 M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis CVPR 2023 Scale-Aware Modulation Meet Transformer ICCV 2023 ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer ICCV 2023 SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Normalization CVPR 2022 Look Closer To Supervise Better: One-Shot Font Generation via Component-Based Discriminator CVPR 2022 Don’t Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Context ECCV 2022 SwinTextSpotter: Scene Text Spotting via Better Synergy Between Text Detection and Text Recognition CVPR 2022 MSDS: A Large-Scale Chinese Signature and Token Digit String Dataset for Handwriting Verification NIPS 2022 LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding ACL 2022 Fourier Contour Embedding for Arbitrary-Shaped Text Detection CVPR 2021 MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction IJCAI 2021 Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences IJCAI 2021 Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution AAAI 2021 Implicit Feature Alignment: Learn To Convert Text Recognizer to Text Spotter CVPR 2021 On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering CVPR 2020 ABCNet: Real-Time Scene Text Spotting With Adaptive Bezier-Curve Network CVPR 2020 Learn to Augment: Joint Data Augmentation and Network Optimization for Text Recognition CVPR 2020 SynSig2Vec: Learning Representations from Synthetic Dynamic Signatures for Real-World Verification AAAI 2020 Decoupled Attention Network for Text Recognition AAAI 2020 RD-GAN: Few/Zero-Shot Chinese Character Style Transfer via Radical Decomposition and Rendering ECCV 2020 Skeleton-Based Gesture Recognition Using Several Fully Connected Layers with Path Signature Features and Temporal Transformer Module AAAI 2019 Tightness-Aware Evaluation Protocol for Scene Text Detection CVPR 2019 EnsNet: Ensconce Text in the Wild AAAI 2019 Adaptive GNN for Image Analysis and Editing NIPS 2019 Aggregation Cross-Entropy for Sequence Recognition CVPR 2019 Attribute-Aware Convolutional Neural Networks for Facial Beauty Prediction IJCAI 2019 Omnidirectional Scene Text Detection with Sequential-free Box Discretization IJCAI 2019 DeRPN: Taking a Further Step toward More General Object Detection AAAI 2019 Deep Matching Prior Network: Toward Tighter Multi-Oriented Text Detection CVPR 2017 Multi-Task Deep Reinforcement Learning for Continuous Action Control IJCAI 2017