LianWen Jin
65 papers · 2017–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π Conference Polyglot (11) π Academic Marathon (8) π Interdisciplinary Bridge π§ Keyword Pioneer π Cross-Pollinator (13)
π
Renaissance Researcher
(8)
π£
Hot Topic Early Bird
π
Conference Polyglot
(11)
π€
Dynamic Duo
(17)
π
Grand Slam
π¬
Deep Specialist
(12)
π§¬
Topic Evolution
π
Keyword Champion
(2)
π
Trend Setter
ποΈ
Keyword Collector
(291)
π
Conference Pioneer
π₯
Unstoppable
(7)
π
Century Club
(60)
β‘
Prolific Year
(6)
Conferences
AAAI (17)
CVPR (17)
ACL (10)
IJCAI (6)
ICCV (4)
NIPS (4)
ECCV (2)
EMNLP (2)
ICLR (1)
ICML (1)
NAACL (1)
Top co-authors
Research topics
Keywords
text recognition
(10)
text detection
(7)
optical character recognition
(7)
object detection
(6)
multimodal large language model
(6)
image restoration
(6)
attention mechanism
(6)
document understanding
(5)
convolutional neural network
(5)
diffusion model
(4)
neural network
(4)
scene text detection
(4)
image generation
(4)
large language model
(4)
benchmark evaluation
(3)
representation learning
(3)
multimodal learning
(3)
semantic segmentation
(3)
end-to-end learning
(3)
multi-task learning
(3)
Papers
TextShield-R1: Reinforced Reasoning for Tampered Text Detection
AAAI 2026
Draft, Verify, Restore: Self-Refining Historical Inscription Restoration with a Unified MLLM
ACL 2026
Frequency Mining Empowered by Text Aggregation: A New Perspective on Document Image Tampering Detection
AAAI 2026
PosterVerse: A Full-Workflow Framework for Commercial-Grade Poster Generation with HTML-Based Scalable Typography
AAAI 2026
URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding
AAAI 2026
DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding
CVPR 2025
Large-Scale Corpus Construction and Retrieval-Augmented Generation for Ancient Chinese Poetry: New Method and Data Insights
NAACL 2025
Hallucination-Aware Prompt Optimization for Text-to-Video Synthesis
IJCAI 2025
Revisiting Tampered Scene Text Detection in the Era of Generative AI
AAAI 2025
Predicting the Original Appearance of Damaged Historical Documents
AAAI 2025
DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming
AAAI 2025
MCS-Bench: A Comprehensive Benchmark for Evaluating Multimodal Large Language Models in Chinese Classical Studies
ACL 2025
Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document Restoration
ACL 2025
RedundancyLens: Revealing and Exploiting Visual Token Processing Redundancy for Efficient Decoder-Only MLLMs
ACL 2025
Mini-Monkey: Alleviating the Semantic Sawtooth Effect for Lightweight MLLMs via Complementary Image Pyramid
ICLR 2025
CC-OCR: A Comprehensive and Challenging OCR Benchmark for Evaluating Large Multimodal Models in Literacy
ICCV 2025
TongGu: Mastering Classical Chinese Understanding with Knowledge-Grounded Large Language Models
EMNLP 2024
UPOCR: Towards Unified Pixel-Level OCR Interface
ICML 2024
DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
CVPR 2024
WenMind: A Comprehensive Benchmark for Evaluating Large Language Models in Chinese Classical Literature and Language Arts
NIPS 2024
Towards Modern Image Manipulation Localization: A Large-Scale Dataset and Novel Methods
CVPR 2024
Bridging the Gap Between End-to-End and Two-Step Text Spotting
CVPR 2024
Deciphering Oracle Bone Language with Diffusion Models
ACL 2024
DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation
ACL 2024
ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining
AAAI 2024
DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degradations
AAAI 2024
FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning
AAAI 2024
M2Doc: A Multi-Modal Fusion Approach for Document Layout Analysis
AAAI 2024
PPTSER: A Plug-and-Play Tag-guided Method for Few-shot Semantic Entity Recognition on Visually-rich Documents
ACL 2024
VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models
EMNLP 2024
Revisiting Scene Text Recognition: A Data Perspective
ICCV 2023
M5HisDoc: A Large-scale Multi-style Chinese Historical Document Analysis Benchmark
NIPS 2023
CocaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval
ACL 2023
Rapid Diffusion: Building Domain-Specific Text-to-Image Synthesizers with Fast Inference Speed
ACL 2023
Towards Robust Tampered Text Detection in Document Image: New Dataset and New Solution
CVPR 2023
M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis
CVPR 2023
Scale-Aware Modulation Meet Transformer
ICCV 2023
ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
ICCV 2023
SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Normalization
CVPR 2022
Look Closer To Supervise Better: One-Shot Font Generation via Component-Based Discriminator
CVPR 2022
Donβt Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Context
ECCV 2022
SwinTextSpotter: Scene Text Spotting via Better Synergy Between Text Detection and Text Recognition
CVPR 2022
MSDS: A Large-Scale Chinese Signature and Token Digit String Dataset for Handwriting Verification
NIPS 2022
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding
ACL 2022
Fourier Contour Embedding for Arbitrary-Shaped Text Detection
CVPR 2021
MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction
IJCAI 2021
Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences
IJCAI 2021
Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution
AAAI 2021
Implicit Feature Alignment: Learn To Convert Text Recognizer to Text Spotter
CVPR 2021
On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering
CVPR 2020
ABCNet: Real-Time Scene Text Spotting With Adaptive Bezier-Curve Network
CVPR 2020
Learn to Augment: Joint Data Augmentation and Network Optimization for Text Recognition
CVPR 2020
SynSig2Vec: Learning Representations from Synthetic Dynamic Signatures for Real-World Verification
AAAI 2020
Decoupled Attention Network for Text Recognition
AAAI 2020
RD-GAN: Few/Zero-Shot Chinese Character Style Transfer via Radical Decomposition and Rendering
ECCV 2020
Skeleton-Based Gesture Recognition Using Several Fully Connected Layers with Path Signature Features and Temporal Transformer Module
AAAI 2019
Tightness-Aware Evaluation Protocol for Scene Text Detection
CVPR 2019
EnsNet: Ensconce Text in the Wild
AAAI 2019
Adaptive GNN for Image Analysis and Editing
NIPS 2019
Aggregation Cross-Entropy for Sequence Recognition
CVPR 2019
Attribute-Aware Convolutional Neural Networks for Facial Beauty Prediction
IJCAI 2019
Omnidirectional Scene Text Detection with Sequential-free Box Discretization
IJCAI 2019
DeRPN: Taking a Further Step toward More General Object Detection
AAAI 2019
Deep Matching Prior Network: Toward Tighter Multi-Oriented Text Detection
CVPR 2017
Multi-Task Deep Reinforcement Learning for Continuous Action Control
IJCAI 2017