Liang Lin
186 papers · 2012–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
π Conference Polyglot (12) π Academic Marathon (14) π§ Keyword Pioneer π Interdisciplinary Bridge π Cross-Pollinator (9)
π
Cross-Pollinator
(9)
π§
Keyword Pioneer
π
Academic Marathon
(14)
π
Keyword Trendsetter Combo
(3)
π
Conference Loyalist
(21)
π€
Dynamic Duo
(52)
π¬
Deep Specialist
(26)
π§¬
Topic Evolution
π
Keyword Champion
(3)
π
Grand Slam
β
The Questioner
(2)
ποΈ
Keyword Collector
(713)
β‘
Prolific Year
(21)
π
Century Club
(178)
π₯
Unstoppable
(15)
π
Trend Setter
π
Conference Pioneer
Conferences
CVPR (60)
ICCV (33)
AAAI (24)
ACL (16)
ECCV (11)
IJCAI (11)
NIPS (10)
EMNLP (9)
ICML (5)
ICLR (3)
IJCNLP (3)
WACV (1)
Top co-authors
Keywords
convolutional neural network
(21)
object detection
(17)
representation learning
(14)
semantic segmentation
(11)
knowledge graph
(10)
vision-language model
(9)
graph neural network
(9)
domain adaptation
(9)
contrastive learning
(8)
large language model
(8)
knowledge distillation
(7)
transfer learning
(7)
neural network
(7)
diffusion model
(6)
multimodal learning
(6)
adversarial learning
(6)
unsupervised learning
(5)
semi-supervised learning
(5)
image restoration
(5)
person re-identification
(5)
Papers
Stable Language Guidance for VisionβLanguageβAction Models
ACL 2026
Human-Centric Open-Future Task Discovery: Formulation, Benchmark, and Scalable Tree-Based Search
AAAI 2026
Pre-Trained Video Generative Models as World Simulators
AAAI 2026
Similarity-aware Probabilistic Embeddings Modeling for Video-Text Retrieval
WACV 2026
PAM: Enhancing General Alignment of Large Reasoning Models through Priority-Aware Metacognition
ACL 2026
SEE: Signal Embedding Energy for Quantifying Noise Interference in Large Audio Language Models
ACL 2026
Visually-Guided Policy Optimization for Multimodal Reasoning
ACL 2026
Hidden in the Noise: Unveiling Backdoors in Audio LLMs Alignment Through Latent Acoustic Pattern Triggers
AAAI 2026
Backdoor Collapse: Eliminating Unknown Threats Via Known Backdoor Aggregation In Language Models
ACL 2026
Thinking Before You Speak: A Proactive Test-time Scaling Approach
EMNLP 2025
PS-Diffusion: Photorealistic Subject-Driven Image Editing with Disentangled Control and Attention
CVPR 2025
Boosting the Dual-Stream Architecture in Ultra-High Resolution Segmentation with Resolution-Biased Uncertainty Estimation
CVPR 2025
Are High-Quality AI-Generated Images More Difficult for Models to Detect?
ICML 2025
Language Models as Implicit Tree Search
ICML 2025
Cross-modal Causal Relation Alignment for Video Question Grounding
CVPR 2025
DAGSM: Disentangled Avatar Generation with GS-enhanced Mesh
CVPR 2025
DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering
CVPR 2025
VTON 360: High-Fidelity Virtual Try-On from Any Viewing Direction
CVPR 2025
Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method
CVPR 2025
RouterEval: A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in LLMs
EMNLP 2025
No Pains, More Gains: Recycling Sub-Salient Patches for Efficient High-Resolution Image Recognition
CVPR 2025
SR-FoT: A Syllogistic-Reasoning Framework of Thought for Large Language Models Tackling Knowledge-based Reasoning Tasks
AAAI 2025
Monitoring Primitive Interactions During the Training of DNNs
AAAI 2025
Towards Understanding the Robustness of Diffusion-Based Purification: A Stochastic Perspective
ICLR 2025
Can We Achieve Efficient Diffusion Without Self-Attention? Distilling Self-Attention into Convolutions
ICCV 2025
Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering
ICCV 2025
Reproducible Vision-Language Models Meet Concepts Out of Pre-Training
CVPR 2025
Cool-Fusion: Fuse Large Language Models without Training
ACL 2025
Chain of Methodologies: Scaling Test Time Computation without Training
ACL 2025
MiniLongBench: The Low-cost Long Context Understanding Benchmark for Large Language Models
ACL 2025
HyperCRS: Hypergraph-Aware Multi-Grained Preference Learning to Burst Filter Bubbles in Conversational Recommendation System
ACL 2025
IntelliCockpitBench: A Comprehensive Benchmark to Evaluate VLMs for Intelligent Cockpit
ACL 2025
Why Multi-Interest Fairness Matters: Hypergraph Contrastive Multi-Interest Learning for Fair Conversational Recommender System
ACL 2025
DreamFuse: Adaptive Image Fusion with Diffusion Transformer
ICCV 2025
RoboPearls: Editable Video Simulation for Robot Manipulation
ICCV 2025
RoBridge: A Hierarchical Architecture Bridging Cognition and Execution for General Robotic Manipulation
ICCV 2025
Free-MoRef: Instantly Multiplexing Context Perception Capabilities of Video-MLLMs within Single Inference
ICCV 2025
Sim-DETR: Unlock DETR for Temporal Sentence Grounding
ICCV 2025
Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection
CVPR 2024
AlignMiF: Geometry-Aligned Multimodal Implicit Field for LiDAR-Camera Joint Synthesis
CVPR 2024
Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation
CVPR 2024
Kepler codebook
ICML 2024
AttNS: Attention-Inspired Numerical Solving For Limited Data Scenarios
ICML 2024
EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE
AAAI 2024
FacetCRS: Multi-Faceted Preference Learning for Pricking Filter Bubbles in Conversational Recommender System
AAAI 2024
Diagnosing and Rectifying Fake OOD Invariance: A Restructured Causal Approach
AAAI 2024
HyCoRec: Hypergraph-Enhanced Multi-Preference Learning for Alleviating Matthew Effect in Conversational Recommendation
ACL 2024
VisDiaHalBench: A Visual Dialogue Benchmark For Diagnosing Hallucination in Large Vision-Language Models
ACL 2024
Mitigating Matthew Effect: Multi-Hypergraph Boosted Multi-Interest Self-Supervised Learning for Conversational Recommendation
EMNLP 2024
Stripe Observation Guided Inference Cost-free Attention Mechanism
ECCV 2024
WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models
ECCV 2024
MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection
ECCV 2024
Learning Adaptive Spatial Coherent Correlations for Speech-Preserving Facial Expression Manipulation
CVPR 2024
Coordinate Transformer: Achieving Single-stage Multi-person Mesh Recovery from Videos
ICCV 2023
SkeletonMAE: Graph-based Masked Autoencoder for Skeleton Sequence Pre-training
ICCV 2023
Enhanced Soft Label for Semi-Supervised Semantic Segmentation
ICCV 2023
LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts
ICCV 2023
Scene Graph to Image Synthesis via Knowledge Consensus
AAAI 2023
ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection
NIPS 2023
HutCRS: Hierarchical User-Interest Tracking for Conversational Recommender System
EMNLP 2023
Masked Images Are Counterfactual Samples for Robust Fine-Tuning
CVPR 2023
Identity-Preserving Talking Face Generation With Landmark and Appearance Priors
CVPR 2023
Being Comes From Not-Being: Open-Vocabulary Text-to-Motion Generation With Wordless Training
CVPR 2023
De-biased Teacher: Rethinking IoU Matching for Semi-supervised Object Detection
AAAI 2023
Adapting Object Size Variance and Class Imbalance for Semi-supervised Object Detection
AAAI 2023
Actional Atomic-Concept Learning for Demystifying Vision-Language Navigation
AAAI 2023
DenseLight: Efficient Control for Large-scale Traffic Signals with Dense Feedback
IJCAI 2023
Long-term Wind Power Forecasting with Hierarchical Spatial-Temporal Transformer
IJCAI 2023
DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment
ICCV 2023
Understanding Self-attention Mechanism via Dynamical System Perspective
ICCV 2023
Towards Real-World Burst Image Super-Resolution: Benchmark and Method
ICCV 2023
RankMatch: Fostering Confidence and Consistency in Learning with Noisy Labels
ICCV 2023
A Retrospect to Multi-prompt Learning across Vision and Language
ICCV 2023
UniGeo: Unifying Geometry Logical Reasoning via Reformulating Mathematical Expression
EMNLP 2022
Divide and Contrast: Source-free Domain Adaptation via Adaptive Contrastive Learning
NIPS 2022
Structure-Preserving 3D Garment Modeling with Neural Sewing Machines
NIPS 2022
Structured Semantic Transfer for Multi-Label Recognition with Partial Labels
AAAI 2022
Semantic-Aware Representation Blending for Multi-Label Image Recognition with Partial Labels
AAAI 2022
Unsupervised Domain Adaptive Salient Object Detection through Uncertainty-Aware Pseudo-Label Learning
AAAI 2022
Continual Object Detection via Prototypical Task Correlation Guided Gating Mechanism
CVPR 2022
Dual Adversarial Adaptation for Cross-Device Real-World Image Super-Resolution
CVPR 2022
Semantic-Aware Auto-Encoders for Self-Supervised Representation Learning
CVPR 2022
Adversarially-Aware Robust Object Detector
ECCV 2022
LogicSolver: Towards Interpretable Math Word Problem Solving with Logical Prompt-enhanced Learning
EMNLP 2022
Double-Check Soft Teacher for Semi-Supervised Object Detection
IJCAI 2022
Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Counting
CVPR 2021
Rethinking the Pruning Criteria for Convolutional Neural Network
NIPS 2021
Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition
EMNLP 2021
Linguistically Routing Capsule Network for Out-of-Distribution Visual Question Answering
ICCV 2021
Trash To Treasure: Harvesting OOD Data With Cross-Modal Matching for Open-Set Semi-Supervised Learning
ICCV 2021
Pi-NAS: Improving Neural Architecture Search by Reducing Supernet Training Consistency Shift
ICCV 2021
Weakly-Supervised Spatio-Temporal Anomaly Detection in Surveillance Video
IJCAI 2021
Solving Inefficiency of Self-Supervised Representation Learning
ICCV 2021
Deductive Learning for Weakly-Supervised 3D Human Pose Estimation via Uncalibrated Cameras
AAAI 2021
Graph-Evolving Meta-Learning for Low-Resource Medical Dialogue Generation
AAAI 2021
Adversarial Meta Sampling for Multilingual Low-Resource Speech Recognition
AAAI 2021
Towards Quantifiable Dialogue Coherence Evaluation
IJCNLP 2021
Towards Quantifiable Dialogue Coherence Evaluation
ACL 2021
Neural-Symbolic Solver for Math Word Problems with Auxiliary Tasks
IJCNLP 2021
GeoQA: A Geometric Question Answering Benchmark Towards Multimodal Numerical Reasoning
IJCNLP 2021
Neural-Symbolic Solver for Math Word Problems with Auxiliary Tasks
ACL 2021
GeoQA: A Geometric Question Answering Benchmark Towards Multimodal Numerical Reasoning
ACL 2021
Transferable, Controllable, and Inconspicuous Adversarial Attacks on Person Re-identification With Deep Mis-Ranking
CVPR 2020
Tree-Structured Policy Based Progressive Reinforcement Learning for Temporally Language Grounding in Video
AAAI 2020
An Adversarial Perturbation Oriented Domain Adaptation Approach for Semantic Segmentation
AAAI 2020
Component Divide-and-Conquer for Real-World Image Super-Resolution
ECCV 2020
Collaborative Training between Region Proposal Localization and Classification for Domain Adaptive Object Detection
ECCV 2020
Auto-Panoptic: Cooperative Multi-Component Architecture Search for Panoptic Segmentation
NIPS 2020
Semantically-Aligned Universal Tree-Structured Solver for Math Word Problems
EMNLP 2020
GRADE: Automatic Graph-Enhanced Coherence Metric for Evaluating Open-Domain Dialogue Systems
EMNLP 2020
Bidirectional Graph Reasoning Network for Panoptic Segmentation
CVPR 2020
Block-Wisely Supervised Neural Architecture Search With Knowledge Distillation
CVPR 2020
Knowledge Graph Transfer Network for Few-Shot Recognition
AAAI 2020
Learning Semantic-Specific Graph Representation for Multi-Label Image Recognition
ICCV 2019
Larger Norm More Transferable: An Adaptive Feature Norm Approach for Unsupervised Domain Adaptation
ICCV 2019
Weakly-Supervised Discovery of Geometry-Aware Representation for 3D Human Pose Estimation
CVPR 2019
Spatially Variant Linear Representation Models for Joint Filtering
CVPR 2019
Blending-Target Domain Adaptation by Adversarial Meta-Adaptation Networks
CVPR 2019
Adaptively Connected Neural Networks
CVPR 2019
Crowd Counting With Deep Structured Scale Integration Network
ICCV 2019
Fashion Retrieval via Graph Reasoning Networks on a Similarity Pyramid
ICCV 2019
Meta R-CNN: Towards General Solver for Instance-Level Low-Shot Learning
ICCV 2019
FRAME Revisited: An Interpretation View Based on Particle Evolution
AAAI 2019
End-to-End Knowledge-Routed Relational Dialogue System for Automatic Diagnosis
AAAI 2019
Semantic Relationships Guided Representation Learning for Facial Action Unit Recognition
AAAI 2019
SNAS: stochastic neural architecture search
ICLR 2019
Graphonomy: Universal Human Parsing via Graph Transfer Learning
CVPR 2019
Reasoning-RCNN: Unifying Adaptive Global Reasoning Into Large-Scale Object Detection
CVPR 2019
NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning
ICLR 2019
Multivariate-Information Adversarial Ensemble for Scalable Joint Distribution Matching
ICML 2019
Knowledge-Embedded Routing Network for Scene Graph Generation
CVPR 2019
ClusterNet: Deep Hierarchical Cluster Network With Rigorously Rotation-Invariant Representation for Point Cloud Analysis
CVPR 2019
Layout-Graph Reasoning for Fashion Landmark Detection
CVPR 2019
Semi-Supervised Video Salient Object Detection Using Pseudo-Labels
ICCV 2019
Symbolic Graph Reasoning Meets Convolutions
NIPS 2018
Visual Question Reasoning on General Dependency Tree
CVPR 2018
Interpretable Video Captioning via Trajectory Structured Localization
CVPR 2018
LSTM Pose Machines
CVPR 2018
Deep Cocktail Network: Multi-Source Unsupervised Domain Adaptation With Category Shift
CVPR 2018
Flow Guided Recurrent Neural Encoder for Video Salient Object Detection
CVPR 2018
Crafting a Toolchain for Image Restoration by Deep Reinforcement Learning
CVPR 2018
Zoom and Learn: Generalizing Deep Stereo Matching to Novel Domains
CVPR 2018
Instance-level Human Parsing via Part Grouping Network
ECCV 2018
Monocular Depth Estimation with Affinity, Vertical Pooling, and Label Enhancement
ECCV 2018
Learning Warped Guidance for Blind Face Restoration
ECCV 2018
Toward Characteristic-Preserving Image-based Virtual Try-On Network
ECCV 2018
Generative Semantic Manipulation with Mask-Contrasting GAN
ECCV 2018
Kalman Normalization: Normalizing Internal Representations Across Network Layers
NIPS 2018
Towards Human-Machine Cooperation: Self-Supervised Sample Mining for Object Detection
CVPR 2018
Single View Stereo Matching
CVPR 2018
Hybrid Knowledge Routed Modules for Large-scale Object Detection
NIPS 2018
Knowledge-Embedded Representation Learning for Fine-Grained Image Recognition
IJCAI 2018
Crowd Counting using Deep Recurrent Spatial-Aware Network
IJCAI 2018
DRPose3D: Depth Ranking in 3D Human Pose Estimation
IJCAI 2018
Deep Reasoning with Knowledge Graph for Social Relationship Understanding
IJCAI 2018
Convolutional Memory Blocks for Depth Data Representation Learning
IJCAI 2018
Recurrent 3D Pose Sequence Machines
CVPR 2017
Deep Dual Learning for Semantic Image Segmentation
ICCV 2017
Interpretable Structure-Evolving LSTM
CVPR 2017
Instance-Level Salient Object Segmentation
CVPR 2017
Joint Detection and Identification Feature Learning for Person Search
CVPR 2017
Look Into Person: Self-Supervised Structure-Sensitive Learning and a New Benchmark for Human Parsing
CVPR 2017
Learning Object Interactions and Descriptions for Semantic Image Segmentation
CVPR 2017
Attention-Aware Face Hallucination via Deep Reinforcement Learning
CVPR 2017
Multi-Label Image Recognition by Recurrently Discovering Attentional Regions
ICCV 2017
Reversible Recursive Instance-Level Object Segmentation
CVPR 2016
Joint Learning of Single-Image and Cross-Image Representations for Person Re-Identification
CVPR 2016
Dictionary Pair Classifier Driven Convolutional Neural Networks for Object Detection
CVPR 2016
Deep Structured Scene Parsing by Learning With Image Descriptions
CVPR 2016
Semantic Object Parsing With Local-Global Long Short-Term Memory
CVPR 2016
A Stochastic Image Grammar for Fine-Grained 3D Scene Reconstruction
IJCAI 2016
Geometric Scene Parsing with Hierarchical LSTM
IJCAI 2016
Discriminative Learning of Iteration-Wise Priors for Blind Deconvolution
CVPR 2015
SOLD: Sub-Optimal Low-rank Decomposition for Efficient Video Segmentation
CVPR 2015
Human Parsing With Contextualized Convolutional Neural Network
ICCV 2015
Towards Computational Baby Learning: A Weakly-Supervised Approach for Object Detection
ICCV 2015
Matching-CNN Meets KNN: Quasi-Parametric Human Parsing
CVPR 2015
Deep Joint Task Learning for Generic Object Extraction
NIPS 2014
Clothing Co-Parsing by Joint Image Segmentation and Labeling
CVPR 2014
Correntropy Induced L2 Graph for Robust Subspace Clustering
ICCV 2013
Human Re-identification by Matching Compositional Template with Cluster Sampling
ICCV 2013
Incorporating Structural Alternatives and Sharing into Hierarchy for Multiclass Object Recognition and Detection
CVPR 2013
PISA: Pixelwise Image Saliency by Aggregating Complementary Appearance Contrast Measures with Spatial Priors
CVPR 2013
Robust Region Grouping via Internal Patch Statistics
CVPR 2013
SYM-FISH: A Symmetry-Aware Flip Invariant Sketch Histogram Shape Descriptor
ICCV 2013
Dynamical And-Or Graph Learning for Object Shape Modeling and Detection
NIPS 2012