Liang Lin

186 papers · 2012–2026 · 12 conferences · across top CS/AI conferences

Achievements

+17 more ↓

🌍 Conference Polyglot (12) 🏃 Academic Marathon (14) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (9)

🐝 Cross-Pollinator (9) 🧭 Keyword Pioneer 🏃 Academic Marathon (14) 🌟 Keyword Trendsetter Combo (3) 🏠 Conference Loyalist (21) 🤝 Dynamic Duo (52) 🔬 Deep Specialist (26) 🧬 Topic Evolution 🏆 Keyword Champion (3) 🏆 Grand Slam ❓ The Questioner (2) 🗃️ Keyword Collector (713) ⚡ Prolific Year (21) 💎 Century Club (178) 🔥 Unstoppable (15) 📈 Trend Setter 🚀 Conference Pioneer

Conferences

CVPR (60) ICCV (33) AAAI (24) ACL (16) ECCV (11) IJCAI (11) NIPS (10) EMNLP (9) ICML (5) ICLR (3) IJCNLP (3) WACV (1)

Top co-authors

Xiaodan Liang (52) Guanbin Li (47) Guangrun Wang (17) Pengxu Wei (15) Ziliang Chen (15) Jinghui Qin (13) Keze Wang (12) Tianshui Chen (11) Yang Liu (10) Shuicheng Yan (10)

Keywords

convolutional neural network (21) object detection (17) representation learning (14) semantic segmentation (11) knowledge graph (10) vision-language model (9) graph neural network (9) domain adaptation (9) contrastive learning (8) large language model (8) knowledge distillation (7) transfer learning (7) neural network (7) diffusion model (6) multimodal learning (6) adversarial learning (6) unsupervised learning (5) semi-supervised learning (5) image restoration (5) person re-identification (5)

Papers

Stable Language Guidance for Vision–Language–Action Models ACL 2026 Human-Centric Open-Future Task Discovery: Formulation, Benchmark, and Scalable Tree-Based Search AAAI 2026 Pre-Trained Video Generative Models as World Simulators AAAI 2026 Similarity-aware Probabilistic Embeddings Modeling for Video-Text Retrieval WACV 2026 PAM: Enhancing General Alignment of Large Reasoning Models through Priority-Aware Metacognition ACL 2026 SEE: Signal Embedding Energy for Quantifying Noise Interference in Large Audio Language Models ACL 2026 Visually-Guided Policy Optimization for Multimodal Reasoning ACL 2026 Hidden in the Noise: Unveiling Backdoors in Audio LLMs Alignment Through Latent Acoustic Pattern Triggers AAAI 2026 Backdoor Collapse: Eliminating Unknown Threats Via Known Backdoor Aggregation In Language Models ACL 2026 Thinking Before You Speak: A Proactive Test-time Scaling Approach EMNLP 2025 PS-Diffusion: Photorealistic Subject-Driven Image Editing with Disentangled Control and Attention CVPR 2025 Boosting the Dual-Stream Architecture in Ultra-High Resolution Segmentation with Resolution-Biased Uncertainty Estimation CVPR 2025 Are High-Quality AI-Generated Images More Difficult for Models to Detect? ICML 2025 Language Models as Implicit Tree Search ICML 2025 Cross-modal Causal Relation Alignment for Video Question Grounding CVPR 2025 DAGSM: Disentangled Avatar Generation with GS-enhanced Mesh CVPR 2025 DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering CVPR 2025 VTON 360: High-Fidelity Virtual Try-On from Any Viewing Direction CVPR 2025 Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method CVPR 2025 RouterEval: A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in LLMs EMNLP 2025 No Pains, More Gains: Recycling Sub-Salient Patches for Efficient High-Resolution Image Recognition CVPR 2025 SR-FoT: A Syllogistic-Reasoning Framework of Thought for Large Language Models Tackling Knowledge-based Reasoning Tasks AAAI 2025 Monitoring Primitive Interactions During the Training of DNNs AAAI 2025 Towards Understanding the Robustness of Diffusion-Based Purification: A Stochastic Perspective ICLR 2025 Can We Achieve Efficient Diffusion Without Self-Attention? Distilling Self-Attention into Convolutions ICCV 2025 Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering ICCV 2025 Reproducible Vision-Language Models Meet Concepts Out of Pre-Training CVPR 2025 Cool-Fusion: Fuse Large Language Models without Training ACL 2025 Chain of Methodologies: Scaling Test Time Computation without Training ACL 2025 MiniLongBench: The Low-cost Long Context Understanding Benchmark for Large Language Models ACL 2025 HyperCRS: Hypergraph-Aware Multi-Grained Preference Learning to Burst Filter Bubbles in Conversational Recommendation System ACL 2025 IntelliCockpitBench: A Comprehensive Benchmark to Evaluate VLMs for Intelligent Cockpit ACL 2025 Why Multi-Interest Fairness Matters: Hypergraph Contrastive Multi-Interest Learning for Fair Conversational Recommender System ACL 2025 DreamFuse: Adaptive Image Fusion with Diffusion Transformer ICCV 2025 RoboPearls: Editable Video Simulation for Robot Manipulation ICCV 2025 RoBridge: A Hierarchical Architecture Bridging Cognition and Execution for General Robotic Manipulation ICCV 2025 Free-MoRef: Instantly Multiplexing Context Perception Capabilities of Video-MLLMs within Single Inference ICCV 2025 Sim-DETR: Unlock DETR for Temporal Sentence Grounding ICCV 2025 Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection CVPR 2024 AlignMiF: Geometry-Aligned Multimodal Implicit Field for LiDAR-Camera Joint Synthesis CVPR 2024 Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation CVPR 2024 Kepler codebook ICML 2024 AttNS: Attention-Inspired Numerical Solving For Limited Data Scenarios ICML 2024 EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE AAAI 2024 FacetCRS: Multi-Faceted Preference Learning for Pricking Filter Bubbles in Conversational Recommender System AAAI 2024 Diagnosing and Rectifying Fake OOD Invariance: A Restructured Causal Approach AAAI 2024 HyCoRec: Hypergraph-Enhanced Multi-Preference Learning for Alleviating Matthew Effect in Conversational Recommendation ACL 2024 VisDiaHalBench: A Visual Dialogue Benchmark For Diagnosing Hallucination in Large Vision-Language Models ACL 2024 Mitigating Matthew Effect: Multi-Hypergraph Boosted Multi-Interest Self-Supervised Learning for Conversational Recommendation EMNLP 2024 Stripe Observation Guided Inference Cost-free Attention Mechanism ECCV 2024 WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models ECCV 2024 MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection ECCV 2024 Learning Adaptive Spatial Coherent Correlations for Speech-Preserving Facial Expression Manipulation CVPR 2024 Coordinate Transformer: Achieving Single-stage Multi-person Mesh Recovery from Videos ICCV 2023 SkeletonMAE: Graph-based Masked Autoencoder for Skeleton Sequence Pre-training ICCV 2023 Enhanced Soft Label for Semi-Supervised Semantic Segmentation ICCV 2023 LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts ICCV 2023 Scene Graph to Image Synthesis via Knowledge Consensus AAAI 2023 ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection NIPS 2023 HutCRS: Hierarchical User-Interest Tracking for Conversational Recommender System EMNLP 2023 Masked Images Are Counterfactual Samples for Robust Fine-Tuning CVPR 2023 Identity-Preserving Talking Face Generation With Landmark and Appearance Priors CVPR 2023 Being Comes From Not-Being: Open-Vocabulary Text-to-Motion Generation With Wordless Training CVPR 2023 De-biased Teacher: Rethinking IoU Matching for Semi-supervised Object Detection AAAI 2023 Adapting Object Size Variance and Class Imbalance for Semi-supervised Object Detection AAAI 2023 Actional Atomic-Concept Learning for Demystifying Vision-Language Navigation AAAI 2023 DenseLight: Efficient Control for Large-scale Traffic Signals with Dense Feedback IJCAI 2023 Long-term Wind Power Forecasting with Hierarchical Spatial-Temporal Transformer IJCAI 2023 DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment ICCV 2023 Understanding Self-attention Mechanism via Dynamical System Perspective ICCV 2023 Towards Real-World Burst Image Super-Resolution: Benchmark and Method ICCV 2023 RankMatch: Fostering Confidence and Consistency in Learning with Noisy Labels ICCV 2023 A Retrospect to Multi-prompt Learning across Vision and Language ICCV 2023 UniGeo: Unifying Geometry Logical Reasoning via Reformulating Mathematical Expression EMNLP 2022 Divide and Contrast: Source-free Domain Adaptation via Adaptive Contrastive Learning NIPS 2022 Structure-Preserving 3D Garment Modeling with Neural Sewing Machines NIPS 2022 Structured Semantic Transfer for Multi-Label Recognition with Partial Labels AAAI 2022 Semantic-Aware Representation Blending for Multi-Label Image Recognition with Partial Labels AAAI 2022 Unsupervised Domain Adaptive Salient Object Detection through Uncertainty-Aware Pseudo-Label Learning AAAI 2022 Continual Object Detection via Prototypical Task Correlation Guided Gating Mechanism CVPR 2022 Dual Adversarial Adaptation for Cross-Device Real-World Image Super-Resolution CVPR 2022 Semantic-Aware Auto-Encoders for Self-Supervised Representation Learning CVPR 2022 Adversarially-Aware Robust Object Detector ECCV 2022 LogicSolver: Towards Interpretable Math Word Problem Solving with Logical Prompt-enhanced Learning EMNLP 2022 Double-Check Soft Teacher for Semi-Supervised Object Detection IJCAI 2022 Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Counting CVPR 2021 Rethinking the Pruning Criteria for Convolutional Neural Network NIPS 2021 Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition EMNLP 2021 Linguistically Routing Capsule Network for Out-of-Distribution Visual Question Answering ICCV 2021 Trash To Treasure: Harvesting OOD Data With Cross-Modal Matching for Open-Set Semi-Supervised Learning ICCV 2021 Pi-NAS: Improving Neural Architecture Search by Reducing Supernet Training Consistency Shift ICCV 2021 Weakly-Supervised Spatio-Temporal Anomaly Detection in Surveillance Video IJCAI 2021 Solving Inefficiency of Self-Supervised Representation Learning ICCV 2021 Deductive Learning for Weakly-Supervised 3D Human Pose Estimation via Uncalibrated Cameras AAAI 2021 Graph-Evolving Meta-Learning for Low-Resource Medical Dialogue Generation AAAI 2021 Adversarial Meta Sampling for Multilingual Low-Resource Speech Recognition AAAI 2021 Towards Quantifiable Dialogue Coherence Evaluation IJCNLP 2021 Towards Quantifiable Dialogue Coherence Evaluation ACL 2021 Neural-Symbolic Solver for Math Word Problems with Auxiliary Tasks IJCNLP 2021 GeoQA: A Geometric Question Answering Benchmark Towards Multimodal Numerical Reasoning IJCNLP 2021 Neural-Symbolic Solver for Math Word Problems with Auxiliary Tasks ACL 2021 GeoQA: A Geometric Question Answering Benchmark Towards Multimodal Numerical Reasoning ACL 2021 Transferable, Controllable, and Inconspicuous Adversarial Attacks on Person Re-identification With Deep Mis-Ranking CVPR 2020 Tree-Structured Policy Based Progressive Reinforcement Learning for Temporally Language Grounding in Video AAAI 2020 An Adversarial Perturbation Oriented Domain Adaptation Approach for Semantic Segmentation AAAI 2020 Component Divide-and-Conquer for Real-World Image Super-Resolution ECCV 2020 Collaborative Training between Region Proposal Localization and Classification for Domain Adaptive Object Detection ECCV 2020 Auto-Panoptic: Cooperative Multi-Component Architecture Search for Panoptic Segmentation NIPS 2020 Semantically-Aligned Universal Tree-Structured Solver for Math Word Problems EMNLP 2020 GRADE: Automatic Graph-Enhanced Coherence Metric for Evaluating Open-Domain Dialogue Systems EMNLP 2020 Bidirectional Graph Reasoning Network for Panoptic Segmentation CVPR 2020 Block-Wisely Supervised Neural Architecture Search With Knowledge Distillation CVPR 2020 Knowledge Graph Transfer Network for Few-Shot Recognition AAAI 2020 Learning Semantic-Specific Graph Representation for Multi-Label Image Recognition ICCV 2019 Larger Norm More Transferable: An Adaptive Feature Norm Approach for Unsupervised Domain Adaptation ICCV 2019 Weakly-Supervised Discovery of Geometry-Aware Representation for 3D Human Pose Estimation CVPR 2019 Spatially Variant Linear Representation Models for Joint Filtering CVPR 2019 Blending-Target Domain Adaptation by Adversarial Meta-Adaptation Networks CVPR 2019 Adaptively Connected Neural Networks CVPR 2019 Crowd Counting With Deep Structured Scale Integration Network ICCV 2019 Fashion Retrieval via Graph Reasoning Networks on a Similarity Pyramid ICCV 2019 Meta R-CNN: Towards General Solver for Instance-Level Low-Shot Learning ICCV 2019 FRAME Revisited: An Interpretation View Based on Particle Evolution AAAI 2019 End-to-End Knowledge-Routed Relational Dialogue System for Automatic Diagnosis AAAI 2019 Semantic Relationships Guided Representation Learning for Facial Action Unit Recognition AAAI 2019 SNAS: stochastic neural architecture search ICLR 2019 Graphonomy: Universal Human Parsing via Graph Transfer Learning CVPR 2019 Reasoning-RCNN: Unifying Adaptive Global Reasoning Into Large-Scale Object Detection CVPR 2019 NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning ICLR 2019 Multivariate-Information Adversarial Ensemble for Scalable Joint Distribution Matching ICML 2019 Knowledge-Embedded Routing Network for Scene Graph Generation CVPR 2019 ClusterNet: Deep Hierarchical Cluster Network With Rigorously Rotation-Invariant Representation for Point Cloud Analysis CVPR 2019 Layout-Graph Reasoning for Fashion Landmark Detection CVPR 2019 Semi-Supervised Video Salient Object Detection Using Pseudo-Labels ICCV 2019 Symbolic Graph Reasoning Meets Convolutions NIPS 2018 Visual Question Reasoning on General Dependency Tree CVPR 2018 Interpretable Video Captioning via Trajectory Structured Localization CVPR 2018 LSTM Pose Machines CVPR 2018 Deep Cocktail Network: Multi-Source Unsupervised Domain Adaptation With Category Shift CVPR 2018 Flow Guided Recurrent Neural Encoder for Video Salient Object Detection CVPR 2018 Crafting a Toolchain for Image Restoration by Deep Reinforcement Learning CVPR 2018 Zoom and Learn: Generalizing Deep Stereo Matching to Novel Domains CVPR 2018 Instance-level Human Parsing via Part Grouping Network ECCV 2018 Monocular Depth Estimation with Affinity, Vertical Pooling, and Label Enhancement ECCV 2018 Learning Warped Guidance for Blind Face Restoration ECCV 2018 Toward Characteristic-Preserving Image-based Virtual Try-On Network ECCV 2018 Generative Semantic Manipulation with Mask-Contrasting GAN ECCV 2018 Kalman Normalization: Normalizing Internal Representations Across Network Layers NIPS 2018 Towards Human-Machine Cooperation: Self-Supervised Sample Mining for Object Detection CVPR 2018 Single View Stereo Matching CVPR 2018 Hybrid Knowledge Routed Modules for Large-scale Object Detection NIPS 2018 Knowledge-Embedded Representation Learning for Fine-Grained Image Recognition IJCAI 2018 Crowd Counting using Deep Recurrent Spatial-Aware Network IJCAI 2018 DRPose3D: Depth Ranking in 3D Human Pose Estimation IJCAI 2018 Deep Reasoning with Knowledge Graph for Social Relationship Understanding IJCAI 2018 Convolutional Memory Blocks for Depth Data Representation Learning IJCAI 2018 Recurrent 3D Pose Sequence Machines CVPR 2017 Deep Dual Learning for Semantic Image Segmentation ICCV 2017 Interpretable Structure-Evolving LSTM CVPR 2017 Instance-Level Salient Object Segmentation CVPR 2017 Joint Detection and Identification Feature Learning for Person Search CVPR 2017 Look Into Person: Self-Supervised Structure-Sensitive Learning and a New Benchmark for Human Parsing CVPR 2017 Learning Object Interactions and Descriptions for Semantic Image Segmentation CVPR 2017 Attention-Aware Face Hallucination via Deep Reinforcement Learning CVPR 2017 Multi-Label Image Recognition by Recurrently Discovering Attentional Regions ICCV 2017 Reversible Recursive Instance-Level Object Segmentation CVPR 2016 Joint Learning of Single-Image and Cross-Image Representations for Person Re-Identification CVPR 2016 Dictionary Pair Classifier Driven Convolutional Neural Networks for Object Detection CVPR 2016 Deep Structured Scene Parsing by Learning With Image Descriptions CVPR 2016 Semantic Object Parsing With Local-Global Long Short-Term Memory CVPR 2016 A Stochastic Image Grammar for Fine-Grained 3D Scene Reconstruction IJCAI 2016 Geometric Scene Parsing with Hierarchical LSTM IJCAI 2016 Discriminative Learning of Iteration-Wise Priors for Blind Deconvolution CVPR 2015 SOLD: Sub-Optimal Low-rank Decomposition for Efficient Video Segmentation CVPR 2015 Human Parsing With Contextualized Convolutional Neural Network ICCV 2015 Towards Computational Baby Learning: A Weakly-Supervised Approach for Object Detection ICCV 2015 Matching-CNN Meets KNN: Quasi-Parametric Human Parsing CVPR 2015 Deep Joint Task Learning for Generic Object Extraction NIPS 2014 Clothing Co-Parsing by Joint Image Segmentation and Labeling CVPR 2014 Correntropy Induced L2 Graph for Robust Subspace Clustering ICCV 2013 Human Re-identification by Matching Compositional Template with Cluster Sampling ICCV 2013 Incorporating Structural Alternatives and Sharing into Hierarchy for Multiclass Object Recognition and Detection CVPR 2013 PISA: Pixelwise Image Saliency by Aggregating Complementary Appearance Contrast Measures with Spatial Priors CVPR 2013 Robust Region Grouping via Internal Patch Statistics CVPR 2013 SYM-FISH: A Symmetry-Aware Flip Invariant Sketch Histogram Shape Descriptor ICCV 2013 Dynamical And-Or Graph Learning for Object Shape Modeling and Detection NIPS 2012