Lianli Gao
60 papers · 2015–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π Conference Polyglot (11) π§ Keyword Pioneer π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (14) π Academic Marathon (10)
π
Academic Marathon
(10)
π
Cross-Pollinator
(13)
π
Renaissance Researcher
(10)
π
Keyword Champion
(2)
π§¬
Topic Evolution
π€
Dynamic Duo
(48)
π¬
Deep Specialist
(10)
π
Grand Slam
π
Trend Setter
π
Century Club
(59)
π₯
Unstoppable
(9)
β‘
Prolific Year
(11)
π
Conference Pioneer
ποΈ
Keyword Collector
(270)
Conferences
IJCAI (15)
CVPR (13)
AAAI (10)
NIPS (6)
ECCV (5)
ICCV (4)
ACL (3)
CORL (1)
EMNLP (1)
ICLR (1)
ICML (1)
Top co-authors
Keywords
multimodal learning
(7)
attention mechanism
(5)
scene graph generation
(4)
visual question answering
(4)
image retrieval
(4)
video understanding
(4)
generative adversarial network
(4)
graph neural network
(4)
adversarial attack
(4)
convolutional neural network
(4)
metric learning
(3)
cross-modal retrieval
(3)
image captioning
(2)
unsupervised learning
(2)
semi-supervised learning
(2)
visual grounding
(2)
graph learning
(2)
prototype learning
(2)
probabilistic modeling
(2)
image generation
(2)
Papers
Debiased Orthogonal Boundary-Driven Efficient Noise Mitigation
ACL 2026
DFDNet: Disentangling and Filtering Dynamics for Enhanced Video Prediction
AAAI 2025
Unlocking Smarter Device Control: Foresighted Planning with a World Model-Driven Code Execution Approach
EMNLP 2025
OmniCharacter: Towards Immersive Role-Playing Agents with Seamless Speech-Language Personality Interaction
ACL 2025
Skip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters Themselves
CVPR 2025
MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
ACL 2025
Shortcut Learning in Generalist Robot Policies: The Role of Dataset Diversity and Fragmentation
CORL 2025
DePT: Decoupled Prompt Tuning
CVPR 2024
CoIN: A Benchmark of Continual Instruction Tuning for Multimodel Large Language Models
NIPS 2024
Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization
NIPS 2024
FΒ³-Pruning: A Training-Free and Generalized Pruning Strategy towards Faster and Finer Text-to-Video Synthesis
AAAI 2024
RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception
ECCV 2024
Any Target Can be Offense: Adversarial Example Generation via Generalized Latent Infection
ECCV 2024
ProS: Prompting-to-simulate Generalized knowledge for Universal Cross-Domain Retrieval
CVPR 2024
Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval
NIPS 2023
Prototype-Based Embedding Network for Scene Graph Generation
CVPR 2023
Part-Aware Transformer for Generalizable Person Re-identification
ICCV 2023
DETA: Denoised Task Adaptation for Few-Shot Learning
ICCV 2023
A Closer Look at Few-shot Classification Again
ICML 2023
Towards Open-Vocabulary Scene Graph Generation with Prompt-Based Finetuning
ECCV 2022
Natural Color Fool: Towards Boosting Black-box Unrestricted Attacks
NIPS 2022
A Differentiable Semantic Metric Approximation in Probabilistic Embedding for Cross-Modal Retrieval
NIPS 2022
Practical Evaluation of Adversarial Robustness via Adaptive Auto Attack
CVPR 2022
Fine-Grained Predicates Learning for Scene Graph Generation
CVPR 2022
Beyond ImageNet Attack: Towards Crafting Adversarial Examples for Black-box Domains
ICLR 2022
Frequency Domain Model Augmentation for Adversarial Attack
ECCV 2022
Wnet: Audio-Guided Video Object Segmentation via Wavelet-Based Cross-Modal Denoising Networks
CVPR 2022
S2 Transformer for Image Captioning
IJCAI 2022
A Lower Bound of Hash Codes' Performance
NIPS 2022
Unified Multivariate Gaussian Mixture for Efficient Neural Image Compression
CVPR 2022
RSGNet: Relation based Skeleton Graph Network for Crowded Scenes Pose Estimation
AAAI 2021
Exploiting Scene Graphs for Human-Object Interaction Detection
ICCV 2021
From General to Specific: Informative Scene Graph Generation via Balance Adjustment
ICCV 2021
Feature Space Targeted Attacks by Statistic Alignment
IJCAI 2021
Towards Unsupervised Deformable-Instances Image-to-Image Translation
IJCAI 2021
PoseGTAC: Graph Transformer Encoder-Decoder with Atrous Convolution for 3D Human Pose Estimation
IJCAI 2021
Where Does It Exist: Spatio-Temporal Video Grounding for Multi-Form Sentences
CVPR 2020
Label-Attended Hashing for Multi-Label Image Retrieval
IJCAI 2020
SNEQ: Semi-Supervised Attributed Network Embedding with Attention-Based Quantisation
AAAI 2020
Learning Cross-Aligned Latent Embeddings for Zero-Shot Cross-Modal Retrieval
AAAI 2020
What Machines See Is Not What They Get: Fooling Scene Text Recognition Models With Adversarial Text Images
CVPR 2020
Patch-wise Attack for Fooling Deep Neural Network
ECCV 2020
Learning from the Scene and Borrowing from the Rich: Tackling the Long Tail in Scene Graph Generation
IJCAI 2020
Bottom-up and Top-down: Bidirectional Additive Net for Edge Detection
IJCAI 2020
Matching User with Item Set: Collaborative Bundle Recommendation with Deep Attention Network
IJCAI 2019
One Network for Multi-Domains: Domain Adaptive Hashing with Intersectant Generative Adversarial Networks
IJCAI 2019
Neighbourhood Watch: Referring Expression Comprehension via Language-Guided Graph Attention Networks
CVPR 2019
Structured Two-Stream Attention Network for Video Question Answering
AAAI 2019
Template-Based Math Word Problem Solvers with Recursive Neural Networks
AAAI 2019
Beyond RNNs: Positional Self-Attention with Co-Attention for Video Question Answering
AAAI 2019
Deliberate Attention Networks for Image Captioning
AAAI 2019
Perceptual Pyramid Adversarial Networks for Text-to-Image Synthesis
AAAI 2019
Beyond Product Quantization: Deep Progressive Quantization for Image Retrieval
IJCAI 2019
Deep Recurrent Quantization for Generating Sequential Binary Codes
IJCAI 2019
Social Relation Recognition From Videos via Multi-Scale Spatial-Temporal Reasoning
CVPR 2019
Dual Conditional GANs for Face Aging and Rejuvenation
IJCAI 2018
Coarse-to-fine Image Co-segmentation with Intra and Inter Rank Constraints
IJCAI 2018
From Pixels to Objects: Cubic Visual Attention for Visual Question Answering
IJCAI 2018
Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning
IJCAI 2017
Optimal Graph Learning With Partial Tags and Multiple Features for Image and Video Annotation
CVPR 2015