Dongfang Liu
44 papers · 2021–2026 · 13 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
🌍 Conference Polyglot (13) 🐝 Cross-Pollinator (14) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (5)
🧭
Keyword Pioneer
🌈
Renaissance Researcher
(7)
🐣
Hot Topic Early Bird
🤝
Dynamic Duo
(25)
👑
Triple Crown
🏆
Grand Slam
🧬
Topic Evolution
🏆
Keyword Champion
(3)
🗃️
Keyword Collector
(135)
⚡
Prolific Year
(10)
❓
The Questioner
🔥
Unstoppable
(5)
💎
Century Club
(42)
Conferences
ICLR (9)
EMNLP (5)
CVPR (4)
ECCV (4)
ICCV (4)
NIPS (4)
ACL (3)
ICML (3)
AAAI (2)
IJCAI (2)
WACV (2)
AACL (1)
IJCNLP (1)
Top co-authors
Keywords
prompt tuning
(5)
instance segmentation
(4)
representation learning
(3)
semantic segmentation
(3)
modality gap
(3)
feature aggregation
(2)
vision-language model
(2)
visual prompt tuning
(2)
optical flow
(2)
transformer architecture
(2)
depth estimation
(2)
transfer learning
(2)
object detection
(2)
embedding space
(2)
image classification
(2)
attribute extraction
(2)
few-shot learning
(2)
video captioning
(2)
object tracking
(2)
parameter-efficient fine-tuning
(2)
Papers
Make LVLMs Focus: Context-Aware Attention Modulation for Better Multimodal In-Context Learning
AAAI 2026
On-the-Fly VLA Adaptation via Test-Time Reinforcement Learning
ACL 2026
MEPT: Mixture of Expert Prompt Tuning as a Manifold Mapper
EMNLP 2025
Item-Language Model: Improving Large Language Model for Recommendation via Item-Language Representation Learning
AACL 2025
Item-Language Model: Improving Large Language Model for Recommendation via Item-Language Representation Learning
IJCNLP 2025
Re-Imagining Multimodal Instruction Tuning: A Representation View
ICLR 2025
Visual Agents as Fast and Slow Thinkers
ICLR 2025
Diff-PIC: Revolutionizing Particle-In-Cell Nuclear Fusion Simulation with Diffusion Models
ICLR 2025
DS-LLM: Leveraging Dynamical Systems to Enhance Both Training and Inference of Large Language Models
ICLR 2025
Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics
ICCV 2025
Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval
CVPR 2024
Diffusion-Inspired Truncated Sampler for Text-Video Retrieval
NIPS 2024
Visual Fourier Prompt Tuning
NIPS 2024
ProMotion: Prototypes As Motion Learners
CVPR 2024
UAV First-Person Viewers Are Radiance Field Learners
ECCV 2024
AMD: Automatic Multi-step Distillation of Large-scale Vision Models
ECCV 2024
M2PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning
EMNLP 2024
EAVE: Efficient Product Attribute Value Extraction via Lightweight Sparse-layer Interaction
EMNLP 2024
Fusion Is Not Enough: Single Modal Attacks on Fusion Models for 3D Object Detection
ICLR 2024
Facing the Elephant in the Room: Visual Prompt Tuning or Full finetuning?
ICLR 2024
Image Translation as Diffusion Visual Programmers
ICLR 2024
BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression Tasks
ICML 2024
Prototypical Transformer As Unified Motion Learners
ICML 2024
E^2VPT: An Effective and Efficient Approach for Visual Prompt Tuning
ICCV 2023
Adversarial Training of Self-supervised Monocular Depth Estimation against Physical-World Attacks
ICLR 2023
Visual Recognition with Deep Nearest Centroids
ICLR 2023
MUSTIE: Multimodal Structural Transformer for Web Information Extraction
ACL 2023
TransFlow: Transformer As Flow Learner
CVPR 2023
MixPAVE: Mix-Prompt Tuning for Few-shot Product Attribute Value Extraction
ACL 2023
CLUSTSEG: Clustering for Universal Segmentation
ICML 2023
ClusterFomer: Clustering As A Universal Visual Learner
NIPS 2023
APrompt: Attention Prompt Tuning for Efficient Adaptation of Pre-trained Language Models
EMNLP 2023
Prompt Learns Prompt: Exploring Knowledge-Aware Generative Prompt Collaboration For Video Captioning
IJCAI 2023
GL-RG: Global-Local Representation Granularity for Video Captioning
IJCAI 2022
Towards Unbiased Label Distribution Learning for Facial Pose Estimation Using Anisotropic Spherical Gaussian
ECCV 2022
Physical Attack on Monocular Depth Estimation with Optimal Adversarial Patches
ECCV 2022
Learning to Generate Question by Asking Question: A Primal-Dual Approach with Uncommon Word Generation
EMNLP 2022
DG-Labeler and DGL-MOTS Dataset: Boost the Autonomous Driving Perception
WACV 2022
Learning Equivariant Segmentation with Instance-Unique Querying
NIPS 2022
Semantic Aware Data Augmentation for Cell Nuclei Microscopical Images With Artificial Neural Networks
ICCV 2021
A Vector-Based Representation to Enhance Head Pose Estimation
WACV 2021
DenserNet: Weakly Supervised Visual Localization Using Multi-Scale Feature Aggregation
AAAI 2021
TF-Blender: Temporal Feature Blender for Video Object Detection
ICCV 2021
SG-Net: Spatial Granularity Network for One-Stage Video Instance Segmentation
CVPR 2021