Feng Li

51 papers · 2018–2026 · 11 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🏃 Academic Marathon (7) 🌍 Conference Polyglot (10) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (14)

🐝 Cross-Pollinator (14) 🌈 Renaissance Researcher (8) 🗺️ Taxonomy Completionist (70) 🤝 Dynamic Duo (21) 🧬 Topic Evolution ⚡ Prolific Year (11) 🚀 Conference Pioneer 🔥 Unstoppable (8) 💎 Century Club (49) 🗃️ Keyword Collector (182) ❓ The Questioner

Conferences

CVPR (12) AAAI (7) ECCV (6) ICLR (6) ICCV (5) NIPS (5) EMNLP (3) MICCAI (3) IJCAI (2) ACL (1) NAACL (1)

Top co-authors

Lei Zhang (21) Shilong Liu (21) Hao Zhang (20) Tianhe Ren (9) Jianwei Yang (8) Chunyuan Li (7) Xueyan Zou (7) Hongyang Li (7) Huihui Bai (6) Zhaoyang Zeng (6)

Keywords

object detection (5) image segmentation (4) convolutional neural network (4) deformable attention (3) transformer architecture (3) semantic segmentation (3) instance segmentation (3) attention mechanism (3) open-vocabulary segmentation (3) panoptic segmentation (2) representation learning (2) image restoration (2) transfer learning (2) visual prompting (2) image super-resolution (2) depth estimation (2) multimodal learning (2) knowledge transfer (2) efficient computing (2) dialogue system (2)

Papers

Can Large Language Models Grasp 3D Medical Anatomy Shapes? (Student Abstract) AAAI 2026 Scaling Law for Multimodal Large Language Model Supervised Fine-Tuning ACL 2026 EvEnhancer: Empowering Effectiveness, Efficiency and Generalizability for Continuous Space-Time Video Super-Resolution with Events CVPR 2025 Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaptation ICLR 2025 CLEAR: A Clinically Grounded Tabular Framework for Radiology Report Evaluation EMNLP 2025 Sign-IDD: Iconicity Disentangled Diffusion for Sign Language Production AAAI 2025 Thinking in Granularity: Dynamic Quantization for Image Super-Resolution by Intriguing Multi-Granularity Clues AAAI 2025 MDD-5k: A New Diagnostic Conversation Dataset for Mental Disorders Synthesized via Neuro-Symbolic LLM Agents AAAI 2025 ResMAP: Restoring MRIs of Mixed Artifacts by Prompt Cascading Retrieval MICCAI 2025 Intelligent Virtual Sonographer (IVS): Enhancing Physician-Robot-Patient Communication MICCAI 2025 MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans? ICLR 2025 LLaVA-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models ICLR 2025 Attend and Enrich: Enhanced Visual Prompt for Zero-Shot Learning AAAI 2025 Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection ECCV 2024 LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents ECCV 2024 Segment and Recognize Anything at Any Granularity ECCV 2024 MoCo-Diff: Adaptive Conditional Prior on Diffusion Network for MRI Motion Correction MICCAI 2024 TAPTR: Tracking Any Point with Transformers as Detection ECCV 2024 Visual In-Context Prompting CVPR 2024 Interfacing Foundation Models' Embeddings NIPS 2024 TAPTRv2: Attention-based Position Update Improves Tracking Any Point NIPS 2024 LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models ECCV 2024 T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy ECCV 2024 DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection ICLR 2023 Segment Everything Everywhere All at Once NIPS 2023 DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding AAAI 2023 Exploring Data Geometry for Continual Learning CVPR 2023 Mask DINO: Towards a Unified Transformer-Based Framework for Object Detection and Segmentation CVPR 2023 MP-Former: Mask-Piloted Transformer for Image Segmentation CVPR 2023 Progressive Semantic-Visual Mutual Adaption for Generalized Zero-Shot Learning CVPR 2023 Lite DETR: An Interleaved Multi-Scale Encoder for Efficient DETR CVPR 2023 A Simple Framework for Open-Vocabulary Segmentation and Detection ICCV 2023 DFA3D: 3D Deformable Attention For 2D-to-3D Feature Lifting ICCV 2023 Detection Transformer with Stable Matching ICCV 2023 Neural Interactive Keypoint Detection ICCV 2023 Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation ICLR 2023 Learn To Remember: Transformer with Recurrent Memory for Document-Level Machine Translation NAACL 2022 A Token-pair Framework for Information Extraction from Dialog Transcripts in SereTOD Challenge EMNLP 2022 DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR ICLR 2022 APG: Adaptive Parameter Generation Network for Click-Through Rate Prediction NIPS 2022 Toward the Limitation of Code-Switching in Cross-Lingual Transfer EMNLP 2022 Upright-Net: Learning Upright Orientation for 3D Point Cloud CVPR 2022 DN-DETR: Accelerate DETR Training by Introducing Query DeNoising CVPR 2022 SARG: A Novel Semi Autoregressive Generator for Multi-turn Incomplete Utterance Restoration AAAI 2021 Encoding Spatial Distribution of Convolutional Features for Texture Representation NIPS 2021 Deep Texture Recognition via Exploiting Cross-Layer Statistical Self-Similarity CVPR 2021 Towards Complete Scene and Regular Shape for Distortion Rectification by Curve-Aware Extrapolation ICCV 2021 Towards Fast and Accurate Real-World Depth Super-Resolution: Benchmark Dataset and Baseline CVPR 2021 Deep Interleaved Network for Single Image Super-Resolution with Asymmetric Co-Attention IJCAI 2020 High Performance Gesture Recognition via Effective and Efficient Temporal Modeling IJCAI 2019 Learning Spatial-Temporal Regularized Correlation Filters for Visual Tracking CVPR 2018