Feng Li
51 papers · 2018–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π Academic Marathon (7) π Conference Polyglot (10) π§ Keyword Pioneer π Interdisciplinary Bridge π Cross-Pollinator (14)
π
Cross-Pollinator
(14)
π
Renaissance Researcher
(8)
πΊοΈ
Taxonomy Completionist
(70)
π€
Dynamic Duo
(21)
π§¬
Topic Evolution
β‘
Prolific Year
(11)
π
Conference Pioneer
π₯
Unstoppable
(8)
π
Century Club
(49)
ποΈ
Keyword Collector
(182)
β
The Questioner
Conferences
CVPR (12)
AAAI (7)
ECCV (6)
ICLR (6)
ICCV (5)
NIPS (5)
EMNLP (3)
MICCAI (3)
IJCAI (2)
ACL (1)
NAACL (1)
Top co-authors
Keywords
object detection
(5)
image segmentation
(4)
convolutional neural network
(4)
deformable attention
(3)
transformer architecture
(3)
semantic segmentation
(3)
instance segmentation
(3)
attention mechanism
(3)
open-vocabulary segmentation
(3)
panoptic segmentation
(2)
representation learning
(2)
image restoration
(2)
transfer learning
(2)
visual prompting
(2)
image super-resolution
(2)
depth estimation
(2)
multimodal learning
(2)
knowledge transfer
(2)
efficient computing
(2)
dialogue system
(2)
Papers
Can Large Language Models Grasp 3D Medical Anatomy Shapes? (Student Abstract)
AAAI 2026
Scaling Law for Multimodal Large Language Model Supervised Fine-Tuning
ACL 2026
EvEnhancer: Empowering Effectiveness, Efficiency and Generalizability for Continuous Space-Time Video Super-Resolution with Events
CVPR 2025
Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaptation
ICLR 2025
CLEAR: A Clinically Grounded Tabular Framework for Radiology Report Evaluation
EMNLP 2025
Sign-IDD: Iconicity Disentangled Diffusion for Sign Language Production
AAAI 2025
Thinking in Granularity: Dynamic Quantization for Image Super-Resolution by Intriguing Multi-Granularity Clues
AAAI 2025
MDD-5k: A New Diagnostic Conversation Dataset for Mental Disorders Synthesized via Neuro-Symbolic LLM Agents
AAAI 2025
ResMAP: Restoring MRIs of Mixed Artifacts by Prompt Cascading Retrieval
MICCAI 2025
Intelligent Virtual Sonographer (IVS): Enhancing Physician-Robot-Patient Communication
MICCAI 2025
MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
ICLR 2025
LLaVA-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models
ICLR 2025
Attend and Enrich: Enhanced Visual Prompt for Zero-Shot Learning
AAAI 2025
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
ECCV 2024
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
ECCV 2024
Segment and Recognize Anything at Any Granularity
ECCV 2024
MoCo-Diff: Adaptive Conditional Prior on Diffusion Network for MRI Motion Correction
MICCAI 2024
TAPTR: Tracking Any Point with Transformers as Detection
ECCV 2024
Visual In-Context Prompting
CVPR 2024
Interfacing Foundation Models' Embeddings
NIPS 2024
TAPTRv2: Attention-based Position Update Improves Tracking Any Point
NIPS 2024
LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models
ECCV 2024
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
ECCV 2024
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection
ICLR 2023
Segment Everything Everywhere All at Once
NIPS 2023
DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding
AAAI 2023
Exploring Data Geometry for Continual Learning
CVPR 2023
Mask DINO: Towards a Unified Transformer-Based Framework for Object Detection and Segmentation
CVPR 2023
MP-Former: Mask-Piloted Transformer for Image Segmentation
CVPR 2023
Progressive Semantic-Visual Mutual Adaption for Generalized Zero-Shot Learning
CVPR 2023
Lite DETR: An Interleaved Multi-Scale Encoder for Efficient DETR
CVPR 2023
A Simple Framework for Open-Vocabulary Segmentation and Detection
ICCV 2023
DFA3D: 3D Deformable Attention For 2D-to-3D Feature Lifting
ICCV 2023
Detection Transformer with Stable Matching
ICCV 2023
Neural Interactive Keypoint Detection
ICCV 2023
Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation
ICLR 2023
Learn To Remember: Transformer with Recurrent Memory for Document-Level Machine Translation
NAACL 2022
A Token-pair Framework for Information Extraction from Dialog Transcripts in SereTOD Challenge
EMNLP 2022
DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR
ICLR 2022
APG: Adaptive Parameter Generation Network for Click-Through Rate Prediction
NIPS 2022
Toward the Limitation of Code-Switching in Cross-Lingual Transfer
EMNLP 2022
Upright-Net: Learning Upright Orientation for 3D Point Cloud
CVPR 2022
DN-DETR: Accelerate DETR Training by Introducing Query DeNoising
CVPR 2022
SARG: A Novel Semi Autoregressive Generator for Multi-turn Incomplete Utterance Restoration
AAAI 2021
Encoding Spatial Distribution of Convolutional Features for Texture Representation
NIPS 2021
Deep Texture Recognition via Exploiting Cross-Layer Statistical Self-Similarity
CVPR 2021
Towards Complete Scene and Regular Shape for Distortion Rectification by Curve-Aware Extrapolation
ICCV 2021
Towards Fast and Accurate Real-World Depth Super-Resolution: Benchmark Dataset and Baseline
CVPR 2021
Deep Interleaved Network for Single Image Super-Resolution with Asymmetric Co-Attention
IJCAI 2020
High Performance Gesture Recognition via Effective and Efficient Temporal Modeling
IJCAI 2019
Learning Spatial-Temporal Regularized Correlation Filters for Visual Tracking
CVPR 2018