Le Xue
11 papers · 2022–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+4 more ↓ Show less ↑
π Interdisciplinary Bridge π§ Keyword Pioneer π Cross-Pollinator (5) π Renaissance Researcher (7) πΊοΈ Taxonomy Completionist (30)
π
Conference Polyglot
(8)
π
Century Club
(10)
π₯
Unstoppable
(5)
ποΈ
Keyword Collector
(63)
Conferences
CVPR (3)
AAAI (1)
COLING (1)
ECCV (1)
EMNLP (1)
ICCV (1)
ICLR (1)
MICCAI (1)
NIPS (1)
Top co-authors
Research topics
Keywords
multimodal learning
(4)
vision-language model
(3)
point cloud
(2)
zero-shot classification
(2)
pet imaging
(2)
video understanding
(2)
3d understanding
(2)
large language model
(2)
medical imaging
(1)
visual question answering
(1)
benchmark evaluation
(1)
audio-visual learning
(1)
document understanding
(1)
human-object interaction
(1)
promptable segmentation
(1)
temporal modeling
(1)
universal segmentation
(1)
language modeling
(1)
medical image segmentation
(1)
machine reading comprehension
(1)
Papers
PET2Rep: Towards Vision-Language Model-Drived Automated Radiology Report Generation for Positron Emission Tomography
AAAI 2026
Towards Multi-Scenario Generalization: Text-Guided Unified Framework for Low-Dose CT and Total-Body PET Reconstruction
MICCAI 2025
Contra4: Evaluating Contrastive Cross-Modal Reasoning in Audio, Video, Image, and 3D
EMNLP 2025
SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images
ICCV 2025
LLAVIDAL: A Large LAnguage VIsion Model for Daily Activities of Living
CVPR 2025
"X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-modal Reasoning"
ECCV 2024
ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding
CVPR 2024
MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens
NIPS 2024
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
ICLR 2024
ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D Understanding
CVPR 2023
DocQueryNet: Value Retrieval with Arbitrary Queries for Form-like Documents
COLING 2022