conftrace_

Le Xue

11 papers · 2022–2026 · 9 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+4 more ↓

🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (5) 🌈 Renaissance Researcher (7) 🗺️ Taxonomy Completionist (30)

🌍 Conference Polyglot (8) 💎 Century Club (10) 🔥 Unstoppable (5) 🗃️ Keyword Collector (63)

Conferences

CVPR (3) AAAI (1) COLING (1) ECCV (1) EMNLP (1) ICCV (1) ICLR (1) MICCAI (1) NIPS (1)

Top co-authors

Ran Xu (7) Caiming Xiong (7) Silvio Savarese (6) Juan Carlos Niebles (5) Artemis Panagopoulou (3) Chen Xing (2) Jiajun Wu (2) Yuchen Liu (2) Yuan Cheng (2) Yuan Qi (2)

Research topics

Domain-Specific (1) Applications (1)

Keywords

multimodal learning (4) vision-language model (3) point cloud (2) zero-shot classification (2) pet imaging (2) video understanding (2) 3d understanding (2) large language model (2) medical imaging (1) visual question answering (1) benchmark evaluation (1) audio-visual learning (1) document understanding (1) human-object interaction (1) promptable segmentation (1) temporal modeling (1) universal segmentation (1) language modeling (1) medical image segmentation (1) machine reading comprehension (1)

Papers

PET2Rep: Towards Vision-Language Model-Drived Automated Radiology Report Generation for Positron Emission Tomography AAAI 2026 Towards Multi-Scenario Generalization: Text-Guided Unified Framework for Low-Dose CT and Total-Body PET Reconstruction MICCAI 2025 Contra4: Evaluating Contrastive Cross-Modal Reasoning in Audio, Video, Image, and 3D EMNLP 2025 SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images ICCV 2025 LLAVIDAL: A Large LAnguage VIsion Model for Daily Activities of Living CVPR 2025 "X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-modal Reasoning" ECCV 2024 ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding CVPR 2024 MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens NIPS 2024 Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization ICLR 2024 ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D Understanding CVPR 2023 DocQueryNet: Value Retrieval with Arbitrary Queries for Form-like Documents COLING 2022