conftrace_

Shijie Geng

14 papers · 2018–2025 · 9 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+7 more ↓

🌈 Renaissance Researcher (6) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (9) 🏃 Academic Marathon (7) 🗺️ Taxonomy Completionist (28)

🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (28) 👥 Mega-Team (20) 🏆 Grand Slam 🚀 Conference Pioneer 🔥 Unstoppable (6) 💎 Century Club (14)

Conferences

ECCV (4) AAAI (2) ICLR (2) ACL (1) CORL (1) CVPR (1) EMNLP (1) ICML (1) NIPS (1)

Top co-authors

Yongfeng Zhang (4) hongsheng Li (4) peng gao (4) Gerard de Melo (3) Zuohui Fu (3) Yu Tian (3) Yuxiao Chen (3) Yu Qiao (3) Jianbo Yuan (3) Renrui Zhang (3)

Keywords

multimodal learning (3) scene graph (2) adversarial learning (1) cross-lingual transfer (1) question answering (1) language grounding (1) multi-modal learning (1) code generation (1) visual grounding (1) adversarial training (1) object localization (1) scene understanding (1) benchmark evaluation (1) low-resource language (1) dynamic graph (1) cross-modal alignment (1) foundation model (1) generative adversarial network (1) vision-language model (1) contrastive learning (1)

Papers

Lumina-T2X: Scalable Flow-based Large Diffusion Transformer for Flexible Resolution Generation ICLR 2025 InfiBench: Evaluating the Question-Answering Capabilities of Code Large Language Models NIPS 2024 SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models ICML 2024 HiCLIP: Contrastive Language-Image Pretraining with Hierarchy-aware Attention ICLR 2023 Revisiting Multimodal Representation in Contrastive Learning: From Patch and Token Embeddings to Finite Discrete Tokens CVPR 2023 Context-Aware Entity Grounding with Open-Vocabulary 3D Scene Graphs CORL 2023 VIP5: Towards Multimodal Foundation Models for Recommendation EMNLP 2023 Hierarchically Self-Supervised Transformer for Human Skeleton Representation Learning ECCV 2022 Improving Personalized Explanation Generation through Visualization ACL 2022 COMPOSER: Compositional Reasoning of Group Activity in Videos with Keypoint-Only Modality ECCV 2022 Frozen CLIP Models Are Efficient Video Learners ECCV 2022 Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers AAAI 2021 ABSent: Cross-Lingual Sentence Representation Mapping with Bidirectional GANs AAAI 2020 Quantized Densely Connected U-Nets for Efficient Landmark Localization ECCV 2018