Shijie Geng
14 papers · 2018–2025 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
π Renaissance Researcher (6) π Interdisciplinary Bridge π Conference Polyglot (9) π Academic Marathon (7) πΊοΈ Taxonomy Completionist (28)
π§
Keyword Pioneer
πΊοΈ
Taxonomy Completionist
(28)
π₯
Mega-Team
(20)
π
Grand Slam
π
Conference Pioneer
π₯
Unstoppable
(6)
π
Century Club
(14)
Conferences
ECCV (4)
AAAI (2)
ICLR (2)
ACL (1)
CORL (1)
CVPR (1)
EMNLP (1)
ICML (1)
NIPS (1)
Top co-authors
Keywords
multimodal learning
(3)
scene graph
(2)
adversarial learning
(1)
cross-lingual transfer
(1)
question answering
(1)
language grounding
(1)
multi-modal learning
(1)
code generation
(1)
visual grounding
(1)
adversarial training
(1)
object localization
(1)
scene understanding
(1)
benchmark evaluation
(1)
low-resource language
(1)
dynamic graph
(1)
cross-modal alignment
(1)
foundation model
(1)
generative adversarial network
(1)
vision-language model
(1)
contrastive learning
(1)
Papers
Lumina-T2X: Scalable Flow-based Large Diffusion Transformer for Flexible Resolution Generation
ICLR 2025
InfiBench: Evaluating the Question-Answering Capabilities of Code Large Language Models
NIPS 2024
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models
ICML 2024
HiCLIP: Contrastive Language-Image Pretraining with Hierarchy-aware Attention
ICLR 2023
Revisiting Multimodal Representation in Contrastive Learning: From Patch and Token Embeddings to Finite Discrete Tokens
CVPR 2023
Context-Aware Entity Grounding with Open-Vocabulary 3D Scene Graphs
CORL 2023
VIP5: Towards Multimodal Foundation Models for Recommendation
EMNLP 2023
Hierarchically Self-Supervised Transformer for Human Skeleton Representation Learning
ECCV 2022
Improving Personalized Explanation Generation through Visualization
ACL 2022
COMPOSER: Compositional Reasoning of Group Activity in Videos with Keypoint-Only Modality
ECCV 2022
Frozen CLIP Models Are Efficient Video Learners
ECCV 2022
Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers
AAAI 2021
ABSent: Cross-Lingual Sentence Representation Mapping with Bidirectional GANs
AAAI 2020
Quantized Densely Connected U-Nets for Efficient Landmark Localization
ECCV 2018