Peng Jin

41 papers · 2007–2026 · 13 conferences · across top CS/AI conferences

Achievements

+16 more ↓

🧭 Keyword Pioneer 🌍 Conference Polyglot (13) 🌈 Renaissance Researcher (6) 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (18)

🌈 Renaissance Researcher (6) 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (6) 🏆 Grand Slam 👑 Triple Crown 🌱 Topic Pioneer 🤝 Dynamic Duo (17) 🔬 Deep Specialist (11) 🧬 Topic Evolution 🏆 Keyword Champion (4) ⚡ Prolific Year (8) 🗃️ Keyword Collector (118) 🚀 Conference Pioneer 💎 Century Club (40) 📈 Trend Setter 🔥 Unstoppable (5)

Conferences

SEMEVAL (6) ICML (5) AAAI (4) IJCAI (4) NIPS (4) CVPR (3) ECCV (3) ICCV (3) ACL (2) COLING (2) EMNLP (2) ICLR (2) NAACL (1)

Top co-authors

Li Yuan (18) Jie Chen (10) Chang Liu (10) Zesen Cheng (9) Hao Li (7) Kehan Li (7) Xiangyang Ji (6) Jinfa Huang (6) Yinpeng Chen (5) Yunfang Wu (5)

Keywords

contrastive learning (4) multimodal learning (4) text-video retrieval (4) seismic datum (3) diffusion model (3) knowledge triple (2) language representation (2) full waveform inversion (2) cross-modal alignment (2) unsupervised learning (2) representation learning (2) vision-language model (2) generative model (2) large language model (2) visual representation (2) commonsense reasoning (2) deep learning (2) knowledge graph (2) video understanding (2) question answering (1)

Papers

Next Patch Prediction for AutoRegressive Visual Generation AAAI 2026 LLaVA-CoT: Let Vision Language Models Reason Step-by-Step ICCV 2025 VSNet: Focusing on the Linguistic Characteristics of Sign Language CVPR 2025 MUSE: Mamba Is Efficient Multi-scale Learner for Text-video Retrieval AAAI 2025 Aligning Instance Brownian Bridge with Texts for Open-Vocabulary Video Instance Segmentation AAAI 2025 MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts ICLR 2025 Orthogonal Subspace Decomposition for Generalizable AI-Generated Image Detection ICML 2025 MoH: Multi-Head Attention as Mixture-of-Head Attention ICML 2025 Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation ECCV 2024 Video-LLaVA: Learning United Visual Representation by Alignment Before Projection EMNLP 2024 Parallel Vertex Diffusion for Unified Visual Grounding AAAI 2024 LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference EMNLP 2024 SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models ICML 2024 Auto-Linear Phenomenon in Subsurface Imaging ICML 2024 RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter ACL 2024 Towards Multi-Relational Multi-Hop Reasoning over Dense Temporal Knowledge Graphs ACL 2024 Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding CVPR 2024 FreestyleRet: Retrieving Images from Style-Diversified Queries ECCV 2024 Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable Repainting ECCV 2024 WiCo: Win-win Cooperation of Bottom-up and Top-down Referring Image Segmentation IJCAI 2023 Act As You Wish: Fine-Grained Control of Motion Diffusion Model with Hierarchical Semantic Graphs NIPS 2023 $\mathbf{\mathbb{E}^{FWI}}$: Multiparameter Benchmark Datasets for Elastic Full Waveform Inversion of Geophysical Properties NIPS 2023 Video-Text As Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning CVPR 2023 Multi-granularity Interaction Simulation for Unsupervised Interactive Segmentation ICCV 2023 DiffusionRet: Generative Text-Video Retrieval with Diffusion Model ICCV 2023 Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment IJCAI 2023 TG-VQA: Ternary Game of Video Question Answering IJCAI 2023 Unsupervised Learning of Full-Waveform Inversion: Connecting CNN and Partial Differential Equation in a Loop ICLR 2022 Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations NIPS 2022 An Intriguing Property of Geophysics Inversion ICML 2022 OpenFWI: Large-scale Multi-structural Benchmark Datasets for Full Waveform Inversion NIPS 2022 CN-HIT-IT.NLP at SemEval-2020 Task 4: Enhanced Language Representation with Multiple Knowledge Triples COLING 2020 CN-HIT-IT.NLP at SemEval-2020 Task 4: Enhanced Language Representation with Multiple Knowledge Triples SEMEVAL 2020 Bag-of-Embeddings for Text Classification IJCAI 2016 Multi-view Chinese Treebanking COLING 2014 SemEval-2012 Task 4: Evaluating Chinese Word Similarity SEMEVAL 2012 SemEval-2 Task 15: Infrequent Sense Identification for Mandarin Text to Speech Systems SEMEVAL 2010 SemEval-2010 Task 18: Disambiguating Sentiment Ambiguous Adjectives SEMEVAL 2010 Estimating and Exploiting the Entropy of Sense Distributions NAACL 2009 PKU: Combining Supervised Classifiers with Features Selection SEMEVAL 2007 SemEval-2007 Task 05: Multilingual Chinese-English Lexical Sample SEMEVAL 2007