Peng Jin
41 papers · 2007–2026 · 13 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
🧭 Keyword Pioneer 🌍 Conference Polyglot (13) 🌈 Renaissance Researcher (6) 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (18)
🌈
Renaissance Researcher
(6)
🌉
Interdisciplinary Bridge
🐝
Cross-Pollinator
(6)
🏆
Grand Slam
👑
Triple Crown
🌱
Topic Pioneer
🤝
Dynamic Duo
(17)
🔬
Deep Specialist
(11)
🧬
Topic Evolution
🏆
Keyword Champion
(4)
⚡
Prolific Year
(8)
🗃️
Keyword Collector
(118)
🚀
Conference Pioneer
💎
Century Club
(40)
📈
Trend Setter
🔥
Unstoppable
(5)
Conferences
SEMEVAL (6)
ICML (5)
AAAI (4)
IJCAI (4)
NIPS (4)
CVPR (3)
ECCV (3)
ICCV (3)
ACL (2)
COLING (2)
EMNLP (2)
ICLR (2)
NAACL (1)
Top co-authors
Keywords
contrastive learning
(4)
multimodal learning
(4)
text-video retrieval
(4)
seismic datum
(3)
diffusion model
(3)
knowledge triple
(2)
language representation
(2)
full waveform inversion
(2)
cross-modal alignment
(2)
unsupervised learning
(2)
representation learning
(2)
vision-language model
(2)
generative model
(2)
large language model
(2)
visual representation
(2)
commonsense reasoning
(2)
deep learning
(2)
knowledge graph
(2)
video understanding
(2)
question answering
(1)
Papers
Next Patch Prediction for AutoRegressive Visual Generation
AAAI 2026
LLaVA-CoT: Let Vision Language Models Reason Step-by-Step
ICCV 2025
VSNet: Focusing on the Linguistic Characteristics of Sign Language
CVPR 2025
MUSE: Mamba Is Efficient Multi-scale Learner for Text-video Retrieval
AAAI 2025
Aligning Instance Brownian Bridge with Texts for Open-Vocabulary Video Instance Segmentation
AAAI 2025
MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts
ICLR 2025
Orthogonal Subspace Decomposition for Generalizable AI-Generated Image Detection
ICML 2025
MoH: Multi-Head Attention as Mixture-of-Head Attention
ICML 2025
Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation
ECCV 2024
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
EMNLP 2024
Parallel Vertex Diffusion for Unified Visual Grounding
AAAI 2024
LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference
EMNLP 2024
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models
ICML 2024
Auto-Linear Phenomenon in Subsurface Imaging
ICML 2024
RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter
ACL 2024
Towards Multi-Relational Multi-Hop Reasoning over Dense Temporal Knowledge Graphs
ACL 2024
Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
CVPR 2024
FreestyleRet: Retrieving Images from Style-Diversified Queries
ECCV 2024
Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable Repainting
ECCV 2024
WiCo: Win-win Cooperation of Bottom-up and Top-down Referring Image Segmentation
IJCAI 2023
Act As You Wish: Fine-Grained Control of Motion Diffusion Model with Hierarchical Semantic Graphs
NIPS 2023
$\mathbf{\mathbb{E}^{FWI}}$: Multiparameter Benchmark Datasets for Elastic Full Waveform Inversion of Geophysical Properties
NIPS 2023
Video-Text As Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning
CVPR 2023
Multi-granularity Interaction Simulation for Unsupervised Interactive Segmentation
ICCV 2023
DiffusionRet: Generative Text-Video Retrieval with Diffusion Model
ICCV 2023
Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment
IJCAI 2023
TG-VQA: Ternary Game of Video Question Answering
IJCAI 2023
Unsupervised Learning of Full-Waveform Inversion: Connecting CNN and Partial Differential Equation in a Loop
ICLR 2022
Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations
NIPS 2022
An Intriguing Property of Geophysics Inversion
ICML 2022
OpenFWI: Large-scale Multi-structural Benchmark Datasets for Full Waveform Inversion
NIPS 2022
CN-HIT-IT.NLP at SemEval-2020 Task 4: Enhanced Language Representation with Multiple Knowledge Triples
COLING 2020
CN-HIT-IT.NLP at SemEval-2020 Task 4: Enhanced Language Representation with Multiple Knowledge Triples
SEMEVAL 2020
Bag-of-Embeddings for Text Classification
IJCAI 2016
Multi-view Chinese Treebanking
COLING 2014
SemEval-2012 Task 4: Evaluating Chinese Word Similarity
SEMEVAL 2012
SemEval-2 Task 15: Infrequent Sense Identification for Mandarin Text to Speech Systems
SEMEVAL 2010
SemEval-2010 Task 18: Disambiguating Sentiment Ambiguous Adjectives
SEMEVAL 2010
Estimating and Exploiting the Entropy of Sense Distributions
NAACL 2009
PKU: Combining Supervised Classifiers with Features Selection
SEMEVAL 2007
SemEval-2007 Task 05: Multilingual Chinese-English Lexical Sample
SEMEVAL 2007