Jinpeng Wang
54 papers · 2013–2026 · 13 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
🏃 Academic Marathon (12) 🌍 Conference Polyglot (13) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird
🐣
Hot Topic Early Bird
🧭
Keyword Pioneer
🏃
Academic Marathon
(12)
🤝
Dynamic Duo
(15)
🔬
Deep Specialist
(14)
🏆
Keyword Champion
(3)
📈
Trend Setter
🚀
Conference Pioneer
⚡
Prolific Year
(6)
🗃️
Keyword Collector
(220)
❓
The Questioner
💎
Century Club
(47)
🔥
Unstoppable
(11)
Conferences
AAAI (19)
CVPR (7)
EMNLP (7)
ACL (5)
ICCV (3)
IJCNLP (3)
COLING (2)
NIPS (2)
WACV (2)
ACML (1)
CONLL (1)
ECCV (1)
ICLR (1)
Top co-authors
Keywords
contrastive learning
(10)
multimodal learning
(7)
video retrieval
(6)
self-supervised learning
(5)
attention mechanism
(5)
transfer learning
(4)
data-to-text generation
(4)
domain adaptation
(4)
video representation learning
(3)
representation learning
(3)
video hashing
(3)
video-text retrieval
(3)
vision-language model
(3)
multimodal large language model
(3)
vision-language retrieval
(3)
metric learning
(2)
cross-modal learning
(2)
temporal modeling
(2)
text representation
(2)
3d vision
(2)
Papers
Suit the Remedy to the Retriever: Interpretable Query Optimization with Retriever Preference Alignment for Vision-Language Retrieval
AAAI 2026
Beyond Fully Random Masking: Attention-Guided Denoising and Optimization for Diffusion Language Models
ACL 2026
From Verbatim to Gist: Distilling Pyramidal Multimodal Memory via Semantic Information Bottleneck for Long-Horizon Video Agents
ACL 2026
HALoRA: Low-Rank Adaptation with Hierarchical Budget Allocation for Efficient Vision-Language Alignment
AAAI 2026
Towards Efficient Low-rate Image Compression with Frequency-aware Diffusion Prior Refinement
AAAI 2026
Heterogeneous Uncertainty-Guided Composed Image Retrieval with Fine-Grained Probabilistic Learning
AAAI 2026
Imagine with Layout and Sketch: Enhancing Vision-Language Retrieval with Dual-Stream Multi-Modal Query Refinement
AAAI 2026
What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Instruction Tuning
COLING 2025
Graph-Based Cross-Domain Knowledge Distillation for Cross-Dataset Text-to-Image Person Retrieval
AAAI 2025
EvdCLIP: Improving Vision-Language Retrieval with Entity Visual Descriptions from Large Language Models
AAAI 2025
Efficient Self-Supervised Video Hashing with Selective State Spaces
AAAI 2025
Modeling Uncertainty in Composed Image Retrieval via Probabilistic Embeddings
ACL 2025
PMA: Towards Parameter-Efficient Point Cloud Understanding via Point Mamba Adapter
CVPR 2025
AutoSSVH: Exploring Automated Frame Sampling for Efficient Self-Supervised Video Hashing
CVPR 2025
Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning
CVPR 2025
MoSEs: Uncertainty-Aware AI-Generated Text Detection via Mixture of Stylistics Experts with Conditional Thresholds
EMNLP 2025
Cassic: Towards Content-Adaptive State-Space Models for Learned Image Compression
ICCV 2025
Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning
ICCV 2025
DiffPC: Diffusion-based High Perceptual Fidelity Image Compression with Semantic Refinement
ICLR 2025
Multi-Energy Guided Image Translation with Stochastic Differential Equations for Near-Infrared Facial Expression Recognition
AAAI 2024
BoostAdapter: Improving Vision-Language Test-Time Adaptation via Regional Bootstrapping
NIPS 2024
Hypergraph-Guided Disentangled Spectrum Transformer Networks for Near-Infrared Facial Expression Recognition
AAAI 2024
PREFER: Prompt Ensemble Learning via Feedback-Reflect-Refine
AAAI 2024
GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval
AAAI 2024
Contrastive Masked Autoencoders for Self-Supervised Video Hashing
AAAI 2023
Evaluating Object Hallucination in Large Vision-Language Models
EMNLP 2023
Position-Guided Text Prompt for Vision-Language Pre-Training
CVPR 2023
All in One: Exploring Unified Video-Language Pre-Training
CVPR 2023
Instance-aware Dynamic Prompt Tuning for Pre-trained Point Cloud Models
ICCV 2023
Video-Text Pre-training with Learned Regions for Retrieval
AAAI 2023
MILES: Visual BERT Pre-training with Injected Language Semantics for Video-Text Retrieval
ECCV 2022
Suppressing Static Visual Cues via Normalizing Flows for Self-Supervised Video Representation Learning
AAAI 2022
Object-Aware Video-Language Pre-Training for Retrieval
CVPR 2022
Egocentric Video-Language Pretraining
NIPS 2022
Contrastive Quantization with Code Memory for Unsupervised Image Retrieval
AAAI 2022
ChartOCR: Data Extraction From Charts Images via a Deep Hybrid Framework
WACV 2021
Removing the Background by Adding the Background: Towards Background Robust Self-Supervised Video Representation Learning
CVPR 2021
Enhancing Unsupervised Video Representation Learning by Decoupling the Scene and the Motion
AAAI 2021
Weakly Supervised Deep Hyperspherical Quantization for Image Retrieval
AAAI 2021
Multi-Scale Adversarial Cross-Domain Detection with Robust Discriminative Learning
WACV 2020
Learning Semantic Correspondences from Noisy Data-text Pairs by Local-to-Global Alignments
COLING 2020
Improving Entity Linking by Modeling Latent Entity Type Information
AAAI 2020
A Simple Recipe towards Reducing Hallucination in Neural Surface Realisation
ACL 2019
Enhancing Neural Data-To-Text Generation Models with External Background Knowledge
IJCNLP 2019
Enhancing Neural Data-To-Text Generation Models with External Background Knowledge
EMNLP 2019
Aggregated Semantic Matching for Short Text Entity Linking
CONLL 2018
Data2Text Studio: Automated Text Generation from Structured Data
EMNLP 2018
Operation-guided Neural Networks for High Fidelity Data-To-Text Generation
EMNLP 2018
Learning Latent Semantic Annotations for Grounding Natural Language to Structured Data
EMNLP 2018
A Statistical Framework for Product Description Generation
IJCNLP 2017
Non-Linear Smoothed Transductive Network Embedding with Text Information
ACML 2016
User Based Aggregation for Biterm Topic Model
ACL 2015
User Based Aggregation for Biterm Topic Model
IJCNLP 2015
Mining New Business Opportunities: Identifying Trend related Products by Leveraging Commercial Intents from Microblogs
EMNLP 2013