Jinpeng Wang

54 papers · 2013–2026 · 13 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🏃 Academic Marathon (12) 🌍 Conference Polyglot (13) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird

🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🏃 Academic Marathon (12) 🤝 Dynamic Duo (15) 🔬 Deep Specialist (14) 🏆 Keyword Champion (3) 📈 Trend Setter 🚀 Conference Pioneer ⚡ Prolific Year (6) 🗃️ Keyword Collector (220) ❓ The Questioner 💎 Century Club (47) 🔥 Unstoppable (11)

Conferences

AAAI (19) CVPR (7) EMNLP (7) ACL (5) ICCV (3) IJCNLP (3) COLING (2) NIPS (2) WACV (2) ACML (1) CONLL (1) ECCV (1) ICLR (1)

Top co-authors

Bin Chen (17) Shu-Tao Xia (15) Chin-Yew Lin (11) Tao Dai (7) Yaowei Wang (7) GuangHao Meng (6) Mike Zheng Shou (5) Niu Lian (4) Rui Yan (4) Feng Nie (4)

Keywords

contrastive learning (10) multimodal learning (7) video retrieval (6) self-supervised learning (5) attention mechanism (5) transfer learning (4) data-to-text generation (4) domain adaptation (4) video representation learning (3) representation learning (3) video hashing (3) video-text retrieval (3) vision-language model (3) multimodal large language model (3) vision-language retrieval (3) metric learning (2) cross-modal learning (2) temporal modeling (2) text representation (2) 3d vision (2)

Papers

Suit the Remedy to the Retriever: Interpretable Query Optimization with Retriever Preference Alignment for Vision-Language Retrieval AAAI 2026 Beyond Fully Random Masking: Attention-Guided Denoising and Optimization for Diffusion Language Models ACL 2026 From Verbatim to Gist: Distilling Pyramidal Multimodal Memory via Semantic Information Bottleneck for Long-Horizon Video Agents ACL 2026 HALoRA: Low-Rank Adaptation with Hierarchical Budget Allocation for Efficient Vision-Language Alignment AAAI 2026 Towards Efficient Low-rate Image Compression with Frequency-aware Diffusion Prior Refinement AAAI 2026 Heterogeneous Uncertainty-Guided Composed Image Retrieval with Fine-Grained Probabilistic Learning AAAI 2026 Imagine with Layout and Sketch: Enhancing Vision-Language Retrieval with Dual-Stream Multi-Modal Query Refinement AAAI 2026 What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Instruction Tuning COLING 2025 Graph-Based Cross-Domain Knowledge Distillation for Cross-Dataset Text-to-Image Person Retrieval AAAI 2025 EvdCLIP: Improving Vision-Language Retrieval with Entity Visual Descriptions from Large Language Models AAAI 2025 Efficient Self-Supervised Video Hashing with Selective State Spaces AAAI 2025 Modeling Uncertainty in Composed Image Retrieval via Probabilistic Embeddings ACL 2025 PMA: Towards Parameter-Efficient Point Cloud Understanding via Point Mamba Adapter CVPR 2025 AutoSSVH: Exploring Automated Frame Sampling for Efficient Self-Supervised Video Hashing CVPR 2025 Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning CVPR 2025 MoSEs: Uncertainty-Aware AI-Generated Text Detection via Mixture of Stylistics Experts with Conditional Thresholds EMNLP 2025 Cassic: Towards Content-Adaptive State-Space Models for Learned Image Compression ICCV 2025 Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning ICCV 2025 DiffPC: Diffusion-based High Perceptual Fidelity Image Compression with Semantic Refinement ICLR 2025 Multi-Energy Guided Image Translation with Stochastic Differential Equations for Near-Infrared Facial Expression Recognition AAAI 2024 BoostAdapter: Improving Vision-Language Test-Time Adaptation via Regional Bootstrapping NIPS 2024 Hypergraph-Guided Disentangled Spectrum Transformer Networks for Near-Infrared Facial Expression Recognition AAAI 2024 PREFER: Prompt Ensemble Learning via Feedback-Reflect-Refine AAAI 2024 GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval AAAI 2024 Contrastive Masked Autoencoders for Self-Supervised Video Hashing AAAI 2023 Evaluating Object Hallucination in Large Vision-Language Models EMNLP 2023 Position-Guided Text Prompt for Vision-Language Pre-Training CVPR 2023 All in One: Exploring Unified Video-Language Pre-Training CVPR 2023 Instance-aware Dynamic Prompt Tuning for Pre-trained Point Cloud Models ICCV 2023 Video-Text Pre-training with Learned Regions for Retrieval AAAI 2023 MILES: Visual BERT Pre-training with Injected Language Semantics for Video-Text Retrieval ECCV 2022 Suppressing Static Visual Cues via Normalizing Flows for Self-Supervised Video Representation Learning AAAI 2022 Object-Aware Video-Language Pre-Training for Retrieval CVPR 2022 Egocentric Video-Language Pretraining NIPS 2022 Contrastive Quantization with Code Memory for Unsupervised Image Retrieval AAAI 2022 ChartOCR: Data Extraction From Charts Images via a Deep Hybrid Framework WACV 2021 Removing the Background by Adding the Background: Towards Background Robust Self-Supervised Video Representation Learning CVPR 2021 Enhancing Unsupervised Video Representation Learning by Decoupling the Scene and the Motion AAAI 2021 Weakly Supervised Deep Hyperspherical Quantization for Image Retrieval AAAI 2021 Multi-Scale Adversarial Cross-Domain Detection with Robust Discriminative Learning WACV 2020 Learning Semantic Correspondences from Noisy Data-text Pairs by Local-to-Global Alignments COLING 2020 Improving Entity Linking by Modeling Latent Entity Type Information AAAI 2020 A Simple Recipe towards Reducing Hallucination in Neural Surface Realisation ACL 2019 Enhancing Neural Data-To-Text Generation Models with External Background Knowledge IJCNLP 2019 Enhancing Neural Data-To-Text Generation Models with External Background Knowledge EMNLP 2019 Aggregated Semantic Matching for Short Text Entity Linking CONLL 2018 Data2Text Studio: Automated Text Generation from Structured Data EMNLP 2018 Operation-guided Neural Networks for High Fidelity Data-To-Text Generation EMNLP 2018 Learning Latent Semantic Annotations for Grounding Natural Language to Structured Data EMNLP 2018 A Statistical Framework for Product Description Generation IJCNLP 2017 Non-Linear Smoothed Transductive Network Embedding with Text Information ACML 2016 User Based Aggregation for Biterm Topic Model ACL 2015 User Based Aggregation for Biterm Topic Model IJCNLP 2015 Mining New Business Opportunities: Identifying Trend related Products by Leveraging Commercial Intents from Microblogs EMNLP 2013