Pan Zhou
114 papers · 2017–2026 · 17 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (16) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (7) π£ Hot Topic Early Bird
π£
Hot Topic Early Bird
π
Conference Polyglot
(16)
π
Academic Marathon
(8)
π
Conference Loyalist
(21)
π¬
Deep Specialist
(19)
π
Grand Slam
π€
Dynamic Duo
(26)
π₯
Mega-Team
(20)
π
Triple Crown
π
Keyword Champion
ποΈ
Keyword Collector
(405)
β
The Questioner
(4)
β‘
Prolific Year
(6)
π
Conference Pioneer
π
Century Club
(108)
π₯
Unstoppable
(9)
Conferences
AAAI (21)
CVPR (19)
NIPS (16)
ICLR (10)
EMNLP (10)
ICML (7)
ICCV (7)
ECCV (7)
ACL (7)
COLING (2)
IJCAI (2)
EACL (1)
AISTATS (1)
INTERSPEECH (1)
JMLR (1)
NAACL (1)
UAI (1)
Top co-authors
Research topics
Keywords
large language model
(12)
video understanding
(11)
temporal sentence grounding
(8)
adversarial attack
(8)
adversarial learning
(7)
cross-modal learning
(6)
diffusion model
(6)
multimodal learning
(6)
unsupervised learning
(5)
graph neural network
(5)
backdoor attack
(5)
few-shot learning
(5)
semantic segmentation
(4)
vision transformer
(4)
image classification
(4)
stochastic gradient
(3)
object detection
(3)
stochastic gradient descent
(3)
convergence analysis
(3)
contrastive learning
(3)
Papers
CrowdSelect: SyntheticInstruction Data Selection with Multi-LLM Wisdom
EACL 2026
SafeAgent: Safeguarding LLM Agents via an Automated Risk Simulator
ACL 2026
LearnerCoMPASS: Intelligent Tutoring System with Dynamic Cognitive Diagnosis and Multi-Model Path Planning
ACL 2026
DRFGD: Disentangled Representation-Focused Generative Defense for Attack-Tolerant Cross-Modal Hashing
AAAI 2026
ReAlign: Text-to-Motion Generation via Step-Aware Reward-Guided Alignment
AAAI 2026
Revisiting the Canonicalization for Fast and Accurate Crystal Tensor Property Prediction
AAAI 2026
Graph Agent Network: Empowering Nodes with Inference Capabilities for Adversarial Resilience
AAAI 2025
Grimm: A Plug-and-Play Perturbation Rectifier for Graph Neural Networks Defending Against Poisoning Attacks
AAAI 2025
Misalignment Attack on Text-to-Image Models via Text Embedding Optimization and Inversion
EMNLP 2025
Merger-as-a-Stealer: Stealing Targeted PII from Aligned LLMs with Model Merging
EMNLP 2025
Stealing Training Data from Large Language Models in Decentralized Training through Activation Inversion Attack
ACL 2025
Merge Hijacking: Backdoor Attacks to Model Merging of Large Language Models
ACL 2025
The Impact of Large Language Models in Academia: from Writing to Speaking
ACL 2025
Learning from Few Samples: A Novel Approach for High-Quality Malcode Generation
EMNLP 2025
BadToken: Token-level Backdoor Attacks to Multi-modal Large Language Models
CVPR 2025
HPS: Hard Preference Sampling for Human Preference Alignment
ICML 2025
GUI-World: A Video Benchmark and Dataset for Multimodal GUI-oriented Understanding
ICLR 2025
Interleaved Scene Graphs for Interleaved Text-and-Image Generation Assessment
ICLR 2025
Probabilistic Prototype Calibration of Vision-language Models for Generalized Few-shot Semantic Segmentation
ICCV 2025
Memory-Efficient 4-bit Preconditioned Stochastic Optimization
ICCV 2025
Zeroth-Order Fine-Tuning of LLMs in Random Subspaces
ICCV 2025
Probabilistic Interactive 3D Segmentation with Hierarchical Neural Processes
ICML 2025
Collaborative Tree Search for Enhancing Embodied Multi-Agent Collaboration
CVPR 2025
Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning
ICLR 2025
CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation
ICLR 2025
Multi-Pair Temporal Sentence Grounding via Multi-Thread Knowledge Transfer Network
AAAI 2025
Sparse Enhanced Network: An Adversarial Generation Method for Robust Augmentation in Sequential Recommendation
AAAI 2024
MVGamba: Unify 3D Content Generation as State Space Sequence Modeling
NIPS 2024
Pandora's Box: Towards Building Universal Attackers against Real-World Large Vision-Language Models
NIPS 2024
Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation
NIPS 2024
LOVA3: Learning to Visual Question Answering, Asking and Assessment
NIPS 2024
4-bit Shampoo for Memory-Efficient Network Training
NIPS 2024
Fewer Steps, Better Performance: Efficient Cross-Modal Clip Trimming for Video Moment Retrieval Using Language
AAAI 2024
Unsupervised Domain Adaptative Temporal Sentence Localization with Mutual Information Maximization
AAAI 2024
Manifold Constraints for Imperceptible Adversarial Attacks on Point Clouds
AAAI 2024
Towards Inductive Robustness: Distilling and Fostering Wave-Induced Resonance in Transductive GCNs against Graph Adversarial Attacks
AAAI 2024
What Makes Good Collaborative Views? Contrastive Mutual Information Maximization for Multi-Agent Perception
AAAI 2024
Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?
ACL 2024
MoExtend: Tuning New Experts for Modality and Task Extension
ACL 2024
Towards Robust Temporal Activity Localization Learning with Noisy Labels
COLING 2024
Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation
CVPR 2024
Friendly Sharpness-Aware Minimization
CVPR 2024
Consistent3D: Towards Consistent High-Fidelity Text-to-3D Generation with Deterministic Sampling Prior
CVPR 2024
Physical Backdoor: Towards Temperature-based Backdoor Attacks in the Physical World
CVPR 2024
MetaCloak: Preventing Unauthorized Subject-driven Text-to-image Diffusion-based Synthesis via Meta-learning
CVPR 2024
Few-shot Learner Parameterization by Diffusion Time-steps
CVPR 2024
InceptionNeXt: When Inception Meets ConvNeXt
CVPR 2024
Diffusion Time-step Curriculum for One Image to 3D Generation
CVPR 2024
Efficient Cascaded Multiscale Adaptive Network for Image Restoration
ECCV 2024
GENIXER: Empowering Multimodal Large Language Models as a Powerful Data Generator
ECCV 2024
Hiding Imperceptible Noise in Curvature-Aware Patches for 3D Point Cloud Attack
ECCV 2024
Rethinking Weakly-supervised Video Temporal Grounding From a Game Perspective
ECCV 2024
CodeIP: A Grammar-Guided Multi-Bit Watermark for Large Language Models of Code
EMNLP 2024
Virtual Context Enhancing Jailbreak Attacks with Special Token Injection
EMNLP 2024
MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use
ICLR 2024
MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language Benchmark
ICML 2024
Position: Exploring the Robustness of Pipeline-Parallelism-Based Decentralized Training
ICML 2024
Win: Weight-Decay-Integrated Nesterov Acceleration for Faster Network Training
JMLR 2024
ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection
NIPS 2023
Annotations Are Not All You Need: A Cross-modal Knowledge Transfer Network for Unsupervised Temporal Sentence Grounding
EMNLP 2023
3DHacker: Spectrum-based Decision Boundary Generation for Hard-label 3D Point Cloud Attack
ICCV 2023
STPrivacy: Spatio-Temporal Privacy-Preserving Action Recognition
ICCV 2023
Hypotheses Tree Building for One-Shot Temporal Sentence Localization
AAAI 2023
Distantly-Supervised Named Entity Recognition with Adaptive Teacher Learning and Fine-Grained Student Ensemble
AAAI 2023
Win: Weight-Decay-Integrated Nesterov Acceleration for Adaptive Gradient Algorithms
ICLR 2023
You Can Ground Earlier Than See: An Effective and Efficient Pipeline for Temporal Sentence Grounding in Compressed Videos
CVPR 2023
You Are Catching My Attention: Are Vision Transformers Bad Learners Under Backdoor Attacks?
CVPR 2023
LPT: Long-tailed Prompt Tuning for Image Classification
ICLR 2023
Towards Understanding Why Mask Reconstruction Pretraining Helps in Downstream Tasks
ICLR 2023
Position-Guided Text Prompt for Vision-Language Pre-Training
CVPR 2023
Masked Diffusion Transformer is a Strong Image Synthesizer
ICCV 2023
Inception Transformer
NIPS 2022
Self-Promoted Supervision for Few-Shot Transformer
ECCV 2022
MetaFormer Is Actually What You Need for Vision
CVPR 2022
DualFormer: Local-Global Stratified Transformer for Efficient Video Recognition
ECCV 2022
Video Graph Transformer for Video Question Answering
ECCV 2022
Rethinking the Video Sampling and Reasoning Strategies for Temporal Sentence Grounding
EMNLP 2022
Unsupervised Temporal Video Grounding with Deep Semantic Clustering
AAAI 2022
Exploring Motion and Appearance Information for Temporal Sentence Grounding
AAAI 2022
Memory-Guided Semantic Learning Network for Temporal Sentence Grounding
AAAI 2022
Bandits for Structure Perturbation-Based Black-Box Attacks To Graph Neural Networks With Theoretical Guarantees
CVPR 2022
Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition
EMNLP 2021
Context-Aware Biaffine Localizing Network for Temporal Sentence Grounding
CVPR 2021
Adversarial Meta Sampling for Multilingual Low-Resource Speech Recognition
AAAI 2021
Graph-Evolving Meta-Learning for Low-Resource Medical Dialogue Generation
AAAI 2021
Progressively Guide to Attend: An Iterative Alignment Framework for Temporal Sentence Grounding
EMNLP 2021
Prototypical Contrastive Learning of Unsupervised Representations
ICLR 2021
Adaptive Proposal Generation Network for Temporal Sentence Localization in Videos
EMNLP 2021
Task similarity aware meta learning: theory-inspired improvement on MAML
UAI 2021
How Important is the Train-Validation Split in Meta-Learning?
ICML 2021
Towards Understanding Why Lookahead Generalizes Better Than SGD and Beyond
NIPS 2021
F2Net: Learning to Focus on the Foreground for Unsupervised Video Object Segmentation
AAAI 2021
Spatiotemporal Graph Neural Network based Mask Reconstruction for Video Object Segmentation
AAAI 2021
TRS: Transferability Reduced Ensemble via Promoting Gradient Diversity and Model Smoothness
NIPS 2021
A Theory-Driven Self-Labeling Refinement Method for Contrastive Representation Learning
NIPS 2021
Generating Robust Audio Adversarial Examples with Temporal Dependency
IJCAI 2020
Towards Theoretically Understanding Why Sgd Generalizes Better Than Adam in Deep Learning
NIPS 2020
Theory-Inspired Path-Regularized Differential Network Architecture Search
NIPS 2020
Reasoning Step-by-Step: Temporal Sentence Localization in Videos via Deep Rectification-Modulation Network
COLING 2020
Hybrid Stochastic-Deterministic Minibatch Proximal Gradient: Less-Than-Single-Pass Optimization with Nearly Optimal Generalization
ICML 2020
Improving GAN Training with Probability Ratio Clipping and Sample Reweighting
NIPS 2020
An Online Attention-Based Model for Speech Recognition
INTERSPEECH 2019
Faster First-Order Methods for Stochastic Non-Convex Optimization on Riemannian Manifolds
AISTATS 2019
Generalized Majorization-Minimization for Non-Convex Optimization
IJCAI 2019
Efficient Meta Learning via Minibatch Proximal Update
NIPS 2019
MHP-VOS: Multiple Hypotheses Propagation for Video Object Segmentation
CVPR 2019
Adversarial Category Alignment Network for Cross-domain Sentiment Classification
NAACL 2019
New Insight into Hybrid Stochastic Gradient Descent: Beyond With-Replacement Sampling and Convexity
NIPS 2018
Understanding Generalization and Optimization Performance of Deep CNNs
ICML 2018
Empirical Risk Landscape Analysis for Understanding Deep Neural Networks
ICLR 2018
Efficient Stochastic Gradient Hard Thresholding
NIPS 2018
Deep Adversarial Subspace Clustering
CVPR 2018
Jointly Attentive Spatial-Temporal Pooling Networks for Video-Based Person Re-Identification
ICCV 2017
Outlier-Robust Tensor PCA
CVPR 2017