conftrace_

Pan Zhou

114 papers · 2017–2026 · 17 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+16 more ↓

🗺️ Taxonomy Completionist (16) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (7) 🐣 Hot Topic Early Bird

🐣 Hot Topic Early Bird 🌍 Conference Polyglot (16) 🏃 Academic Marathon (8) 🏠 Conference Loyalist (21) 🔬 Deep Specialist (19) 🏆 Grand Slam 🤝 Dynamic Duo (26) 👥 Mega-Team (20) 👑 Triple Crown 🏆 Keyword Champion 🗃️ Keyword Collector (405) ❓ The Questioner (4) ⚡ Prolific Year (6) 🚀 Conference Pioneer 💎 Century Club (108) 🔥 Unstoppable (9)

Conferences

AAAI (21) CVPR (19) NIPS (16) ICLR (10) EMNLP (10) ICML (7) ICCV (7) ECCV (7) ACL (7) COLING (2) IJCAI (2) EACL (1) AISTATS (1) INTERSPEECH (1) JMLR (1) NAACL (1) UAI (1)

Top co-authors

Daizong Liu (27) Shuicheng Yan (17) Xiaoye Qu (14) Yu Cheng (13) Jiashi Feng (13) Lichao Sun (12) Xiang Fang (9) Yao Wan (9) Jianfeng Dong (7) Keke Tang (7)

Research topics

Keywords

large language model (12) video understanding (11) temporal sentence grounding (8) adversarial attack (8) adversarial learning (7) cross-modal learning (6) diffusion model (6) multimodal learning (6) unsupervised learning (5) graph neural network (5) backdoor attack (5) few-shot learning (5) semantic segmentation (4) vision transformer (4) image classification (4) stochastic gradient (3) object detection (3) stochastic gradient descent (3) convergence analysis (3) contrastive learning (3)

Papers

CrowdSelect: SyntheticInstruction Data Selection with Multi-LLM Wisdom EACL 2026 SafeAgent: Safeguarding LLM Agents via an Automated Risk Simulator ACL 2026 LearnerCoMPASS: Intelligent Tutoring System with Dynamic Cognitive Diagnosis and Multi-Model Path Planning ACL 2026 DRFGD: Disentangled Representation-Focused Generative Defense for Attack-Tolerant Cross-Modal Hashing AAAI 2026 ReAlign: Text-to-Motion Generation via Step-Aware Reward-Guided Alignment AAAI 2026 Revisiting the Canonicalization for Fast and Accurate Crystal Tensor Property Prediction AAAI 2026 Graph Agent Network: Empowering Nodes with Inference Capabilities for Adversarial Resilience AAAI 2025 Grimm: A Plug-and-Play Perturbation Rectifier for Graph Neural Networks Defending Against Poisoning Attacks AAAI 2025 Misalignment Attack on Text-to-Image Models via Text Embedding Optimization and Inversion EMNLP 2025 Merger-as-a-Stealer: Stealing Targeted PII from Aligned LLMs with Model Merging EMNLP 2025 Stealing Training Data from Large Language Models in Decentralized Training through Activation Inversion Attack ACL 2025 Merge Hijacking: Backdoor Attacks to Model Merging of Large Language Models ACL 2025 The Impact of Large Language Models in Academia: from Writing to Speaking ACL 2025 Learning from Few Samples: A Novel Approach for High-Quality Malcode Generation EMNLP 2025 BadToken: Token-level Backdoor Attacks to Multi-modal Large Language Models CVPR 2025 HPS: Hard Preference Sampling for Human Preference Alignment ICML 2025 GUI-World: A Video Benchmark and Dataset for Multimodal GUI-oriented Understanding ICLR 2025 Interleaved Scene Graphs for Interleaved Text-and-Image Generation Assessment ICLR 2025 Probabilistic Prototype Calibration of Vision-language Models for Generalized Few-shot Semantic Segmentation ICCV 2025 Memory-Efficient 4-bit Preconditioned Stochastic Optimization ICCV 2025 Zeroth-Order Fine-Tuning of LLMs in Random Subspaces ICCV 2025 Probabilistic Interactive 3D Segmentation with Hierarchical Neural Processes ICML 2025 Collaborative Tree Search for Enhancing Embodied Multi-Agent Collaboration CVPR 2025 Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning ICLR 2025 CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation ICLR 2025 Multi-Pair Temporal Sentence Grounding via Multi-Thread Knowledge Transfer Network AAAI 2025 Sparse Enhanced Network: An Adversarial Generation Method for Robust Augmentation in Sequential Recommendation AAAI 2024 MVGamba: Unify 3D Content Generation as State Space Sequence Modeling NIPS 2024 Pandora's Box: Towards Building Universal Attackers against Real-World Large Vision-Language Models NIPS 2024 Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation NIPS 2024 LOVA3: Learning to Visual Question Answering, Asking and Assessment NIPS 2024 4-bit Shampoo for Memory-Efficient Network Training NIPS 2024 Fewer Steps, Better Performance: Efficient Cross-Modal Clip Trimming for Video Moment Retrieval Using Language AAAI 2024 Unsupervised Domain Adaptative Temporal Sentence Localization with Mutual Information Maximization AAAI 2024 Manifold Constraints for Imperceptible Adversarial Attacks on Point Clouds AAAI 2024 Towards Inductive Robustness: Distilling and Fostering Wave-Induced Resonance in Transductive GCNs against Graph Adversarial Attacks AAAI 2024 What Makes Good Collaborative Views? Contrastive Mutual Information Maximization for Multi-Agent Perception AAAI 2024 Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning? ACL 2024 MoExtend: Tuning New Experts for Modality and Task Extension ACL 2024 Towards Robust Temporal Activity Localization Learning with Noisy Labels COLING 2024 Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation CVPR 2024 Friendly Sharpness-Aware Minimization CVPR 2024 Consistent3D: Towards Consistent High-Fidelity Text-to-3D Generation with Deterministic Sampling Prior CVPR 2024 Physical Backdoor: Towards Temperature-based Backdoor Attacks in the Physical World CVPR 2024 MetaCloak: Preventing Unauthorized Subject-driven Text-to-image Diffusion-based Synthesis via Meta-learning CVPR 2024 Few-shot Learner Parameterization by Diffusion Time-steps CVPR 2024 InceptionNeXt: When Inception Meets ConvNeXt CVPR 2024 Diffusion Time-step Curriculum for One Image to 3D Generation CVPR 2024 Efficient Cascaded Multiscale Adaptive Network for Image Restoration ECCV 2024 GENIXER: Empowering Multimodal Large Language Models as a Powerful Data Generator ECCV 2024 Hiding Imperceptible Noise in Curvature-Aware Patches for 3D Point Cloud Attack ECCV 2024 Rethinking Weakly-supervised Video Temporal Grounding From a Game Perspective ECCV 2024 CodeIP: A Grammar-Guided Multi-Bit Watermark for Large Language Models of Code EMNLP 2024 Virtual Context Enhancing Jailbreak Attacks with Special Token Injection EMNLP 2024 MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use ICLR 2024 MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language Benchmark ICML 2024 Position: Exploring the Robustness of Pipeline-Parallelism-Based Decentralized Training ICML 2024 Win: Weight-Decay-Integrated Nesterov Acceleration for Faster Network Training JMLR 2024 ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection NIPS 2023 Annotations Are Not All You Need: A Cross-modal Knowledge Transfer Network for Unsupervised Temporal Sentence Grounding EMNLP 2023 3DHacker: Spectrum-based Decision Boundary Generation for Hard-label 3D Point Cloud Attack ICCV 2023 STPrivacy: Spatio-Temporal Privacy-Preserving Action Recognition ICCV 2023 Hypotheses Tree Building for One-Shot Temporal Sentence Localization AAAI 2023 Distantly-Supervised Named Entity Recognition with Adaptive Teacher Learning and Fine-Grained Student Ensemble AAAI 2023 Win: Weight-Decay-Integrated Nesterov Acceleration for Adaptive Gradient Algorithms ICLR 2023 You Can Ground Earlier Than See: An Effective and Efficient Pipeline for Temporal Sentence Grounding in Compressed Videos CVPR 2023 You Are Catching My Attention: Are Vision Transformers Bad Learners Under Backdoor Attacks? CVPR 2023 LPT: Long-tailed Prompt Tuning for Image Classification ICLR 2023 Towards Understanding Why Mask Reconstruction Pretraining Helps in Downstream Tasks ICLR 2023 Position-Guided Text Prompt for Vision-Language Pre-Training CVPR 2023 Masked Diffusion Transformer is a Strong Image Synthesizer ICCV 2023 Inception Transformer NIPS 2022 Self-Promoted Supervision for Few-Shot Transformer ECCV 2022 MetaFormer Is Actually What You Need for Vision CVPR 2022 DualFormer: Local-Global Stratified Transformer for Efficient Video Recognition ECCV 2022 Video Graph Transformer for Video Question Answering ECCV 2022 Rethinking the Video Sampling and Reasoning Strategies for Temporal Sentence Grounding EMNLP 2022 Unsupervised Temporal Video Grounding with Deep Semantic Clustering AAAI 2022 Exploring Motion and Appearance Information for Temporal Sentence Grounding AAAI 2022 Memory-Guided Semantic Learning Network for Temporal Sentence Grounding AAAI 2022 Bandits for Structure Perturbation-Based Black-Box Attacks To Graph Neural Networks With Theoretical Guarantees CVPR 2022 Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition EMNLP 2021 Context-Aware Biaffine Localizing Network for Temporal Sentence Grounding CVPR 2021 Adversarial Meta Sampling for Multilingual Low-Resource Speech Recognition AAAI 2021 Graph-Evolving Meta-Learning for Low-Resource Medical Dialogue Generation AAAI 2021 Progressively Guide to Attend: An Iterative Alignment Framework for Temporal Sentence Grounding EMNLP 2021 Prototypical Contrastive Learning of Unsupervised Representations ICLR 2021 Adaptive Proposal Generation Network for Temporal Sentence Localization in Videos EMNLP 2021 Task similarity aware meta learning: theory-inspired improvement on MAML UAI 2021 How Important is the Train-Validation Split in Meta-Learning? ICML 2021 Towards Understanding Why Lookahead Generalizes Better Than SGD and Beyond NIPS 2021 F2Net: Learning to Focus on the Foreground for Unsupervised Video Object Segmentation AAAI 2021 Spatiotemporal Graph Neural Network based Mask Reconstruction for Video Object Segmentation AAAI 2021 TRS: Transferability Reduced Ensemble via Promoting Gradient Diversity and Model Smoothness NIPS 2021 A Theory-Driven Self-Labeling Refinement Method for Contrastive Representation Learning NIPS 2021 Generating Robust Audio Adversarial Examples with Temporal Dependency IJCAI 2020 Towards Theoretically Understanding Why Sgd Generalizes Better Than Adam in Deep Learning NIPS 2020 Theory-Inspired Path-Regularized Differential Network Architecture Search NIPS 2020 Reasoning Step-by-Step: Temporal Sentence Localization in Videos via Deep Rectification-Modulation Network COLING 2020 Hybrid Stochastic-Deterministic Minibatch Proximal Gradient: Less-Than-Single-Pass Optimization with Nearly Optimal Generalization ICML 2020 Improving GAN Training with Probability Ratio Clipping and Sample Reweighting NIPS 2020 An Online Attention-Based Model for Speech Recognition INTERSPEECH 2019 Faster First-Order Methods for Stochastic Non-Convex Optimization on Riemannian Manifolds AISTATS 2019 Generalized Majorization-Minimization for Non-Convex Optimization IJCAI 2019 Efficient Meta Learning via Minibatch Proximal Update NIPS 2019 MHP-VOS: Multiple Hypotheses Propagation for Video Object Segmentation CVPR 2019 Adversarial Category Alignment Network for Cross-domain Sentiment Classification NAACL 2019 New Insight into Hybrid Stochastic Gradient Descent: Beyond With-Replacement Sampling and Convexity NIPS 2018 Understanding Generalization and Optimization Performance of Deep CNNs ICML 2018 Empirical Risk Landscape Analysis for Understanding Deep Neural Networks ICLR 2018 Efficient Stochastic Gradient Hard Thresholding NIPS 2018 Deep Adversarial Subspace Clustering CVPR 2018 Jointly Attentive Spatial-Temporal Pooling Networks for Video-Based Person Re-Identification ICCV 2017 Outlier-Robust Tensor PCA CVPR 2017