Boqing Gong
76 papers · 2013–2025 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
🐣 Hot Topic Early Bird 🌍 Conference Polyglot (10) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (12)
🌉
Interdisciplinary Bridge
🏃
Academic Marathon
(12)
🧭
Keyword Pioneer
🏠
Conference Loyalist
(25)
🤝
Dynamic Duo
(12)
👑
Triple Crown
🏆
Grand Slam
🌱
Topic Pioneer
🔬
Deep Specialist
(11)
🔥
Unstoppable
(10)
🚀
Conference Pioneer
❓
The Questioner
(2)
⚡
Prolific Year
(14)
🗃️
Keyword Collector
(256)
💎
Century Club
(76)
📈
Trend Setter
Conferences
CVPR (25)
ICCV (14)
ICLR (10)
NIPS (9)
ECCV (6)
ICML (6)
EMNLP (2)
WACV (2)
AAAI (1)
AISTATS (1)
Top co-authors
Research topics
Keywords
domain adaptation
(9)
semantic segmentation
(9)
video understanding
(8)
domain generalization
(6)
transfer learning
(6)
object detection
(6)
contrastive learning
(5)
action recognition
(5)
neural network
(4)
few-shot learning
(4)
self-supervised learning
(4)
video representation learning
(3)
data augmentation
(3)
vision-language model
(3)
adversarial example
(3)
knowledge distillation
(3)
model calibration
(2)
zero-shot learning
(2)
model compression
(2)
curriculum learning
(2)
Papers
Scaling Up Temporal Domain Generalization via Temporal Experts Averaging
EMNLP 2025
Attention to Neural Plagiarism: Diffusion Models Can Plagiarize Your Copyrighted Images!
ICCV 2025
The Crystal Ball Hypothesis in diffusion models: Anticipating object positions from initial noise
ICLR 2025
OmnixR: Evaluating Omni-modality Language Models on Reasoning across Modalities
ICLR 2025
Epsilon-VAE: Denoising as Visual Decoding
ICML 2025
HypDAE: Hyperbolic Diffusion Autoencoders for Hierarchical Few-shot Image Generation
ICCV 2025
SITE: towards Spatial Intelligence Thorough Evaluation
ICCV 2025
VideoAds for Fast-Paced Video Understanding
ICCV 2025
BabyVLM: Data-Efficient Pretraining of VLMs Inspired by Infant Learning
ICCV 2025
When and How do negative prompts take effect?
ECCV 2024
Extending Video Masked Autoencoders to 128 frames
NIPS 2024
VideoPrism: A Foundational Visual Encoder for Video Understanding
ICML 2024
On Discrete Prompt Optimization for Diffusion Models
ICML 2024
Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding
ICLR 2024
Language Model Beats Diffusion - Tokenizer is key to visual generation
ICLR 2024
Distilling Vision-Language Models on Millions of Videos
CVPR 2024
Instruct-Imagen: Image Generation with Multi-modal Instruction
CVPR 2024
Module-wise Adaptive Distillation for Multimodality Foundation Models
NIPS 2023
On Calibrating Semantic Segmentation Models: Analyses and an Algorithm
CVPR 2023
Video Timeline Modeling For News Story Understanding
NIPS 2023
Unified Visual Relationship Detection with Vision and Language Models
ICCV 2023
Surrogate Gap Minimization Improves Sharpness-Aware Training
ICLR 2022
Federated Multi-Target Domain Adaptation
WACV 2022
Contextualized Spatio-Temporal Contrastive Learning With Self-Supervision
CVPR 2022
Anti-Neuron Watermarking: Protecting Personal Data against Unauthorized Neural Networks
ECCV 2022
LESS: Label-Efficient Semantic Segmentation for LiDAR Point Clouds
ECCV 2022
When Vision Transformers Outperform ResNets without Pre-training or Strong Data Augmentations
ICLR 2022
Robust and Accurate Object Detection via Adversarial Learning
CVPR 2021
Complete & Label: A Domain Adaptation Approach to Semantic Segmentation of LiDAR Point Clouds
CVPR 2021
Adversarially Adaptive Normalization for Single Domain Generalization
CVPR 2021
Spatiotemporal Contrastive Video Representation Learning
CVPR 2021
MoViNets: Mobile Video Networks for Efficient Video Recognition
CVPR 2021
Large-Scale Meta-Learning with Continual Trajectory Shifting
ICML 2021
Analyzing Deep Neural Network's Transferability via Frechet Distance
WACV 2021
CrossVQA: Scalably Generating Benchmarks for Systematically Testing VQA Generalization
EMNLP 2021
A Lazy Approach to Long-Horizon Gradient-Based Meta-Learning
ICCV 2021
MosaicOS: A Simple and Effective Use of Object-Centric Images for Long-Tailed Object Detection
ICCV 2021
Contrastive Learning for Label Efficient Semantic Segmentation
ICCV 2021
On Model Calibration for Long-Tailed Object Detection and Instance Segmentation
NIPS 2021
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
NIPS 2021
Ranking Neural Checkpoints
CVPR 2021
Improving Object Detection with Selective Self-Supervised Self-Training
ECCV 2020
Neural Networks Are More Productive Teachers Than Human Raters: Active Mixup for Data-Efficient Knowledge Distillation From a Blackbox Model
CVPR 2020
Adversarial Examples Improve Image Recognition
CVPR 2020
Rethinking Class-Balanced Methods for Long-Tailed Visual Recognition From a Domain Adaptation Perspective
CVPR 2020
PolarNet: An Improved Grid Representation for Online LiDAR Point Clouds Semantic Segmentation
CVPR 2020
Open Compound Domain Adaptation
CVPR 2020
MACER: Attack-free and Scalable Robust Training via Maximizing Certified Radius
ICLR 2020
Constructing Self-Motivated Pyramid Curriculums for Cross-Domain Semantic Segmentation: A Non-Adversarial Approach
ICCV 2019
Domain Randomization and Pyramid Consistency: Simulation-to-Real Generalization Without Accessing Target Domain Data
ICCV 2019
Not All Frames Are Equal: Weakly-Supervised Video Grounding With Contextual Similarity and Visual Clustering Losses
CVPR 2019
Large-Scale Long-Tailed Recognition in an Open World
CVPR 2019
DHER: Hindsight Experience Replay for Dynamic Goals
ICLR 2019
CAMOU: Learning Physical Vehicle Camouflages to Adversarially Attack Detectors in the Wild
ICLR 2019
NATTACK: Learning the Distributions of Adversarial Examples for an Improved Black-Box Attack on Deep Neural Networks
ICML 2019
A Robust Zero-Sum Game Framework for Pool-based Active Learning
AISTATS 2019
Controllable Image-to-Video Translation: A Case Study on Facial Expression Generation
AAAI 2019
A Fast and Accurate One-Stage Approach to Visual Grounding
ICCV 2019
Improving the Improved Training of Wasserstein GANs: A Consistency Term and Its Dual Effect
ICLR 2018
Synthesized Policies for Transfer and Adaptation across Tasks and Environments
NIPS 2018
Deep Face Detector Adaptation Without Negative Transfer or Catastrophic Forgetting
CVPR 2018
Geometry Guided Convolutional Neural Networks for Self-Supervised Video Representation Learning
CVPR 2018
How Local is the Local Diversity? Reinforcing Sequential Determinantal Point Processes with Dynamic Ground Sets for Supervised Video Summarization
ECCV 2018
Improving Sequential Determinantal Point Processes for Supervised Video Summarization
ECCV 2018
End-to-End Learning of Motion Representation for Video Understanding
CVPR 2018
Query-Focused Video Summarization: Dataset, Evaluation, and a Memory Network Based Approach
CVPR 2017
Curriculum Domain Adaptation for Semantic Segmentation of Urban Scenes
ICCV 2017
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation
ICCV 2017
Improving Facial Attribute Prediction Using Semantic Segmentation
CVPR 2017
Fast Zero-Shot Image Tagging
CVPR 2016
Learning Attributes Equals Multi-Source Domain Generalization
CVPR 2016
Synthesized Classifiers for Zero-Shot Learning
CVPR 2016
Improved Dropout for Shallow and Deep Learning
NIPS 2016
Diverse Sequential Subset Selection for Supervised Video Summarization
NIPS 2014
Reshaping Visual Datasets for Domain Adaptation
NIPS 2013
Connecting the Dots with Landmarks: Discriminatively Learning Domain-Invariant Features for Unsupervised Domain Adaptation
ICML 2013