Bernard Ghanem
147 papers · 2013–2026 · 15 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+18 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (16) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (5) π Conference Polyglot (14)
π
Renaissance Researcher
(5)
π
Interdisciplinary Bridge
π§
Keyword Pioneer
π
Keyword Trendsetter Combo
(3)
π
Conference Loyalist
(52)
π
Triple Crown
π
Grand Slam
π¬
Deep Specialist
(21)
π
Keyword Champion
(2)
π€
Dynamic Duo
(22)
π₯
Mega-Team
(100)
π
Century Club
(144)
π
Trend Setter
π
Conference Pioneer
β‘
Prolific Year
(29)
π₯
Unstoppable
(11)
β
The Questioner
(7)
ποΈ
Keyword Collector
(571)
Conferences
CVPR (52)
ICCV (31)
ECCV (18)
NIPS (11)
AAAI (9)
ICLR (7)
WACV (4)
EMNLP (3)
ICML (3)
ACL (2)
EACL (2)
RSS (2)
CORL (1)
MICCAI (1)
UAI (1)
Top co-authors
Research topics
Keywords
video understanding
(13)
continual learning
(8)
point cloud
(7)
action recognition
(7)
semantic segmentation
(6)
large language model
(6)
neural network
(6)
generative adversarial network
(5)
object detection
(5)
sparse representation
(5)
3d reconstruction
(5)
autonomous driving
(5)
diffusion model
(5)
online learning
(4)
transfer learning
(4)
catastrophic forgetting
(4)
image generation
(4)
video generation
(4)
adversarial attack
(4)
multimodal learning
(4)
Papers
Multimodal Safety Evaluation in Generative Agent Social Simulations
ACL 2026
AraLingBench: A Human-Annotated Benchmark for Evaluating Arabic Linguistic Capabilities of Large Language Models
EACL 2026
Hala Technical Report Building Arabic-Centric Instruction & Translation Models at Scale
EACL 2026
Enhancing Online Continual Learning with Plug-and-Play State Space Model and Class-Conditional Mixture of Discretization
CVPR 2025
BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding
CVPR 2025
3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes
CVPR 2025
HAMSt3R: Human-Aware Multi-view Stereo 3D Reconstruction
ICCV 2025
Test-Time Adaptation for Combating Missing Modalities in Egocentric Videos
ICLR 2025
UnMix-NeRF: Spectral Unmixing Meets Neural Radiance Fields
ICCV 2025
Diffusion-Based Imaginative Coordination for Bimanual Manipulation
ICCV 2025
4D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object Understanding
ICCV 2025
ResidualViT for Efficient Temporally Dense Video Encoding
ICCV 2025
SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data
ICCV 2025
MatchDiffusion: Training-free Generation of Match-Cuts
ICCV 2025
MOLE: Metadata Extraction and Validation in Scientific Papers Using LLMs
EMNLP 2025
RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything
ICLR 2025
SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning
CVPR 2025
Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models
AAAI 2025
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents
ACL 2025
Boundary Denoising for Video Activity Localization
ICLR 2024
SplitNeRF: Split Sum Approximation Neural Field for Joint Geometry, Illumination, and Material Estimation
NIPS 2024
Can Large Language Model Agents Simulate Human Trust Behavior?
NIPS 2024
Vivid-ZOO: Multi-View Video Generation with Diffusion Model
NIPS 2024
CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for Task-Aware Parameter-Efficient Fine-tuning
NIPS 2024
SimCS: Simulation for Domain Incremental Online Continual Segmentation
AAAI 2024
Privacy-Preserving Optics for Enhancing Protection in Face De-Identification
CVPR 2024
SPAD: Spatially Aware Multi-View Diffusers
CVPR 2024
Tune-An-Ellipse: CLIP Has Potential to Find What You Want
CVPR 2024
Dr2Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning
CVPR 2024
Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
CVPR 2024
Towards Automated Movie Trailer Generation
CVPR 2024
GES : Generalized Exponential Splatting for Efficient Radiance Field Rendering
CVPR 2024
End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
CVPR 2024
DATENeRF: Depth-Aware Text-based Editing of NeRFs
ECCV 2024
TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks
ECCV 2024
ColorMAE: Exploring data-independent masking strategies in Masked AutoEncoders
ECCV 2024
Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders
ECCV 2024
On Pretraining Data Diversity for Self-Supervised Learning
ECCV 2024
GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning
ECCV 2024
Model Merging and Safety Alignment: One Bad Model Spoils the Bunch
EMNLP 2024
Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors
ICLR 2024
Continual Learning on a Diet: Learning from Sparsely Labeled Streams Under Constrained Computation
ICLR 2024
Evaluation of Test-Time Adaptation Under Computational Time Constraints
ICML 2024
Towards Interpretable Deep Local Learning with Successive Gradient Reconciliation
ICML 2024
FedMedICL: Towards Holistic Evaluation of Distribution Shifts in Federated Medical Imaging
MICCAI 2024
Learning to Read Analog Gauges from Synthetic Data
WACV 2024
Active Learning for Single-Stage Object Detection in UAV Images
WACV 2024
StyleAvatar: Stylizing Animatable Head Avatars
WACV 2024
FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model
ICCV 2023
Rapid Adaptation in Online Continual Learning: Are We Evaluating It Right?
ICCV 2023
Re-ReND: Real-Time Rendering of NeRFs across Devices
ICCV 2023
Combating Mode Collapse via Offline Manifold Entropy Estimation
AAAI 2023
A Unified Continual Learning Framework with General Parameter-Efficient Tuning
ICCV 2023
Learning to Identify Critical States for Reinforcement Learning from Videos
ICCV 2023
Automatic Animation of Hair Blowing in Still Portrait Photos
ICCV 2023
Localizing Moments in Long Video Via Multimodal Guidance
ICCV 2023
EgoLoc: Revisiting 3D Object Localization from Egocentric Videos with Visual Queries
ICCV 2023
Exploring Open-Vocabulary Semantic Segmentation from CLIP Vision Encoder Distillation Only
ICCV 2023
AdaptiveMix: Improving GAN Training via Feature Space Shrinkage
CVPR 2023
Computationally Budgeted Continual Learning: What Does Matter?
CVPR 2023
Re2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization
CVPR 2023
Where Is My Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization
CVPR 2023
How To Not Train Your Dragon: Training-free Embodied Object Goal Navigation with Semantic Frontiers
RSS 2023
CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society
NIPS 2023
Large-Capacity and Flexible Video Steganography via Invertible Neural Network
CVPR 2023
Voint Cloud: Multi-View Point Cloud Representation for 3D Understanding
ICLR 2023
Real-Time Evaluation in Online Continual Learning: A New Hope
CVPR 2023
NewsNet: A Novel Dataset for Hierarchical Temporal Segmentation
CVPR 2023
PIVOT: Prompting for Video Continual Learning
CVPR 2023
Dynamically Masked Discriminator for GANs
NIPS 2023
PointNeXt: Revisiting PointNet++ with Improved Training and Scaling Strategies
NIPS 2022
Egocentric Video-Language Pretraining
NIPS 2022
vCLIMB: A Novel Video Class Incremental Learning Benchmark
CVPR 2022
Spatio-Temporal Relation Modeling for Few-Shot Action Recognition
CVPR 2022
Real-Time Hyperspectral Imaging in Hardware via Trained Metasurface Encoders
CVPR 2022
Robust Optimization As Data Augmentation for Large-Scale Graphs
CVPR 2022
SCTN: Sparse Convolution-Transformer Network for Scene Flow Estimation
AAAI 2022
Combating Adversaries with Anti-adversaries
AAAI 2022
Data dependent randomized smoothing
UAI 2022
MAD: A Scalable Dataset for Language Grounding in Videos From Movie Audio Descriptions
CVPR 2022
3DeformRS: Certifying Spatial Deformations on Point Clouds
CVPR 2022
DeformRS: Certifying Input Deformations with Randomized Smoothing
AAAI 2022
MovieCuts: A New Dataset and Benchmark for Cut Type Recognition
ECCV 2022
On the Robustness of Quality Measures for GANs
ECCV 2022
R-DFCIL: Relation-Guided Representation Learning for Data-Free Class Incremental Learning
ECCV 2022
End-to-End Active Speaker Detection
ECCV 2022
Ego4D: Around the World in 3,000 Hours of Egocentric Video
CVPR 2022
Learning To Cut by Watching Movies
ICCV 2021
RefineLoc: Iterative Refinement for Weakly-Supervised Action Localization
WACV 2021
Training Graph Neural Networks with 1000 Layers
ICML 2021
ASSANet: An Anisotropic Separable Set Abstraction for Efficient Point Cloud Representation Learning
NIPS 2021
Low-Fidelity Video Encoder Optimization for Temporal Action Localization
NIPS 2021
Relation-aware Video Reading Comprehension for Temporal Language Grounding
EMNLP 2021
Video Self-Stitching Graph Network for Temporal Action Localization
ICCV 2021
MVTN: Multi-View Transformation Network for 3D Shape Recognition
ICCV 2021
MAAS: Multi-Modal Assignation for Active Speaker Detection
ICCV 2021
Boundary-Sensitive Pre-Training for Temporal Localization in Videos
ICCV 2021
High Quality Disparity Remapping With Two-Stage Warping
ICCV 2021
PU-GCN: Point Cloud Upsampling Using Graph Convolutional Networks
CVPR 2021
G-TAD: Sub-Graph Localization for Temporal Action Detection
CVPR 2020
SADA: Semantic Adversarial Diagnostic Attacks for Autonomous Applications
AAAI 2020
A Stochastic Derivative-Free Optimization Method with Importance Sampling: Theory and Learning to Control
AAAI 2020
Gabor Layers Enhance Network Robustness
ECCV 2020
SGAS: Sequential Greedy Architecture Search
CVPR 2020
A Context-Aware Loss Function for Action Spotting in Soccer Videos
CVPR 2020
Active Speakers in Context
CVPR 2020
AdvPC: Transferable Adversarial Perturbations on 3D Point Clouds
ECCV 2020
Self-Supervised Learning by Cross-Modal Audio-Video Clustering
NIPS 2020
3D Instance Segmentation via Multi-Task Metric Learning
ICCV 2019
A Novel Framework for Robustness Analysis of Visual QA Models
AAAI 2019
OIL: Observational Imitation Learning
RSS 2019
Leveraging Shape Completion for 3D Siamese Tracking
CVPR 2019
DeepGCNs: Can GCNs Go As Deep As CNNs?
ICCV 2019
Deep Layers as Stochastic Solvers
ICLR 2019
Tagging Like Humans: Diverse and Distinct Image Annotation
CVPR 2018
W2F: A Weakly-Supervised to Fully-Supervised Framework for Object Detection
CVPR 2018
Finding Tiny Faces in the Wild With Generative Adversarial Network
CVPR 2018
Driving Policy Transfer via Modularity and Abstraction
CORL 2018
Diagnosing Error in Temporal Action Detectors
ECCV 2018
Face Super-resolution Guided by Facial Component Heatmaps
ECCV 2018
Action Search: Spotting Actions in Videos and Its Application to Temporal Action Localization
ECCV 2018
TrackingNet: A Large-Scale Dataset and Benchmark for Object Tracking in the Wild
ECCV 2018
What do I Annotate Next? An Empirical Study of Active Learning for Action Localization
ECCV 2018
SOD-MTGAN: Small Object Detection via Multi-Task Generative Adversarial Network
ECCV 2018
Analytic Expressions for Probabilistic Moments of PL-DNN With Gaussian Input
CVPR 2018
ISTA-Net: Interpretable Optimization-Inspired Deep Network for Image Compressive Sensing
CVPR 2018
A Matrix Splitting Method for Composite Function Minimization
CVPR 2017
Constrained Convolutional Sparse Coding for Parametric Based Reconstruction of Line Drawings
ICCV 2017
2D-Driven 3D Object Detection in RGB-D Images
ICCV 2017
Context-Aware Correlation Filter Tracking
CVPR 2017
SCC: Semantic Context Cascade for Efficient Action Detection
CVPR 2017
FFTLasso: Large-Scale LASSO in the Fourier Domain
CVPR 2017
Diverse Image Annotation
CVPR 2017
SST: Single-Stream Temporal Action Proposals
CVPR 2017
High Order Tensor Formulation for Convolutional Sparse Coding
ICCV 2017
In Defense of Sparse Tracking: Circulant Sparse Tracker
CVPR 2016
3D Part-Based Sparse Tracker With Automatic Synchronization and Registration
CVPR 2016
Fast Temporal Activity Proposals for Efficient Detection of Human Actions in Untrimmed Videos
CVPR 2016
On the Relationship Between Visual Attributes and Convolutional Networks
CVPR 2015
ActivityNet: A Large-Scale Video Benchmark for Human Activity Understanding
CVPR 2015
Structural Sparse Tracking
CVPR 2015
Intrinsic Scene Decomposition From RGB-D images
ICCV 2015
What Makes an Object Memorable?
ICCV 2015
ML-MG: Multi-Label Learning With Missing Labels Using a Mixed Graph
ICCV 2015
L0TV: A New Method for Image Restoration in the Presence of Impulse Noise
CVPR 2015
Robust Manhattan Frame Estimation From a Single RGB-D Image
CVPR 2015
Low-Rank Sparse Coding for Image Classification
ICCV 2013