Xilin Chen
123 papers · 2013–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
🌍 Conference Polyglot (12) 🏃 Academic Marathon (13) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (12)
🧭
Keyword Pioneer
🐝
Cross-Pollinator
(12)
🐣
Hot Topic Early Bird
🌟
Keyword Trendsetter Combo
(5)
🏠
Conference Loyalist
(27)
🏆
Grand Slam
🔬
Deep Specialist
(18)
🏆
Keyword Champion
🤝
Dynamic Duo
(84)
🗃️
Keyword Collector
(458)
🚀
Conference Pioneer
⚡
Prolific Year
(10)
🔥
Unstoppable
(14)
💎
Century Club
(121)
📈
Trend Setter
Conferences
CVPR (40)
ICCV (27)
ECCV (18)
NIPS (10)
WACV (10)
ICLR (5)
AAAI (4)
ACL (3)
ICML (2)
IJCAI (2)
EMNLP (1)
IJCNLP (1)
Top co-authors
Research topics
Keywords
representation learning
(9)
face recognition
(9)
domain adaptation
(9)
video understanding
(7)
metric learning
(7)
self-supervised learning
(7)
multimodal learning
(6)
image classification
(6)
unsupervised learning
(6)
convolutional neural network
(5)
attention mechanism
(5)
vision-language model
(5)
object detection
(5)
person re-identification
(5)
few-shot learning
(5)
adversarial learning
(4)
feature extraction
(4)
feature learning
(4)
image retrieval
(4)
feature embedding
(4)
Papers
INFACT: A Diagnostic Benchmark for Induced Faithfulness and Factuality Hallucinations in Video-LLMs
ACL 2026
Tell as You Want: Customizing Image Narrative with Knowledge and Thoughts
AAAI 2026
M4U: Evaluating Multilingual Understanding and Reasoning for Large Multimodal Models
WACV 2026
Feature Decomposition-Recomposition in Large Vision-Language Model for Few-Shot Class-Incremental Learning
ICCV 2025
CogCM: Cognition-Inspired Contextual Modeling for Audio-Visual Speech Enhancement
ICCV 2025
OV3D-CG: Open-vocabulary 3D Instance Segmentation with Contextual Guidance
ICCV 2025
Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs
ICLR 2025
MATS: An Audio Language Model under Text-only Supervision
ICML 2025
CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation
ICLR 2025
R2C: Mapping Room to Chessboard to Unlock LLM As Low-Level Action Planner
CVPR 2025
UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing
CVPR 2025
Wavelet-Driven Masked Image Modeling: A Path to Efficient Visual Representation
AAAI 2025
G2PDiffusion: Cross-Species Genotype-to-Phenotype Prediction via Evolutionary Diffusion
ICCV 2025
EfficientMT: Efficient Temporal Adaptation for Motion Transfer in Text-to-Video Diffusion Models
ICCV 2025
Not Only Vision: Evolve Visual Speech Recognition via Peripheral Information
ICCV 2025
UMFC: Unsupervised Multi-Domain Feature Calibration for Vision-Language Models
NIPS 2024
Point2Real: Bridging the Gap between Point Cloud and Realistic Image for Open-World 3D Recognition
AAAI 2024
PreLAR: World Model Pre-training with Learnable Action Representation
ECCV 2024
Visual Alignment Pre-training for Sign Language Translation
ECCV 2024
An Information Theoretical View for Out-Of-Distribution Detection
ECCV 2024
HiFi-Score: Fine-grained Image Description Evaluation with Hierarchical Parsing Graphs
ECCV 2024
Think before Placement: Common Sense Enhanced Transformer for Object Placement
ECCV 2024
T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models
ECCV 2024
Interpretable Object Recognition by Semantic Prototype Analysis
WACV 2024
Deep Subdomain Alignment for Cross-Domain Image Classification
WACV 2024
A Simple Romance Between Multi-Exit Vision Transformer and Token Reduction
ICLR 2024
Scalable Modular Network: A Framework for Adaptive Learning via Agreement Routing
ICLR 2024
Rethinking the Evaluation of Out-of-Distribution Detection: A Sorites Paradox
NIPS 2024
HPNet: Dynamic Trajectory Forecasting with Historical Prediction Attention
CVPR 2024
ES3: Evolving Self-Supervised Learning of Robust Audio-Visual Speech Representations
CVPR 2024
Understanding Few-Shot Learning: Measuring Task Relatedness and Adaptation Difficulty via Attributes
NIPS 2023
Glance and Focus: Memory Prompting for Multi-Event Video Question Answering
NIPS 2023
Generalized Semi-Supervised Learning via Self-Supervised Feature Adaptation
NIPS 2023
DISC: Learning From Noisy Labels via Dynamic Instance-Specific Selection and Correction
CVPR 2023
Function-Consistent Feature Distillation
ICLR 2023
CoSign: Exploring Co-occurrence Signals in Skeleton-based Continuous Sign Language Recognition
ICCV 2023
DandelionNet: Domain Composition with Instance Adaptive Classification for Domain Generalization
ICCV 2023
Diversity-Measurable Anomaly Detection
CVPR 2023
Source-Free Adaptive Gaze Estimation by Uncertainty Reduction
CVPR 2023
Semantic Guided Latent Parts Embedding for Few-Shot Learning
WACV 2023
From Node To Graph: Joint Reasoning on Visual-Semantic Relational Graph for Zero-Shot Detection
WACV 2022
SEGA: Semantic Guided Attention on Visual Prototype for Few-Shot Learning
WACV 2022
Learning Temporal Video Procedure Segmentation From an Automatically Collected Large Dataset
WACV 2022
Optimal Positive Generation via Latent Transformation for Contrastive Learning
NIPS 2022
Clothes-Changing Person Re-Identification With RGB Modality Only
CVPR 2022
Enhancing Face Recognition With Self-Supervised 3D Reconstruction
CVPR 2022
Salient-to-Broad Transition for Video Person Re-Identification
CVPR 2022
Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework
ECCV 2022
Deep Radial Embedding for Visual Sequence Learning
ECCV 2022
Mutual Learning of Joint and Separate Domain Alignments for Multi-Source Domain Adaptation
WACV 2022
Env-QA: A Video Question Answering Benchmark for Comprehensive Understanding of Dynamic Environments
ICCV 2021
Visual Alignment Constraint for Continuous Sign Language Recognition
ICCV 2021
Self-Mutual Distillation Learning for Continuous Sign Language Recognition
ICCV 2021
Cross-Encoder for Unsupervised Gaze Representation Learning
ICCV 2021
FAIEr: Fidelity and Adequacy Ensured Image Caption Evaluation
CVPR 2021
Hierarchical Context-aware Network for Dense Video Event Captioning
ACL 2021
Hierarchical Context-aware Network for Dense Video Event Captioning
IJCNLP 2021
Holistic Pose Graph: Modeling Geometric Structure Among Objects in a Scene Using Graph Inference for 3D Object Prediction
ICCV 2021
Topic Scene Graph Generation by Attention Distillation From Caption
ICCV 2021
HRFormer: High-Resolution Vision Transformer for Dense Predict
NIPS 2021
SegFix: Model-Agnostic Boundary Refinement for Segmentation
ECCV 2020
Functionality Discovery and Prediction of Physical Objects
AAAI 2020
Unsupervised Domain Adaptation With Hierarchical Gradient Synchronization
CVPR 2020
Cross-Domain Face Presentation Attack Detection via Multi-Domain Disentangled Representation Learning
CVPR 2020
An Efficient PointLSTM for Point Clouds Based Gesture Recognition
CVPR 2020
Single-Side Domain Generalization for Face Anti-Spoofing
CVPR 2020
Self-Supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation
CVPR 2020
TCTS: A Task-Consistent Two-Stage Framework for Person Search
CVPR 2020
Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text
CVPR 2020
Appearance-Preserving 3D Convolution for Video-based Person Re-identification
ECCV 2020
Object-Contextual Representations for Semantic Segmentation
ECCV 2020
Sketching Image Gist: Human-Mimetic Hierarchical Scene Graph Generation
ECCV 2020
Dynamic R-CNN: Towards High Quality Object Detection via Dynamic Training
ECCV 2020
Temporal Complementary Learning for Video Person Re-Identification
ECCV 2020
Cross-modal Scene Graph Matching for Relationship-aware Image-Text Retrieval
WACV 2020
Deep Position-Aware Hashing for Semantic Continuous Image Retrieval
WACV 2020
VRSTC: Occlusion-Free Video Person Re-Identification
CVPR 2019
S2GAN: Share Aging Factors Across Ages and Share Aging Trends Among Individuals
ICCV 2019
Temporal Knowledge Propagation for Image-to-Video Person Re-Identification
ICCV 2019
Weakly Supervised Object Detection With Segmentation Collaboration
ICCV 2019
Transferable Contrastive Network for Generalized Zero-Shot Learning
ICCV 2019
Weakly Supervised Image Classification Through Noise Regularization
CVPR 2019
Self-Supervised Representation Learning From Videos for Facial Action Unit Detection
CVPR 2019
Interaction-And-Aggregation Network for Person Re-Identification
CVPR 2019
Fully Learnable Group Convolution for Acceleration of Deep Neural Networks
CVPR 2019
Exploring Context and Visual Pattern of Relationship for Scene Graph Generation
CVPR 2019
Multi-label Co-regularization for Semi-supervised Facial Action Unit Recognition
NIPS 2019
Retrieving Sequential Information for Non-Autoregressive Neural Machine Translation
ACL 2019
Cross Attention Network for Few-shot Classification
NIPS 2019
Structure Inference Net: Object Detection Using Scene-Level Context and Instance-Level Relationships
CVPR 2018
Mean-Variance Loss for Deep Age Estimation From a Face
CVPR 2018
Learning Class Prototypes via Structure Alignment for Zero-Shot Recognition
ECCV 2018
Greedy Search with Probabilistic N-gram Matching for Neural Machine Translation
EMNLP 2018
Duplex Generative Adversarial Network for Unsupervised Domain Adaptation
CVPR 2018
Real-Time Rotation-Invariant Face Detection With Progressive Calibration Networks
CVPR 2018
Facial Expression Recognition with Inconsistently Annotated Datasets
ECCV 2018
Face Recognition with Contrastive Convolution
ECCV 2018
Generative Adversarial Network with Spatial Attention for Face Attribute Editing
ECCV 2018
Recursive Spatial Transformer (ReST) for Alignment-Free Face Recognition
ICCV 2017
Catadioptric HyperSpectral Light Field Imaging
ICCV 2017
Learning Discriminative Latent Attributes for Zero-Shot Classification
ICCV 2017
Discriminative Covariance Oriented Representation Learning for Face Recognition With Image Sets
CVPR 2017
Learning Multifunctional Binary Codes for Both Category and Attribute Oriented Retrieval Tasks
CVPR 2017
Incomplete Attribute Learning with auxiliary labels
IJCAI 2017
Occlusion-Free Face Alignment: Deep Regression Networks Coupled With De-Corrupt AutoEncoders
CVPR 2016
Deep Supervised Hashing for Fast Image Retrieval
CVPR 2016
Multi-View Deep Network for Cross-View Classification
CVPR 2016
Leveraging Datasets With Varying Annotations for Face Alignment via Deep Regression Network
ICCV 2015
Log-Euclidean Metric Learning on Symmetric Positive Definite Manifold with Application to Image Set Classification
ICML 2015
Bi-Shifting Auto-Encoder for Unsupervised Domain Adaptation
ICCV 2015
Two Birds, One Stone: Jointly Learning Binary Code for Large-Scale Face Image Retrieval and Attributes Prediction
ICCV 2015
Face Video Retrieval With Image Query via Hashing Across Euclidean Space and Riemannian Manifold
CVPR 2015
Discriminant Analysis on Riemannian Manifold of Gaussian Distributions for Face Recognition With Image Sets
CVPR 2015
Projection Metric Learning on Grassmann Manifold With Application to Video Based Face Recognition
CVPR 2015
A Unified Multiplicative Framework for Attribute Learning
ICCV 2015
Generalized Unsupervised Manifold Alignment
NIPS 2014
Stacked Progressive Auto-Encoders (SPAE) for Face Recognition Across Poses
CVPR 2014
Learning Expressionlets on Spatio-Temporal Manifold for Dynamic Facial Expression Recognition
CVPR 2014
Learning Euclidean-to-Riemannian Metric for Point-to-Set Classification
CVPR 2014
Fusing Robust Face Region Descriptors via Multiple Metric Learning for Face Recognition in the Wild
CVPR 2013
Cascaded Shape Space Pruning for Robust Facial Landmark Detection
ICCV 2013
Coupling Alignments with Recognition for Still-to-Video Face Recognition
ICCV 2013
Parametric Local Multimodal Hashing for Cross-View Similarity Search
IJCAI 2013