Xu Yang
80 papers · 2016–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π§ Keyword Pioneer π Conference Polyglot (10) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (12) π Academic Marathon (9)
π
Academic Marathon
(9)
π
Cross-Pollinator
(11)
π
Renaissance Researcher
(11)
π
Conference Loyalist
(20)
π§¬
Topic Evolution
π€
Dynamic Duo
(21)
π
Keyword Champion
(2)
π¬
Deep Specialist
(13)
ποΈ
Keyword Collector
(284)
π
Century Club
(69)
π
Trend Setter
π₯
Unstoppable
(8)
β‘
Prolific Year
(22)
π
Conference Pioneer
Conferences
CVPR (20)
AAAI (19)
IJCAI (10)
NIPS (10)
ICCV (6)
ICML (6)
ACL (3)
ECCV (3)
WACV (2)
MICCAI (1)
Top co-authors
Keywords
knowledge distillation
(13)
model compression
(8)
in-context learning
(7)
continual learning
(7)
transfer learning
(7)
catastrophic forgetting
(6)
representation learning
(6)
diffusion model
(6)
model initialization
(5)
contrastive learning
(5)
image captioning
(5)
vision transformer
(5)
domain adaptation
(5)
neural network
(4)
multimodal learning
(4)
attention mechanism
(4)
vision-language model
(4)
incremental learning
(3)
visual question answering
(3)
class-incremental learning
(3)
Papers
Semantically Comprehensive Token Pruning in LVLMs via Maximizing Concept Coverage
ACL 2026
GraphIC: A Graph-Based In-Context Example Retrieval Model for Multi-Step Reasoning
AAAI 2026
Adaptive-Learngene: Continual Expansion and Task-Aware Selection of Learngenes for Dynamic Environments
AAAI 2026
Diffusion-calibrated Continual Test-time Adaptation
AAAI 2026
Towards Robust Edge Model Adaptation via Elastic Architecture Search
AAAI 2026
Extracting Multimodal Learngene in CLIP: Unveiling the Multimodal Generalizable Knowledge
AAAI 2026
Efficient and Effective In-context Demonstration Selection with Coreset
AAAI 2026
SΒ²Flow: Towards Fast and Authentic Training-Free High-Resolution Video Generation
AAAI 2026
Mix-QSAM2: Mixed-Precision Quantization for High Fidelity Segmentation in Resource Constrained Scenarios
AAAI 2026
Self-Indexing KVCache: Predicting Sparse Attention from Compressed Keys
AAAI 2026
Learngene: Inheritable βGenesβ in Intelligent Agents (Abstract Reprint)
AAAI 2026
Learngene Tells You How to Customize: Task-Aware Parameter Initialization at Flexible Scales
ICML 2025
Fast Large Language Model Collaborative Decoding via Speculation
ICML 2025
Democratizing High-Fidelity Co-Speech Gesture Video Generation
ICCV 2025
Inheriting Generalized Learngene for Efficient Knowledge Transfer across Multiple Tasks
AAAI 2025
Dynamic Adapter Tuning for Long-Tailed Class-Incremental Learning
WACV 2025
Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion
MICCAI 2025
DTA: Dual Temporal-Channel-Wise Attention for Spiking Neural Networks
WACV 2025
Outstanding Orthodontist: No More Artifactual Teeth in Talking Face
IJCAI 2025
Q-MiniSAM2: A Quantization-based Benchmark for Resource-Efficient Video Segmentation
IJCAI 2025
Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark
AAAI 2025
Unlearning Concepts in Diffusion Model via Concept Domain Correction and Concept Preserving Gradient
AAAI 2025
Number it: Temporal Grounding Videos like Flipping Manga
CVPR 2025
Redefining <Creative> in Dictionary: Towards an Enhanced Semantic Understanding of Creative Generation
CVPR 2025
Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens
CVPR 2025
Mimic In-Context Learning for Multimodal Tasks
CVPR 2025
Tackling Long-Tailed Data Challenges in Spiking Neural Networks via Heterogeneous Knowledge Distillation
IJCAI 2025
Three-Dimensional Trajectory Prediction with 3DMoTraj Dataset
ICML 2025
VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and Proprioception
ICML 2024
LIVE: Learnable In-Context Vector for Visual Question Answering
NIPS 2024
Cluster-Learngene: Inheriting Adaptive Clusters for Vision Transformers
NIPS 2024
Linearly Decomposing and Recomposing Vision Transformers for Diverse-Scale Models
NIPS 2024
Initializing Variable-sized Vision Transformers from Learngene with Learnable Transformation
NIPS 2024
BPQP: A Differentiable Convex Optimization Framework for Efficient End-to-End Learning
NIPS 2024
Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models
NIPS 2024
Mixture of Adversarial LoRAs: Boosting Robust Generalization in Meta-Tuning
NIPS 2024
Building Variable-Sized Models via Learngene Pool
AAAI 2024
Transformer as Linear Expansion of Learngene
AAAI 2024
Dynamic Reactive Spiking Graph Neural Network
AAAI 2024
Exploiting Intrinsic Multilateral Logical Rules for Weakly Supervised Natural Language Video Localization
ACL 2024
Texture-Preserving Diffusion Models for High-Fidelity Virtual Try-On
CVPR 2024
Long-Tail Class Incremental Learning via Independent Sub-prototype Construction
CVPR 2024
A Versatile Framework for Continual Test-Time Domain Adaptation: Balancing Discriminability and Generalizability
CVPR 2024
How to Configure Good In-Context Sequence for Visual Question Answering
CVPR 2024
MemoNav: Working Memory Model for Visual Navigation
CVPR 2024
Unveiling the Unknown: Unleashing the Power of Unknown to Known in Open-Set Source-Free Domain Adaptation
CVPR 2024
Vision Transformers as Probabilistic Expansion from Learngene
ICML 2024
One Meta-tuned Transformer is What You Need for Few-shot Learning
ICML 2024
Exploring Learngene via Stage-wise Weight Sharing for Initializing Variable-sized Models
IJCAI 2024
Navigating Continual Test-time Adaptation with Symbiosis Knowledge
IJCAI 2024
Exploring Safety Supervision for Continual Test-time Domain Adaptation
IJCAI 2023
Learning Trajectory-Word Alignments for Video-Language Tasks
ICCV 2023
Learning From Biased Soft Labels
NIPS 2023
Exploring Diverse In-Context Configurations for Image Captioning
NIPS 2023
Transforming Visual Scene Graphs to Image Captions
ACL 2023
Siamese Contrastive Embedding Network for Compositional Zero-Shot Learning
CVPR 2022
Towards End-to-End Image Compression and Analysis with Transformers
AAAI 2022
Attention-guided Contrastive Hashing for Long-tailed Image Retrieval
IJCAI 2022
Learning Universal Adversarial Perturbation by Adversarial Example
AAAI 2022
EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching
CVPR 2022
Show, Deconfound and Tell: Image Captioning With Causal Inference
CVPR 2022
Not Just Selection, but Exploration: Online Class-Incremental Continual Learning via Dual View Consistency
CVPR 2022
Nearest Neighbor Matching for Deep Clustering
CVPR 2021
Auto-Parsing Network for Image Captioning and Visual Question Answering
ICCV 2021
Graph Debiased Contrastive Learning with Joint Representation Clustering
IJCAI 2021
Causal Attention for Vision-Language Tasks
CVPR 2021
Incremental Embedding Learning via Zero-Shot Translation
AAAI 2021
SelfSAGCN: Self-Supervised Semantic Alignment for Graph Convolution Network
CVPR 2021
Adversarial Learning for Robust Deep Clustering
NIPS 2020
Finding It at Another Side: A Viewpoint-Adapted Matching Encoder for Change Captioning
ECCV 2020
Learning Progressive Joint Propagation for Human Motion Prediction
ECCV 2020
Lifelong Zero-Shot Learning
IJCAI 2020
Multi-Scale Fusion Subspace Clustering Using Similarity Constraint
CVPR 2020
Unpaired Image Captioning via Scene Graph Alignments
ICCV 2019
Auto-Encoding Scene Graphs for Image Captioning
CVPR 2019
Learning to Collocate Neural Modules for Image Captioning
ICCV 2019
Weakly Aligned Cross-Modal Learning for Multispectral Pedestrian Detection
ICCV 2019
Deep Spectral Clustering Using Dual Autoencoder Network
CVPR 2019
Shuffle-Then-Assemble: Learning Object-Agnostic Visual Relationship Features
ECCV 2018
Sparsity Conditional Energy Label Distribution Learning for Age Estimation
IJCAI 2016