Xing Xu
30 papers · 2012–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
🌍 Conference Polyglot (8) 🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🏃 Academic Marathon (13)
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
🏃
Academic Marathon
(13)
🤝
Dynamic Duo
(21)
🔬
Deep Specialist
(10)
🧬
Topic Evolution
📈
Trend Setter
🚀
Conference Pioneer
⚡
Prolific Year
(6)
🗃️
Keyword Collector
(174)
💎
Century Club
(28)
🔥
Unstoppable
(7)
Conferences
AAAI (11)
CVPR (10)
ICCV (3)
IJCAI (2)
ACL (1)
AISTATS (1)
EMNLP (1)
ICLR (1)
Top co-authors
Keywords
representation learning
(5)
multimodal learning
(5)
contrastive learning
(4)
attention mechanism
(3)
adversarial attack
(3)
multi-modal learning
(3)
zero-shot learning
(3)
multimodal fusion
(2)
video understanding
(2)
cross-modal matching
(2)
cross-modal learning
(2)
feature selection
(2)
deep neural network
(2)
cross-modal alignment
(2)
domain adaptation
(2)
transfer learning
(2)
metric learning
(2)
feature alignment
(2)
self-supervised learning
(2)
large language model
(2)
Papers
De-biased Natural Language Egocentric Task Verification via Prototypical Evidence Learning
AAAI 2026
Hyper-Opinion Vagueness Quantification for Robust Multimodal Learning
AAAI 2026
From Observation to Understanding: Front-Door Adjustments with Uncertainty Calibration for Enhancing Egocentric Reasoning in LVLMs
ACL 2025
ReCon: Enhancing True Correspondence Discrimination through Relation Consistency for Robust Noisy Correspondence Learning
CVPR 2025
TAU-106K: A New Dataset for Comprehensive Understanding of Traffic Accident
ICLR 2025
PHGC: Procedural Heterogeneous Graph Completion for Natural Language Task Verification in Egocentric Videos
CVPR 2025
Adaptive Uncertainty-Based Learning for Text-Based Person Retrieval
AAAI 2024
Embracing Unimodal Aleatoric Uncertainty for Robust Multimodal Fusion
CVPR 2024
T-SciQ: Teaching Multimodal Chain-of-Thought Reasoning via Large Language Model Signals for Science Question Answering
AAAI 2024
Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models
AAAI 2023
ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction
ICCV 2023
ImbSAM: A Closer Look at Sharpness-Aware Minimization in Class-Imbalanced Recognition
ICCV 2023
LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models
EMNLP 2023
TVT: Three-Way Vision Transformer through Multi-Modal Hypersphere Learning for Zero-Shot Sketch-Based Image Retrieval
AAAI 2022
Semi-Supervised Video Paragraph Grounding With Contrastive Encoder
CVPR 2022
Partial Feature Selection and Alignment for Multi-Source Domain Adaptation
CVPR 2021
Enhancing Audio-Visual Association with Self-Supervised Curriculum Learning
AAAI 2021
Multi-Stage Aggregated Transformer Network for Temporal Language Localization in Videos
CVPR 2021
From General to Specific: Informative Scene Graph Generation via Balance Adjustment
ICCV 2021
Feature Space Targeted Attacks by Statistic Alignment
IJCAI 2021
PoseGTAC: Graph Transformer Encoder-Decoder with Atrous Convolution for 3D Human Pose Estimation
IJCAI 2021
What Machines See Is Not What They Get: Fooling Scene Text Recognition Models With Adversarial Text Images
CVPR 2020
Learning Cross-Aligned Latent Embeddings for Zero-Shot Cross-Modal Retrieval
AAAI 2020
Universal Weighting Metric Learning for Cross-Modal Matching
CVPR 2020
Sequence-To-Sequence Domain Adaptation Network for Robust Text Image Recognition
CVPR 2019
Deliberate Attention Networks for Image Captioning
AAAI 2019
Template-Based Math Word Problem Solvers with Recursive Neural Networks
AAAI 2019
Perceptual Pyramid Adversarial Networks for Text-to-Image Synthesis
AAAI 2019
Matrix Tri-Factorization With Manifold Regularizations for Zero-Shot Learning
CVPR 2017
A Two-Graph Guided Multi-task Lasso Approach for eQTL Mapping
AISTATS 2012