Jun Xiao
79 papers · 2016–2026 · 14 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π Academic Marathon (9) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (14) π Cross-Pollinator (7)
π
Renaissance Researcher
(10)
π
Conference Polyglot
(14)
π
Academic Marathon
(9)
π
Keyword Trendsetter Combo
(4)
π
Keyword Champion
(2)
π
Grand Slam
π¬
Deep Specialist
(12)
π€
Dynamic Duo
(29)
β‘
Prolific Year
(6)
π
Century Club
(72)
π
Trend Setter
π₯
Unstoppable
(10)
ποΈ
Keyword Collector
(319)
Conferences
CVPR (18)
AAAI (12)
ACL (8)
IJCAI (7)
EMNLP (6)
ICML (6)
ICCV (5)
NIPS (5)
ECCV (4)
ICLR (3)
IJCNLP (2)
COLING (1)
NAACL (1)
SEMEVAL (1)
Top co-authors
Keywords
semantic segmentation
(5)
video localization
(5)
representation learning
(4)
scene graph generation
(4)
large language model
(4)
image captioning
(3)
video understanding
(3)
graph neural network
(3)
text-to-image generation
(3)
question generation
(3)
causal inference
(3)
multimodal learning
(3)
domain adaptation
(3)
visual question answering
(3)
reinforcement learning
(3)
self-supervised learning
(3)
visual grounding
(3)
in-context learning
(3)
counterfactual reasoning
(3)
3d reconstruction
(2)
Papers
CoVerRL: Breaking the Consensus Trap in Label-Free Reasoning via Generator-Verifier Co-Evolution
ACL 2026
TarPro: Targeted Protection Against Malicious Image Editing
AAAI 2026
PILOT: Planning via Internalized Latent Optimization Trajectories for Large Language Models
ACL 2026
GUI-GΒ²: Gaussian Reward Modeling for GUI Grounding
AAAI 2026
UI-Copilot: Advancing Long-Horizon GUI Automation via Tool-Integrated Policy Optimization
ACL 2026
MAU-GPT: Enhancing Multi-type Industrial Anomaly Understanding via Anomaly-aware and Generalist Experts Adaptation
AAAI 2026
Experience-driven Multi-turn Reinforcement Learning for GUI Agents
ACL 2026
Empowering Vector Graphics with Consistently Arbitrary Viewing and View-dependent Visibility
CVPR 2025
The Four Color Theorem for Cell Instance Segmentation
ICML 2025
Event-Customized Image Generation
ICML 2025
Latent Score-Based Reweighting for Robust Classification on Imbalanced Tabular Data
ICML 2025
MobiLoRA: Accelerating LoRA-based LLM Inference on Mobile Devices via Context-aware KV Cache Optimization
ACL 2025
Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing
ICML 2025
HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation
ICML 2025
Decoding Correlation-Induced Misalignment in the Stable Diffusion Workflow for Text-to-Image Generation
ICCV 2025
Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards
CVPR 2025
Activating Sparse Part Concepts for 3D Class Incremental Learning
CVPR 2025
TAGA: Self-supervised Learning for Template-free Animatable Gaussian Articulated Model
CVPR 2025
D^3CTTA: Domain-Dependent Decorrelation for Continual Test-Time Adaption of 3D LiDAR Segmentation
CVPR 2025
MICAS: Multi-grained In-Context Adaptive Sampling for 3D Point Cloud Processing
CVPR 2025
Distributionally Generative Augmentation for Fair Facial Attribute Classification
CVPR 2024
$\text{Di}^2\text{Pose}$: Discrete Diffusion Model for Occluded 3D Human Pose Estimation
NIPS 2024
Fully Data-Driven Pseudo Label Estimation for Pointly-Supervised Panoptic Segmentation
AAAI 2024
Existence Is Chaos: Enhancing 3D Human Motion Prediction with Uncertainty Consideration
AAAI 2024
CoreRec: A Counterfactual Correlation Inference for Next Set Recommendation
AAAI 2024
Latent Learningscape Guided In-context Learning
ACL 2024
Chain-of-Quizzes: Pedagogy-inspired Example Selection in In-Context-Learning
ACL 2024
Letβs Rectify Step by Step: Improving Aspect-based Sentiment Analysis with Diffusion Models
COLING 2024
Towards Progressive Multi-Frequency Representation for Image Warping
CVPR 2024
DECap: Towards Generalized Explicit Caption Editing via Diffusion Mechanism
ECCV 2024
Learning Equilibrium Transformation for Gamut Expansion and Color Restoration
ECCV 2024
World to Code: Multi-modal Data Generation via Self-Instructed Compositional Captioning and Filtering
EMNLP 2024
SSF: Accelerating Training of Spiking Neural Networks with Stabilized Spiking Flow
ICCV 2023
Better Simultaneous Translation with Monotonic Knowledge Distillation
ACL 2023
Two Heads are Better Than One: A Simple Exploration Framework for Efficient Multi-Agent Reinforcement Learning
NIPS 2023
Informative Data Mining for One-Shot Cross-Domain Semantic Segmentation
ICCV 2023
Compositional Feature Augmentation for Unbiased Scene Graph Generation
ICCV 2023
VectorFloorSeg: Two-Stream Graph Attention Network for Vectorized Roughcast Floorplan Segmentation
CVPR 2023
Bit-Shrinking: Limiting Instantaneous Sharpness for Improving Post-Training Quantization
CVPR 2023
Decompose Novel into Known: Part Concept Learning For 3D Novel Class Discovery
NIPS 2023
Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models
NIPS 2023
Fairness-aware Contrastive Learning with Partially Annotated Sensitive Attributes
ICLR 2023
Video Scene Graph Generation from Single-Frame Weak Supervision
ICLR 2023
Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection
ICLR 2023
The Devil Is in the Labels: Noisy Label Correction for Robust Scene Graph Generation
CVPR 2022
Classification-Then-Grounding: Reformulating Video Scene Graphs As Temporal Bipartite Graphs
CVPR 2022
Rethinking Data Augmentation for Robust Visual Question Answering
ECCV 2022
Explicit Image Caption Editing
ECCV 2022
Rethinking Multi-Modal Alignment in Multi-Choice VideoQA from Feature and Sample Perspectives
EMNLP 2022
Deconfounded Value Decomposition for Multi-Agent Reinforcement Learning
ICML 2022
ACDNet: Adaptively Combined Dilated Convolution for Monocular Panorama Depth Estimation
AAAI 2022
ECNU_ICA at SemEval-2022 Task 10: A Simple and Unified Model for Monolingual and Crosslingual Structured Sentiment Analysis
NAACL 2022
ECNU_ICA at SemEval-2022 Task 10: A Simple and Unified Model for Monolingual and Crosslingual Structured Sentiment Analysis
SEMEVAL 2022
SAViT: Structure-Aware Vision Transformer Pruning via Collaborative Optimization
NIPS 2022
Consensus Graph Representation Learning for Better Grounded Image Captioning
AAAI 2021
Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding
AAAI 2021
Boundary Proposal Network for Two-stage Natural Language Video Localization
AAAI 2021
Human-Like Controllable Image Captioning With Verb-Specific Semantic Roles
CVPR 2021
Natural Language Video Localization with Learnable Moment Proposals
EMNLP 2021
Hierarchical Attention Based Spatial-Temporal Graph-to-Sequence Learning for Grounded Video Description
IJCAI 2020
Rethinking the Bottom-Up Framework for Query-Based Video Localization
AAAI 2020
CIAN: Cross-Image Affinity Net for Weakly Supervised Semantic Segmentation
AAAI 2020
De-Biased Courtβs View Generation with Causality
EMNLP 2020
Counterfactual Samples Synthesizing for Robust Visual Question Answering
CVPR 2020
End-to-End 3D Point Cloud Instance Segmentation Without Detection
CVPR 2020
DEBUG: A Dense Bottom-Up Grounding Approach for Natural Language Video Localization
EMNLP 2019
Weak Supervision Enhanced Generative Network for Question Generation
IJCAI 2019
Video Dialog via Progressive Inference and Cross-Transformer
IJCNLP 2019
DEBUG: A Dense Bottom-Up Grounding Approach for Natural Language Video Localization
IJCNLP 2019
Counterfactual Critic Multi-Agent Training for Scene Graph Generation
ICCV 2019
Self-Supervised Spatiotemporal Learning via Video Clip Order Prediction
CVPR 2019
Video Dialog via Progressive Inference and Cross-Transformer
EMNLP 2019
Zero-Shot Visual Recognition Using Semantics-Preserving Adversarial Embedding Networks
CVPR 2018
Multi-Turn Video Question Answering via Multi-Stream Hierarchical Attention Context Network
IJCAI 2018
Attentional Image Retweet Modeling via Multi-Faceted Ranking Network Learning
IJCAI 2018
SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning
CVPR 2017
Attentional Factorization Machines: Learning the Weight of Feature Interactions via Attention Networks
IJCAI 2017
Diverse Image Captioning via GroupTalk
IJCAI 2016
Self-Paced Boost Learning for Classification
IJCAI 2016