Hongxia Yang
50 papers · 2011–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (15) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (6) π Conference Polyglot (11)
π
Conference Polyglot
(11)
π
Academic Marathon
(14)
π
Cross-Pollinator
(9)
π
Triple Crown
π§¬
Topic Evolution
π₯
Mega-Team
(29)
π€
Dynamic Duo
(12)
π¬
Deep Specialist
(13)
π
Grand Slam
π
Trend Setter
ποΈ
Keyword Collector
(201)
π₯
Unstoppable
(8)
β
The Questioner
β‘
Prolific Year
(10)
π
Century Club
(46)
Conferences
ACL (10)
ICML (8)
NIPS (8)
AAAI (6)
ICLR (6)
IJCAI (4)
IJCNLP (3)
AISTATS (1)
CVPR (1)
EACL (1)
ECCV (1)
EMNLP (1)
Top co-authors
Keywords
large language model
(6)
multimodal learning
(5)
multimodal large language model
(4)
recommender system
(4)
benchmark evaluation
(3)
vision-language model
(3)
graph neural network
(3)
semantic alignment
(3)
question answering
(3)
cross-modal retrieval
(3)
relation alignment
(2)
image-text retrieval
(2)
image generation
(2)
task automation
(2)
graphical user interface
(2)
representation learning
(2)
end-to-end learning
(2)
multi-modal large language model
(2)
multi-modal learning
(2)
model compression
(2)
Papers
InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization
AAAI 2026
InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection
EACL 2026
EcoAgent: An Efficient Device-Cloud Collaborative Multi-Agent Framework for Mobile Automation
AAAI 2026
Benchmarking LLMsβ Mathematical Reasoning with Unseen Random Variables Questions
AAAI 2026
ParallelComp: Parallel Long-Context Compressor for Length Extrapolation
ICML 2025
OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use
ACL 2025
DavIR: Data Selection via Implicit Reward for Large Language Models
ACL 2025
Visual Anchors Are Strong Information Aggregators For Multimodal Large Language Model
NIPS 2024
DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation
NIPS 2024
Expedited Training of Visual Conditioned Language Generation via Redundancy Reduction
ACL 2024
An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing
ACL 2024
DeVAn: Dense Video Annotation for Video-Language Models
ACL 2024
InfiMM: Advancing Multimodal Understanding with an Open-Sourced Visual Language Model
ACL 2024
LoraRetriever: Input-Aware LoRA Retrieval and Composition for Mixed Tasks in the Wild
ACL 2024
Let Models Speak Ciphers: Multiagent Debate through Embeddings
ICLR 2024
LEMON: Lossless model expansion
ICLR 2024
$\mathcal{B}$-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis
ICLR 2024
InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks
ICML 2024
InfiBench: Evaluating the Question-Answering Capabilities of Code Large Language Models
NIPS 2024
Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation
ICML 2024
Learning to Reweight for Generalizable Graph Neural Network
AAAI 2024
Self-Infilling Code Generation
ICML 2024
Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling
ICLR 2024
Revisiting Multimodal Representation in Contrastive Learning: From Patch and Token Embeddings to Finite Discrete Tokens
CVPR 2023
Single Stage Virtual Try-On via Deformable Attention Flows
ECCV 2022
Modality Competition: What Makes Joint Training of Multi-modal Network Fail in Deep Learning? (Provably)
ICML 2022
Reliable Adversarial Distillation with Unreliable Teachers
ICLR 2022
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
ICML 2022
KNAS: Green Neural Architecture Search
ICML 2021
CogView: Mastering Text-to-Image Generation via Transformers
NIPS 2021
UFC-BERT: Unifying Multi-Modal Controls for Conditional Image Synthesis
NIPS 2021
Dynamic Memory based Attention Network for Sequential Recommendation
AAAI 2021
Learning with Group Noise
AAAI 2021
Learning Relation Alignment for Calibrated Cross-modal Retrieval
ACL 2021
Sketch and Refine: Towards Faithful and Informative Table-to-Text Generation
ACL 2021
Learning to Rehearse in Long Sequence Memorization
ICML 2021
Learning Relation Alignment for Calibrated Cross-modal Retrieval
IJCNLP 2021
Sketch and Refine: Towards Faithful and Informative Table-to-Text Generation
IJCNLP 2021
Variational Autoencoders for Highly Multivariate Spatial Point Processes Intensities
ICLR 2020
Dress like an Internet Celebrity: Fashion Retrieval in Videos
IJCAI 2020
Counterfactual Prediction for Bundle Treatment
NIPS 2020
CogLTX: Applying BERT to Long Texts
NIPS 2020
Cognitive Graph for Multi-Hop Reading Comprehension at Scale
ACL 2019
Towards Knowledge-Based Recommender Dialog System
IJCNLP 2019
Learning Disentangled Representations for Recommendation
NIPS 2019
Towards Knowledge-Based Recommender Dialog System
EMNLP 2019
Hierarchical Representation Learning for Bipartite Graphs
IJCAI 2019
Large Scale Evolving Graphs with Burst Detection
IJCAI 2019
ANRL: Attributed Network Representation Learning via Deep Neural Networks
IJCAI 2018
Dependent Hierarchical Beta Process for Image Interpolation and Denoising
AISTATS 2011