Hongxia Yang

50 papers · 2011–2026 · 12 conferences · across top CS/AI conferences

Achievements

+15 more ↓

🗺️ Taxonomy Completionist (15) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (6) 🌍 Conference Polyglot (11)

🌍 Conference Polyglot (11) 🏃 Academic Marathon (14) 🐝 Cross-Pollinator (9) 👑 Triple Crown 🧬 Topic Evolution 👥 Mega-Team (29) 🤝 Dynamic Duo (12) 🔬 Deep Specialist (13) 🏆 Grand Slam 📈 Trend Setter 🗃️ Keyword Collector (201) 🔥 Unstoppable (8) ❓ The Questioner ⚡ Prolific Year (10) 💎 Century Club (46)

Conferences

ACL (10) ICML (8) NIPS (8) AAAI (6) ICLR (6) IJCAI (4) IJCNLP (3) AISTATS (1) CVPR (1) EACL (1) ECCV (1) EMNLP (1)

Top co-authors

Chang Zhou (12) Junyang Lin (10) Jianbo Yuan (10) Jingren Zhou (9) Fei Wu (8) Jie Tang (7) Ming Ding (6) Jianxin Ma (6) Xiaotian Han (5) An Yang (5)

Keywords

large language model (6) multimodal learning (5) multimodal large language model (4) recommender system (4) benchmark evaluation (3) vision-language model (3) graph neural network (3) semantic alignment (3) question answering (3) cross-modal retrieval (3) relation alignment (2) image-text retrieval (2) image generation (2) task automation (2) graphical user interface (2) representation learning (2) end-to-end learning (2) multi-modal large language model (2) multi-modal learning (2) model compression (2)

Papers

InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization AAAI 2026 InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection EACL 2026 EcoAgent: An Efficient Device-Cloud Collaborative Multi-Agent Framework for Mobile Automation AAAI 2026 Benchmarking LLMs’ Mathematical Reasoning with Unseen Random Variables Questions AAAI 2026 ParallelComp: Parallel Long-Context Compressor for Length Extrapolation ICML 2025 OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use ACL 2025 DavIR: Data Selection via Implicit Reward for Large Language Models ACL 2025 Visual Anchors Are Strong Information Aggregators For Multimodal Large Language Model NIPS 2024 DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation NIPS 2024 Expedited Training of Visual Conditioned Language Generation via Redundancy Reduction ACL 2024 An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing ACL 2024 DeVAn: Dense Video Annotation for Video-Language Models ACL 2024 InfiMM: Advancing Multimodal Understanding with an Open-Sourced Visual Language Model ACL 2024 LoraRetriever: Input-Aware LoRA Retrieval and Composition for Mixed Tasks in the Wild ACL 2024 Let Models Speak Ciphers: Multiagent Debate through Embeddings ICLR 2024 LEMON: Lossless model expansion ICLR 2024 $\mathcal{B}$-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis ICLR 2024 InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks ICML 2024 InfiBench: Evaluating the Question-Answering Capabilities of Code Large Language Models NIPS 2024 Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation ICML 2024 Learning to Reweight for Generalizable Graph Neural Network AAAI 2024 Self-Infilling Code Generation ICML 2024 Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling ICLR 2024 Revisiting Multimodal Representation in Contrastive Learning: From Patch and Token Embeddings to Finite Discrete Tokens CVPR 2023 Single Stage Virtual Try-On via Deformable Attention Flows ECCV 2022 Modality Competition: What Makes Joint Training of Multi-modal Network Fail in Deep Learning? (Provably) ICML 2022 Reliable Adversarial Distillation with Unreliable Teachers ICLR 2022 OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework ICML 2022 KNAS: Green Neural Architecture Search ICML 2021 CogView: Mastering Text-to-Image Generation via Transformers NIPS 2021 UFC-BERT: Unifying Multi-Modal Controls for Conditional Image Synthesis NIPS 2021 Dynamic Memory based Attention Network for Sequential Recommendation AAAI 2021 Learning with Group Noise AAAI 2021 Learning Relation Alignment for Calibrated Cross-modal Retrieval ACL 2021 Sketch and Refine: Towards Faithful and Informative Table-to-Text Generation ACL 2021 Learning to Rehearse in Long Sequence Memorization ICML 2021 Learning Relation Alignment for Calibrated Cross-modal Retrieval IJCNLP 2021 Sketch and Refine: Towards Faithful and Informative Table-to-Text Generation IJCNLP 2021 Variational Autoencoders for Highly Multivariate Spatial Point Processes Intensities ICLR 2020 Dress like an Internet Celebrity: Fashion Retrieval in Videos IJCAI 2020 Counterfactual Prediction for Bundle Treatment NIPS 2020 CogLTX: Applying BERT to Long Texts NIPS 2020 Cognitive Graph for Multi-Hop Reading Comprehension at Scale ACL 2019 Towards Knowledge-Based Recommender Dialog System IJCNLP 2019 Learning Disentangled Representations for Recommendation NIPS 2019 Towards Knowledge-Based Recommender Dialog System EMNLP 2019 Hierarchical Representation Learning for Bipartite Graphs IJCAI 2019 Large Scale Evolving Graphs with Burst Detection IJCAI 2019 ANRL: Attributed Network Representation Learning via Deep Neural Networks IJCAI 2018 Dependent Hierarchical Beta Process for Image Interpolation and Denoising AISTATS 2011