Wenqi Zhang
27 papers · 2021–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
๐ Conference Polyglot (6) ๐งญ Keyword Pioneer ๐ Interdisciplinary Bridge ๐บ๏ธ Taxonomy Completionist (13) ๐ Cross-Pollinator (13)
๐
Cross-Pollinator
(13)
๐
Academic Marathon
(5)
๐
Renaissance Researcher
(9)
๐งฌ
Topic Evolution
๐ค
Dynamic Duo
(18)
๐ฅ
Unstoppable
(5)
โ
The Questioner
๐๏ธ
Keyword Collector
(139)
๐
Century Club
(21)
โก
Prolific Year
(7)
Conferences
ACL (10)
EMNLP (8)
AAAI (3)
IJCAI (3)
CVPR (1)
ICCV (1)
NIPS (1)
Top co-authors
Research topics
Keywords
large language model
(8)
reinforcement learning
(5)
contrastive learning
(4)
theory of mind
(3)
mathematical reasoning
(3)
vision-language model
(2)
policy optimization
(2)
few-shot learning
(2)
egocentric video
(2)
imitation learning
(2)
visual reasoning
(2)
multimodal large language model
(2)
spatial reasoning
(2)
equation generation
(2)
game artificial intelligence
(1)
benchmark evaluation
(1)
sentiment analysis
(1)
representation learning
(1)
information extraction
(1)
relation extraction
(1)
Papers
GUI-Gยฒ: Gaussian Reward Modeling for GUI Grounding
AAAI 2026
Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks
ACL 2026
Reality vs Counterfactual: Multi-World Contrastive Reinforcement Learning for Enhancing MLLMโs Theory of Mind in Egocentric Videos
AAAI 2026
CoVerRL: Breaking the Consensus Trap in Label-Free Reasoning via Generator-Verifier Co-Evolution
ACL 2026
UI-Copilot: Advancing Long-Horizon GUI Automation via Tool-Integrated Policy Optimization
ACL 2026
Ground What You See: Hallucination-Resistant MLLMs via Caption Feedback, Diversity-Aware Sampling, and Conflict Regularization
AAAI 2026
ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark
CVPR 2025
Scaling LLMsโ Social Reasoning: Sprinkle Cognitive โAha Momentโ into Fundamental Long-thought Logical Capabilities
ACL 2025
STaR-SQL: Self-Taught Reasoner for Text-to-SQL
ACL 2025
AskToAct: Enhancing LLMs Tool Use via Self-Correcting Clarification
EMNLP 2025
DB-Explore: Automated Database Exploration and Instruction Synthesis for Text-to-SQL
EMNLP 2025
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining
ICCV 2025
Advancing Process Verification for Large Language Models via Tree-Based Preference Learning
EMNLP 2024
Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives
ACL 2024
Learning Global Controller in Latent Space for Parameter-Efficient Fine-Tuning
ACL 2024
Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization
ACL 2024
TimeToM: Temporal Space is the Key to Unlocking the Door of Large Language Modelsโ Theory-of-Mind
ACL 2024
TaskBench: Benchmarking Large Language Models for Task Automation
NIPS 2024
Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model
EMNLP 2024
PromptNER: Prompt Locating and Typing for Named Entity Recognition
ACL 2023
Enhancing Emotion Recognition in Conversation via Multi-view Feature Alignment and Memorization
EMNLP 2023
An Expression Tree Decoding Strategy for Mathematical Equation Generation
EMNLP 2023
Query-based Instance Discrimination Network for Relational Triple Extraction
EMNLP 2022
Multi-View Reasoning: Consistent Contrastive Learning for Math Word Problem
EMNLP 2022
A Closed-Loop Perception, Decision-Making and Reasoning Mechanism for Human-Like Navigation
IJCAI 2022
Dynamic Rebalancing Dockless Bike-Sharing System based on Station Community Discovery
IJCAI 2021
Deep Reinforcement Learning for Multi-contact Motion Planning of Hexapod Robots
IJCAI 2021