Biqing Qi
30 papers · 2023–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (8) 🐝 Cross-Pollinator (13)
🗺️
Taxonomy Completionist
(62)
🌍
Conference Polyglot
(8)
🤝
Dynamic Duo
(19)
🏆
Grand Slam
🏆
Keyword Champion
(2)
⚡
Prolific Year
(11)
🗃️
Keyword Collector
(115)
💎
Century Club
(23)
Conferences
ACL (10)
EMNLP (5)
NIPS (5)
AAAI (4)
CVPR (2)
NAACL (2)
ICLR (1)
ICML (1)
Top co-authors
Keywords
large language model
(8)
language model
(4)
multi-agent system
(4)
reinforcement learning
(3)
memory module
(2)
neural network
(2)
collaborative generation
(2)
state space model
(2)
supervised fine-tuning
(2)
code generation
(2)
multimodal learning
(2)
test-time scaling
(2)
reinforcement learning from human feedback
(2)
attention mechanism
(2)
multimodal large language model
(2)
knowledge distillation
(2)
model compression
(2)
preference optimization
(2)
continual learning
(2)
preference learning
(1)
Papers
A Survey of Inductive Reasoning for Large Language Models
ACL 2026
D²Pruner: Debiased Importance and Structural Diversity for MLLM Token Pruning
AAAI 2026
GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning
AAAI 2026
Nirvana: A Specialized Generalist Model With Task-Aware Memory Mechanism
ACL 2026
MARS2: Scaling Multi-Agent Tree Search via Reinforcement Learning for Code Generation
ACL 2026
WIST: Web-Grounded Iterative Self-Play Tree for Domain-Targeted Reasoning Improvement
ACL 2026
SDAR-VL: Stable and Efficient Block-wise Diffusion for Vision-Language Understanding
ACL 2026
Retrieval-Augmented Visual Question Answering via Built-in Autoregressive Search Engines
AAAI 2025
Less is More: Efficient Model Merging with Binary Task Switch
CVPR 2025
ReviewRL: Towards Automated Scientific Review with RL
EMNLP 2025
Fast and Slow Gradient Approximation for Binary Neural Network Optimization
AAAI 2025
Scalability of LLM-Based Multi-Agent Systems for Scientific Code Generation: A Preliminary Study
EMNLP 2025
OpenPRM: Building Open-domain Process-based Reward Models with Preference Trees
ICLR 2025
Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization
ICML 2025
Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process
ACL 2025
Graph Counselor: Adaptive Graph Exploration via Multi-Agent Synergy to Enhance LLM Reasoning
ACL 2025
Many Heads Are Better Than One: Improved Scientific Idea Generation by A LLM-Based Multi-Agent System
ACL 2025
MSI-Agent: Incorporating Multi-Scale Insight into Embodied Agents for Superior Planning and Decision-Making
EMNLP 2024
Exploring Adversarial Robustness of Deep State Space Models
NIPS 2024
UltraMedical: Building Specialized Generalists in Biomedicine
NIPS 2024
Neural Residual Diffusion Models for Deep Scalable Vision Generation
NIPS 2024
An Efficient Memory Module for Graph Few-Shot Class-Incremental Learning
NIPS 2024
CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following
ACL 2024
SMR: State Memory Replay for Long Sequence Modeling
ACL 2024
Interactive Continual Learning: Fast and Slow Thinking
CVPR 2024
On the token distance modeling ability of higher RoPE attention dimension
EMNLP 2024
On Large Language Models’ Hallucination with Regard to Known Facts
NAACL 2024
PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning
NAACL 2024
Perturbation Towards Easy Samples Improves Targeted Adversarial Transferability
NIPS 2023
CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model
EMNLP 2023