Zhiyuan Ma

31 papers · 2021–2026 · 7 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🐣 Hot Topic Early Bird 🐝 Cross-Pollinator (13) 🌍 Conference Polyglot (7) 🧭 Keyword Pioneer 🏃 Academic Marathon (5)

🐣 Hot Topic Early Bird 🐝 Cross-Pollinator (13) 🏃 Academic Marathon (5) 🧬 Topic Evolution 🗃️ Keyword Collector (165) 💎 Century Club (27) 🔥 Unstoppable (5) ⚡ Prolific Year (9)

Conferences

AAAI (10) CVPR (7) ACL (4) NIPS (4) COLING (2) ECCV (2) EMNLP (2)

Top co-authors

Lei Zhang (8) Bowen Zhou (8) Jianjun Li (7) Zhen Lei (4) Biqing Qi (4) Guohui Li (4) Xiangyu Zhu (4) Jintao Du (4) Lingchen Sun (3) Rongyuan Wu (3)

Keywords

diffusion model (7) multimodal learning (3) knowledge distillation (3) large language model (3) retrieval-augmented generation (3) knowledge retrieval (2) 3d reconstruction (2) multi-view generation (2) few-shot learning (2) attention mechanism (2) task-oriented dialogue (2) preference optimization (2) residual learning (2) knowledge grounding (2) visual question answering (2) video generation (2) reinforcement learning (2) multi-modal learning (2) vision-language model (2) intention reasoning (2)

Papers

AlignCVC: Aligning Cross-View Consistency for Single-Image-to-3D Generation AAAI 2026 I2E: From Image Pixels to Actionable Interactive Environments for Text-Guided Image Editing ACL 2026 OPERA: A Reinforcement Learning--Enhanced Orchestrated Planner-Executor Architecture for Reasoning-Oriented Multi-Hop Retrieval AAAI 2026 CCAHCL: Multi-Level Hypergraph Contrastive Learning for Connected Component Awareness AAAI 2026 Gumbel Reranking: Differentiable End-to-End Reranker Optimization ACL 2025 Zero-Shot Blind-spot Image Denoising via Implicit Neural Sampling CVPR 2025 Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data CVPR 2025 Retrieval-Augmented Visual Question Answering via Built-in Autoregressive Search Engines AAAI 2025 Automated Creation of Reusable and Diverse Toolsets for Enhancing LLM Reasoning AAAI 2025 Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach CVPR 2025 MVBoost: Boost 3D Reconstruction with Multi-View Refinement CVPR 2025 DreamAlign: Dynamic Text-to-3D Optimization with Human Preference Alignment AAAI 2025 VideoDirector: Precise Video Editing via Text-to-Video Models CVPR 2025 Enhancing Distantly Supervised Named Entity Recognition with Strong Label Guided Lottery Training COLING 2024 UltraMedical: Building Specialized Generalists in Biomedicine NIPS 2024 One-Step Effective Diffusion Network for Real-World Image Super-Resolution NIPS 2024 Neural Residual Diffusion Models for Deep Scalable Vision Generation NIPS 2024 LMD: Faster Image Reconstruction with Latent Masking Diffusion AAAI 2024 AdapEdit: Spatio-Temporal Guided Adaptive Editing Algorithm for Text-Based Continuity-Sensitive Image Editing AAAI 2024 Generative Multi-Modal Knowledge Retrieval with Large Language Models AAAI 2024 Exploring Adversarial Robustness of Deep State Space Models NIPS 2024 Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models CVPR 2024 ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation ECCV 2024 Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding ECCV 2024 Mirror-Consistency: Harnessing Inconsistency in Majority Voting EMNLP 2024 HybridPrompt: Bridging Language Models and Human Priors in Prompt Tuning for Visual Question Answering AAAI 2023 OTAvatar: One-Shot Talking Face Avatar With Controllable Tri-Plane Rendering CVPR 2023 Noise-Robust Training with Dynamic Loss and Contrastive Learning for Distantly-Supervised Named Entity Recognition ACL 2023 GLAF: Global-to-Local Aggregation and Fission Network for Semantic Level Fact Verification COLING 2022 UniTranSeR: A Unified Transformer Semantic Representation Framework for Multimodal Task-Oriented Dialog System ACL 2022 Intention Reasoning Network for Multi-Domain End-to-end Task-Oriented Dialogue EMNLP 2021