Zhiyuan Ma
31 papers · 2021–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
🐣 Hot Topic Early Bird 🐝 Cross-Pollinator (13) 🌍 Conference Polyglot (7) 🧭 Keyword Pioneer 🏃 Academic Marathon (5)
🐣
Hot Topic Early Bird
🐝
Cross-Pollinator
(13)
🏃
Academic Marathon
(5)
🧬
Topic Evolution
🗃️
Keyword Collector
(165)
💎
Century Club
(27)
🔥
Unstoppable
(5)
⚡
Prolific Year
(9)
Conferences
AAAI (10)
CVPR (7)
ACL (4)
NIPS (4)
COLING (2)
ECCV (2)
EMNLP (2)
Top co-authors
Keywords
diffusion model
(7)
multimodal learning
(3)
knowledge distillation
(3)
large language model
(3)
retrieval-augmented generation
(3)
knowledge retrieval
(2)
3d reconstruction
(2)
multi-view generation
(2)
few-shot learning
(2)
attention mechanism
(2)
task-oriented dialogue
(2)
preference optimization
(2)
residual learning
(2)
knowledge grounding
(2)
visual question answering
(2)
video generation
(2)
reinforcement learning
(2)
multi-modal learning
(2)
vision-language model
(2)
intention reasoning
(2)
Papers
AlignCVC: Aligning Cross-View Consistency for Single-Image-to-3D Generation
AAAI 2026
I2E: From Image Pixels to Actionable Interactive Environments for Text-Guided Image Editing
ACL 2026
OPERA: A Reinforcement Learning--Enhanced Orchestrated Planner-Executor Architecture for Reasoning-Oriented Multi-Hop Retrieval
AAAI 2026
CCAHCL: Multi-Level Hypergraph Contrastive Learning for Connected Component Awareness
AAAI 2026
Gumbel Reranking: Differentiable End-to-End Reranker Optimization
ACL 2025
Zero-Shot Blind-spot Image Denoising via Implicit Neural Sampling
CVPR 2025
Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data
CVPR 2025
Retrieval-Augmented Visual Question Answering via Built-in Autoregressive Search Engines
AAAI 2025
Automated Creation of Reusable and Diverse Toolsets for Enhancing LLM Reasoning
AAAI 2025
Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach
CVPR 2025
MVBoost: Boost 3D Reconstruction with Multi-View Refinement
CVPR 2025
DreamAlign: Dynamic Text-to-3D Optimization with Human Preference Alignment
AAAI 2025
VideoDirector: Precise Video Editing via Text-to-Video Models
CVPR 2025
Enhancing Distantly Supervised Named Entity Recognition with Strong Label Guided Lottery Training
COLING 2024
UltraMedical: Building Specialized Generalists in Biomedicine
NIPS 2024
One-Step Effective Diffusion Network for Real-World Image Super-Resolution
NIPS 2024
Neural Residual Diffusion Models for Deep Scalable Vision Generation
NIPS 2024
LMD: Faster Image Reconstruction with Latent Masking Diffusion
AAAI 2024
AdapEdit: Spatio-Temporal Guided Adaptive Editing Algorithm for Text-Based Continuity-Sensitive Image Editing
AAAI 2024
Generative Multi-Modal Knowledge Retrieval with Large Language Models
AAAI 2024
Exploring Adversarial Robustness of Deep State Space Models
NIPS 2024
Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models
CVPR 2024
ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation
ECCV 2024
Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding
ECCV 2024
Mirror-Consistency: Harnessing Inconsistency in Majority Voting
EMNLP 2024
HybridPrompt: Bridging Language Models and Human Priors in Prompt Tuning for Visual Question Answering
AAAI 2023
OTAvatar: One-Shot Talking Face Avatar With Controllable Tri-Plane Rendering
CVPR 2023
Noise-Robust Training with Dynamic Loss and Contrastive Learning for Distantly-Supervised Named Entity Recognition
ACL 2023
GLAF: Global-to-Local Aggregation and Fission Network for Semantic Level Fact Verification
COLING 2022
UniTranSeR: A Unified Transformer Semantic Representation Framework for Multimodal Task-Oriented Dialog System
ACL 2022
Intention Reasoning Network for Multi-Domain End-to-end Task-Oriented Dialogue
EMNLP 2021