Zhenyu Wang

28 papers · 2018–2026 · 11 conferences · across top CS/AI conferences

Achievements

+9 more ↓

🏃 Academic Marathon (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (11) 🐝 Cross-Pollinator (8)

🌍 Conference Polyglot (11) 🏃 Academic Marathon (7) 🧭 Keyword Pioneer 👥 Mega-Team (37) 🔥 Unstoppable (6) ⚡ Prolific Year (6) 💎 Century Club (27) ❓ The Questioner 🗃️ Keyword Collector (146)

Conferences

NIPS (6) AAAI (5) CVPR (5) INTERSPEECH (4) EMNLP (2) ACL (1) COLING (1) ECCV (1) ICLR (1) IJCAI (1) RSS (1)

Top co-authors

Shengjin Wang (7) Yali Li (4) Yangyang Zhao (4) Ya-Li Li (4) Hengshuang Zhao (4) Mengzhen Liu (2) Hao Li (2) Hao Luo (2) Shanghang Zhang (2) Zhenhua Huang (2)

Keywords

deep reinforcement learning (3) attention mechanism (3) pseudo label (3) dialogue policy (3) vision transformer (2) deep q-network (2) vision-language model (2) model compression (2) policy learning (2) semi-supervised learning (2) multimodal learning (2) 3d object detection (2) uncertainty quantification (2) point cloud processing (2) point cloud (2) robot manipulation (2) domain generalization (2) reinforcement learning (2) zero-shot learning (2) autoregressive transformer (1)

Papers

MUSE: Multimodal Uncertainty-Based Self-Driven Evolution for Robust Physiological-Signal–Based Driver Fatigue Detection AAAI 2026 Layered Image Vectorization via Semantic Simplification CVPR 2025 PortLLM: Personalizing Evolving Large Language Models with Training-Free and Portable Model Patches ICLR 2025 RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation RSS 2025 An Efficient Dialogue Policy Agent with Model-Based Causal Reinforcement Learning COLING 2025 PatternCIR Benchmark and TisCIR: Advancing Zero-Shot Composed Image Retrieval in Remote Sensing IJCAI 2025 Large Language Models in Bioinformatics: A Survey ACL 2025 Query-by-Example Keyword Spotting Using Spectral-Temporal Graph Attentive Pooling and Multi-Task Learning INTERSPEECH 2024 OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation ECCV 2024 RoboMamba: Efficient Vision-Language-Action Model for Robotic Reasoning and Manipulation NIPS 2024 One for All: Multi-Domain Joint Training for Point Cloud Based 3D Object Detection NIPS 2024 GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing NIPS 2024 BVT-IMA: Binary Vision Transformer with Information-Modified Attention AAAI 2024 Uni3DETR: Unified 3D Detection Transformer NIPS 2023 SoulChat: Improving LLMs’ Empathy, Listening, and Comfort Abilities through Fine-tuning with Multi-turn Empathy Conversations EMNLP 2023 Detecting Everything in the Open World: Towards Universal Object Detection CVPR 2023 Noisy Boundaries: Lemon or Lemonade for Semi-Supervised Instance Segmentation? CVPR 2022 VTC-LFC: Vision Transformer Compression with Low-Frequency Components NIPS 2022 Rethinking Depth Estimation for Multi-View Stereo: A Unified Representation CVPR 2022 Audio Anti-spoofing Using Simple Attention Module and Joint Optimization Based on Additive Angular Margin Loss and Meta-learning INTERSPEECH 2022 Combating Noise: Semi-supervised Learning by Region Uncertainty Quantification NIPS 2021 Data-Uncertainty Guided Multi-Phase Learning for Semi-Supervised Object Detection CVPR 2021 Melodic Phrase Attention Network for Symbolic Data-based Music Genre Classification (Student Abstract) AAAI 2021 Automatic Curriculum Learning With Over-repetition Penalty for Dialogue Policy Learning AAAI 2021 Efficient Dialogue Complementary Policy Learning via Deep Q-network Policy and Episodic Memory Policy EMNLP 2021 Cross-Domain Adaptation with Discrepancy Minimization for Text-Independent Forensic Speaker Verification INTERSPEECH 2020 Dynamic Reward-Based Dueling Deep Dyna-Q: Robust Policy Learning in Noisy Environments AAAI 2020 EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System INTERSPEECH 2018