Zhenyu Wang
28 papers · 2018–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
π Academic Marathon (7) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (11) π Cross-Pollinator (8)
π
Conference Polyglot
(11)
π
Academic Marathon
(7)
π§
Keyword Pioneer
π₯
Mega-Team
(37)
π₯
Unstoppable
(6)
β‘
Prolific Year
(6)
π
Century Club
(27)
β
The Questioner
ποΈ
Keyword Collector
(146)
Conferences
NIPS (6)
AAAI (5)
CVPR (5)
INTERSPEECH (4)
EMNLP (2)
ACL (1)
COLING (1)
ECCV (1)
ICLR (1)
IJCAI (1)
RSS (1)
Top co-authors
Keywords
deep reinforcement learning
(3)
attention mechanism
(3)
pseudo label
(3)
dialogue policy
(3)
vision transformer
(2)
deep q-network
(2)
vision-language model
(2)
model compression
(2)
policy learning
(2)
semi-supervised learning
(2)
multimodal learning
(2)
3d object detection
(2)
uncertainty quantification
(2)
point cloud processing
(2)
point cloud
(2)
robot manipulation
(2)
domain generalization
(2)
reinforcement learning
(2)
zero-shot learning
(2)
autoregressive transformer
(1)
Papers
MUSE: Multimodal Uncertainty-Based Self-Driven Evolution for Robust Physiological-SignalβBased Driver Fatigue Detection
AAAI 2026
Layered Image Vectorization via Semantic Simplification
CVPR 2025
PortLLM: Personalizing Evolving Large Language Models with Training-Free and Portable Model Patches
ICLR 2025
RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation
RSS 2025
An Efficient Dialogue Policy Agent with Model-Based Causal Reinforcement Learning
COLING 2025
PatternCIR Benchmark and TisCIR: Advancing Zero-Shot Composed Image Retrieval in Remote Sensing
IJCAI 2025
Large Language Models in Bioinformatics: A Survey
ACL 2025
Query-by-Example Keyword Spotting Using Spectral-Temporal Graph Attentive Pooling and Multi-Task Learning
INTERSPEECH 2024
OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation
ECCV 2024
RoboMamba: Efficient Vision-Language-Action Model for Robotic Reasoning and Manipulation
NIPS 2024
One for All: Multi-Domain Joint Training for Point Cloud Based 3D Object Detection
NIPS 2024
GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing
NIPS 2024
BVT-IMA: Binary Vision Transformer with Information-Modified Attention
AAAI 2024
Uni3DETR: Unified 3D Detection Transformer
NIPS 2023
SoulChat: Improving LLMsβ Empathy, Listening, and Comfort Abilities through Fine-tuning with Multi-turn Empathy Conversations
EMNLP 2023
Detecting Everything in the Open World: Towards Universal Object Detection
CVPR 2023
Noisy Boundaries: Lemon or Lemonade for Semi-Supervised Instance Segmentation?
CVPR 2022
VTC-LFC: Vision Transformer Compression with Low-Frequency Components
NIPS 2022
Rethinking Depth Estimation for Multi-View Stereo: A Unified Representation
CVPR 2022
Audio Anti-spoofing Using Simple Attention Module and Joint Optimization Based on Additive Angular Margin Loss and Meta-learning
INTERSPEECH 2022
Combating Noise: Semi-supervised Learning by Region Uncertainty Quantification
NIPS 2021
Data-Uncertainty Guided Multi-Phase Learning for Semi-Supervised Object Detection
CVPR 2021
Melodic Phrase Attention Network for Symbolic Data-based Music Genre Classification (Student Abstract)
AAAI 2021
Automatic Curriculum Learning With Over-repetition Penalty for Dialogue Policy Learning
AAAI 2021
Efficient Dialogue Complementary Policy Learning via Deep Q-network Policy and Episodic Memory Policy
EMNLP 2021
Cross-Domain Adaptation with Discrepancy Minimization for Text-Independent Forensic Speaker Verification
INTERSPEECH 2020
Dynamic Reward-Based Dueling Deep Dyna-Q: Robust Policy Learning in Noisy Environments
AAAI 2020
EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System
INTERSPEECH 2018