Zhiyong Wang

56 papers · 2012–2026 · 15 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🌍 Conference Polyglot (15) 🏃 Academic Marathon (13) 🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🐝 Cross-Pollinator (13)

🐣 Hot Topic Early Bird 🐝 Cross-Pollinator (13) 🗺️ Taxonomy Completionist (75) 🤝 Dynamic Duo (15) 🏆 Grand Slam 💎 Century Club (49) 🔥 Unstoppable (5) 📈 Trend Setter 🚀 Conference Pioneer ⚡ Prolific Year (10) 🗃️ Keyword Collector (220)

Conferences

AAAI (14) INTERSPEECH (7) ICCV (5) ECCV (4) ICML (4) NIPS (4) ACL (3) CVPR (3) ICLR (3) AISTATS (2) NAACL (2) WACV (2) COLING (1) EMNLP (1) SEMEVAL (1)

Top co-authors

Kun Hu (17) John C.S. Lui (9) Xiaopeng Wang (6) Ruibo Fu (6) Jianhua Tao (5) Zhengqi Wen (5) Yuankun Xie (5) Shuai Li (5) LEI BAI (5) Xin Qi (4)

Keywords

large language model (5) graph neural network (4) regret bound (4) contextual bandit (4) deepfake detection (3) motion synthesis (3) word embedding (2) federated learning (2) dueling bandit (2) speaker verification (2) attention mechanism (2) diffusion model (2) vision language model (2) vision-language model (2) multi-armed bandit (2) online learning (2) image generation (2) 3d reconstruction (2) action recognition (2) domain adaptation (2)

Papers

DuoCast: Duo-Probabilistic Diffusion for Precipitation Nowcasting AAAI 2026 Detect All-Type Deepfake Audio: Wavelet Prompt Tuning for Enhanced Auditory Perception AAAI 2026 Federated Linear Dueling Bandits AAAI 2026 Pb4U-GNet: Resolution-Adaptive Garment Simulation via Propagation-before-Update Graph Network AAAI 2026 HKAFER: Achieve Visual Parameter-Efficient Fine-Tuning via Heterogeneous Kronecker Adaptation for Facial Expression Recognition AAAI 2026 Self-Reflective Generation at Test Time ACL 2026 Large Language Model-Enhanced Multi-Armed Bandits ACL 2026 PUMPS: Skeleton-Agnostic Point-based Universal Motion Pre-Training for Synthesis in Human Motion Tasks ICCV 2025 Model-based RL as a Minimalist Approach to Horizon-Free and Second-Order Bounds ICLR 2025 Diffusing to the Top: Boost Graph Neural Networks with Minimal Hyperparameter Tuning ICLR 2025 Provable Zero-Shot Generalization in Offline Reinforcement Learning ICML 2025 RI-MAE: Rotation-Invariant Masked AutoEncoders for Self-Supervised Point Cloud Representation Learning AAAI 2025 DC-PCN: Point Cloud Completion Network with Dual-Codebook Guided Quantization AAAI 2025 Federated In-Context Learning: Iterative Refinement for Improved Answer Quality ICML 2025 Online Clustering of Dueling Bandits ICML 2025 Variance-Dependent Regret Bounds for Nonstationary Linear Bandits AISTATS 2025 HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding COLING 2025 ReSo: A Reward-driven Self-organizing LLM-based Multi-Agent System for Reasoning Tasks EMNLP 2025 When Graph Neural Networks Meet Dynamic Mode Decomposition ICLR 2025 B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens ICCV 2025 VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Prior ICCV 2025 RobAVA: A Large-scale Dataset and Baseline Towards Video based Robotic Arm Action Understanding ICCV 2025 Generalized Source Tracing: Detecting Novel Audio Deepfake Algorithm with Real Emphasis and Fake Dispersion Strategy INTERSPEECH 2024 Exploring Self- and Cross-Triplet Correlations for Human-Object Interaction Detection AAAI 2024 SurgicalSAM: Efficient Class Promptable Surgical Instrument Segmentation AAAI 2024 Terrain Diffusion Network: Climatic-Aware Terrain Generation with Geological Sketch Guidance AAAI 2024 Autoregressive Omni-Aware Outpainting for Open-Vocabulary 360-Degree Image Generation AAAI 2024 Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users AAAI 2024 AEDNet: Adaptive Embedding and Multiview-Aware Disentanglement for Point Cloud Completion ECCV 2024 Language-Assisted Skeleton Action Understanding for Skeleton-Based Temporal Action Segmentation ECCV 2024 Motion Keyframe Interpolation for Any Human Skeleton using Point Cloud-based Human Motion Data Homogenisation ECCV 2024 Identity-Consistent Diffusion Network for Grading Knee Osteoarthritis Progression in Radiographic Imaging ECCV 2024 Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond ICML 2024 Codecfake: An Initial Dataset for Detecting LLM-based Deepfake Audio INTERSPEECH 2024 Generalized Fake Audio Detection via Deep Stable Learning INTERSPEECH 2024 Genuine-Focused Learning using Mask AutoEncoder for Generalized Fake Audio Detection INTERSPEECH 2024 PPPR: Portable Plug-in Prompt Refiner for Text to Audio Generation INTERSPEECH 2024 Efficient Explorative Key-Term Selection Strategies for Conversational Contextual Bandits AAAI 2023 Continuous Intermediate Token Learning With Implicit Motion Manifold for Keyframe Based Motion Interpolation CVPR 2023 Multi-Scale Control Signal-Aware Transformer for Motion Synthesis without Phase AAAI 2023 Adversarial Attacks on Online Learning to Rank with Click Feedback NIPS 2023 Online Clustering of Bandits with Misspecified User Models NIPS 2023 Online Corrupted User Detection and Regret Minimization NIPS 2023 LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark NIPS 2023 Robust Audio Anti-Spoofing with Fusion-Reconstruction Learning on Multi-Order Spectrograms INTERSPEECH 2023 Efficient and Interpretable Compressive Text Summarisation with Unsupervised Dual-Agent Reinforcement Learning ACL 2023 VAPCNet: Viewpoint-Aware 3D Point Cloud Completion ICCV 2023 LiDARCap: Long-Range Marker-Less 3D Human Motion Capture With LiDAR Point Clouds CVPR 2022 Sign Language Translation With Hierarchical Spatio-Temporal Graph Neural Network WACV 2022 OTExtSum: Extractive Text Summarisation with Optimal Transport NAACL 2022 1Cademy at Semeval-2022 Task 1: Investigating the Effectiveness of Multilingual, Multitask, and Language-Agnostic Tricks for the Reverse Dictionary Task NAACL 2022 1Cademy at Semeval-2022 Task 1: Investigating the Effectiveness of Multilingual, Multitask, and Language-Agnostic Tricks for the Reverse Dictionary Task SEMEVAL 2022 Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition CVPR 2020 3D Hand Pose Estimation with Disentangled Cross-Modal Latent Space WACV 2020 Speaker-Aware Monaural Speech Separation INTERSPEECH 2020 A Two-Graph Guided Multi-task Lasso Approach for eQTL Mapping AISTATS 2012