Zhiyong Wang
56 papers · 2012–2026 · 15 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π Conference Polyglot (15) π Academic Marathon (13) π£ Hot Topic Early Bird π§ Keyword Pioneer π Cross-Pollinator (13)
π£
Hot Topic Early Bird
π
Cross-Pollinator
(13)
πΊοΈ
Taxonomy Completionist
(75)
π€
Dynamic Duo
(15)
π
Grand Slam
π
Century Club
(49)
π₯
Unstoppable
(5)
π
Trend Setter
π
Conference Pioneer
β‘
Prolific Year
(10)
ποΈ
Keyword Collector
(220)
Conferences
AAAI (14)
INTERSPEECH (7)
ICCV (5)
ECCV (4)
ICML (4)
NIPS (4)
ACL (3)
CVPR (3)
ICLR (3)
AISTATS (2)
NAACL (2)
WACV (2)
COLING (1)
EMNLP (1)
SEMEVAL (1)
Top co-authors
Keywords
large language model
(5)
graph neural network
(4)
regret bound
(4)
contextual bandit
(4)
deepfake detection
(3)
motion synthesis
(3)
word embedding
(2)
federated learning
(2)
dueling bandit
(2)
speaker verification
(2)
attention mechanism
(2)
diffusion model
(2)
vision language model
(2)
vision-language model
(2)
multi-armed bandit
(2)
online learning
(2)
image generation
(2)
3d reconstruction
(2)
action recognition
(2)
domain adaptation
(2)
Papers
DuoCast: Duo-Probabilistic Diffusion for Precipitation Nowcasting
AAAI 2026
Detect All-Type Deepfake Audio: Wavelet Prompt Tuning for Enhanced Auditory Perception
AAAI 2026
Federated Linear Dueling Bandits
AAAI 2026
Pb4U-GNet: Resolution-Adaptive Garment Simulation via Propagation-before-Update Graph Network
AAAI 2026
HKAFER: Achieve Visual Parameter-Efficient Fine-Tuning via Heterogeneous Kronecker Adaptation for Facial Expression Recognition
AAAI 2026
Self-Reflective Generation at Test Time
ACL 2026
Large Language Model-Enhanced Multi-Armed Bandits
ACL 2026
PUMPS: Skeleton-Agnostic Point-based Universal Motion Pre-Training for Synthesis in Human Motion Tasks
ICCV 2025
Model-based RL as a Minimalist Approach to Horizon-Free and Second-Order Bounds
ICLR 2025
Diffusing to the Top: Boost Graph Neural Networks with Minimal Hyperparameter Tuning
ICLR 2025
Provable Zero-Shot Generalization in Offline Reinforcement Learning
ICML 2025
RI-MAE: Rotation-Invariant Masked AutoEncoders for Self-Supervised Point Cloud Representation Learning
AAAI 2025
DC-PCN: Point Cloud Completion Network with Dual-Codebook Guided Quantization
AAAI 2025
Federated In-Context Learning: Iterative Refinement for Improved Answer Quality
ICML 2025
Online Clustering of Dueling Bandits
ICML 2025
Variance-Dependent Regret Bounds for Nonstationary Linear Bandits
AISTATS 2025
HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding
COLING 2025
ReSo: A Reward-driven Self-organizing LLM-based Multi-Agent System for Reasoning Tasks
EMNLP 2025
When Graph Neural Networks Meet Dynamic Mode Decomposition
ICLR 2025
B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens
ICCV 2025
VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Prior
ICCV 2025
RobAVA: A Large-scale Dataset and Baseline Towards Video based Robotic Arm Action Understanding
ICCV 2025
Generalized Source Tracing: Detecting Novel Audio Deepfake Algorithm with Real Emphasis and Fake Dispersion Strategy
INTERSPEECH 2024
Exploring Self- and Cross-Triplet Correlations for Human-Object Interaction Detection
AAAI 2024
SurgicalSAM: Efficient Class Promptable Surgical Instrument Segmentation
AAAI 2024
Terrain Diffusion Network: Climatic-Aware Terrain Generation with Geological Sketch Guidance
AAAI 2024
Autoregressive Omni-Aware Outpainting for Open-Vocabulary 360-Degree Image Generation
AAAI 2024
Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users
AAAI 2024
AEDNet: Adaptive Embedding and Multiview-Aware Disentanglement for Point Cloud Completion
ECCV 2024
Language-Assisted Skeleton Action Understanding for Skeleton-Based Temporal Action Segmentation
ECCV 2024
Motion Keyframe Interpolation for Any Human Skeleton using Point Cloud-based Human Motion Data Homogenisation
ECCV 2024
Identity-Consistent Diffusion Network for Grading Knee Osteoarthritis Progression in Radiographic Imaging
ECCV 2024
Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond
ICML 2024
Codecfake: An Initial Dataset for Detecting LLM-based Deepfake Audio
INTERSPEECH 2024
Generalized Fake Audio Detection via Deep Stable Learning
INTERSPEECH 2024
Genuine-Focused Learning using Mask AutoEncoder for Generalized Fake Audio Detection
INTERSPEECH 2024
PPPR: Portable Plug-in Prompt Refiner for Text to Audio Generation
INTERSPEECH 2024
Efficient Explorative Key-Term Selection Strategies for Conversational Contextual Bandits
AAAI 2023
Continuous Intermediate Token Learning With Implicit Motion Manifold for Keyframe Based Motion Interpolation
CVPR 2023
Multi-Scale Control Signal-Aware Transformer for Motion Synthesis without Phase
AAAI 2023
Adversarial Attacks on Online Learning to Rank with Click Feedback
NIPS 2023
Online Clustering of Bandits with Misspecified User Models
NIPS 2023
Online Corrupted User Detection and Regret Minimization
NIPS 2023
LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark
NIPS 2023
Robust Audio Anti-Spoofing with Fusion-Reconstruction Learning on Multi-Order Spectrograms
INTERSPEECH 2023
Efficient and Interpretable Compressive Text Summarisation with Unsupervised Dual-Agent Reinforcement Learning
ACL 2023
VAPCNet: Viewpoint-Aware 3D Point Cloud Completion
ICCV 2023
LiDARCap: Long-Range Marker-Less 3D Human Motion Capture With LiDAR Point Clouds
CVPR 2022
Sign Language Translation With Hierarchical Spatio-Temporal Graph Neural Network
WACV 2022
OTExtSum: Extractive Text Summarisation with Optimal Transport
NAACL 2022
1Cademy at Semeval-2022 Task 1: Investigating the Effectiveness of Multilingual, Multitask, and Language-Agnostic Tricks for the Reverse Dictionary Task
NAACL 2022
1Cademy at Semeval-2022 Task 1: Investigating the Effectiveness of Multilingual, Multitask, and Language-Agnostic Tricks for the Reverse Dictionary Task
SEMEVAL 2022
Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition
CVPR 2020
3D Hand Pose Estimation with Disentangled Cross-Modal Latent Space
WACV 2020
Speaker-Aware Monaural Speech Separation
INTERSPEECH 2020
A Two-Graph Guided Multi-task Lasso Approach for eQTL Mapping
AISTATS 2012