Shusuke Takahashi
6 papers · 2021–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (5) π Cross-Pollinator (13) π Renaissance Researcher (5)
πΊοΈ
Taxonomy Completionist
(18)
Conferences
INTERSPEECH (2)
ICCV (1)
ICLR (1)
ICML (1)
NIPS (1)
Top co-authors
Keywords
generative model
(2)
diffusion model
(2)
deep clustering
(1)
speech enhancement
(1)
speaker embedding
(1)
vector quantization
(1)
audio-visual correspondence
(1)
variational autoencoder
(1)
perceptual quality
(1)
text-to-video generation
(1)
latent optimization
(1)
inference-time alignment
(1)
video processing
(1)
discrete representation
(1)
stochastic quantization
(1)
direction of arrival
(1)
metric discriminator
(1)
model refinement
(1)
spatial audio
(1)
angular margin
(1)
Papers
TITAN-Guide: Taming Inference-Time Alignment for Guided Text-to-Video Diffusion Models
ICCV 2025
Mining your own secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models
ICLR 2025
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
NIPS 2023
Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement
INTERSPEECH 2023
SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization
ICML 2022
Manifold-Aware Deep Clustering: Maximizing Angles Between Embedding Vectors Based on Regular Simplex
INTERSPEECH 2021