Oriol Nieto
8 papers · 2023–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Conference Polyglot (5) π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (13) π§ Keyword Pioneer
π£
Hot Topic Early Bird
π
Cross-Pollinator
(15)
Conferences
ICLR (3)
CVPR (2)
EMNLP (1)
ICML (1)
INTERSPEECH (1)
Top co-authors
Keywords
multimodal learning
(3)
audio-visual learning
(1)
efficient computing
(1)
instruction tuning
(1)
audio source separation
(1)
multilabel classification
(1)
convolutional neural network
(1)
vision-language model
(1)
complex reasoning
(1)
open-set recognition
(1)
audio synthesis
(1)
audio-language model
(1)
audio-visual source separation
(1)
audio separation
(1)
large language model
(1)
sound generation
(1)
audio understanding
(1)
audio language model
(1)
multimodal conditioning
(1)
video-guided generation
(1)
Papers
FLAM: Frame-Wise Language-Audio Modeling
ICML 2025
Video-Guided Foley Sound Generation with Multimodal Controls
CVPR 2025
Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs
ICLR 2025
MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark
ICLR 2025
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities
EMNLP 2024
CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models
ICLR 2024
Efficient Spoken Language Recognition via Multilabel Classification
INTERSPEECH 2023
Language-Guided Audio-Visual Source Separation via Trimodal Consistency
CVPR 2023