Chiori Hori
18 papers · 2003–2024 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (5) πΊοΈ Taxonomy Completionist (11) π£ Hot Topic Early Bird
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Interdisciplinary Bridge
π¬
Deep Specialist
(10)
ποΈ
Keyword Collector
(63)
π
Trend Setter
π
Century Club
(18)
π₯
Unstoppable
(6)
π
Conference Pioneer
Conferences
INTERSPEECH (8)
AAAI (2)
COLING (2)
IJCNLP (2)
ACL (1)
CVPR (1)
ICCV (1)
WACV (1)
Top co-authors
Keywords
multimodal learning
(5)
video understanding
(5)
dialogue system
(3)
video captioning
(3)
automatic speech recognition
(2)
action recognition
(2)
audio-visual transformer
(2)
end-to-end speech recognition
(2)
video question answering
(2)
dialog generation
(2)
long-context speech recognition
(2)
scene graph
(2)
spoken language understanding
(1)
multi-modal learning
(1)
attention mechanism
(1)
audio-visual learning
(1)
recurrent neural network
(1)
contextual information
(1)
visual question answering
(1)
event detection
(1)
Papers
ZeroST: Zero-Shot Speech Translation
INTERSPEECH 2024
Style-transfer based Speech and Audio-visual Scene understanding for Robot Action Sequence Acquisition from Videos
INTERSPEECH 2023
(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering
AAAI 2022
Low-Latency Online Streaming VideoQA Using Audio-Visual Transformers
INTERSPEECH 2022
Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers
AAAI 2021
Optimizing Latency for Online Video Captioning Using Audio-Visual Transformers
INTERSPEECH 2021
Advanced Long-Context End-to-End Speech Recognition Using Context-Expanded Transformers
INTERSPEECH 2021
Spatio-Temporal Ranked-Attention Networks for Video Captioning
WACV 2020
Transformer-Based Long-Context End-to-End Speech Recognition
INTERSPEECH 2020
Joint Student-Teacher Learning for Audio-Visual Scene-Aware Dialog
INTERSPEECH 2019
Audio Visual Scene-Aware Dialog
CVPR 2019
Attention-Based Multimodal Fusion for Video Description
ICCV 2017
Context-Sensitive and Role-Dependent Spoken Language Understanding Using Bidirectional and Attention LSTMs
INTERSPEECH 2016
Recurrent Neural Network-based Tuple Sequence Model for Machine Translation
COLING 2014
Factored Language Model based on Recurrent Neural Network
COLING 2012
Improving Related Entity Finding via Incorporating Homepages and Recognizing Fine-grained Entities
IJCNLP 2011
Answering Complex Questions via Exploiting Social Q&A Collection
IJCNLP 2011
Spoken Interactive ODQA System: SPIQA
ACL 2003