Nakamasa Inoue
29 papers · 2013–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π Cross-Pollinator (13) π Conference Polyglot (10) π Academic Marathon (13) π§ Keyword Pioneer π Renaissance Researcher (6)
π
Renaissance Researcher
(6)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(50)
π
Grand Slam
π€
Dynamic Duo
(11)
β‘
Prolific Year
(8)
π
Conference Pioneer
π₯
Unstoppable
(5)
π
Century Club
(28)
ποΈ
Keyword Collector
(122)
β
The Questioner
(2)
Conferences
ICCV (8)
AAAI (5)
WACV (4)
ECCV (3)
INTERSPEECH (3)
CVPR (2)
AISTATS (1)
ICLR (1)
ICML (1)
NIPS (1)
Top co-authors
Research topics
Keywords
vision transformer
(5)
image classification
(5)
formula-driven supervised learning
(4)
vision-language model
(3)
data augmentation
(3)
object detection
(2)
transfer learning
(2)
self-supervised learning
(2)
neural network
(2)
target propagation
(2)
diffusion model
(2)
formula-driven supervision
(2)
synthetic image
(2)
multi-modal learning
(1)
test-time adaptation
(1)
autonomous driving
(1)
attention mechanism
(1)
fine-grained classification
(1)
vision-language navigation
(1)
instance segmentation
(1)
Papers
DF-Mamba: Deformable State Space Modeling for 3D Hand Pose Estimation in Interactions
WACV 2026
DISCODE: Distribution-Aware Score Decoder for Robust Automatic Evaluation of Image Captioning
AAAI 2026
Referring Expression Comprehension for Small Objects
ICCV 2025
Rectified Lagrangian for Out-of-Distribution Detection in Modern Hopfield Networks
AAAI 2025
CityNav: A Large-Scale Dataset for Real-World Aerial Navigation
ICCV 2025
GeoProg3D: Compositional Visual Reasoning for City-Scale 3D Language Fields
ICCV 2025
AnimalClue: Recognizing Animals by their Traces
ICCV 2025
AgroBench: Vision-Language Model Benchmark in Agriculture
ICCV 2025
HALL-E: Hierarchical Neural Codec Language Model for Minute-Long Zero-Shot Text-to-Speech Synthesis
ICLR 2025
Diffusion-Based Generative Regularization for Supervised Discriminative Learning
WACV 2025
Formula-Supervised Visual-Geometric Pre-training
ECCV 2024
Locally Aligned Rectified Flow Model for Speech Enhancement Towards Single-Step Diffusion
INTERSPEECH 2024
Efficient Target Propagation by Deriving Analytical Solution
AAAI 2024
Scaling Backwards: Minimal Synthetic Pre-training?
ECCV 2024
Rethinking Image Super Resolution from Training Data Perspectives
ECCV 2024
CityRefer: Geography-aware 3D Visual Grounding Dataset on City-scale Point Cloud Data
NIPS 2023
Text-Guided Object Detector for Multi-Modal Video Question Answering
WACV 2023
Fixed-Weight Difference Target Propagation
AAAI 2023
Visual Atoms: Pre-Training Vision Transformers With Sinusoidal Waves
CVPR 2023
Learning with Partial Forgetting in Modern Hopfield Networks
AISTATS 2023
Pre-training Vision Transformers with Very Limited Synthesized Images
ICCV 2023
SegRCDB: Semantic Segmentation via Formula-Driven Supervised Learning
ICCV 2023
Can Vision Transformers Learn without Natural Images?
AAAI 2022
Spatiotemporal Initialization for 3D CNNs With Generated Motion Patterns
WACV 2022
Replacing Labeled Real-Image Datasets With Auto-Generated Contours
CVPR 2022
PoF: Post-Training of Feature Extractor for Improving Generalization
ICML 2022
Detecting Alzheimerβs Disease Using Gated Convolutional Neural Network from Audio Data
INTERSPEECH 2018
I-vector Transformation Using Conditional Generative Adversarial Networks for Short Utterance Speaker Verification
INTERSPEECH 2018
Neighbor-to-Neighbor Search for Fast Coding of Feature Vectors
ICCV 2013