Yiran Zhong
34 papers · 2016–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π Conference Polyglot (8) π Academic Marathon (9) π§ Keyword Pioneer π Interdisciplinary Bridge π Cross-Pollinator (13)
π
Cross-Pollinator
(13)
π
Renaissance Researcher
(5)
πΊοΈ
Taxonomy Completionist
(60)
π§¬
Topic Evolution
π€
Dynamic Duo
(13)
π
Triple Crown
π
Grand Slam
π
Century Club
(33)
π
Conference Pioneer
β‘
Prolific Year
(5)
ποΈ
Keyword Collector
(144)
π₯
Unstoppable
(8)
Conferences
CVPR (11)
AAAI (6)
ECCV (4)
NIPS (4)
EMNLP (3)
ICLR (3)
ICCV (2)
ICML (1)
Top co-authors
Keywords
multimodal learning
(6)
optical flow
(5)
depth estimation
(5)
attention mechanism
(4)
multi-modal learning
(4)
optical flow estimation
(3)
sequence modeling
(3)
stereo matching
(3)
motion estimation
(3)
transformer architecture
(3)
language modeling
(3)
video understanding
(3)
visual slam
(2)
linear complexity
(2)
audio-visual learning
(2)
3d vision
(2)
semantic segmentation
(2)
unsupervised learning
(2)
3d reconstruction
(2)
state space model
(2)
Papers
Learning Spatial Decay for Vision Transformers
AAAI 2026
Deep Non-Rigid Structure-from-Motion Revisited: Canonicalization and Sequence Modeling
AAAI 2025
Tri-Ergon: Fine-Grained Video-to-Audio Generation with Multi-Modal Conditions and LUFS Control
AAAI 2025
Towards Open-Vocabulary Audio-Visual Event Localization
CVPR 2025
Exploring Transformer Extrapolation
AAAI 2024
CO2: Efficient Distributed Training with Full Communication-Computation Overlap
ICLR 2024
Label-anticipated Event Disentanglement for Audio-Visual Video Parsing
ECCV 2024
Scaling Laws for Linear Complexity Language Models
EMNLP 2024
MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map
NIPS 2024
Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention
ICML 2024
Improving Audio-Visual Segmentation with Bidirectional Generation
AAAI 2024
Fine-Grained Audible Video Description
CVPR 2023
Hierarchically Gated Recurrent Neural Network for Sequence Modeling
NIPS 2023
Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning
CVPR 2023
Accelerating Toeplitz Neural Network with Constant-time Inference Complexity
EMNLP 2023
Multimodal Variational Auto-encoder based Audio-Visual Segmentation
ICCV 2023
Toeplitz Neural Network for Sequence Modeling
ICLR 2023
AudioβVisual Segmentation
ECCV 2022
Implicit Motion Handling for Video Camouflaged Object Detection
CVPR 2022
The Devil in Linear Transformer
EMNLP 2022
cosFormer: Rethinking Softmax In Attention
ICLR 2022
Transcribing Natural Languages for the Deaf via Neural Editing Programs
AAAI 2022
Deep Two-View Structure-From-Motion Revisited
CVPR 2021
RGB-D Saliency Detection via Cascaded Mutual Information Minimization
ICCV 2021
ARVo: Learning All-Range Volumetric Correspondence for Video Deblurring
CVPR 2021
Positive Sample Propagation Along the Audio-Visual Event Line
CVPR 2021
Displacement-Invariant Matching Cost Learning for Accurate Optical Flow Estimation
NIPS 2020
Hierarchical Neural Architecture Search for Deep Stereo Matching
NIPS 2020
Deblurring by Realistic Blurring
CVPR 2020
Noise-Aware Unsupervised Deep Lidar-Stereo Fusion
CVPR 2019
Unsupervised Deep Epipolar Flow for Stationary or Dynamic Scenes
CVPR 2019
Stereo Computation for a Single Mixture Image
ECCV 2018
Open-World Stereo Video Matching with Deep RNN
ECCV 2018
Robust Multi-Body Feature Tracker: A Segmentation-Free Approach
CVPR 2016