Guo Chen
21 papers · 2018–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
🧭 Keyword Pioneer 🐝 Cross-Pollinator (4) 🌍 Conference Polyglot (8) 🏃 Academic Marathon (7) 🌈 Renaissance Researcher (8)
🐝
Cross-Pollinator
(4)
🧭
Keyword Pioneer
🌈
Renaissance Researcher
(8)
🧬
Topic Evolution
🔥
Unstoppable
(5)
💎
Century Club
(18)
⚡
Prolific Year
(8)
🗃️
Keyword Collector
(85)
Conferences
ICLR (6)
AAAI (5)
CVPR (4)
NSDI (2)
ECCV (1)
ICCV (1)
ICML (1)
IJCAI (1)
Top co-authors
Keywords
video understanding
(4)
multimodal learning
(3)
retrieval-augmented generation
(2)
egocentric vision
(2)
multi-modal learning
(2)
convolutional neural network
(2)
temporal modeling
(1)
vision transformer
(1)
zero-shot learning
(1)
knowledge distillation
(1)
in-context learning
(1)
video captioning
(1)
vision-language alignment
(1)
benchmark evaluation
(1)
human motion
(1)
scene understanding
(1)
network architecture
(1)
resource allocation
(1)
resource management
(1)
semantic segmentation
(1)
Papers
Boosting Adversarial Transferability via Ensemble Non-Attention
AAAI 2026
LiR3AG: A Lightweight Rerank Reasoning Strategy Framework for Retrieval-Augmented Generation
AAAI 2026
Self-Indexing KVCache: Predicting Sparse Attention from Compressed Keys
AAAI 2026
TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation
ICLR 2025
SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
ICLR 2025
Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning
ICLR 2025
EgoExo-Gen: Ego-centric Video Prediction by Watching Exo-centric Videos
ICLR 2025
CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
ICLR 2025
Egocentric Object-Interaction Anticipation with Retentive and Predictive Learning
IJCAI 2025
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
ICLR 2024
AVSegFormer: Audio-Visual Segmentation with Transformer
AAAI 2024
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks
CVPR 2024
EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World
CVPR 2024
InternVideo2: Scaling Foundation Models for Multimodal Video Understanding
ECCV 2024
Retrieval-Augmented Egocentric Video Captioning
CVPR 2024
MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
CVPR 2024
NeuralIndicator: Implicit Surface Reconstruction from Neural Indicator Priors
ICML 2024
Memory-and-Anticipation Transformer for Online Action Understanding
ICCV 2023
DCAN: Improving Temporal Action Detection via Dual Context Aggregation
AAAI 2022
Direct Universal Access: Making Data Center Resources Available to FPGA
NSDI 2019
Multi-Path Transport for RDMA in Datacenters
NSDI 2018