Guo Chen

21 papers · 2018–2026 · 8 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🧭 Keyword Pioneer 🐝 Cross-Pollinator (4) 🌍 Conference Polyglot (8) 🏃 Academic Marathon (7) 🌈 Renaissance Researcher (8)

🐝 Cross-Pollinator (4) 🧭 Keyword Pioneer 🌈 Renaissance Researcher (8) 🧬 Topic Evolution 🔥 Unstoppable (5) 💎 Century Club (18) ⚡ Prolific Year (8) 🗃️ Keyword Collector (85)

Conferences

ICLR (6) AAAI (5) CVPR (4) NSDI (2) ECCV (1) ICCV (1) ICML (1) IJCAI (1)

Top co-authors

Limin Wang (8) Yifei Huang (8) Jilan Xu (7) Yali Wang (6) Yu Qiao (6) Tong Lu (6) Baoqi Pei (5) Kunchang Li (3) Ping Luo (3) Yinan He (3)

Keywords

video understanding (4) multimodal learning (3) retrieval-augmented generation (2) egocentric vision (2) multi-modal learning (2) convolutional neural network (2) temporal modeling (1) vision transformer (1) zero-shot learning (1) knowledge distillation (1) in-context learning (1) video captioning (1) vision-language alignment (1) benchmark evaluation (1) human motion (1) scene understanding (1) network architecture (1) resource allocation (1) resource management (1) semantic segmentation (1)

Papers

Boosting Adversarial Transferability via Ensemble Non-Attention AAAI 2026 LiR3AG: A Lightweight Rerank Reasoning Strategy Framework for Retrieval-Augmented Generation AAAI 2026 Self-Indexing KVCache: Predicting Sparse Attention from Compressed Keys AAAI 2026 TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation ICLR 2025 SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios ICLR 2025 Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning ICLR 2025 EgoExo-Gen: Ego-centric Video Prediction by Watching Exo-centric Videos ICLR 2025 CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding ICLR 2025 Egocentric Object-Interaction Anticipation with Retentive and Predictive Learning IJCAI 2025 InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation ICLR 2024 AVSegFormer: Audio-Visual Segmentation with Transformer AAAI 2024 InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks CVPR 2024 EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World CVPR 2024 InternVideo2: Scaling Foundation Models for Multimodal Video Understanding ECCV 2024 Retrieval-Augmented Egocentric Video Captioning CVPR 2024 MVBench: A Comprehensive Multi-modal Video Understanding Benchmark CVPR 2024 NeuralIndicator: Implicit Surface Reconstruction from Neural Indicator Priors ICML 2024 Memory-and-Anticipation Transformer for Online Action Understanding ICCV 2023 DCAN: Improving Temporal Action Detection via Dual Context Aggregation AAAI 2022 Direct Universal Access: Making Data Center Resources Available to FPGA NSDI 2019 Multi-Path Transport for RDMA in Datacenters NSDI 2018