Zhongyuan Wang

62 papers · 2015–2026 · 13 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🌍 Conference Polyglot (13) 🏃 Academic Marathon (10) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (7)

🌍 Conference Polyglot (13) 🐣 Hot Topic Early Bird 🏃 Academic Marathon (10) 🔬 Deep Specialist (14) 🧬 Topic Evolution 🔥 Unstoppable (7) ⚡ Prolific Year (14) 🚀 Conference Pioneer 💎 Century Club (57) ❓ The Questioner (2) 📈 Trend Setter 🗃️ Keyword Collector (294)

Conferences

AAAI (13) CVPR (12) COLING (8) EMNLP (7) IJCAI (7) ACL (4) ICCV (4) NIPS (2) AACL (1) INTERSPEECH (1) MICCAI (1) RSS (1) WACV (1)

Top co-authors

Jiayi Ma (6) Kui Jiang (6) Xing Wu (5) Peng Yi (5) Fuzheng Zhang (5) Songlin Hu (5) Ruiji Fu (5) Junjun Jiang (4) Zijia Lin (4) Jizhong Han (4)

Research topics

Applications (1)

Keywords

contrastive learning (7) unsupervised learning (5) latent space (3) multimodal learning (3) sentence embedding (3) self-supervised learning (3) deepfake detection (3) face forgery detection (3) convolutional neural network (2) temporal modeling (2) knowledge base (2) representation learning (2) disfluency detection (2) adversarial attack (2) language model (2) video understanding (2) image restoration (2) video super-resolution (2) information retrieval (2) attention mechanism (2)

Papers

NavA3: Understanding Any Instruction, Navigating Anywhere, Finding Anything ACL 2026 Towards Effective Code-Integrated Reasoning AAAI 2026 GLoMOT: Efficient Online GNN-based Low-Frame-Rate Multi-Object Tracker AAAI 2026 Rethinking Surgical Smoke: A Smoke-Type-Aware Laparoscopic Video Desmoking Method and Dataset AAAI 2026 DeformTrace: A Deformable State Space Model with Relay Tokens for Temporal Forgery Localization AAAI 2026 Multi-Shape Matching with Cycle Consistency Basis via Functional Maps AAAI 2025 Cross-Modal Stealth: A Coarse-to-Fine Attack Framework for RGB-T Tracker AAAI 2025 OODML: Whole Slide Image Classification Meets Online Pseudo-Supervision and Dynamic Mutual Learning AAAI 2025 Uni-NaVid: A Video-based Vision-Language-Action Model for Unifying Embodied Navigation Tasks RSS 2025 MapNav: A Novel Memory Representation via Annotated Semantic Maps for VLM-based Vision-and-Language Navigation ACL 2025 ProJudge: A Multi-Modal Multi-Discipline Benchmark and Instruction-Tuning Dataset for MLLM-based Process Judges ICCV 2025 SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis EMNLP 2025 Lift3D Policy: Lifting 2D Foundation Models for Robust 3D Robotic Manipulation CVPR 2025 Rethinking the Adversarial Robustness of Multi-Exit Neural Networks in an Attack-Defense Game CVPR 2025 Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection CVPR 2025 Link-based Contrastive Learning for One-Shot Unsupervised Domain Adaptation CVPR 2025 Stacking Brick by Brick: Aligned Feature Isolation for Incremental Face Forgery Detection CVPR 2025 RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete CVPR 2025 FriendsQA: A New Large-Scale Deep Video Understanding Dataset with Fine-grained Topic Categorization for Story Videos AAAI 2025 Refining Intraocular Lens Power Calculation: A Multi-modal Framework Using Cross-layer Attention and Effective Channel Attention MICCAI 2024 Can We Leave Deepfake Data Behind in Training Deepfake Detector? NIPS 2024 Code-Style In-Context Learning for Knowledge-Based Question Answering AAAI 2024 Just Ask One More Time! Self-Agreement Improves Reasoning of Language Models in (Almost) All Scenarios ACL 2024 Decoding at the Speed of Thought: Harnessing Parallel Decoding of Lexical Units for LLMs COLING 2024 Decompose, Prioritize, and Eliminate: Dynamically Integrating Diverse Representations for Multimodal Named Entity Recognition COLING 2024 Learning Multi-Dimensional Human Preference for Text-to-Image Generation CVPR 2024 CogGPT: Unleashing the Power of Cognitive Dynamics on Large Language Models EMNLP 2024 GUIDE: A Guideline-Guided Dataset for Instructional Video Comprehension IJCAI 2024 Exploring Sentence Type Effects on the Lombard Effect and Intelligibility Enhancement: A Comparative Study of Natural and Grid Sentences INTERSPEECH 2024 Improving Vision-and-Language Reasoning via Spatial Relations Modeling WACV 2024 GTR: A Grafting-Then-Reassembling Framework for Dynamic Scene Graph Generation IJCAI 2023 Augmentation-Aware Self-Supervision for Data-Efficient GAN Training NIPS 2023 LSTFE-Net:Long Short-Term Feature Enhancement Network for Video Small Object Detection CVPR 2023 ConTextual Masked Auto-Encoder for Dense Passage Retrieval AAAI 2023 FEditNet: Few-Shot Editing of Latent Semantics in GAN Spaces AAAI 2023 Implicit Identity Driven Deepfake Face Swapping Detection CVPR 2023 Adaptive Unsupervised Self-training for Disfluency Detection COLING 2022 InfoCSE: Information-aggregated Contrastive Learning of Sentence Embeddings EMNLP 2022 DANet: Image Deraining via Dynamic Association Learning IJCAI 2022 Degrade Is Upgrade: Learning Degradation for Low-Light Image Enhancement AAAI 2022 Smoothed Contrastive Learning for Unsupervised Sentence Embedding COLING 2022 ESimCSE: Enhanced Sample Building Method for Contrastive Learning of Unsupervised Sentence Embedding COLING 2022 RaP: Redundancy-aware Video-language Pre-training for Text-Video Retrieval EMNLP 2022 Domain Generalization via Shuffled Style Assembly for Face Anti-Spoofing CVPR 2022 HiT: Hierarchical Transformer With Momentum Contrast for Video-Text Retrieval ICCV 2021 Dynamic Inconsistency-aware DeepFake Video Detection IJCAI 2021 Frequency-Aware Discriminative Feature Learning Supervised by Single-Center Loss for Face Forgery Detection CVPR 2021 Converse, Focus and Guess - Towards Multi-Document Driven Dialogue AAAI 2021 Omniscient Video Super-Resolution ICCV 2021 Multi-Scale Progressive Fusion Network for Single Image Deraining CVPR 2020 Syntactic Graph Convolutional Network for Spoken Language Understanding COLING 2020 Table Fact Verification with Structure-Aware Transformer EMNLP 2020 Combining Self-Training and Self-Supervised Learning for Unsupervised Disfluency Detection EMNLP 2020 Learn with Noisy Data via Unsupervised Loss Correction for Weakly Supervised Reading Comprehension COLING 2020 Combining ResNet and Transformer for Chinese Grammatical Error Diagnosis AACL 2020 Progressive Fusion Video Super-Resolution Network via Exploiting Non-Local Spatio-Temporal Correlations ICCV 2019 Earlier Attention? Aspect-Aware LSTM for Aspect-Based Sentiment Analysis IJCAI 2019 Combining Knowledge with Deep Convolutional Neural Networks for Short Text Classification IJCAI 2017 Probabilistic Prototype Model for Serendipitous Property Mining COLING 2016 Understanding Short Texts ACL 2016 Syntactic Parsing of Web Queries EMNLP 2016 Query Understanding through Knowledge-Based Conceptualization IJCAI 2015