Zhongyuan Wang
62 papers · 2015–2026 · 13 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
🌍 Conference Polyglot (13) 🏃 Academic Marathon (10) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (7)
🌍
Conference Polyglot
(13)
🐣
Hot Topic Early Bird
🏃
Academic Marathon
(10)
🔬
Deep Specialist
(14)
🧬
Topic Evolution
🔥
Unstoppable
(7)
⚡
Prolific Year
(14)
🚀
Conference Pioneer
💎
Century Club
(57)
❓
The Questioner
(2)
📈
Trend Setter
🗃️
Keyword Collector
(294)
Conferences
AAAI (13)
CVPR (12)
COLING (8)
EMNLP (7)
IJCAI (7)
ACL (4)
ICCV (4)
NIPS (2)
AACL (1)
INTERSPEECH (1)
MICCAI (1)
RSS (1)
WACV (1)
Top co-authors
Research topics
Keywords
contrastive learning
(7)
unsupervised learning
(5)
latent space
(3)
multimodal learning
(3)
sentence embedding
(3)
self-supervised learning
(3)
deepfake detection
(3)
face forgery detection
(3)
convolutional neural network
(2)
temporal modeling
(2)
knowledge base
(2)
representation learning
(2)
disfluency detection
(2)
adversarial attack
(2)
language model
(2)
video understanding
(2)
image restoration
(2)
video super-resolution
(2)
information retrieval
(2)
attention mechanism
(2)
Papers
NavA3: Understanding Any Instruction, Navigating Anywhere, Finding Anything
ACL 2026
Towards Effective Code-Integrated Reasoning
AAAI 2026
GLoMOT: Efficient Online GNN-based Low-Frame-Rate Multi-Object Tracker
AAAI 2026
Rethinking Surgical Smoke: A Smoke-Type-Aware Laparoscopic Video Desmoking Method and Dataset
AAAI 2026
DeformTrace: A Deformable State Space Model with Relay Tokens for Temporal Forgery Localization
AAAI 2026
Multi-Shape Matching with Cycle Consistency Basis via Functional Maps
AAAI 2025
Cross-Modal Stealth: A Coarse-to-Fine Attack Framework for RGB-T Tracker
AAAI 2025
OODML: Whole Slide Image Classification Meets Online Pseudo-Supervision and Dynamic Mutual Learning
AAAI 2025
Uni-NaVid: A Video-based Vision-Language-Action Model for Unifying Embodied Navigation Tasks
RSS 2025
MapNav: A Novel Memory Representation via Annotated Semantic Maps for VLM-based Vision-and-Language Navigation
ACL 2025
ProJudge: A Multi-Modal Multi-Discipline Benchmark and Instruction-Tuning Dataset for MLLM-based Process Judges
ICCV 2025
SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis
EMNLP 2025
Lift3D Policy: Lifting 2D Foundation Models for Robust 3D Robotic Manipulation
CVPR 2025
Rethinking the Adversarial Robustness of Multi-Exit Neural Networks in an Attack-Defense Game
CVPR 2025
Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection
CVPR 2025
Link-based Contrastive Learning for One-Shot Unsupervised Domain Adaptation
CVPR 2025
Stacking Brick by Brick: Aligned Feature Isolation for Incremental Face Forgery Detection
CVPR 2025
RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete
CVPR 2025
FriendsQA: A New Large-Scale Deep Video Understanding Dataset with Fine-grained Topic Categorization for Story Videos
AAAI 2025
Refining Intraocular Lens Power Calculation: A Multi-modal Framework Using Cross-layer Attention and Effective Channel Attention
MICCAI 2024
Can We Leave Deepfake Data Behind in Training Deepfake Detector?
NIPS 2024
Code-Style In-Context Learning for Knowledge-Based Question Answering
AAAI 2024
Just Ask One More Time! Self-Agreement Improves Reasoning of Language Models in (Almost) All Scenarios
ACL 2024
Decoding at the Speed of Thought: Harnessing Parallel Decoding of Lexical Units for LLMs
COLING 2024
Decompose, Prioritize, and Eliminate: Dynamically Integrating Diverse Representations for Multimodal Named Entity Recognition
COLING 2024
Learning Multi-Dimensional Human Preference for Text-to-Image Generation
CVPR 2024
CogGPT: Unleashing the Power of Cognitive Dynamics on Large Language Models
EMNLP 2024
GUIDE: A Guideline-Guided Dataset for Instructional Video Comprehension
IJCAI 2024
Exploring Sentence Type Effects on the Lombard Effect and Intelligibility Enhancement: A Comparative Study of Natural and Grid Sentences
INTERSPEECH 2024
Improving Vision-and-Language Reasoning via Spatial Relations Modeling
WACV 2024
GTR: A Grafting-Then-Reassembling Framework for Dynamic Scene Graph Generation
IJCAI 2023
Augmentation-Aware Self-Supervision for Data-Efficient GAN Training
NIPS 2023
LSTFE-Net:Long Short-Term Feature Enhancement Network for Video Small Object Detection
CVPR 2023
ConTextual Masked Auto-Encoder for Dense Passage Retrieval
AAAI 2023
FEditNet: Few-Shot Editing of Latent Semantics in GAN Spaces
AAAI 2023
Implicit Identity Driven Deepfake Face Swapping Detection
CVPR 2023
Adaptive Unsupervised Self-training for Disfluency Detection
COLING 2022
InfoCSE: Information-aggregated Contrastive Learning of Sentence Embeddings
EMNLP 2022
DANet: Image Deraining via Dynamic Association Learning
IJCAI 2022
Degrade Is Upgrade: Learning Degradation for Low-Light Image Enhancement
AAAI 2022
Smoothed Contrastive Learning for Unsupervised Sentence Embedding
COLING 2022
ESimCSE: Enhanced Sample Building Method for Contrastive Learning of Unsupervised Sentence Embedding
COLING 2022
RaP: Redundancy-aware Video-language Pre-training for Text-Video Retrieval
EMNLP 2022
Domain Generalization via Shuffled Style Assembly for Face Anti-Spoofing
CVPR 2022
HiT: Hierarchical Transformer With Momentum Contrast for Video-Text Retrieval
ICCV 2021
Dynamic Inconsistency-aware DeepFake Video Detection
IJCAI 2021
Frequency-Aware Discriminative Feature Learning Supervised by Single-Center Loss for Face Forgery Detection
CVPR 2021
Converse, Focus and Guess - Towards Multi-Document Driven Dialogue
AAAI 2021
Omniscient Video Super-Resolution
ICCV 2021
Multi-Scale Progressive Fusion Network for Single Image Deraining
CVPR 2020
Syntactic Graph Convolutional Network for Spoken Language Understanding
COLING 2020
Table Fact Verification with Structure-Aware Transformer
EMNLP 2020
Combining Self-Training and Self-Supervised Learning for Unsupervised Disfluency Detection
EMNLP 2020
Learn with Noisy Data via Unsupervised Loss Correction for Weakly Supervised Reading Comprehension
COLING 2020
Combining ResNet and Transformer for Chinese Grammatical Error Diagnosis
AACL 2020
Progressive Fusion Video Super-Resolution Network via Exploiting Non-Local Spatio-Temporal Correlations
ICCV 2019
Earlier Attention? Aspect-Aware LSTM for Aspect-Based Sentiment Analysis
IJCAI 2019
Combining Knowledge with Deep Convolutional Neural Networks for Short Text Classification
IJCAI 2017
Probabilistic Prototype Model for Serendipitous Property Mining
COLING 2016
Understanding Short Texts
ACL 2016
Syntactic Parsing of Web Queries
EMNLP 2016
Query Understanding through Knowledge-Based Conceptualization
IJCAI 2015