Qi Chen

79 papers · 2018–2026 · 18 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🗺️ Taxonomy Completionist (18) 🧭 Keyword Pioneer 🌈 Renaissance Researcher (6) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (18)

🏃 Academic Marathon (7) 🗺️ Taxonomy Completionist (18) 🐣 Hot Topic Early Bird 🔬 Deep Specialist (13) 🏆 Keyword Champion 🏆 Grand Slam 💎 Century Club (74) ⚡ Prolific Year (8) 🚀 Conference Pioneer ❓ The Questioner 🔥 Unstoppable (8) 🗃️ Keyword Collector (382)

Conferences

NIPS (13) AAAI (11) CVPR (11) EMNLP (7) ICCV (6) IJCAI (5) ACL (5) ICML (4) ICLR (3) MICCAI (3) ECCV (2) INTERSPEECH (2) OSDI (2) AISTATS (1) NAACL (1) SEMEVAL (1) UAI (1) WACV (1)

Top co-authors

Qi Wu (9) Wei Wang (7) Mingkui Tan (6) Yuqi Wang (5) Changjian Shui (5) Mao Yang (5) Qi Zhang (4) Minh-Son To (4) Yuankai Qi (4) Jiayi Ji (4)

Research topics

Techniques (1)

Keywords

large language model (7) semantic segmentation (5) multimodal learning (4) medical imaging (4) zero-shot learning (4) self-supervised learning (3) data augmentation (3) 3d referring expression segmentation (3) diffusion model (3) contrastive learning (3) vision-language model (3) model compression (3) point cloud (3) visual grounding (3) speech synthesis (2) unsupervised learning (2) prototype learning (2) image generation (2) video generation (2) transfer learning (2)

Papers

UniABG: Unified Adversarial View Bridging and Graph Correspondence for Unsupervised Cross-View Geo-Localization AAAI 2026 SpecAgent: A Speculative Retrieval and Forecasting Agent for Code Completion ACL 2026 MMCLIP: Cross-Modal Attention Masked Modelling for Medical Language-Image Pre-Training ACL 2026 Tracking the Unstable: Appearance-Guided Motion Modeling for Robust Multi-Object Tracking in UAV-Captured Videos AAAI 2026 3D-DRES: Detailed 3D Referring Expression Segmentation AAAI 2026 TSTAI: A Time-varying Brain Effective Connectivity Network Construction Method Combining with Brain Active Information IJCAI 2025 IPDN: Image-enhanced Prompt Decoding Network for 3D Referring Expression Segmentation AAAI 2025 VQTalker: Towards Multilingual Talking Avatars Through Facial Motion Tokenization AAAI 2025 Attention-Driven GUI Grounding: Leveraging Pretrained Multimodal Large Language Models Without Fine-Tuning AAAI 2025 Enhancing Large Language Model Performance with Gradient-Based Parameter Selection AAAI 2025 Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training ACL 2025 From Words to Structured Visuals: A Benchmark and Framework for Text-to-Diagram Generation and Editing CVPR 2025 Dual Energy-Based Model with Open-World Uncertainty Estimation for Out-of-distribution Detection CVPR 2025 Separation of Powers: On Segregating Knowledge from Observation in LLM-enabled Knowledge-based Visual Question Answering CVPR 2025 ChartMind: A Comprehensive Benchmark for Complex Real-world Multimodal Chart Question Answering EMNLP 2025 InMind: Evaluating LLMs in Capturing and Applying Individual Human Reasoning Styles EMNLP 2025 Lost in Pronunciation: Detecting Chinese Offensive Language Disguised by Phonetic Cloaking Replacement EMNLP 2025 Alleviating Performance Degradation Caused by Out-of-Distribution Issues in Embedding-Based Retrieval EMNLP 2025 Efficiently Selecting Response Generation Strategies for Synthetic Data Construction by Self-Aligned Perplexity EMNLP 2025 FinDebate: Multi-Agent Collaborative Intelligence for Financial Analysis EMNLP 2025 Bitrate-Controlled Diffusion for Disentangling Motion and Content in Video ICCV 2025 OVG-HQ: Online Video Grounding with Hybrid-modal Queries ICCV 2025 Scaling Tumor Segmentation: Best Lessons from Real and Synthetic Data ICCV 2025 Training-Free Class Purification for Open-Vocabulary Semantic Segmentation ICCV 2025 Seeing the Trees for the Forest: Rethinking Weakly-Supervised Medical Visual Grounding ICCV 2025 RAG-SR: Retrieval-Augmented Generation for Neural Symbolic Regression ICLR 2025 Generalization in VAE and Diffusion Models: A Unified Information-Theoretic Analysis ICLR 2025 Integrative Decoding: Improving Factuality via Implicit Self-consistency ICLR 2025 EpiCoder: Encompassing Diversity and Complexity in Code Generation ICML 2025 SketchAgent: Generating Structured Diagrams from Hand-Drawn Sketches IJCAI 2025 Localizing Before Answering: A Benchmark for Grounded Medical Visual Question Answering IJCAI 2025 Multi-Hierarchical Fine-Grained Feature Mapping Driven by Feature Contributions for Molecular Odor Prediction IJCAI 2025 Controllable Image Synthesis Workflow for Enhancing Cervical Cell Detection MICCAI 2025 PedCLIP: A Vision-Language model for Pediatric X-rays with Mixture of Body part Experts MICCAI 2025 Towards Understanding Evolving Patterns in Sequential Data NIPS 2024 RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation NIPS 2024 Weak-eval-Strong: Evaluating and Eliciting Lateral Thinking of LLMs with Situation Puzzles NIPS 2024 IRGen: Generative Modeling for Image Retrieval ECCV 2024 G-NeRF: Geometry-enhanced Novel View Synthesis from Single-View Images CVPR 2024 Towards Generalizable Tumor Synthesis CVPR 2024 PairAug: What Can Augmented Image-Text Pairs Do for Radiology? CVPR 2024 Knowledge Distillation from Monolingual to Multilingual Models for Intelligent and Interpretable Multilingual Emotion Detection ACL 2024 DKE-Research at SemEval-2024 Task 2: Incorporating Data Augmentation with Generative Models and Biomedical Knowledge to Enhance Inference Robustness SEMEVAL 2024 CREAD: A Classification-Restoration Framework with Error Adaptive Discretization for Watch Time Prediction in Video Recommender Systems AAAI 2024 ProxEdit: Improving Tuning-Free Real Image Editing With Proximal Guidance WACV 2024 Intersectional Unfairness Discovery ICML 2024 Accelerated Multi-Contrast MRI Reconstruction via Frequency and Spatial Mutual Learning MICCAI 2024 3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation AAAI 2024 WebVLN: Vision-and-Language Navigation on Websites AAAI 2024 DKE-Research at SemEval-2024 Task 2: Incorporating Data Augmentation with Generative Models and Biomedical Knowledge to Enhance Inference Robustness NAACL 2024 Attention-based Interactive Disentangling Network for Instance-level Emotional Voice Conversion INTERSPEECH 2023 Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement Learning IJCAI 2023 Prompt Switch: Efficient CLIP Adaptation for Text-Video Retrieval ICCV 2023 On the Stability-Plasticity Dilemma in Continual Meta-Learning: Theory and Algorithm NIPS 2023 Model-enhanced Vector Index NIPS 2023 VBASE: Unifying Online Vector Similarity Search and Relational Queries via Relaxed Monotonicity OSDI 2023 Prompt-based Zero-shot Text Classification with Conceptual Knowledge ACL 2023 Algorithm-Dependent Bounds for Representation Learning of Multi-Source Domain Adaptation AISTATS 2023 An Alignment Method Leveraging Articulatory Features for Mispronunciation Detection and Diagnosis in L2 English INTERSPEECH 2022 Fair Representation Learning through Implicit Path Alignment ICML 2022 Self-Supervised Image-Specific Prototype Exploration for Weakly Supervised Semantic Segmentation CVPR 2022 V2C: Visual Voice Cloning CVPR 2022 Learning Distinct and Representative Modes for Image Captioning NIPS 2022 A Neural Corpus Indexer for Document Retrieval NIPS 2022 Sublinear time algorithms for greedy selection in high dimensions UAI 2022 Optimization-Induced Graph Implicit Nonlinear Diffusion ICML 2022 On Learning Fairness and Accuracy on Multiple Subgroups NIPS 2022 StrokeGAN: Reducing Mode Collapse in Chinese Font Generation via Stroke Encoding AAAI 2021 SPANN: Highly-efficient Billion-scale Approximate Nearest Neighborhood Search NIPS 2021 Generalization Bounds For Meta-Learning: An Information-Theoretic Analysis NIPS 2021 PolarStream: Streaming Object Detection and Segmentation with Polar Pillars NIPS 2021 Contrastive Neural Architecture Search With Neural Architecture Comparators CVPR 2021 Every View Counts: Cross-View Consistency in 3D Object Detection with Hybrid-Cylindrical-Spherical Voxelization NIPS 2020 Closed-Loop Matters: Dual Regression Networks for Single Image Super-Resolution CVPR 2020 Object as Hotspots: An Anchor-Free 3D Object Detection Approach via Firing of Hotspots ECCV 2020 Byzantine Ordered Consensus without Byzantine Oligarchy OSDI 2020 Intelligent Home 3D: Automatic 3D-House Design From Linguistic Descriptions Only CVPR 2020 NAT: Neural Architecture Transformer for Accurate and Compact Architectures NIPS 2019 Auto-Dialabel: Labeling Dialogue Data with Unsupervised Learning EMNLP 2018