Qi Chen
79 papers · 2018–2026 · 18 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (18) π§ Keyword Pioneer π Renaissance Researcher (6) π Interdisciplinary Bridge π Conference Polyglot (18)
π
Academic Marathon
(7)
πΊοΈ
Taxonomy Completionist
(18)
π£
Hot Topic Early Bird
π¬
Deep Specialist
(13)
π
Keyword Champion
π
Grand Slam
π
Century Club
(74)
β‘
Prolific Year
(8)
π
Conference Pioneer
β
The Questioner
π₯
Unstoppable
(8)
ποΈ
Keyword Collector
(382)
Conferences
NIPS (13)
AAAI (11)
CVPR (11)
EMNLP (7)
ICCV (6)
IJCAI (5)
ACL (5)
ICML (4)
ICLR (3)
MICCAI (3)
ECCV (2)
INTERSPEECH (2)
OSDI (2)
AISTATS (1)
NAACL (1)
SEMEVAL (1)
UAI (1)
WACV (1)
Top co-authors
Research topics
Keywords
large language model
(7)
semantic segmentation
(5)
multimodal learning
(4)
medical imaging
(4)
zero-shot learning
(4)
self-supervised learning
(3)
data augmentation
(3)
3d referring expression segmentation
(3)
diffusion model
(3)
contrastive learning
(3)
vision-language model
(3)
model compression
(3)
point cloud
(3)
visual grounding
(3)
speech synthesis
(2)
unsupervised learning
(2)
prototype learning
(2)
image generation
(2)
video generation
(2)
transfer learning
(2)
Papers
UniABG: Unified Adversarial View Bridging and Graph Correspondence for Unsupervised Cross-View Geo-Localization
AAAI 2026
SpecAgent: A Speculative Retrieval and Forecasting Agent for Code Completion
ACL 2026
MMCLIP: Cross-Modal Attention Masked Modelling for Medical Language-Image Pre-Training
ACL 2026
Tracking the Unstable: Appearance-Guided Motion Modeling for Robust Multi-Object Tracking in UAV-Captured Videos
AAAI 2026
3D-DRES: Detailed 3D Referring Expression Segmentation
AAAI 2026
TSTAI: A Time-varying Brain Effective Connectivity Network Construction Method Combining with Brain Active Information
IJCAI 2025
IPDN: Image-enhanced Prompt Decoding Network for 3D Referring Expression Segmentation
AAAI 2025
VQTalker: Towards Multilingual Talking Avatars Through Facial Motion Tokenization
AAAI 2025
Attention-Driven GUI Grounding: Leveraging Pretrained Multimodal Large Language Models Without Fine-Tuning
AAAI 2025
Enhancing Large Language Model Performance with Gradient-Based Parameter Selection
AAAI 2025
Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training
ACL 2025
From Words to Structured Visuals: A Benchmark and Framework for Text-to-Diagram Generation and Editing
CVPR 2025
Dual Energy-Based Model with Open-World Uncertainty Estimation for Out-of-distribution Detection
CVPR 2025
Separation of Powers: On Segregating Knowledge from Observation in LLM-enabled Knowledge-based Visual Question Answering
CVPR 2025
ChartMind: A Comprehensive Benchmark for Complex Real-world Multimodal Chart Question Answering
EMNLP 2025
InMind: Evaluating LLMs in Capturing and Applying Individual Human Reasoning Styles
EMNLP 2025
Lost in Pronunciation: Detecting Chinese Offensive Language Disguised by Phonetic Cloaking Replacement
EMNLP 2025
Alleviating Performance Degradation Caused by Out-of-Distribution Issues in Embedding-Based Retrieval
EMNLP 2025
Efficiently Selecting Response Generation Strategies for Synthetic Data Construction by Self-Aligned Perplexity
EMNLP 2025
FinDebate: Multi-Agent Collaborative Intelligence for Financial Analysis
EMNLP 2025
Bitrate-Controlled Diffusion for Disentangling Motion and Content in Video
ICCV 2025
OVG-HQ: Online Video Grounding with Hybrid-modal Queries
ICCV 2025
Scaling Tumor Segmentation: Best Lessons from Real and Synthetic Data
ICCV 2025
Training-Free Class Purification for Open-Vocabulary Semantic Segmentation
ICCV 2025
Seeing the Trees for the Forest: Rethinking Weakly-Supervised Medical Visual Grounding
ICCV 2025
RAG-SR: Retrieval-Augmented Generation for Neural Symbolic Regression
ICLR 2025
Generalization in VAE and Diffusion Models: A Unified Information-Theoretic Analysis
ICLR 2025
Integrative Decoding: Improving Factuality via Implicit Self-consistency
ICLR 2025
EpiCoder: Encompassing Diversity and Complexity in Code Generation
ICML 2025
SketchAgent: Generating Structured Diagrams from Hand-Drawn Sketches
IJCAI 2025
Localizing Before Answering: A Benchmark for Grounded Medical Visual Question Answering
IJCAI 2025
Multi-Hierarchical Fine-Grained Feature Mapping Driven by Feature Contributions for Molecular Odor Prediction
IJCAI 2025
Controllable Image Synthesis Workflow for Enhancing Cervical Cell Detection
MICCAI 2025
PedCLIP: A Vision-Language model for Pediatric X-rays with Mixture of Body part Experts
MICCAI 2025
Towards Understanding Evolving Patterns in Sequential Data
NIPS 2024
RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation
NIPS 2024
Weak-eval-Strong: Evaluating and Eliciting Lateral Thinking of LLMs with Situation Puzzles
NIPS 2024
IRGen: Generative Modeling for Image Retrieval
ECCV 2024
G-NeRF: Geometry-enhanced Novel View Synthesis from Single-View Images
CVPR 2024
Towards Generalizable Tumor Synthesis
CVPR 2024
PairAug: What Can Augmented Image-Text Pairs Do for Radiology?
CVPR 2024
Knowledge Distillation from Monolingual to Multilingual Models for Intelligent and Interpretable Multilingual Emotion Detection
ACL 2024
DKE-Research at SemEval-2024 Task 2: Incorporating Data Augmentation with Generative Models and Biomedical Knowledge to Enhance Inference Robustness
SEMEVAL 2024
CREAD: A Classification-Restoration Framework with Error Adaptive Discretization for Watch Time Prediction in Video Recommender Systems
AAAI 2024
ProxEdit: Improving Tuning-Free Real Image Editing With Proximal Guidance
WACV 2024
Intersectional Unfairness Discovery
ICML 2024
Accelerated Multi-Contrast MRI Reconstruction via Frequency and Spatial Mutual Learning
MICCAI 2024
3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation
AAAI 2024
WebVLN: Vision-and-Language Navigation on Websites
AAAI 2024
DKE-Research at SemEval-2024 Task 2: Incorporating Data Augmentation with Generative Models and Biomedical Knowledge to Enhance Inference Robustness
NAACL 2024
Attention-based Interactive Disentangling Network for Instance-level Emotional Voice Conversion
INTERSPEECH 2023
Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement Learning
IJCAI 2023
Prompt Switch: Efficient CLIP Adaptation for Text-Video Retrieval
ICCV 2023
On the Stability-Plasticity Dilemma in Continual Meta-Learning: Theory and Algorithm
NIPS 2023
Model-enhanced Vector Index
NIPS 2023
VBASE: Unifying Online Vector Similarity Search and Relational Queries via Relaxed Monotonicity
OSDI 2023
Prompt-based Zero-shot Text Classification with Conceptual Knowledge
ACL 2023
Algorithm-Dependent Bounds for Representation Learning of Multi-Source Domain Adaptation
AISTATS 2023
An Alignment Method Leveraging Articulatory Features for Mispronunciation Detection and Diagnosis in L2 English
INTERSPEECH 2022
Fair Representation Learning through Implicit Path Alignment
ICML 2022
Self-Supervised Image-Specific Prototype Exploration for Weakly Supervised Semantic Segmentation
CVPR 2022
V2C: Visual Voice Cloning
CVPR 2022
Learning Distinct and Representative Modes for Image Captioning
NIPS 2022
A Neural Corpus Indexer for Document Retrieval
NIPS 2022
Sublinear time algorithms for greedy selection in high dimensions
UAI 2022
Optimization-Induced Graph Implicit Nonlinear Diffusion
ICML 2022
On Learning Fairness and Accuracy on Multiple Subgroups
NIPS 2022
StrokeGAN: Reducing Mode Collapse in Chinese Font Generation via Stroke Encoding
AAAI 2021
SPANN: Highly-efficient Billion-scale Approximate Nearest Neighborhood Search
NIPS 2021
Generalization Bounds For Meta-Learning: An Information-Theoretic Analysis
NIPS 2021
PolarStream: Streaming Object Detection and Segmentation with Polar Pillars
NIPS 2021
Contrastive Neural Architecture Search With Neural Architecture Comparators
CVPR 2021
Every View Counts: Cross-View Consistency in 3D Object Detection with Hybrid-Cylindrical-Spherical Voxelization
NIPS 2020
Closed-Loop Matters: Dual Regression Networks for Single Image Super-Resolution
CVPR 2020
Object as Hotspots: An Anchor-Free 3D Object Detection Approach via Firing of Hotspots
ECCV 2020
Byzantine Ordered Consensus without Byzantine Oligarchy
OSDI 2020
Intelligent Home 3D: Automatic 3D-House Design From Linguistic Descriptions Only
CVPR 2020
NAT: Neural Architecture Transformer for Accurate and Compact Architectures
NIPS 2019
Auto-Dialabel: Labeling Dialogue Data with Unsupervised Learning
EMNLP 2018