Tao Chen

108 papers · 2007–2026 · 18 conferences · across top CS/AI conferences

Achievements

+16 more ↓

🗺️ Taxonomy Completionist (15) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🌍 Conference Polyglot (18)

🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (15) 🧭 Keyword Pioneer 🤝 Dynamic Duo (15) 👑 Triple Crown 🏆 Grand Slam 👥 Mega-Team (24) 🌱 Topic Pioneer 🔬 Deep Specialist (13) 🧬 Topic Evolution 🏆 Keyword Champion (2) 💎 Century Club (103) 🔥 Unstoppable (8) 🗃️ Keyword Collector (404) ⚡ Prolific Year (26) ❓ The Questioner

Conferences

CVPR (18) NIPS (17) ACL (11) AAAI (10) ECCV (8) ICLR (6) IJCNLP (6) EMNLP (5) ICCV (5) RSS (5) IJCAI (4) CORL (3) MICCAI (3) WACV (2) ICML (2) NSDI (1) SEMEVAL (1) COLING (1)

Top co-authors

Peng Ye (16) Gang Yu (16) Jiakang Yuan (12) Chong Yu (12) Bo Zhang (11) Xin Chen (11) Pulkit Agrawal (10) Wanli Ouyang (10) Botian Shi (9) Jiayuan Fan (9)

Research topics

Applications (1) Robotics (1)

Keywords

model compression (10) reinforcement learning (8) knowledge distillation (6) large language model (6) point cloud (6) 3d vision (5) novel view synthesis (5) diffusion model (5) domain adaptation (4) multimodal learning (4) image generation (4) semantic segmentation (4) neural network (4) zero-shot learning (3) neural network optimization (3) scene understanding (3) 3d reconstruction (3) multi-instance learning (3) 3d object detection (3) relation extraction (3)

Papers

A Scalable Multi-LLM Collaboration System with Retrieval-based Selection and Exploration-Exploitation-Driven Enhancement ACL 2026 Learning from Human Gaze: Human-like Robot Social Navigation in Dense Crowds AAAI 2026 Beyond Quadratic: Linear-Time Change Detection with RWKV AAAI 2026 Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers AAAI 2026 Mitigating Low-Quality Reasoning in MLLMs: Self-Driven Refined Multimodal CoT with Selective Thinking and Step-wise Visual Enhancement AAAI 2026 HiSplat: Hierarchical 3D Gaussian Splatting for Generalizable Sparse-View Reconstruction ICLR 2025 Dolphin: Moving Towards Closed-loop Auto-research through Thinking, Practice, and Feedback ACL 2025 uir-cis at SemEval-2025 Task 3: Detection of Hallucinations in Generated Text ACL 2025 GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training ICLR 2025 uir-cis at SemEval-2025 Task 3: Detection of Hallucinations in Generated Text SEMEVAL 2025 Chimera: Improving Generalist Model with Domain-Specific Experts ICCV 2025 SC-Captioner: Improving Image Captioning with Self-Correction by Reinforcement Learning ICCV 2025 DeRS: Towards Extremely Efficient Upcycled Mixture-of-Experts Models CVPR 2025 Pioneering 4-Bit FP Quantization for Diffusion Models: Mixup-Sign Quantization and Timestep-Aware Fine-Tuning CVPR 2025 Hard Sample Mining-based Tongue Diagnosis for Fatty Liver Disease Severity Classification MICCAI 2025 Consistency-aware Self-Training for Iterative-based Stereo Matching CVPR 2025 Biology-Instructions: A Dataset and Benchmark for Multi-Omics Sequence Understanding Capability of Large Language Models EMNLP 2025 PAMN: Multi-phase Correlation Modeling for Contrast-Enhanced 3D Medical Image Retrieval EMNLP 2025 All-in-One: Transferring Vision Foundation Models into Stereo Matching AAAI 2025 Cross-Modal Graph Learning for Perivascular Spaces Segmentation MICCAI 2025 Seeing What Matters: Empowering CLIP with Patch Generation-to-Selection CVPR 2025 Autoregressive Medical Image Segmentation via Next-Scale Mask Prediction MICCAI 2025 Boost Embodied AI Models with Robust Compression Boundary IJCAI 2025 Once-Tuning-Multiple-Variants: Tuning Once and Expanded as Multiple Vision-Language Model Variants CVPR 2025 ITFormer: Bridging Time Series and Natural Language for Multi-Modal QA with Large-Scale Multitask Dataset ICML 2025 MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer CVPR 2024 Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient NIPS 2024 Foster Adaptivity and Balance in Learning with Noisy Labels ECCV 2024 Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic Segmentation ECCV 2024 Better Regression Makes Better Test-time Adaptive 3D Object Detection ECCV 2024 DECap: Towards Generalized Explicit Caption Editing via Diffusion Mechanism ECCV 2024 Enhanced Sparsification via Stimulative Training ECCV 2024 M3DBench: Towards Omni 3D Assistant with Interleaved Multi-modal Instructions ECCV 2024 MeshXL: Neural Coordinate Field for Generative 3D Foundation Models NIPS 2024 S2HPruner: Soft-to-Hard Distillation Bridges the Discretization Gap in Pruning NIPS 2024 EMR-Merging: Tuning-Free High-Performance Model Merging NIPS 2024 $\textit{Bifr\"ost}$: 3D-Aware Image Compositing with Language Instructions NIPS 2024 FNP: Fourier Neural Processes for Arbitrary-Resolution Data Assimilation NIPS 2024 Reconciling Reality through Simulation: A Real-To-Sim-to-Real Approach for Robust Manipulation RSS 2024 Adaptive Integration of Partial Label Learning and Negative Learning for Enhanced Noisy Label Learning AAAI 2024 Boosting Residual Networks with Group Knowledge AAAI 2024 PM-INR: Prior-Rich Multi-Modal Implicit Large-Scale Scene Neural Representation AAAI 2024 Spear: Evaluate the Adversarial Robustness of Compressed Neural Models IJCAI 2024 FLDM-VTON: Faithful Latent Diffusion Model for Virtual Try-on IJCAI 2024 MMAPS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization COLING 2024 ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation ICLR 2024 Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy NIPS 2024 3DET-Mamba: Causal Sequence Modelling for End-to-End 3D Object Detection NIPS 2024 VideoMAC: Video Masked Autoencoders Meet ConvNets CVPR 2024 LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding Reasoning and Planning CVPR 2024 Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression CVPR 2024 Adversarial Amendment is the Only Force Capable of Transforming an Enemy into a Friend IJCAI 2023 PDF: Point Diffusion Implicit Function for Large-scale Scene Neural Representation NIPS 2023 MotionGPT: Human Motion as a Foreign Language NIPS 2023 AD-PT: Autonomous Driving Pre-Training with Large-scale Point Cloud Dataset NIPS 2023 Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop Feedback NIPS 2023 Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation NIPS 2023 Boost Transformer-based Language Models with GPU-Friendly Sparsity and Quantization ACL 2023 Boost Vision Transformer With GPU-Friendly Sparsity and Quantization CVPR 2023 Executing Your Commands via Motion Diffusion in Latent Space CVPR 2023 Uni3D: A Unified Baseline for Multi-Dataset 3D Object Detection CVPR 2023 End-to-End 3D Dense Captioning With Vote2Cap-DETR CVPR 2023 Bi3D: Bi-Domain Active Learning for Cross-Domain 3D Object Detection CVPR 2023 Creator Context for Tweet Recommendation EMNLP 2023 Reasoning Makes Good Annotators : An Automatic Task-specific Rules Distilling Framework for Low-resource Relation Extraction EMNLP 2023 Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering ICCV 2023 A Large-Scale Outdoor Multi-Modal Dataset and Benchmark for Novel View Synthesis and Implicit Scene Reconstruction ICCV 2023 DexDeform: Dexterous Deformable Object Manipulation with Human Demonstrations and Differentiable Physics ICLR 2023 Parallel $Q$-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation ICML 2023 ConceptFusion: Open-set multimodal 3D mapping RSS 2023 Topological Experience Replay ICLR 2022 Pre-Trained Language Models for Interactive Decision-Making NIPS 2022 Efficient Tactile Simulation with Differentiability for Robotic Manipulation CORL 2022 Stimulative Training of Residual Networks: A Social Psychology Perspective of Loafing NIPS 2022 b-DARTS: Beta-Decay Regularization for Differentiable Architecture Search CVPR 2022 Colorization for In Situ Marine Plankton Images ECCV 2022 ED2LM: Encoder-Decoder to Language Model for Faster Document Re-ranking Inference ACL 2022 Coordinates Are NOT Lonely - Codebook Prior Helps Implicit Neural 3D representations NIPS 2022 Rapid Locomotion via Reinforcement Learning RSS 2022 Fast and Constrained Absent Keyphrase Generation by Prompt-Based Learning AAAI 2022 TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation CVPR 2022 What Makes for Effective Few-Shot Point Cloud Classification? WACV 2022 Watermark Vaccine: Adversarial Attacks to Prevent Watermark Removal ECCV 2022 Learning to Jump from Pixels CORL 2021 Non-Salient Region Object Mining for Weakly Supervised Semantic Segmentation CVPR 2021 UniKeyphrase: A Unified Extraction and Generation Framework for Keyphrase Prediction ACL 2021 TexSmart: A System for Enhanced Natural Language Understanding ACL 2021 CIL: Contrastive Instance Learning Framework for Distantly Supervised Relation Extraction ACL 2021 Concept-Based Label Embedding via Dynamic Routing for Hierarchical Text Classification ACL 2021 Concept-Based Label Embedding via Dynamic Routing for Hierarchical Text Classification IJCNLP 2021 CIL: Contrastive Instance Learning Framework for Distantly Supervised Relation Extraction IJCNLP 2021 TexSmart: A System for Enhanced Natural Language Understanding IJCNLP 2021 UniKeyphrase: A Unified Extraction and Generation Framework for Keyphrase Prediction IJCNLP 2021 An End-to-End Differentiable Framework for Contact-Aware Robot Design RSS 2021 Empower Distantly Supervised Relation Extraction with Collaborative Adversarial Training AAAI 2021 Coarse-to-Fine Gaze Redirection With Numerical and Pictorial Guidance WACV 2021 Deep Symmetric Network for Underexposed Image Enhancement With Recurrent Attentional Learning ICCV 2021 A System for General In-Hand Object Re-Orientation CORL 2021 Cascade EF-GAN: Progressive Facial Expression Editing With Local Focuses CVPR 2020 Millions of Tiny Databases NSDI 2020 Learning Exploration Policies for Navigation ICLR 2019 CoSQL: A Conversational Text-to-SQL Challenge Towards Cross-Domain Natural Language Interfaces to Databases IJCNLP 2019 CoSQL: A Conversational Text-to-SQL Challenge Towards Cross-Domain Natural Language Interfaces to Databases EMNLP 2019 SParC: Cross-Domain Semantic Parsing in Context ACL 2019 Hardware Conditioned Policies for Multi-Robot Transfer Learning NIPS 2018 Improving Distributed Representation of Word Sense via WordNet Gloss Composition and Context Clustering IJCNLP 2015 Improving Distributed Representation of Word Sense via WordNet Gloss Composition and Context Clustering ACL 2015 Design of a Bio-inspired Dynamical Vertical Climbing Robot RSS 2007