Tao Chen
108 papers · 2007–2026 · 18 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (15) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (5) π Conference Polyglot (18)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(15)
π§
Keyword Pioneer
π€
Dynamic Duo
(15)
π
Triple Crown
π
Grand Slam
π₯
Mega-Team
(24)
π±
Topic Pioneer
π¬
Deep Specialist
(13)
π§¬
Topic Evolution
π
Keyword Champion
(2)
π
Century Club
(103)
π₯
Unstoppable
(8)
ποΈ
Keyword Collector
(404)
β‘
Prolific Year
(26)
β
The Questioner
Conferences
CVPR (18)
NIPS (17)
ACL (11)
AAAI (10)
ECCV (8)
ICLR (6)
IJCNLP (6)
EMNLP (5)
ICCV (5)
RSS (5)
IJCAI (4)
CORL (3)
MICCAI (3)
WACV (2)
ICML (2)
NSDI (1)
SEMEVAL (1)
COLING (1)
Top co-authors
Research topics
Keywords
model compression
(10)
reinforcement learning
(8)
knowledge distillation
(6)
large language model
(6)
point cloud
(6)
3d vision
(5)
novel view synthesis
(5)
diffusion model
(5)
domain adaptation
(4)
multimodal learning
(4)
image generation
(4)
semantic segmentation
(4)
neural network
(4)
zero-shot learning
(3)
neural network optimization
(3)
scene understanding
(3)
3d reconstruction
(3)
multi-instance learning
(3)
3d object detection
(3)
relation extraction
(3)
Papers
A Scalable Multi-LLM Collaboration System with Retrieval-based Selection and Exploration-Exploitation-Driven Enhancement
ACL 2026
Learning from Human Gaze: Human-like Robot Social Navigation in Dense Crowds
AAAI 2026
Beyond Quadratic: Linear-Time Change Detection with RWKV
AAAI 2026
Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers
AAAI 2026
Mitigating Low-Quality Reasoning in MLLMs: Self-Driven Refined Multimodal CoT with Selective Thinking and Step-wise Visual Enhancement
AAAI 2026
HiSplat: Hierarchical 3D Gaussian Splatting for Generalizable Sparse-View Reconstruction
ICLR 2025
Dolphin: Moving Towards Closed-loop Auto-research through Thinking, Practice, and Feedback
ACL 2025
uir-cis at SemEval-2025 Task 3: Detection of Hallucinations in Generated Text
ACL 2025
GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training
ICLR 2025
uir-cis at SemEval-2025 Task 3: Detection of Hallucinations in Generated Text
SEMEVAL 2025
Chimera: Improving Generalist Model with Domain-Specific Experts
ICCV 2025
SC-Captioner: Improving Image Captioning with Self-Correction by Reinforcement Learning
ICCV 2025
DeRS: Towards Extremely Efficient Upcycled Mixture-of-Experts Models
CVPR 2025
Pioneering 4-Bit FP Quantization for Diffusion Models: Mixup-Sign Quantization and Timestep-Aware Fine-Tuning
CVPR 2025
Hard Sample Mining-based Tongue Diagnosis for Fatty Liver Disease Severity Classification
MICCAI 2025
Consistency-aware Self-Training for Iterative-based Stereo Matching
CVPR 2025
Biology-Instructions: A Dataset and Benchmark for Multi-Omics Sequence Understanding Capability of Large Language Models
EMNLP 2025
PAMN: Multi-phase Correlation Modeling for Contrast-Enhanced 3D Medical Image Retrieval
EMNLP 2025
All-in-One: Transferring Vision Foundation Models into Stereo Matching
AAAI 2025
Cross-Modal Graph Learning for Perivascular Spaces Segmentation
MICCAI 2025
Seeing What Matters: Empowering CLIP with Patch Generation-to-Selection
CVPR 2025
Autoregressive Medical Image Segmentation via Next-Scale Mask Prediction
MICCAI 2025
Boost Embodied AI Models with Robust Compression Boundary
IJCAI 2025
Once-Tuning-Multiple-Variants: Tuning Once and Expanded as Multiple Vision-Language Model Variants
CVPR 2025
ITFormer: Bridging Time Series and Natural Language for Multi-Modal QA with Large-Scale Multitask Dataset
ICML 2025
MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer
CVPR 2024
Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient
NIPS 2024
Foster Adaptivity and Balance in Learning with Noisy Labels
ECCV 2024
Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic Segmentation
ECCV 2024
Better Regression Makes Better Test-time Adaptive 3D Object Detection
ECCV 2024
DECap: Towards Generalized Explicit Caption Editing via Diffusion Mechanism
ECCV 2024
Enhanced Sparsification via Stimulative Training
ECCV 2024
M3DBench: Towards Omni 3D Assistant with Interleaved Multi-modal Instructions
ECCV 2024
MeshXL: Neural Coordinate Field for Generative 3D Foundation Models
NIPS 2024
S2HPruner: Soft-to-Hard Distillation Bridges the Discretization Gap in Pruning
NIPS 2024
EMR-Merging: Tuning-Free High-Performance Model Merging
NIPS 2024
$\textit{Bifr\"ost}$: 3D-Aware Image Compositing with Language Instructions
NIPS 2024
FNP: Fourier Neural Processes for Arbitrary-Resolution Data Assimilation
NIPS 2024
Reconciling Reality through Simulation: A Real-To-Sim-to-Real Approach for Robust Manipulation
RSS 2024
Adaptive Integration of Partial Label Learning and Negative Learning for Enhanced Noisy Label Learning
AAAI 2024
Boosting Residual Networks with Group Knowledge
AAAI 2024
PM-INR: Prior-Rich Multi-Modal Implicit Large-Scale Scene Neural Representation
AAAI 2024
Spear: Evaluate the Adversarial Robustness of Compressed Neural Models
IJCAI 2024
FLDM-VTON: Faithful Latent Diffusion Model for Virtual Try-on
IJCAI 2024
MMAPS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization
COLING 2024
ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation
ICLR 2024
Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
NIPS 2024
3DET-Mamba: Causal Sequence Modelling for End-to-End 3D Object Detection
NIPS 2024
VideoMAC: Video Masked Autoencoders Meet ConvNets
CVPR 2024
LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding Reasoning and Planning
CVPR 2024
Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression
CVPR 2024
Adversarial Amendment is the Only Force Capable of Transforming an Enemy into a Friend
IJCAI 2023
PDF: Point Diffusion Implicit Function for Large-scale Scene Neural Representation
NIPS 2023
MotionGPT: Human Motion as a Foreign Language
NIPS 2023
AD-PT: Autonomous Driving Pre-Training with Large-scale Point Cloud Dataset
NIPS 2023
Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop Feedback
NIPS 2023
Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation
NIPS 2023
Boost Transformer-based Language Models with GPU-Friendly Sparsity and Quantization
ACL 2023
Boost Vision Transformer With GPU-Friendly Sparsity and Quantization
CVPR 2023
Executing Your Commands via Motion Diffusion in Latent Space
CVPR 2023
Uni3D: A Unified Baseline for Multi-Dataset 3D Object Detection
CVPR 2023
End-to-End 3D Dense Captioning With Vote2Cap-DETR
CVPR 2023
Bi3D: Bi-Domain Active Learning for Cross-Domain 3D Object Detection
CVPR 2023
Creator Context for Tweet Recommendation
EMNLP 2023
Reasoning Makes Good Annotators : An Automatic Task-specific Rules Distilling Framework for Low-resource Relation Extraction
EMNLP 2023
Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering
ICCV 2023
A Large-Scale Outdoor Multi-Modal Dataset and Benchmark for Novel View Synthesis and Implicit Scene Reconstruction
ICCV 2023
DexDeform: Dexterous Deformable Object Manipulation with Human Demonstrations and Differentiable Physics
ICLR 2023
Parallel $Q$-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation
ICML 2023
ConceptFusion: Open-set multimodal 3D mapping
RSS 2023
Topological Experience Replay
ICLR 2022
Pre-Trained Language Models for Interactive Decision-Making
NIPS 2022
Efficient Tactile Simulation with Differentiability for Robotic Manipulation
CORL 2022
Stimulative Training of Residual Networks: A Social Psychology Perspective of Loafing
NIPS 2022
b-DARTS: Beta-Decay Regularization for Differentiable Architecture Search
CVPR 2022
Colorization for In Situ Marine Plankton Images
ECCV 2022
ED2LM: Encoder-Decoder to Language Model for Faster Document Re-ranking Inference
ACL 2022
Coordinates Are NOT Lonely - Codebook Prior Helps Implicit Neural 3D representations
NIPS 2022
Rapid Locomotion via Reinforcement Learning
RSS 2022
Fast and Constrained Absent Keyphrase Generation by Prompt-Based Learning
AAAI 2022
TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation
CVPR 2022
What Makes for Effective Few-Shot Point Cloud Classification?
WACV 2022
Watermark Vaccine: Adversarial Attacks to Prevent Watermark Removal
ECCV 2022
Learning to Jump from Pixels
CORL 2021
Non-Salient Region Object Mining for Weakly Supervised Semantic Segmentation
CVPR 2021
UniKeyphrase: A Unified Extraction and Generation Framework for Keyphrase Prediction
ACL 2021
TexSmart: A System for Enhanced Natural Language Understanding
ACL 2021
CIL: Contrastive Instance Learning Framework for Distantly Supervised Relation Extraction
ACL 2021
Concept-Based Label Embedding via Dynamic Routing for Hierarchical Text Classification
ACL 2021
Concept-Based Label Embedding via Dynamic Routing for Hierarchical Text Classification
IJCNLP 2021
CIL: Contrastive Instance Learning Framework for Distantly Supervised Relation Extraction
IJCNLP 2021
TexSmart: A System for Enhanced Natural Language Understanding
IJCNLP 2021
UniKeyphrase: A Unified Extraction and Generation Framework for Keyphrase Prediction
IJCNLP 2021
An End-to-End Differentiable Framework for Contact-Aware Robot Design
RSS 2021
Empower Distantly Supervised Relation Extraction with Collaborative Adversarial Training
AAAI 2021
Coarse-to-Fine Gaze Redirection With Numerical and Pictorial Guidance
WACV 2021
Deep Symmetric Network for Underexposed Image Enhancement With Recurrent Attentional Learning
ICCV 2021
A System for General In-Hand Object Re-Orientation
CORL 2021
Cascade EF-GAN: Progressive Facial Expression Editing With Local Focuses
CVPR 2020
Millions of Tiny Databases
NSDI 2020
Learning Exploration Policies for Navigation
ICLR 2019
CoSQL: A Conversational Text-to-SQL Challenge Towards Cross-Domain Natural Language Interfaces to Databases
IJCNLP 2019
CoSQL: A Conversational Text-to-SQL Challenge Towards Cross-Domain Natural Language Interfaces to Databases
EMNLP 2019
SParC: Cross-Domain Semantic Parsing in Context
ACL 2019
Hardware Conditioned Policies for Multi-Robot Transfer Learning
NIPS 2018
Improving Distributed Representation of Word Sense via WordNet Gloss Composition and Context Clustering
IJCNLP 2015
Improving Distributed Representation of Word Sense via WordNet Gloss Composition and Context Clustering
ACL 2015
Design of a Bio-inspired Dynamical Vertical Climbing Robot
RSS 2007