Tong Wu

71 papers · 2019–2026 · 17 conferences · across top CS/AI conferences

Achievements

+14 more ↓

🗺️ Taxonomy Completionist (11) 🧭 Keyword Pioneer 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird

🏃 Academic Marathon (6) 🐝 Cross-Pollinator (11) 🗺️ Taxonomy Completionist (11) 🔬 Deep Specialist (13) 🧬 Topic Evolution 🏆 Keyword Champion (3) 🤝 Dynamic Duo (22) 👑 Triple Crown 🏆 Grand Slam 🗃️ Keyword Collector (283) ⚡ Prolific Year (17) 📈 Trend Setter 💎 Century Club (66) 🔥 Unstoppable (7)

Conferences

CVPR (11) NIPS (10) ICLR (8) ICCV (7) AAAI (6) ECCV (6) ICML (5) ACL (4) SEMEVAL (3) WACV (2) IJCAI (2) COLING (2) EMNLP (1) EACL (1) CORL (1) UAI (1) AISTATS (1)

Top co-authors

Dahua Lin (22) Jiaqi Wang (14) Ziwei Liu (14) Pan Zhang (7) Thanet Markchom (6) Yuhang Zang (6) Huizhi Liang (6) Prateek Mittal (5) Xiaoyi Dong (5) Zeyi Sun (5)

Research topics

Differential Privacy (1)

Keywords

large language model (9) diffusion model (6) multimodal learning (5) factual verification (4) hallucination detection (4) adversarial robustness (3) attention mechanism (3) text generation (3) contrastive learning (3) semantic segmentation (3) object detection (3) multilingual nlp (3) auto-regressive model (3) 3d generation (3) vision transformer (2) vision-language model (2) generative model (2) foundation model (2) multimodal large language model (2) point cloud (2)

Papers

Eguard: Defending LLM Embeddings Against Inversion Attacks via Text Mutual Information Optimization AAAI 2026 DySy-Det: A Synergistic Framework with Dynamic Reconstruction-Path Consistency for AI-Generated Image Detection AAAI 2026 Delayed Wh-Question Development in Children with Hearing Loss: Evidence for Morphosyntactic Vulnerability from Corpus-Based NLP and LLM Analyses EACL 2026 Label Distribution Propagation-based Label Completion for Crowdsourcing ICML 2025 MotionClone: Training-Free Motion Cloning for Controllable Video Generation ICLR 2025 LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models ICLR 2025 EventPillars: Pillar-based Efficient Representations for Event Data AAAI 2025 Sensing Surface Patches in Volume Rendering for Inferring Signed Distance Functions AAAI 2025 Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy ICLR 2025 IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations ICLR 2025 Light-A-Video: Training-free Video Relighting via Progressive Light Fusion ICCV 2025 GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography ICCV 2025 Bootstrap3D: Improving Multi-view Diffusion Model with Synthetic Data ICCV 2025 X-Prompt: Generalizable Auto-Regressive Visual Learning with In-Context Prompting ICCV 2025 An Efficient Hybrid Vision Transformer for TinyML Applications ICCV 2025 The Task Shield: Enforcing Task Alignment to Defend Against Indirect Prompt Injection in LLM Agents ACL 2025 NCL-UoR at SemEval-2025 Task 3: Detecting Multilingual Hallucination and Related Observable Overgeneration Text Spans with Modified RefChecker and Modified SeflCheckGPT ACL 2025 UoR-NCL at SemEval-2025 Task 1: Using Generative LLMs and CLIP Models for Multilingual Multimodal Idiomaticity Representation ACL 2025 NCL-UoR at SemEval-2025 Task 3: Detecting Multilingual Hallucination and Related Observable Overgeneration Text Spans with Modified RefChecker and Modified SelfCheckGPT ACL 2025 Automated Progressive Red Teaming COLING 2025 DepthSSC: Monocular 3D Semantic Scene Completion via Depth-Spatial Alignment and Voxel Adaptation WACV 2025 Fast Non-convex Matrix Sensing with Optimal Sample Complexity UAI 2025 UoR-NCL at SemEval-2025 Task 1: Using Generative LLMs and CLIP Models for Multilingual Multimodal Idiomaticity Representation SEMEVAL 2025 NCL-UoR at SemEval-2025 Task 3: Detecting Multilingual Hallucination and Related Observable Overgeneration Text Spans with Modified RefChecker and Modified SeflCheckGPT SEMEVAL 2025 NCL-UoR at SemEval-2025 Task 3: Detecting Multilingual Hallucination and Related Observable Overgeneration Text Spans with Modified RefChecker and Modified SelfCheckGPT SEMEVAL 2025 3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion CVPR 2025 OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts CVPR 2025 FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation Learning CVPR 2025 ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way CVPR 2025 TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation ICML 2025 ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance ECCV 2024 FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models NIPS 2024 An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding NIPS 2024 Make-it-Real: Unleashing Large Multimodal Model for Painting 3D Objects with Realistic Materials NIPS 2024 ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction NIPS 2024 GREATS: Online Selection of High-Quality Data for LLM Training in Every Iteration NIPS 2024 PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications NIPS 2024 SoftCLIP: Softer Cross-Modal Alignment Makes CLIP Stronger AAAI 2024 Robust Data Clustering with Outliers via Transformed Tensor Low-Rank Representation AISTATS 2024 Sinkhorn Distance Minimization for Knowledge Distillation COLING 2024 GPT4Point: A Unified Framework for Point-Language Understanding and Generation CVPR 2024 Alpha-CLIP: A CLIP Model Focusing on Wherever You Want CVPR 2024 GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation CVPR 2024 Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation ECCV 2024 Retargeting Visual Data with Deformation Fields ECCV 2024 Privacy-Preserving In-Context Learning for Large Language Models ICLR 2024 Large-Vocabulary 3D Diffusion Model with Transformer ICLR 2024 A Randomized Approach to Tight Privacy Accounting NIPS 2023 SLAN: Self-Locator Aided Network for Vision-Language Understanding ICCV 2023 Voxurf: Voxel-based Efficient and Accurate Neural Surface Reconstruction ICLR 2023 AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation NIPS 2023 V3Det: Vast Vocabulary Visual Detection Dataset ICCV 2023 OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation CVPR 2023 Intersectional Stereotypes in Large Language Models: Dataset and Analysis EMNLP 2023 Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise ICML 2023 Uncovering Adversarial Risks of Test-Time Adaptation ICML 2023 Towards Trustworthy Explanation: On Causal Rationalization ICML 2023 Adversarial Robustness of Deep Sensor Fusion Models WACV 2022 Human-Robot Commensality: Bite Timing Prediction for Robot-Assisted Feeding in Groups CORL 2022 Adaptive Spatial-BCE Loss for Weakly Supervised Semantic Segmentation ECCV 2022 Balanced Chamfer Distance as a Comprehensive Metric for Point Cloud Completion NIPS 2021 Few-Shot Object Detection via Association and DIscrimination NIPS 2021 Towards Evaluating and Training Verifiably Robust Neural Networks CVPR 2021 Embedded Discriminative Attention Mechanism for Weakly Supervised Semantic Segmentation CVPR 2021 Adversarial Robustness Under Long-Tailed Distribution CVPR 2021 Defending Against Physically Realizable Attacks on Image Classification ICLR 2020 Caption-Supervised Face Recognition: Training a State-of-the-Art Face Model without Manual Annotation ECCV 2020 Distribution-Balanced Loss for Multi-Label Classification in Long-Tailed Datasets ECCV 2020 Meta Segmentation Network for Ultra-Resolution Medical Images IJCAI 2020 Patch Proposal Network for Fast Semantic Segmentation of High-Resolution Images AAAI 2020 Co-Attentive Multi-Task Learning for Explainable Recommendation IJCAI 2019