Tong Wu
71 papers · 2019–2026 · 17 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (11) π§ Keyword Pioneer π Renaissance Researcher (5) π Interdisciplinary Bridge π£ Hot Topic Early Bird
π
Academic Marathon
(6)
π
Cross-Pollinator
(11)
πΊοΈ
Taxonomy Completionist
(11)
π¬
Deep Specialist
(13)
π§¬
Topic Evolution
π
Keyword Champion
(3)
π€
Dynamic Duo
(22)
π
Triple Crown
π
Grand Slam
ποΈ
Keyword Collector
(283)
β‘
Prolific Year
(17)
π
Trend Setter
π
Century Club
(66)
π₯
Unstoppable
(7)
Conferences
CVPR (11)
NIPS (10)
ICLR (8)
ICCV (7)
AAAI (6)
ECCV (6)
ICML (5)
ACL (4)
SEMEVAL (3)
WACV (2)
IJCAI (2)
COLING (2)
EMNLP (1)
EACL (1)
CORL (1)
UAI (1)
AISTATS (1)
Top co-authors
Research topics
Keywords
large language model
(9)
diffusion model
(6)
multimodal learning
(5)
factual verification
(4)
hallucination detection
(4)
adversarial robustness
(3)
attention mechanism
(3)
text generation
(3)
contrastive learning
(3)
semantic segmentation
(3)
object detection
(3)
multilingual nlp
(3)
auto-regressive model
(3)
3d generation
(3)
vision transformer
(2)
vision-language model
(2)
generative model
(2)
foundation model
(2)
multimodal large language model
(2)
point cloud
(2)
Papers
Eguard: Defending LLM Embeddings Against Inversion Attacks via Text Mutual Information Optimization
AAAI 2026
DySy-Det: A Synergistic Framework with Dynamic Reconstruction-Path Consistency for AI-Generated Image Detection
AAAI 2026
Delayed Wh-Question Development in Children with Hearing Loss: Evidence for Morphosyntactic Vulnerability from Corpus-Based NLP and LLM Analyses
EACL 2026
Label Distribution Propagation-based Label Completion for Crowdsourcing
ICML 2025
MotionClone: Training-Free Motion Cloning for Controllable Video Generation
ICLR 2025
LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
ICLR 2025
EventPillars: Pillar-based Efficient Representations for Event Data
AAAI 2025
Sensing Surface Patches in Volume Rendering for Inferring Signed Distance Functions
AAAI 2025
Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
ICLR 2025
IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations
ICLR 2025
Light-A-Video: Training-free Video Relighting via Progressive Light Fusion
ICCV 2025
GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography
ICCV 2025
Bootstrap3D: Improving Multi-view Diffusion Model with Synthetic Data
ICCV 2025
X-Prompt: Generalizable Auto-Regressive Visual Learning with In-Context Prompting
ICCV 2025
An Efficient Hybrid Vision Transformer for TinyML Applications
ICCV 2025
The Task Shield: Enforcing Task Alignment to Defend Against Indirect Prompt Injection in LLM Agents
ACL 2025
NCL-UoR at SemEval-2025 Task 3: Detecting Multilingual Hallucination and Related Observable Overgeneration Text Spans with Modified RefChecker and Modified SeflCheckGPT
ACL 2025
UoR-NCL at SemEval-2025 Task 1: Using Generative LLMs and CLIP Models for Multilingual Multimodal Idiomaticity Representation
ACL 2025
NCL-UoR at SemEval-2025 Task 3: Detecting Multilingual Hallucination and Related Observable Overgeneration Text Spans with Modified RefChecker and Modified SelfCheckGPT
ACL 2025
Automated Progressive Red Teaming
COLING 2025
DepthSSC: Monocular 3D Semantic Scene Completion via Depth-Spatial Alignment and Voxel Adaptation
WACV 2025
Fast Non-convex Matrix Sensing with Optimal Sample Complexity
UAI 2025
UoR-NCL at SemEval-2025 Task 1: Using Generative LLMs and CLIP Models for Multilingual Multimodal Idiomaticity Representation
SEMEVAL 2025
NCL-UoR at SemEval-2025 Task 3: Detecting Multilingual Hallucination and Related Observable Overgeneration Text Spans with Modified RefChecker and Modified SeflCheckGPT
SEMEVAL 2025
NCL-UoR at SemEval-2025 Task 3: Detecting Multilingual Hallucination and Related Observable Overgeneration Text Spans with Modified RefChecker and Modified SelfCheckGPT
SEMEVAL 2025
3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion
CVPR 2025
OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts
CVPR 2025
FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation Learning
CVPR 2025
ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way
CVPR 2025
TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation
ICML 2025
ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance
ECCV 2024
FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
NIPS 2024
An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding
NIPS 2024
Make-it-Real: Unleashing Large Multimodal Model for Painting 3D Objects with Realistic Materials
NIPS 2024
ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction
NIPS 2024
GREATS: Online Selection of High-Quality Data for LLM Training in Every Iteration
NIPS 2024
PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications
NIPS 2024
SoftCLIP: Softer Cross-Modal Alignment Makes CLIP Stronger
AAAI 2024
Robust Data Clustering with Outliers via Transformed Tensor Low-Rank Representation
AISTATS 2024
Sinkhorn Distance Minimization for Knowledge Distillation
COLING 2024
GPT4Point: A Unified Framework for Point-Language Understanding and Generation
CVPR 2024
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
CVPR 2024
GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation
CVPR 2024
Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation
ECCV 2024
Retargeting Visual Data with Deformation Fields
ECCV 2024
Privacy-Preserving In-Context Learning for Large Language Models
ICLR 2024
Large-Vocabulary 3D Diffusion Model with Transformer
ICLR 2024
A Randomized Approach to Tight Privacy Accounting
NIPS 2023
SLAN: Self-Locator Aided Network for Vision-Language Understanding
ICCV 2023
Voxurf: Voxel-based Efficient and Accurate Neural Surface Reconstruction
ICLR 2023
AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation
NIPS 2023
V3Det: Vast Vocabulary Visual Detection Dataset
ICCV 2023
OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation
CVPR 2023
Intersectional Stereotypes in Large Language Models: Dataset and Analysis
EMNLP 2023
Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise
ICML 2023
Uncovering Adversarial Risks of Test-Time Adaptation
ICML 2023
Towards Trustworthy Explanation: On Causal Rationalization
ICML 2023
Adversarial Robustness of Deep Sensor Fusion Models
WACV 2022
Human-Robot Commensality: Bite Timing Prediction for Robot-Assisted Feeding in Groups
CORL 2022
Adaptive Spatial-BCE Loss for Weakly Supervised Semantic Segmentation
ECCV 2022
Balanced Chamfer Distance as a Comprehensive Metric for Point Cloud Completion
NIPS 2021
Few-Shot Object Detection via Association and DIscrimination
NIPS 2021
Towards Evaluating and Training Verifiably Robust Neural Networks
CVPR 2021
Embedded Discriminative Attention Mechanism for Weakly Supervised Semantic Segmentation
CVPR 2021
Adversarial Robustness Under Long-Tailed Distribution
CVPR 2021
Defending Against Physically Realizable Attacks on Image Classification
ICLR 2020
Caption-Supervised Face Recognition: Training a State-of-the-Art Face Model without Manual Annotation
ECCV 2020
Distribution-Balanced Loss for Multi-Label Classification in Long-Tailed Datasets
ECCV 2020
Meta Segmentation Network for Ultra-Resolution Medical Images
IJCAI 2020
Patch Proposal Network for Fast Semantic Segmentation of High-Resolution Images
AAAI 2020
Co-Attentive Multi-Task Learning for Explainable Recommendation
IJCAI 2019