conftrace_

Lei Zhang

399 papers · 2000–2026 · 22 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+17 more ↓

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🗺️ Taxonomy Completionist (23) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (22)

🌉 Interdisciplinary Bridge 🏃 Academic Marathon (25) 🗺️ Taxonomy Completionist (23) 🏠 Conference Loyalist (41) 🌟 Keyword Trendsetter Combo (6) 🤝 Dynamic Duo (37) 👑 Triple Crown 🏆 Grand Slam 🔬 Deep Specialist (52) 🏆 Keyword Champion (7) 📈 Trend Setter 🔥 Unstoppable (16) 🚀 Conference Pioneer ⚡ Prolific Year (22) 💎 Century Club (385) ❓ The Questioner 🗃️ Keyword Collector (61)

Conferences

CVPR (124) ICCV (60) AAAI (54) ECCV (49) ICLR (21) NIPS (19) ACL (15) EMNLP (11) MICCAI (7) IJCAI (7) ICML (6) IJCNLP (5) COLING (5) EACL (3) NAACL (3) NSDI (2) SEMEVAL (2) WACV (2) INTERSPEECH (1) L4DC (1) ACML (1) UAI (1)

Top co-authors

Wangmeng Zuo (37) Shilong Liu (23) Feng Li (21) Hao Zhang (17) Jianfeng Gao (17) Shuai Li (17) Ruihuang Li (15) Wei Wei (15) Ailing Zeng (15) Chenhang He (14)

Research topics

Keywords

object detection (32) convolutional neural network (24) domain adaptation (21) large language model (19) diffusion model (18) image restoration (16) semantic segmentation (15) contrastive learning (15) few-shot learning (13) multimodal learning (12) transfer learning (12) image super-resolution (11) unsupervised learning (11) knowledge distillation (11) image denoising (11) metric learning (9) image generation (9) attention mechanism (8) semi-supervised learning (8) image captioning (8)

Papers

Otter: Mitigating Background Distractions of Wide-Angle Few-Shot Action Recognition with Enhanced RWKV AAAI 2026 Geometric Correspondence Constrained Pseudo-Label Alignment for Source-Free Domain Adaptive Fundus Image Segmentation AAAI 2026 LayerEdit: Disentangled Multi-Object Editing via Conflict-Aware Multi-Layer Learning AAAI 2026 DualFete: Revisiting Teacher-Student Interactions from a Feedback Perspective for Semi-supervised Medical Image Segmentation AAAI 2026 Towards Better Code Understanding in Decoder-Only Models with Contrastive Learning AAAI 2026 From Completion to Editing: Unlocking Context-Aware Code Infilling via Search-and-Replace Instruction Tuning ACL 2026 T-Rex-Omni: Integrating Negative Visual Prompt in Generic Object Detection AAAI 2026 DeepSenseMoE: Harnessing Power of Time Series Foundation Models for Few-Shot Human Activity Recognition AAAI 2026 Fast Multi-view Consistent 3D Editing with Video Priors AAAI 2026 BEVDilation: LiDAR-Centric Multi-Modal Fusion for 3D Object Detection AAAI 2026 JoDiffusion: Jointly Diffusing Image with Pixel-Level Annotations for Semantic Segmentation Promotion AAAI 2026 Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding AAAI 2026 SegDINO3D: 3D Instance Segmentation Empowered by Both Image-Level and Object-Level 2D Features AAAI 2026 AlignCVC: Aligning Cross-View Consistency for Single-Image-to-3D Generation AAAI 2026 RORem: Training a Robust Object Remover with Human-in-the-Loop CVPR 2025 HumanMM: Global Human Motion Recovery from Multi-shot Videos CVPR 2025 Adversarial Diffusion Compression for Real-World Image Super-Resolution CVPR 2025 Minder: Faulty Machine Detection for Large-scale Distributed Model Training NSDI 2025 Rethinking Smoothness for Fast and Adaptable Entity Alignment Decoding NAACL 2025 SlimFormer-3D: A Layer-Adaptive Lightweight Transformer for Efficient 3D Medical Image Segmentation MICCAI 2025 R1Seg-3D: Rethinking Reasoning Segmentation for Medical 3D CTs MICCAI 2025 PDC-Net: Pattern Divide-and-Conquer Network for Pelvic Radiation Injury Segmentation MICCAI 2025 SLRL: Semi-Supervised Local Community Detection Based on Reinforcement Learning AAAI 2025 Generalizable Sensor-Based Activity Recognition via Categorical Concept Invariant Learning AAAI 2025 Multi-Edge Reinforced Collaborative Data Acquisition for Continuous Video Analytics by Prioritizing Quality over Quantity AAAI 2025 CustomContrast: A Multilevel Contrastive Perspective for Subject-Driven Text-to-Image Customization AAAI 2025 GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolution AAAI 2025 Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence AAAI 2025 SyncNoise: Geometrically Consistent Noise Prediction for Instruction-based 3D Editing AAAI 2025 CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility AAAI 2025 ChatTime: A Unified Multimodal Time Series Foundation Model Bridging Numerical and Textual Data AAAI 2025 MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis AAAI 2025 GapMatch: Bridging Instance and Model Perturbations for Enhanced Semi-Supervised Medical Image Segmentation AAAI 2025 Adversarial Contrastive Graph Augmentation with Counterfactual Regularization AAAI 2025 Imitate Before Detect: Aligning Machine Stylistic Preference for Machine-Revised Text Detection AAAI 2025 Fine-Tuning Language Models with Collaborative and Semantic Experts AAAI 2025 Hierarchical Context Pruning: Optimizing Real-World Code Completion with Repository-Level Pretrained Code LLMs AAAI 2025 Controllable Skin Synthesis via Lesion-Focused Vector Autoregression Model MICCAI 2025 BayesKD: Bayesian Knowledge Distillation for Compact LLMs in Constrained Fine-tuning Scenarios ACL 2025 STORYTELLER: An Enhanced Plot-Planning Framework for Coherent and Cohesive Story Generation ACL 2025 Beyond Statistical Analysis: Multimodal Framework for Time Series Forecasting with LLM-Driven Temporal Pattern IJCAI 2025 Prompt-Free Conditional Diffusion for Multi-object Image Augmentation IJCAI 2025 Mining Word Boundaries from Speech-Text Parallel Data for Cross-domain Chinese Word Segmentation COLING 2025 A Novel Negative Sample Generation Method for Contrastive Learning in Hierarchical Text Classification COLING 2025 Synthesizing Software Engineering Data in a Test-Driven Manner ICML 2025 FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling ICLR 2025 Toward Generalizing Visual Brain Decoding to Unseen Subjects ICLR 2025 LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation ICLR 2025 Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion ICLR 2025 Visual-O1: Understanding Ambiguous Instructions via Multi-modal Multi-turn Chain-of-thoughts Reasoning ICLR 2025 DEEM: Diffusion models serve as the eyes of large language models for image perception ICLR 2025 On-the-fly Preference Alignment via Principle-Guided Decoding ICLR 2025 Autoregressive Pretraining with Mamba in Vision ICLR 2025 Scaling Speech-Text Pre-training with Synthetic Interleaved Data ICLR 2025 UniGS: Modeling Unitary 3D Gaussians for Novel View Synthesis from Sparse-view Images ICCV 2025 Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models ICCV 2025 Generalized and Efficient 2D Gaussian Splatting for Arbitrary-scale Super-Resolution ICCV 2025 InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction ICCV 2025 FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models ICCV 2025 Reverse Convolution and Its Applications to Image Restoration ICCV 2025 Referring to Any Person ICCV 2025 Towards Effective Foundation Model Adaptation for Extreme Cross-Domain Few-Shot Learning ICCV 2025 Fine-structure Preserved Real-world Image Super-resolution via Transfer VAE Training ICCV 2025 Hierarchy-Aware Pseudo Word Learning with Text Adaptation for Zero-Shot Composed Image Retrieval ICCV 2025 Co-Painter: Fine-Grained Controllable Image Stylization via Implicit Decoupling and Adaptive Injection ICCV 2025 ForgeLens: Data-Efficient Forgery Focus for Generalizable Forgery Image Detection ICCV 2025 Prior-aware Dynamic Temporal Modeling Framework for Sequential 3D Hand Pose Estimation ICCV 2025 Dual-Temporal Exemplar Representation Network for Video Semantic Segmentation ICCV 2025 Integrating Visual Interpretation and Linguistic Reasoning for Geometric Problem Solving ICCV 2025 ESGenius: Benchmarking LLMs on Environmental, Social, and Governance (ESG) and Sustainability Knowledge EMNLP 2025 CodeArena: Evaluating and Aligning CodeLLMs on Human Preference EMNLP 2025 LeanGaussian: Breaking Pixel or Point Cloud Correspondence in Modeling 3D Gaussians CVPR 2025 OSMamba: Omnidirectional Spectral Mamba with Dual-Domain Prior Generator for Exposure Correction CVPR 2025 Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data CVPR 2025 FeedEdit: Text-Based Image Editing with Dynamic Feedback Regulation CVPR 2025 D^2iT: Dynamic Diffusion Transformer for Accurate Image Generation CVPR 2025 HandOS: 3D Hand Reconstruction in One Stage CVPR 2025 Toward Generalized Image Quality Assessment: Relaxing the Perfect Reference Quality Assumption CVPR 2025 Low-Biased General Annotated Dataset Generation CVPR 2025 MaSS13K: A Matting-level Semantic Segmentation Benchmark CVPR 2025 SkillMimic: Learning Basketball Interaction Skills from Demonstrations CVPR 2025 Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach CVPR 2025 DNA-SE: Towards Deep Neural-Nets Assisted Semiparametric Estimation ICML 2024 State-Constrained Zero-Sum Differential Games with One-Sided Information ICML 2024 Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models EMNLP 2024 Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA EMNLP 2024 A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment ECCV 2024 Meta-Exploiting Frequency Prior for Cross-Domain Few-Shot Learning NIPS 2024 TAPTRv2: Attention-based Position Update Improves Tracking Any Point NIPS 2024 TW-NLP at SemEval-2024 Task10: Emotion Recognition and Emotion Reversal Inference in Multi-Party Dialogues. SEMEVAL 2024 Self-Supervised Video Desmoking for Laparoscopic Surgery ECCV 2024 LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models ECCV 2024 TW-NLP at SemEval-2024 Task10: Emotion Recognition and Emotion Reversal Inference in Multi-Party Dialogues. NAACL 2024 Boosting Gaze Object Prediction via Pixel-level Supervision from Vision Foundation Model ECCV 2024 General Geometry-aware Weakly Supervised 3D Object Detection ECCV 2024 MasterWeaver: Taming Editability and Face Identity for Personalized Text-to-Image Generation ECCV 2024 Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models CVPR 2024 Open-World Human-Object Interaction Detection via Multi-modal Prompts CVPR 2024 Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment CVPR 2024 UniVS: Unified and Universal Video Segmentation with Prompts as Queries CVPR 2024 SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution CVPR 2024 Visual In-Context Prompting CVPR 2024 Efficient Scene Recovery Using Luminous Flux Prior CVPR 2024 Robust Overfitting Does Matter: Test-Time Adversarial Purification With FGSM CVPR 2024 Osprey: Pixel Understanding with Visual Instruction Tuning CVPR 2024 Neural Super-Resolution for Real-time Rendering with Radiance Demodulation CVPR 2024 Homology Consistency Constrained Efficient Tuning for Vision-Language Models NIPS 2024 One-Step Effective Diffusion Network for Real-World Image Super-Resolution NIPS 2024 Implicit Discriminative Knowledge Learning for Visible-Infrared Person Re-Identification CVPR 2024 Uncertainty-Aware Source-Free Adaptive Image Super-Resolution with Wavelet Augmentation Transformer CVPR 2024 AdaNeg: Adaptive Negative Proxy Guided OOD Detection with Vision-Language Models NIPS 2024 Many-Shot In-Context Learning NIPS 2024 Dynamic Weighted Combiner for Mixed-Modal Image Retrieval AAAI 2024 Gradual Residuals Alignment: A Dual-Stream Framework for GAN Inversion and Image Attribute Editing AAAI 2024 Identification of Necessary Semantic Undertakers in the Causal View for Image-Text Matching AAAI 2024 Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding ECCV 2024 Tag2Text: Guiding Vision-Language Model via Image Tagging ICLR 2024 Symbol as Points: Panoptic Symbol Spotting via Point-based Representation ICLR 2024 Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts ICLR 2024 DreamTime: An Improved Optimization Strategy for Diffusion-Guided 3D Generation ICLR 2024 TOSS: High-quality Text-guided Novel View Synthesis from a Single Image ICLR 2024 Segment and Recognize Anything at Any Granularity ECCV 2024 LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents ECCV 2024 Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection ECCV 2024 X-Pose: Detecting Any Keypoints ECCV 2024 Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution ECCV 2024 LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models ECCV 2024 Urban Waterlogging Detection: A Challenging Benchmark and Large-Small Model Co-Adapter ECCV 2024 T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy ECCV 2024 ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention ECCV 2024 Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models ECCV 2024 TinyU-Net: Lighter yet Better U-Net with Cascaded Multi-Receptive Fields MICCAI 2024 MetaUNETR: Rethinking Token Mixer Encoding for Efficient Multi-Organ Segmentation MICCAI 2024 LIDIA: Precise Liver Tumor Diagnosis on Multi-Phase Contrast-Enhanced CT via Iterative Fusion and Asymmetric Contrastive Learning MICCAI 2024 Pontryagin neural operator for solving general-sum differential games with parametric state constraints L4DC 2024 Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection NIPS 2024 One-Shot Learning as Instruction Data Prospector for Large Language Models ACL 2024 Marathon: A Race Through the Realm of Long Context with Large Language Models ACL 2024 Chinese Spoken Named Entity Recognition in Real-world Scenarios: Dataset and Approaches ACL 2024 LIRE: listwise reward enhancement for preference alignment ACL 2024 Knowledge Context Modeling with Pre-trained Language Models for Contrastive Knowledge Graph Completion ACL 2024 CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models ACL 2024 Responsible Visual Editing ECCV 2024 Compress3D: a Compressed Latent Space for 3D Generation from a Single Image ECCV 2024 TAPTR: Tracking Any Point with Transformers as Detection ECCV 2024 Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation ECCV 2024 Safeguarding Sustainable Cities: Unsupervised Video Anomaly Detection through Diffusion-based Latent Pattern Learning IJCAI 2024 Visual-Linguistic Dependency Encoding for Image-Text Retrieval COLING 2024 Pixel-Aware Stable Diffusion for Realistic Image Super-Resolution and Personalized Stylization ECCV 2024 ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation ECCV 2024 MLCopilot: Unleashing the Power of Large Language Models in Solving Machine Learning Tasks EACL 2024 HumanTOMATO: Text-aligned Whole-body Motion Generation ICML 2024 Towards Fairness-aware Adversarial Network Pruning ICCV 2023 DreamWaltz: Make a Scene with Complex 3D Animatable Avatars NIPS 2023 SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation NIPS 2023 Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset NIPS 2023 Semi-Supervised Domain Generalization with Known and Unknown Classes NIPS 2023 Label-efficient Segmentation via Affinity Propagation NIPS 2023 A Comprehensive Benchmark for Neural Human Radiance Fields NIPS 2023 MomentDiff: Generative Video Moment Retrieval from Random to Real NIPS 2023 MMTN: Multi-Modal Memory Transformer Network for Image-Report Consistent Medical Report Generation AAAI 2023 DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding AAAI 2023 Revisiting Unsupervised Local Descriptor Learning AAAI 2023 Mind the Gap: Polishing Pseudo Labels for Accurate Semi-supervised Object Detection AAAI 2023 Are Transformers Effective for Time Series Forecasting? AAAI 2023 DRGCN: Dynamic Evolving Initial Residual for Deep Graph Convolutional Networks AAAI 2023 Efficient and Interpretable Compressive Text Summarisation with Unsupervised Dual-Agent Reinforcement Learning ACL 2023 DynaMask: Dynamic Mask Selection for Instance Segmentation CVPR 2023 Revisiting Prototypical Network for Cross Domain Few-Shot Learning CVPR 2023 A General Regret Bound of Preconditioned Gradient Method for DNN Training CVPR 2023 OTAvatar: One-Shot Talking Face Avatar With Controllable Tri-Plane Rendering CVPR 2023 Glocal Energy-Based Learning for Few-Shot Open-Set Recognition CVPR 2023 DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training CVPR 2023 SIM: Semantic-Aware Instance Mask Generation for Box-Supervised Instance Segmentation CVPR 2023 Accelerating Dataset Distillation via Model Augmentation CVPR 2023 Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes CVPR 2023 MSF: Motion-Guided Sequential Fusion for Efficient 3D Object Detection From Point Cloud Sequences CVPR 2023 MDQE: Mining Discriminative Query Embeddings To Segment Occluded Instances on Challenging Videos CVPR 2023 Sharpness-Aware Gradient Matching for Domain Generalization CVPR 2023 One-Stage 3D Whole-Body Mesh Recovery With Component Aware Transformer CVPR 2023 Human Guided Ground-Truth Generation for Realistic Image Super-Resolution CVPR 2023 Mask DINO: Towards a Unified Transformer-Based Framework for Object Detection and Segmentation CVPR 2023 Inferring and Leveraging Parts From Object Shape for Improving Semantic Image Synthesis CVPR 2023 Joint HDR Denoising and Fusion: A Real-World Mobile HDR Image Dataset CVPR 2023 MP-Former: Mask-Piloted Transformer for Image Segmentation CVPR 2023 One-to-Few Label Assignment for End-to-End Dense Detection CVPR 2023 Multi-View Adversarial Discriminator: Mine the Non-Causal Factors for Object Detection in Unseen Domains CVPR 2023 Lite DETR: An Interleaved Multi-Scale Encoder for Efficient DETR CVPR 2023 E-CORE: Emotion Correlation Enhanced Empathetic Dialogue Generation EMNLP 2023 A Benchmark for Chinese-English Scene Text Image Super-Resolution ICCV 2023 CORE: Cooperative Reconstruction for Multi-Agent Perception ICCV 2023 Point2Mask: Point-supervised Panoptic Segmentation via Optimal Transport ICCV 2023 A Simple Framework for Open-Vocabulary Segmentation and Detection ICCV 2023 FPR: False Positive Rectification for Weakly Supervised Semantic Segmentation ICCV 2023 DFA3D: 3D Deformable Attention For 2D-to-3D Feature Lifting ICCV 2023 RCA-NOC: Relative Contrastive Alignment for Novel Object Captioning ICCV 2023 Generative Action Description Prompts for Skeleton-based Action Recognition ICCV 2023 Detection Transformer with Stable Matching ICCV 2023 HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image Generation ICCV 2023 Isomer: Isomerous Transformer for Zero-shot Video Object Segmentation ICCV 2023 ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation ICCV 2023 Neural Interactive Keypoint Detection ICCV 2023 Automatic Network Pruning via Hilbert-Schmidt Independence Criterion Lasso under Information Bottleneck Principle ICCV 2023 LipsFormer: Introducing Lipschitz Continuity to Vision Transformers ICLR 2023 DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection ICLR 2023 Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation ICLR 2023 The Benefit of Hindsight: Tracing Edge-Cases in Distributed Systems NSDI 2023 Conditional counterfactual causal effect for individual attribution UAI 2023 Class-Balanced Pixel-Level Self-Labeling for Domain Adaptive Semantic Segmentation CVPR 2022 A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-Resolution CVPR 2022 Spatiotemporal Self-Attention Modeling with Temporal Patch Shift for Action Recognition ECCV 2022 Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval ECCV 2022 Efficient Long-Range Attention Network for Image Super-Resolution ECCV 2022 From Face to Natural Image: Learning Real Degradation for Blind Image Super-Resolution ECCV 2022 Unfolded Deep Kernel Estimation for Blind Image Super-Resolution ECCV 2022 Efficient and Degradation-Adaptive Network for Real-World Image Super-Resolution ECCV 2022 A Dual Weighting Label Assignment Scheme for Object Detection CVPR 2022 Neural Architecture Search With Representation Mutual Information CVPR 2022 A Differentiable Two-Stage Alignment Scheme for Burst Image Reconstruction With Large Shift CVPR 2022 Towards Efficient Data Free Black-Box Adversarial Attack CVPR 2022 Large-Scale Pre-Training for Person Re-Identification With Noisy Labels CVPR 2022 Blind Image Super-Resolution With Elaborate Degradation Modeling on Noise and Kernel CVPR 2022 Grounded Language-Image Pre-Training CVPR 2022 Quantization-Aware Deep Optics for Diffractive Snapshot Hyperspectral Imaging CVPR 2022 Dense Learning Based Semi-Supervised Object Detection CVPR 2022 DN-DETR: Accelerate DETR Training by Introducing Query DeNoising CVPR 2022 QaDialMoE: Question-answering Dialogue based Fact Verification with Mixture of Experts EMNLP 2022 An Embedded Feature Whitening Approach to Deep Neural Network Optimization ECCV 2022 Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution CVPR 2022 Exact Feature Distribution Matching for Arbitrary Style Transfer and Domain Generalization CVPR 2022 Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection From Point Clouds CVPR 2022 Box-Supervised Instance Segmentation with Level Set Evolution ECCV 2022 Attention Diversification for Domain Generalization ECCV 2022 MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources INTERSPEECH 2022 From “Dynamics on Graphs” to “Dynamics of Graphs”: An Adaptive Echo-State Network Solution (Student Abstract) AAAI 2022 Co-promotion Predictions of Financing Market and Sales Market: A Cooperative-Competitive Attention Approach AAAI 2022 Image-Adaptive YOLO for Object Detection in Adverse Weather Conditions AAAI 2022 Reconciling Cognitive Modeling with Knowledge Forgetting: A Continuous Time-aware Neural Network Approach IJCAI 2022 DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR ICLR 2022 Learning Domain Adaptive Object Detection with Probabilistic Teacher ICML 2022 Axiomatic Explanations for Visual Search, Retrieval, and Similarity Learning ICLR 2022 OTExtSum: Extractive Text Summarisation with Optimal Transport NAACL 2022 Neighborhood-Adaptive Structure Augmented Metric Learning AAAI 2022 Deep Metric Learning with Graph Consistency AAAI 2021 MedAI at SemEval-2021 Task 10: Negation-aware Pre-training for Source-free Negation Detection Domain Adaptation ACL 2021 MedAI at SemEval-2021 Task 10: Negation-aware Pre-training for Source-free Negation Detection Domain Adaptation IJCNLP 2021 Bringing Structure into Summaries: a Faceted Summarization Dataset for Long Scientific Documents IJCNLP 2021 High-Resolution Photorealistic Image Translation in Real-Time: A Laplacian Pyramid Translation Network CVPR 2021 TAP: Text-Aware Pre-Training for Text-VQA and Text-Caption CVPR 2021 Deep Convolutional Dictionary Learning for Image Denoising CVPR 2021 Learning Tensor Low-Rank Prior for Hyperspectral Image Reconstruction CVPR 2021 Dynamic Head: Unifying Object Detection Heads With Attentions CVPR 2021 Dynamic Weighted Learning for Unsupervised Domain Adaptation CVPR 2021 GAN Prior Embedded Network for Blind Face Restoration in the Wild CVPR 2021 Learning Parallel Dense Correspondence From Spatio-Temporal Descriptors for Efficient and Robust 4D Reconstruction CVPR 2021 Unsupervised Pre-Training for Person Re-Identification CVPR 2021 Virtual Fully-Connected Layer: Training a Large-Scale Face Recognition Dataset With Limited Computational Resources CVPR 2021 Interactive Self-Training With Mean Teachers for Semi-Supervised Object Detection CVPR 2021 Progressive Semantic-Aware Style Transformation for Blind Face Restoration CVPR 2021 Unsupervised Part Segmentation Through Disentangling Appearance and Shape CVPR 2021 PPR10K: A Large-Scale Portrait Photo Retouching Dataset With Human-Region Mask and Group-Level Consistency CVPR 2021 Spatial Feature Calibration and Temporal Fusion for Effective One-Stage Video Instance Segmentation CVPR 2021 VinVL: Revisiting Visual Representations in Vision-Language Models CVPR 2021 Contrastive Learning Based Hybrid Networks for Long-Tailed Image Classification CVPR 2021 VirFace: Enhancing Face Recognition via Unlabeled Shallow Data CVPR 2021 Category Dictionary Guided Unsupervised Domain Adaptation for Object Detection AAAI 2021 Lite-HRNet: A Lightweight High-Resolution Network CVPR 2021 DAP: Detection-Aware Pre-Training With Weak Supervision CVPR 2021 Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding ICCV 2021 Variational Attention: Propagating Domain-Specific Knowledge for Multi-Domain Learning in Crowd Counting ICCV 2021 SA-ConvONet: Sign-Agnostic Optimization of Convolutional Occupancy Networks ICCV 2021 Dynamic DETR: End-to-End Object Detection With Dynamic Attention ICCV 2021 CvT: Introducing Convolutions to Vision Transformers ICCV 2021 Real-World Video Super-Resolution: A Benchmark Dataset and a Decomposition Based Learning Scheme ICCV 2021 Reconcile Prediction Consistency for Balanced Object Detection ICCV 2021 HDR Video Reconstruction: A Coarse-To-Fine Network and a Real-World Benchmark Dataset ICCV 2021 MicroNet: Improving Image Recognition With Extremely Low FLOPs ICCV 2021 Improve Unsupervised Pretraining for Few-Label Transfer ICCV 2021 MedAI at SemEval-2021 Task 10: Negation-aware Pre-training for Source-free Negation Detection Domain Adaptation SEMEVAL 2021 Adversarial Pose Regression Network for Pose-Invariant Face Recognitions AAAI 2021 VIVO: Visual Vocabulary Pre-Training for Novel Object Captioning AAAI 2021 Bringing Structure into Summaries: a Faceted Summarization Dataset for Long Scientific Documents ACL 2021 Question-Driven Span Labeling Model for Aspect–Opinion Pair Extraction AAAI 2021 SEED: Self-supervised Distillation For Visual Representation ICLR 2021 Chasing Sparsity in Vision Transformers: An End-to-End Exploration NIPS 2021 Joint Intent Detection and Entity Linking on Spatial Domain Queries EMNLP 2020 Linear Symmetric Quantization of Neural Networks for Low-precision Integer Hardware ICLR 2020 A Single Stream Network for Robust and Real-time RGB-D Salient Object Detection ECCV 2020 Self-adaptive Re-weighted Adversarial Domain Adaptation IJCAI 2020 Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks ECCV 2020 Boosting Weakly Supervised Object Detection with Progressive Knowledge Transfer ECCV 2020 Anchor Box Optimization for Object Detection WACV 2020 Variational Image Deraining WACV 2020 Unified Vision-Language Pre-Training for Image Captioning and VQA AAAI 2020 Pixel-Aware Deep Function-Mixture Network for Spectral Super-Resolution AAAI 2020 A Multi-Unit Profit Competitive Mechanism for Cellular Traffic Offloading AAAI 2020 Multi-Channel Reverse Dictionary Model AAAI 2020 Gradient Centralization: A New Optimization Technique for Deep Neural Networks ECCV 2020 Suppress and Balance: A Simple Gated Network for Salient Object Detection ECCV 2020 Label Propagation with Augmented Anchors: A Simple Semi-Supervised Learning baseline for Unsupervised Domain Adaptation ECCV 2020 Multi-Domain Learning for Accurate and Few-Shot Color Constancy CVPR 2020 Integrating Task Specific Information into Pretrained Language Models for Low Resource Fine Tuning EMNLP 2020 WantWords: An Open-source Online Reverse Dictionary System EMNLP 2020 Unsupervised Adaptation Learning for Hyperspectral Imagery Super-Resolution CVPR 2020 CPR-GCN: Conditional Partial-Residual Graph Convolutional Network in Automated Anatomical Labeling of Coronary Arteries CVPR 2020 HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation CVPR 2020 Probability Weighted Compact Feature for Domain Adaptive Retrieval CVPR 2020 Structure Aware Single-Stage 3D Object Detection From Point Cloud CVPR 2020 Blind Face Restoration via Deep Multi-scale Component Dictionaries ECCV 2020 Dual Adversarial Network: Toward Real-world Noise Removal and Noise Generation ECCV 2020 LST-Net: Learning a Convolutional Neural Network with a Learnable Sparse Transform ECCV 2020 Momentum Batch Normalization for Deep Learning with Small Batch Size ECCV 2020 Bidirectional Dependency-Guided Attention for Relation Extraction ACML 2020 A Decoupled Learning Scheme for Real-world Burst Denoising from Raw Images ECCV 2020 Domain Adaptive Object Detection via Asymmetric Tri-way Faster-RCNN ECCV 2020 WSOD2: Learning Bottom-Up and Top-Down Objectness Distillation for Weakly-Supervised Object Detection ICCV 2019 View Confusion Feature Learning for Person Re-Identification ICCV 2019 Learning a Visual Tracker from a Single Movie without Annotation AAAI 2019 Optimal Projection Guided Transfer Hashing for Image Retrieval AAAI 2019 An Efficient Compressive Convolutional Network for Unified Object Detection and Image Compression AAAI 2019 Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation CVPR 2019 FOCNet: A Fractional Optimal Control Network for Image Denoising CVPR 2019 Reliable and Efficient Image Cropping: A Grid Anchor Based Approach CVPR 2019 Toward Convolutional Blind Denoising of Real Photographs CVPR 2019 REO-Relevance, Extraness, Omission: A Fine-grained Evaluation for Image Captioning IJCNLP 2019 Deep Plug-And-Play Super-Resolution for Arbitrary Blur Kernels CVPR 2019 Toward Real-World Single Image Super-Resolution: A New Benchmark and a New Model ICCV 2019 TDSNN: From Deep Neural Networks to Deep Spike Neural Networks with Temporal-Coding AAAI 2019 Variational Bayesian Dropout With a Hierarchical Prior CVPR 2019 TIGEr: Text-to-Image Grounding for Image Caption Evaluation EMNLP 2019 REO-Relevance, Extraness, Omission: A Fine-grained Evaluation for Image Captioning EMNLP 2019 Second-Order Attention Network for Single Image Super-Resolution CVPR 2019 TIGEr: Text-to-Image Grounding for Image Caption Evaluation IJCNLP 2019 Variational Denoising Network: Toward Blind Noise Modeling and Removal NIPS 2019 Object-Driven Text-To-Image Synthesis via Adversarial Training CVPR 2019 Dynamic Anchor Feature Selection for Single-Shot Object Detection ICCV 2019 Multi-Adversarial Faster-RCNN for Unrestricted Object Detection ICCV 2019 Joint Representation and Truncated Inference Learning for Correlation Filter based Tracking ECCV 2018 A Trilateral Weighted Sparse Coding Scheme for Real-World Image Denoising ECCV 2018 AutoLoc: Weakly-supervised Temporal Action Localization in Untrimmed Videos ECCV 2018 Weakly-supervised Video Summarization using Variational Encoder-Decoder and Web Prior ECCV 2018 A PID Controller Approach for Stochastic Optimization of Deep Networks CVPR 2018 Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering CVPR 2018 CleanNet: Transfer Learning for Scalable Image Classifier Training With Label Noise CVPR 2018 Learning Spatial-Temporal Regularized Correlation Filters for Visual Tracking CVPR 2018 A Hybrid l1-l0 Layer Decomposition Model for Tone Mapping CVPR 2018 Learning a Single Convolutional Super-Resolution Network for Multiple Degradations CVPR 2018 Towards Human-Machine Cooperation: Self-Supervised Sample Mining for Object Detection CVPR 2018 Social Media based Simulation Models for Understanding Disease Dynamics IJCAI 2018 Turbo Learning for CaptionBot and DrawingBot NIPS 2018 Deblurring Natural Image Using Super-Gaussian Fields ECCV 2018 Joint Convolutional Analysis and Synthesis Sparse Representation for Single Image Layer Separation ICCV 2017 Multi-Channel Weighted Nuclear Norm Minimization for Real Color Image Denoising ICCV 2017 Learning Deep CNN Denoiser Prior for Image Restoration CVPR 2017 Learning Dynamic Guidance for Depth Image Enhancement CVPR 2017 G2DeNet: Global Gaussian Distribution Embedding Network and Its Application to Visual Recognition CVPR 2017 Fine-Tuning Convolutional Neural Networks for Biomedical Image Analysis: Actively and Incrementally CVPR 2017 3D Surface Detail Enhancement From a Single Normal Map ICCV 2017 Higher-Order Integration of Hierarchical Convolutional Activations for Fine-Grained Visual Categorization ICCV 2017 When Unsupervised Domain Adaptation Meets Tensor Representations ICCV 2017 RAID-G: Robust Estimation of Approximate Infinite Dimensional Gaussian With Application to Material Recognition CVPR 2016 A Self-Representation Induced Classifier IJCAI 2016 Joint Learning of Single-Image and Cross-Image Representations for Person Re-Identification CVPR 2016 Group MAD Competition - A New Methodology to Compare Objective Image Quality Models CVPR 2016 Multispectral Images Denoising by Intrinsic Tensor Sparsity Regularization CVPR 2016 Dictionary Pair Classifier Driven Convolutional Neural Networks for Object Detection CVPR 2016 A Probabilistic Collaborative Representation Based Approach for Pattern Classification CVPR 2016 Object Tracking via Dual Linear Structured SVM and Explicit Feature Map CVPR 2016 Reweighted Laplace Prior Based Hyperspectral Compressive Sensing for Unknown Sparsity CVPR 2015 Discriminative Learning of Iteration-Wise Priors for Blind Deconvolution CVPR 2015 Patch Group Based Nonlocal Self-Similarity Prior Learning for Image Denoising ICCV 2015 External Patch Prior Guided Internal Clustering for Image Denoising ICCV 2015 Hyperspectral Compressive Sensing Using Manifold-Structured Sparsity Prior ICCV 2015 Convolutional Sparse Coding for Image Super-Resolution ICCV 2015 Point Matching in the Presence of Outliers in Both Point Sets: A Concave Optimization Approach CVPR 2014 Semantic Annotation, Analysis and Comparison: A Multilingual and Cross-lingual Text Analytics Toolkit EACL 2014 Weighted Nuclear Norm Minimization with Application to Image Denoising CVPR 2014 Projective dictionary pair learning for pattern classification NIPS 2014 XLike Project Language Analysis Services EACL 2014 Robust Principal Component Analysis with Complex Noise ICML 2014 Learning without Human Scores for Blind Image Quality Assessment CVPR 2013 Log-Euclidean Kernels for Sparse Representation and Dictionary Learning ICCV 2013 From Point to Set: Extend the Learning of Distance Metrics ICCV 2013 Sparse Variation Dictionary Learning for Face Recognition with a Single Training Sample per Person ICCV 2013 Perceptual Fidelity Aware Mean Squared Error ICCV 2013 A Novel Earth Mover's Distance Methodology for Image Matching with Gaussian Mixture Models ICCV 2013 Scalable Sparse Subspace Clustering CVPR 2013 A Generalized Iterated Shrinkage Algorithm for Non-convex Sparse Coding ICCV 2013 Efficient 2D-to-3D Correspondence Filtering for Scalable 3D Object Recognition CVPR 2013 Texture Enhanced Image Denoising via Gradient Histogram Preservation CVPR 2013 Exploring Implicit Image Statistics for Visual Representativeness Modeling CVPR 2013 Binary Code Ranking with Weighted Hamming Distance CVPR 2013 Generalization Bounds for Domain Adaptation NIPS 2012 Identifying Noun Product Features that Imply Opinions ACL 2011 Extracting Resource Terms for Sentiment Analysis IJCNLP 2011 Distributional Similarity vs. PU Learning for Entity Set Expansion ACL 2010 Extracting and Ranking Product Features in Opinion Documents COLING 2010 Chinese Named Entity Identification Using Class-based Language Model COLING 2002 Automatic Detecting/Correcting Errors in Chinese Text by an Approximate Word-Matching Algorithm ACL 2000