Lei Zhang
399 papers · 2000–2026 · 22 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
π§ Keyword Pioneer π£ Hot Topic Early Bird πΊοΈ Taxonomy Completionist (23) π Interdisciplinary Bridge π Conference Polyglot (22)
π
Interdisciplinary Bridge
π
Academic Marathon
(25)
πΊοΈ
Taxonomy Completionist
(23)
π
Conference Loyalist
(41)
π
Keyword Trendsetter Combo
(6)
π€
Dynamic Duo
(37)
π
Triple Crown
π
Grand Slam
π¬
Deep Specialist
(52)
π
Keyword Champion
(7)
π
Trend Setter
π₯
Unstoppable
(16)
π
Conference Pioneer
β‘
Prolific Year
(22)
π
Century Club
(385)
β
The Questioner
ποΈ
Keyword Collector
(61)
Conferences
CVPR (124)
ICCV (60)
AAAI (54)
ECCV (49)
ICLR (21)
NIPS (19)
ACL (15)
EMNLP (11)
MICCAI (7)
IJCAI (7)
ICML (6)
IJCNLP (5)
COLING (5)
EACL (3)
NAACL (3)
NSDI (2)
SEMEVAL (2)
WACV (2)
INTERSPEECH (1)
L4DC (1)
ACML (1)
UAI (1)
Top co-authors
Research topics
Keywords
object detection
(32)
convolutional neural network
(24)
domain adaptation
(21)
large language model
(19)
diffusion model
(18)
image restoration
(16)
semantic segmentation
(15)
contrastive learning
(15)
few-shot learning
(13)
multimodal learning
(12)
transfer learning
(12)
image super-resolution
(11)
unsupervised learning
(11)
knowledge distillation
(11)
image denoising
(11)
metric learning
(9)
image generation
(9)
attention mechanism
(8)
semi-supervised learning
(8)
image captioning
(8)
Papers
Otter: Mitigating Background Distractions of Wide-Angle Few-Shot Action Recognition with Enhanced RWKV
AAAI 2026
Geometric Correspondence Constrained Pseudo-Label Alignment for Source-Free Domain Adaptive Fundus Image Segmentation
AAAI 2026
LayerEdit: Disentangled Multi-Object Editing via Conflict-Aware Multi-Layer Learning
AAAI 2026
DualFete: Revisiting Teacher-Student Interactions from a Feedback Perspective for Semi-supervised Medical Image Segmentation
AAAI 2026
Towards Better Code Understanding in Decoder-Only Models with Contrastive Learning
AAAI 2026
From Completion to Editing: Unlocking Context-Aware Code Infilling via Search-and-Replace Instruction Tuning
ACL 2026
T-Rex-Omni: Integrating Negative Visual Prompt in Generic Object Detection
AAAI 2026
DeepSenseMoE: Harnessing Power of Time Series Foundation Models for Few-Shot Human Activity Recognition
AAAI 2026
Fast Multi-view Consistent 3D Editing with Video Priors
AAAI 2026
BEVDilation: LiDAR-Centric Multi-Modal Fusion for 3D Object Detection
AAAI 2026
JoDiffusion: Jointly Diffusing Image with Pixel-Level Annotations for Semantic Segmentation Promotion
AAAI 2026
Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding
AAAI 2026
SegDINO3D: 3D Instance Segmentation Empowered by Both Image-Level and Object-Level 2D Features
AAAI 2026
AlignCVC: Aligning Cross-View Consistency for Single-Image-to-3D Generation
AAAI 2026
RORem: Training a Robust Object Remover with Human-in-the-Loop
CVPR 2025
HumanMM: Global Human Motion Recovery from Multi-shot Videos
CVPR 2025
Adversarial Diffusion Compression for Real-World Image Super-Resolution
CVPR 2025
Minder: Faulty Machine Detection for Large-scale Distributed Model Training
NSDI 2025
Rethinking Smoothness for Fast and Adaptable Entity Alignment Decoding
NAACL 2025
SlimFormer-3D: A Layer-Adaptive Lightweight Transformer for Efficient 3D Medical Image Segmentation
MICCAI 2025
R1Seg-3D: Rethinking Reasoning Segmentation for Medical 3D CTs
MICCAI 2025
PDC-Net: Pattern Divide-and-Conquer Network for Pelvic Radiation Injury Segmentation
MICCAI 2025
SLRL: Semi-Supervised Local Community Detection Based on Reinforcement Learning
AAAI 2025
Generalizable Sensor-Based Activity Recognition via Categorical Concept Invariant Learning
AAAI 2025
Multi-Edge Reinforced Collaborative Data Acquisition for Continuous Video Analytics by Prioritizing Quality over Quantity
AAAI 2025
CustomContrast: A Multilevel Contrastive Perspective for Subject-Driven Text-to-Image Customization
AAAI 2025
GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolution
AAAI 2025
Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence
AAAI 2025
SyncNoise: Geometrically Consistent Noise Prediction for Instruction-based 3D Editing
AAAI 2025
CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility
AAAI 2025
ChatTime: A Unified Multimodal Time Series Foundation Model Bridging Numerical and Textual Data
AAAI 2025
MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
AAAI 2025
GapMatch: Bridging Instance and Model Perturbations for Enhanced Semi-Supervised Medical Image Segmentation
AAAI 2025
Adversarial Contrastive Graph Augmentation with Counterfactual Regularization
AAAI 2025
Imitate Before Detect: Aligning Machine Stylistic Preference for Machine-Revised Text Detection
AAAI 2025
Fine-Tuning Language Models with Collaborative and Semantic Experts
AAAI 2025
Hierarchical Context Pruning: Optimizing Real-World Code Completion with Repository-Level Pretrained Code LLMs
AAAI 2025
Controllable Skin Synthesis via Lesion-Focused Vector Autoregression Model
MICCAI 2025
BayesKD: Bayesian Knowledge Distillation for Compact LLMs in Constrained Fine-tuning Scenarios
ACL 2025
STORYTELLER: An Enhanced Plot-Planning Framework for Coherent and Cohesive Story Generation
ACL 2025
Beyond Statistical Analysis: Multimodal Framework for Time Series Forecasting with LLM-Driven Temporal Pattern
IJCAI 2025
Prompt-Free Conditional Diffusion for Multi-object Image Augmentation
IJCAI 2025
Mining Word Boundaries from Speech-Text Parallel Data for Cross-domain Chinese Word Segmentation
COLING 2025
A Novel Negative Sample Generation Method for Contrastive Learning in Hierarchical Text Classification
COLING 2025
Synthesizing Software Engineering Data in a Test-Driven Manner
ICML 2025
FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling
ICLR 2025
Toward Generalizing Visual Brain Decoding to Unseen Subjects
ICLR 2025
LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation
ICLR 2025
Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion
ICLR 2025
Visual-O1: Understanding Ambiguous Instructions via Multi-modal Multi-turn Chain-of-thoughts Reasoning
ICLR 2025
DEEM: Diffusion models serve as the eyes of large language models for image perception
ICLR 2025
On-the-fly Preference Alignment via Principle-Guided Decoding
ICLR 2025
Autoregressive Pretraining with Mamba in Vision
ICLR 2025
Scaling Speech-Text Pre-training with Synthetic Interleaved Data
ICLR 2025
UniGS: Modeling Unitary 3D Gaussians for Novel View Synthesis from Sparse-view Images
ICCV 2025
Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models
ICCV 2025
Generalized and Efficient 2D Gaussian Splatting for Arbitrary-scale Super-Resolution
ICCV 2025
InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction
ICCV 2025
FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models
ICCV 2025
Reverse Convolution and Its Applications to Image Restoration
ICCV 2025
Referring to Any Person
ICCV 2025
Towards Effective Foundation Model Adaptation for Extreme Cross-Domain Few-Shot Learning
ICCV 2025
Fine-structure Preserved Real-world Image Super-resolution via Transfer VAE Training
ICCV 2025
Hierarchy-Aware Pseudo Word Learning with Text Adaptation for Zero-Shot Composed Image Retrieval
ICCV 2025
Co-Painter: Fine-Grained Controllable Image Stylization via Implicit Decoupling and Adaptive Injection
ICCV 2025
ForgeLens: Data-Efficient Forgery Focus for Generalizable Forgery Image Detection
ICCV 2025
Prior-aware Dynamic Temporal Modeling Framework for Sequential 3D Hand Pose Estimation
ICCV 2025
Dual-Temporal Exemplar Representation Network for Video Semantic Segmentation
ICCV 2025
Integrating Visual Interpretation and Linguistic Reasoning for Geometric Problem Solving
ICCV 2025
ESGenius: Benchmarking LLMs on Environmental, Social, and Governance (ESG) and Sustainability Knowledge
EMNLP 2025
CodeArena: Evaluating and Aligning CodeLLMs on Human Preference
EMNLP 2025
LeanGaussian: Breaking Pixel or Point Cloud Correspondence in Modeling 3D Gaussians
CVPR 2025
OSMamba: Omnidirectional Spectral Mamba with Dual-Domain Prior Generator for Exposure Correction
CVPR 2025
Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data
CVPR 2025
FeedEdit: Text-Based Image Editing with Dynamic Feedback Regulation
CVPR 2025
D^2iT: Dynamic Diffusion Transformer for Accurate Image Generation
CVPR 2025
HandOS: 3D Hand Reconstruction in One Stage
CVPR 2025
Toward Generalized Image Quality Assessment: Relaxing the Perfect Reference Quality Assumption
CVPR 2025
Low-Biased General Annotated Dataset Generation
CVPR 2025
MaSS13K: A Matting-level Semantic Segmentation Benchmark
CVPR 2025
SkillMimic: Learning Basketball Interaction Skills from Demonstrations
CVPR 2025
Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach
CVPR 2025
DNA-SE: Towards Deep Neural-Nets Assisted Semiparametric Estimation
ICML 2024
State-Constrained Zero-Sum Differential Games with One-Sided Information
ICML 2024
Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
EMNLP 2024
Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA
EMNLP 2024
A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment
ECCV 2024
Meta-Exploiting Frequency Prior for Cross-Domain Few-Shot Learning
NIPS 2024
TAPTRv2: Attention-based Position Update Improves Tracking Any Point
NIPS 2024
TW-NLP at SemEval-2024 Task10: Emotion Recognition and Emotion Reversal Inference in Multi-Party Dialogues.
SEMEVAL 2024
Self-Supervised Video Desmoking for Laparoscopic Surgery
ECCV 2024
LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models
ECCV 2024
TW-NLP at SemEval-2024 Task10: Emotion Recognition and Emotion Reversal Inference in Multi-Party Dialogues.
NAACL 2024
Boosting Gaze Object Prediction via Pixel-level Supervision from Vision Foundation Model
ECCV 2024
General Geometry-aware Weakly Supervised 3D Object Detection
ECCV 2024
MasterWeaver: Taming Editability and Face Identity for Personalized Text-to-Image Generation
ECCV 2024
Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models
CVPR 2024
Open-World Human-Object Interaction Detection via Multi-modal Prompts
CVPR 2024
Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment
CVPR 2024
UniVS: Unified and Universal Video Segmentation with Prompts as Queries
CVPR 2024
SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
CVPR 2024
Visual In-Context Prompting
CVPR 2024
Efficient Scene Recovery Using Luminous Flux Prior
CVPR 2024
Robust Overfitting Does Matter: Test-Time Adversarial Purification With FGSM
CVPR 2024
Osprey: Pixel Understanding with Visual Instruction Tuning
CVPR 2024
Neural Super-Resolution for Real-time Rendering with Radiance Demodulation
CVPR 2024
Homology Consistency Constrained Efficient Tuning for Vision-Language Models
NIPS 2024
One-Step Effective Diffusion Network for Real-World Image Super-Resolution
NIPS 2024
Implicit Discriminative Knowledge Learning for Visible-Infrared Person Re-Identification
CVPR 2024
Uncertainty-Aware Source-Free Adaptive Image Super-Resolution with Wavelet Augmentation Transformer
CVPR 2024
AdaNeg: Adaptive Negative Proxy Guided OOD Detection with Vision-Language Models
NIPS 2024
Many-Shot In-Context Learning
NIPS 2024
Dynamic Weighted Combiner for Mixed-Modal Image Retrieval
AAAI 2024
Gradual Residuals Alignment: A Dual-Stream Framework for GAN Inversion and Image Attribute Editing
AAAI 2024
Identification of Necessary Semantic Undertakers in the Causal View for Image-Text Matching
AAAI 2024
Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding
ECCV 2024
Tag2Text: Guiding Vision-Language Model via Image Tagging
ICLR 2024
Symbol as Points: Panoptic Symbol Spotting via Point-based Representation
ICLR 2024
Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts
ICLR 2024
DreamTime: An Improved Optimization Strategy for Diffusion-Guided 3D Generation
ICLR 2024
TOSS: High-quality Text-guided Novel View Synthesis from a Single Image
ICLR 2024
Segment and Recognize Anything at Any Granularity
ECCV 2024
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
ECCV 2024
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
ECCV 2024
X-Pose: Detecting Any Keypoints
ECCV 2024
Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution
ECCV 2024
LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models
ECCV 2024
Urban Waterlogging Detection: A Challenging Benchmark and Large-Small Model Co-Adapter
ECCV 2024
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
ECCV 2024
ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention
ECCV 2024
Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models
ECCV 2024
TinyU-Net: Lighter yet Better U-Net with Cascaded Multi-Receptive Fields
MICCAI 2024
MetaUNETR: Rethinking Token Mixer Encoding for Efficient Multi-Organ Segmentation
MICCAI 2024
LIDIA: Precise Liver Tumor Diagnosis on Multi-Phase Contrast-Enhanced CT via Iterative Fusion and Asymmetric Contrastive Learning
MICCAI 2024
Pontryagin neural operator for solving general-sum differential games with parametric state constraints
L4DC 2024
Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection
NIPS 2024
One-Shot Learning as Instruction Data Prospector for Large Language Models
ACL 2024
Marathon: A Race Through the Realm of Long Context with Large Language Models
ACL 2024
Chinese Spoken Named Entity Recognition in Real-world Scenarios: Dataset and Approaches
ACL 2024
LIRE: listwise reward enhancement for preference alignment
ACL 2024
Knowledge Context Modeling with Pre-trained Language Models for Contrastive Knowledge Graph Completion
ACL 2024
CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models
ACL 2024
Responsible Visual Editing
ECCV 2024
Compress3D: a Compressed Latent Space for 3D Generation from a Single Image
ECCV 2024
TAPTR: Tracking Any Point with Transformers as Detection
ECCV 2024
Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation
ECCV 2024
Safeguarding Sustainable Cities: Unsupervised Video Anomaly Detection through Diffusion-based Latent Pattern Learning
IJCAI 2024
Visual-Linguistic Dependency Encoding for Image-Text Retrieval
COLING 2024
Pixel-Aware Stable Diffusion for Realistic Image Super-Resolution and Personalized Stylization
ECCV 2024
ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation
ECCV 2024
MLCopilot: Unleashing the Power of Large Language Models in Solving Machine Learning Tasks
EACL 2024
HumanTOMATO: Text-aligned Whole-body Motion Generation
ICML 2024
Towards Fairness-aware Adversarial Network Pruning
ICCV 2023
DreamWaltz: Make a Scene with Complex 3D Animatable Avatars
NIPS 2023
SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation
NIPS 2023
Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset
NIPS 2023
Semi-Supervised Domain Generalization with Known and Unknown Classes
NIPS 2023
Label-efficient Segmentation via Affinity Propagation
NIPS 2023
A Comprehensive Benchmark for Neural Human Radiance Fields
NIPS 2023
MomentDiff: Generative Video Moment Retrieval from Random to Real
NIPS 2023
MMTN: Multi-Modal Memory Transformer Network for Image-Report Consistent Medical Report Generation
AAAI 2023
DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding
AAAI 2023
Revisiting Unsupervised Local Descriptor Learning
AAAI 2023
Mind the Gap: Polishing Pseudo Labels for Accurate Semi-supervised Object Detection
AAAI 2023
Are Transformers Effective for Time Series Forecasting?
AAAI 2023
DRGCN: Dynamic Evolving Initial Residual for Deep Graph Convolutional Networks
AAAI 2023
Efficient and Interpretable Compressive Text Summarisation with Unsupervised Dual-Agent Reinforcement Learning
ACL 2023
DynaMask: Dynamic Mask Selection for Instance Segmentation
CVPR 2023
Revisiting Prototypical Network for Cross Domain Few-Shot Learning
CVPR 2023
A General Regret Bound of Preconditioned Gradient Method for DNN Training
CVPR 2023
OTAvatar: One-Shot Talking Face Avatar With Controllable Tri-Plane Rendering
CVPR 2023
Glocal Energy-Based Learning for Few-Shot Open-Set Recognition
CVPR 2023
DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training
CVPR 2023
SIM: Semantic-Aware Instance Mask Generation for Box-Supervised Instance Segmentation
CVPR 2023
Accelerating Dataset Distillation via Model Augmentation
CVPR 2023
Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes
CVPR 2023
MSF: Motion-Guided Sequential Fusion for Efficient 3D Object Detection From Point Cloud Sequences
CVPR 2023
MDQE: Mining Discriminative Query Embeddings To Segment Occluded Instances on Challenging Videos
CVPR 2023
Sharpness-Aware Gradient Matching for Domain Generalization
CVPR 2023
One-Stage 3D Whole-Body Mesh Recovery With Component Aware Transformer
CVPR 2023
Human Guided Ground-Truth Generation for Realistic Image Super-Resolution
CVPR 2023
Mask DINO: Towards a Unified Transformer-Based Framework for Object Detection and Segmentation
CVPR 2023
Inferring and Leveraging Parts From Object Shape for Improving Semantic Image Synthesis
CVPR 2023
Joint HDR Denoising and Fusion: A Real-World Mobile HDR Image Dataset
CVPR 2023
MP-Former: Mask-Piloted Transformer for Image Segmentation
CVPR 2023
One-to-Few Label Assignment for End-to-End Dense Detection
CVPR 2023
Multi-View Adversarial Discriminator: Mine the Non-Causal Factors for Object Detection in Unseen Domains
CVPR 2023
Lite DETR: An Interleaved Multi-Scale Encoder for Efficient DETR
CVPR 2023
E-CORE: Emotion Correlation Enhanced Empathetic Dialogue Generation
EMNLP 2023
A Benchmark for Chinese-English Scene Text Image Super-Resolution
ICCV 2023
CORE: Cooperative Reconstruction for Multi-Agent Perception
ICCV 2023
Point2Mask: Point-supervised Panoptic Segmentation via Optimal Transport
ICCV 2023
A Simple Framework for Open-Vocabulary Segmentation and Detection
ICCV 2023
FPR: False Positive Rectification for Weakly Supervised Semantic Segmentation
ICCV 2023
DFA3D: 3D Deformable Attention For 2D-to-3D Feature Lifting
ICCV 2023
RCA-NOC: Relative Contrastive Alignment for Novel Object Captioning
ICCV 2023
Generative Action Description Prompts for Skeleton-based Action Recognition
ICCV 2023
Detection Transformer with Stable Matching
ICCV 2023
HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image Generation
ICCV 2023
Isomer: Isomerous Transformer for Zero-shot Video Object Segmentation
ICCV 2023
ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation
ICCV 2023
Neural Interactive Keypoint Detection
ICCV 2023
Automatic Network Pruning via Hilbert-Schmidt Independence Criterion Lasso under Information Bottleneck Principle
ICCV 2023
LipsFormer: Introducing Lipschitz Continuity to Vision Transformers
ICLR 2023
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection
ICLR 2023
Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation
ICLR 2023
The Benefit of Hindsight: Tracing Edge-Cases in Distributed Systems
NSDI 2023
Conditional counterfactual causal effect for individual attribution
UAI 2023
Class-Balanced Pixel-Level Self-Labeling for Domain Adaptive Semantic Segmentation
CVPR 2022
A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-Resolution
CVPR 2022
Spatiotemporal Self-Attention Modeling with Temporal Patch Shift for Action Recognition
ECCV 2022
Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval
ECCV 2022
Efficient Long-Range Attention Network for Image Super-Resolution
ECCV 2022
From Face to Natural Image: Learning Real Degradation for Blind Image Super-Resolution
ECCV 2022
Unfolded Deep Kernel Estimation for Blind Image Super-Resolution
ECCV 2022
Efficient and Degradation-Adaptive Network for Real-World Image Super-Resolution
ECCV 2022
A Dual Weighting Label Assignment Scheme for Object Detection
CVPR 2022
Neural Architecture Search With Representation Mutual Information
CVPR 2022
A Differentiable Two-Stage Alignment Scheme for Burst Image Reconstruction With Large Shift
CVPR 2022
Towards Efficient Data Free Black-Box Adversarial Attack
CVPR 2022
Large-Scale Pre-Training for Person Re-Identification With Noisy Labels
CVPR 2022
Blind Image Super-Resolution With Elaborate Degradation Modeling on Noise and Kernel
CVPR 2022
Grounded Language-Image Pre-Training
CVPR 2022
Quantization-Aware Deep Optics for Diffractive Snapshot Hyperspectral Imaging
CVPR 2022
Dense Learning Based Semi-Supervised Object Detection
CVPR 2022
DN-DETR: Accelerate DETR Training by Introducing Query DeNoising
CVPR 2022
QaDialMoE: Question-answering Dialogue based Fact Verification with Mixture of Experts
EMNLP 2022
An Embedded Feature Whitening Approach to Deep Neural Network Optimization
ECCV 2022
Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution
CVPR 2022
Exact Feature Distribution Matching for Arbitrary Style Transfer and Domain Generalization
CVPR 2022
Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection From Point Clouds
CVPR 2022
Box-Supervised Instance Segmentation with Level Set Evolution
ECCV 2022
Attention Diversification for Domain Generalization
ECCV 2022
MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources
INTERSPEECH 2022
From βDynamics on Graphsβ to βDynamics of Graphsβ: An Adaptive Echo-State Network Solution (Student Abstract)
AAAI 2022
Co-promotion Predictions of Financing Market and Sales Market: A Cooperative-Competitive Attention Approach
AAAI 2022
Image-Adaptive YOLO for Object Detection in Adverse Weather Conditions
AAAI 2022
Reconciling Cognitive Modeling with Knowledge Forgetting: A Continuous Time-aware Neural Network Approach
IJCAI 2022
DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR
ICLR 2022
Learning Domain Adaptive Object Detection with Probabilistic Teacher
ICML 2022
Axiomatic Explanations for Visual Search, Retrieval, and Similarity Learning
ICLR 2022
OTExtSum: Extractive Text Summarisation with Optimal Transport
NAACL 2022
Neighborhood-Adaptive Structure Augmented Metric Learning
AAAI 2022
Deep Metric Learning with Graph Consistency
AAAI 2021
MedAI at SemEval-2021 Task 10: Negation-aware Pre-training for Source-free Negation Detection Domain Adaptation
ACL 2021
MedAI at SemEval-2021 Task 10: Negation-aware Pre-training for Source-free Negation Detection Domain Adaptation
IJCNLP 2021
Bringing Structure into Summaries: a Faceted Summarization Dataset for Long Scientific Documents
IJCNLP 2021
High-Resolution Photorealistic Image Translation in Real-Time: A Laplacian Pyramid Translation Network
CVPR 2021
TAP: Text-Aware Pre-Training for Text-VQA and Text-Caption
CVPR 2021
Deep Convolutional Dictionary Learning for Image Denoising
CVPR 2021
Learning Tensor Low-Rank Prior for Hyperspectral Image Reconstruction
CVPR 2021
Dynamic Head: Unifying Object Detection Heads With Attentions
CVPR 2021
Dynamic Weighted Learning for Unsupervised Domain Adaptation
CVPR 2021
GAN Prior Embedded Network for Blind Face Restoration in the Wild
CVPR 2021
Learning Parallel Dense Correspondence From Spatio-Temporal Descriptors for Efficient and Robust 4D Reconstruction
CVPR 2021
Unsupervised Pre-Training for Person Re-Identification
CVPR 2021
Virtual Fully-Connected Layer: Training a Large-Scale Face Recognition Dataset With Limited Computational Resources
CVPR 2021
Interactive Self-Training With Mean Teachers for Semi-Supervised Object Detection
CVPR 2021
Progressive Semantic-Aware Style Transformation for Blind Face Restoration
CVPR 2021
Unsupervised Part Segmentation Through Disentangling Appearance and Shape
CVPR 2021
PPR10K: A Large-Scale Portrait Photo Retouching Dataset With Human-Region Mask and Group-Level Consistency
CVPR 2021
Spatial Feature Calibration and Temporal Fusion for Effective One-Stage Video Instance Segmentation
CVPR 2021
VinVL: Revisiting Visual Representations in Vision-Language Models
CVPR 2021
Contrastive Learning Based Hybrid Networks for Long-Tailed Image Classification
CVPR 2021
VirFace: Enhancing Face Recognition via Unlabeled Shallow Data
CVPR 2021
Category Dictionary Guided Unsupervised Domain Adaptation for Object Detection
AAAI 2021
Lite-HRNet: A Lightweight High-Resolution Network
CVPR 2021
DAP: Detection-Aware Pre-Training With Weak Supervision
CVPR 2021
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
ICCV 2021
Variational Attention: Propagating Domain-Specific Knowledge for Multi-Domain Learning in Crowd Counting
ICCV 2021
SA-ConvONet: Sign-Agnostic Optimization of Convolutional Occupancy Networks
ICCV 2021
Dynamic DETR: End-to-End Object Detection With Dynamic Attention
ICCV 2021
CvT: Introducing Convolutions to Vision Transformers
ICCV 2021
Real-World Video Super-Resolution: A Benchmark Dataset and a Decomposition Based Learning Scheme
ICCV 2021
Reconcile Prediction Consistency for Balanced Object Detection
ICCV 2021
HDR Video Reconstruction: A Coarse-To-Fine Network and a Real-World Benchmark Dataset
ICCV 2021
MicroNet: Improving Image Recognition With Extremely Low FLOPs
ICCV 2021
Improve Unsupervised Pretraining for Few-Label Transfer
ICCV 2021
MedAI at SemEval-2021 Task 10: Negation-aware Pre-training for Source-free Negation Detection Domain Adaptation
SEMEVAL 2021
Adversarial Pose Regression Network for Pose-Invariant Face Recognitions
AAAI 2021
VIVO: Visual Vocabulary Pre-Training for Novel Object Captioning
AAAI 2021
Bringing Structure into Summaries: a Faceted Summarization Dataset for Long Scientific Documents
ACL 2021
Question-Driven Span Labeling Model for AspectβOpinion Pair Extraction
AAAI 2021
SEED: Self-supervised Distillation For Visual Representation
ICLR 2021
Chasing Sparsity in Vision Transformers: An End-to-End Exploration
NIPS 2021
Joint Intent Detection and Entity Linking on Spatial Domain Queries
EMNLP 2020
Linear Symmetric Quantization of Neural Networks for Low-precision Integer Hardware
ICLR 2020
A Single Stream Network for Robust and Real-time RGB-D Salient Object Detection
ECCV 2020
Self-adaptive Re-weighted Adversarial Domain Adaptation
IJCAI 2020
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
ECCV 2020
Boosting Weakly Supervised Object Detection with Progressive Knowledge Transfer
ECCV 2020
Anchor Box Optimization for Object Detection
WACV 2020
Variational Image Deraining
WACV 2020
Unified Vision-Language Pre-Training for Image Captioning and VQA
AAAI 2020
Pixel-Aware Deep Function-Mixture Network for Spectral Super-Resolution
AAAI 2020
A Multi-Unit Profit Competitive Mechanism for Cellular Traffic Offloading
AAAI 2020
Multi-Channel Reverse Dictionary Model
AAAI 2020
Gradient Centralization: A New Optimization Technique for Deep Neural Networks
ECCV 2020
Suppress and Balance: A Simple Gated Network for Salient Object Detection
ECCV 2020
Label Propagation with Augmented Anchors: A Simple Semi-Supervised Learning baseline for Unsupervised Domain Adaptation
ECCV 2020
Multi-Domain Learning for Accurate and Few-Shot Color Constancy
CVPR 2020
Integrating Task Specific Information into Pretrained Language Models for Low Resource Fine Tuning
EMNLP 2020
WantWords: An Open-source Online Reverse Dictionary System
EMNLP 2020
Unsupervised Adaptation Learning for Hyperspectral Imagery Super-Resolution
CVPR 2020
CPR-GCN: Conditional Partial-Residual Graph Convolutional Network in Automated Anatomical Labeling of Coronary Arteries
CVPR 2020
HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation
CVPR 2020
Probability Weighted Compact Feature for Domain Adaptive Retrieval
CVPR 2020
Structure Aware Single-Stage 3D Object Detection From Point Cloud
CVPR 2020
Blind Face Restoration via Deep Multi-scale Component Dictionaries
ECCV 2020
Dual Adversarial Network: Toward Real-world Noise Removal and Noise Generation
ECCV 2020
LST-Net: Learning a Convolutional Neural Network with a Learnable Sparse Transform
ECCV 2020
Momentum Batch Normalization for Deep Learning with Small Batch Size
ECCV 2020
Bidirectional Dependency-Guided Attention for Relation Extraction
ACML 2020
A Decoupled Learning Scheme for Real-world Burst Denoising from Raw Images
ECCV 2020
Domain Adaptive Object Detection via Asymmetric Tri-way Faster-RCNN
ECCV 2020
WSOD2: Learning Bottom-Up and Top-Down Objectness Distillation for Weakly-Supervised Object Detection
ICCV 2019
View Confusion Feature Learning for Person Re-Identification
ICCV 2019
Learning a Visual Tracker from a Single Movie without Annotation
AAAI 2019
Optimal Projection Guided Transfer Hashing for Image Retrieval
AAAI 2019
An Efficient Compressive Convolutional Network for Unified Object Detection and Image Compression
AAAI 2019
Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation
CVPR 2019
FOCNet: A Fractional Optimal Control Network for Image Denoising
CVPR 2019
Reliable and Efficient Image Cropping: A Grid Anchor Based Approach
CVPR 2019
Toward Convolutional Blind Denoising of Real Photographs
CVPR 2019
REO-Relevance, Extraness, Omission: A Fine-grained Evaluation for Image Captioning
IJCNLP 2019
Deep Plug-And-Play Super-Resolution for Arbitrary Blur Kernels
CVPR 2019
Toward Real-World Single Image Super-Resolution: A New Benchmark and a New Model
ICCV 2019
TDSNN: From Deep Neural Networks to Deep Spike Neural Networks with Temporal-Coding
AAAI 2019
Variational Bayesian Dropout With a Hierarchical Prior
CVPR 2019
TIGEr: Text-to-Image Grounding for Image Caption Evaluation
EMNLP 2019
REO-Relevance, Extraness, Omission: A Fine-grained Evaluation for Image Captioning
EMNLP 2019
Second-Order Attention Network for Single Image Super-Resolution
CVPR 2019
TIGEr: Text-to-Image Grounding for Image Caption Evaluation
IJCNLP 2019
Variational Denoising Network: Toward Blind Noise Modeling and Removal
NIPS 2019
Object-Driven Text-To-Image Synthesis via Adversarial Training
CVPR 2019
Dynamic Anchor Feature Selection for Single-Shot Object Detection
ICCV 2019
Multi-Adversarial Faster-RCNN for Unrestricted Object Detection
ICCV 2019
Joint Representation and Truncated Inference Learning for Correlation Filter based Tracking
ECCV 2018
A Trilateral Weighted Sparse Coding Scheme for Real-World Image Denoising
ECCV 2018
AutoLoc: Weakly-supervised Temporal Action Localization in Untrimmed Videos
ECCV 2018
Weakly-supervised Video Summarization using Variational Encoder-Decoder and Web Prior
ECCV 2018
A PID Controller Approach for Stochastic Optimization of Deep Networks
CVPR 2018
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
CVPR 2018
CleanNet: Transfer Learning for Scalable Image Classifier Training With Label Noise
CVPR 2018
Learning Spatial-Temporal Regularized Correlation Filters for Visual Tracking
CVPR 2018
A Hybrid l1-l0 Layer Decomposition Model for Tone Mapping
CVPR 2018
Learning a Single Convolutional Super-Resolution Network for Multiple Degradations
CVPR 2018
Towards Human-Machine Cooperation: Self-Supervised Sample Mining for Object Detection
CVPR 2018
Social Media based Simulation Models for Understanding Disease Dynamics
IJCAI 2018
Turbo Learning for CaptionBot and DrawingBot
NIPS 2018
Deblurring Natural Image Using Super-Gaussian Fields
ECCV 2018
Joint Convolutional Analysis and Synthesis Sparse Representation for Single Image Layer Separation
ICCV 2017
Multi-Channel Weighted Nuclear Norm Minimization for Real Color Image Denoising
ICCV 2017
Learning Deep CNN Denoiser Prior for Image Restoration
CVPR 2017
Learning Dynamic Guidance for Depth Image Enhancement
CVPR 2017
G2DeNet: Global Gaussian Distribution Embedding Network and Its Application to Visual Recognition
CVPR 2017
Fine-Tuning Convolutional Neural Networks for Biomedical Image Analysis: Actively and Incrementally
CVPR 2017
3D Surface Detail Enhancement From a Single Normal Map
ICCV 2017
Higher-Order Integration of Hierarchical Convolutional Activations for Fine-Grained Visual Categorization
ICCV 2017
When Unsupervised Domain Adaptation Meets Tensor Representations
ICCV 2017
RAID-G: Robust Estimation of Approximate Infinite Dimensional Gaussian With Application to Material Recognition
CVPR 2016
A Self-Representation Induced Classifier
IJCAI 2016
Joint Learning of Single-Image and Cross-Image Representations for Person Re-Identification
CVPR 2016
Group MAD Competition - A New Methodology to Compare Objective Image Quality Models
CVPR 2016
Multispectral Images Denoising by Intrinsic Tensor Sparsity Regularization
CVPR 2016
Dictionary Pair Classifier Driven Convolutional Neural Networks for Object Detection
CVPR 2016
A Probabilistic Collaborative Representation Based Approach for Pattern Classification
CVPR 2016
Object Tracking via Dual Linear Structured SVM and Explicit Feature Map
CVPR 2016
Reweighted Laplace Prior Based Hyperspectral Compressive Sensing for Unknown Sparsity
CVPR 2015
Discriminative Learning of Iteration-Wise Priors for Blind Deconvolution
CVPR 2015
Patch Group Based Nonlocal Self-Similarity Prior Learning for Image Denoising
ICCV 2015
External Patch Prior Guided Internal Clustering for Image Denoising
ICCV 2015
Hyperspectral Compressive Sensing Using Manifold-Structured Sparsity Prior
ICCV 2015
Convolutional Sparse Coding for Image Super-Resolution
ICCV 2015
Point Matching in the Presence of Outliers in Both Point Sets: A Concave Optimization Approach
CVPR 2014
Semantic Annotation, Analysis and Comparison: A Multilingual and Cross-lingual Text Analytics Toolkit
EACL 2014
Weighted Nuclear Norm Minimization with Application to Image Denoising
CVPR 2014
Projective dictionary pair learning for pattern classification
NIPS 2014
XLike Project Language Analysis Services
EACL 2014
Robust Principal Component Analysis with Complex Noise
ICML 2014
Learning without Human Scores for Blind Image Quality Assessment
CVPR 2013
Log-Euclidean Kernels for Sparse Representation and Dictionary Learning
ICCV 2013
From Point to Set: Extend the Learning of Distance Metrics
ICCV 2013
Sparse Variation Dictionary Learning for Face Recognition with a Single Training Sample per Person
ICCV 2013
Perceptual Fidelity Aware Mean Squared Error
ICCV 2013
A Novel Earth Mover's Distance Methodology for Image Matching with Gaussian Mixture Models
ICCV 2013
Scalable Sparse Subspace Clustering
CVPR 2013
A Generalized Iterated Shrinkage Algorithm for Non-convex Sparse Coding
ICCV 2013
Efficient 2D-to-3D Correspondence Filtering for Scalable 3D Object Recognition
CVPR 2013
Texture Enhanced Image Denoising via Gradient Histogram Preservation
CVPR 2013
Exploring Implicit Image Statistics for Visual Representativeness Modeling
CVPR 2013
Binary Code Ranking with Weighted Hamming Distance
CVPR 2013
Generalization Bounds for Domain Adaptation
NIPS 2012
Identifying Noun Product Features that Imply Opinions
ACL 2011
Extracting Resource Terms for Sentiment Analysis
IJCNLP 2011
Distributional Similarity vs. PU Learning for Entity Set Expansion
ACL 2010
Extracting and Ranking Product Features in Opinion Documents
COLING 2010
Chinese Named Entity Identification Using Class-based Language Model
COLING 2002
Automatic Detecting/Correcting Errors in Chinese Text by an Approximate Word-Matching Algorithm
ACL 2000