Xiang Li
317 papers · 2013–2026 · 26 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+18 more ↓ Show less ↑
🗺️ Taxonomy Completionist (48) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (7) 🐣 Hot Topic Early Bird
🌈
Renaissance Researcher
(7)
🌉
Interdisciplinary Bridge
🐝
Cross-Pollinator
(7)
🏠
Conference Loyalist
(41)
🤝
Dynamic Duo
(40)
👑
Triple Crown
🏆
Keyword Champion
(2)
🏆
Grand Slam
👥
Mega-Team
(71)
🔬
Deep Specialist
(35)
🧬
Topic Evolution
🚀
Conference Pioneer
⚡
Prolific Year
(66)
🔥
Unstoppable
(11)
🗃️
Keyword Collector
(154)
💎
Century Club
(296)
📈
Trend Setter
❓
The Questioner
(4)
Conferences
AAAI (43)
NIPS (41)
ACL (36)
CVPR (25)
EMNLP (23)
IJCAI (18)
ICLR (18)
COLING (17)
ICCV (17)
MICCAI (14)
ECCV (11)
ICML (10)
INTERSPEECH (9)
NAACL (8)
IJCNLP (5)
WACV (5)
NSDI (4)
AISTATS (3)
MIDL (2)
AACL (2)
EACL (1)
CORL (1)
COLT (1)
JMLR (1)
SEMEVAL (1)
UAI (1)
Top co-authors
Research topics
Keywords
large language model
(29)
object detection
(18)
knowledge distillation
(17)
graph neural network
(15)
contrastive learning
(15)
multimodal learning
(14)
model compression
(11)
neural network
(10)
representation learning
(9)
diffusion model
(9)
semi-supervised learning
(8)
attention mechanism
(8)
few-shot learning
(7)
self-supervised learning
(7)
unsupervised learning
(7)
text classification
(7)
convolutional neural network
(7)
transfer learning
(7)
zero-shot learning
(6)
reinforcement learning
(6)
Papers
SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection
AAAI 2026
TCoT: Trajectory Chain-of-Thoughts for Robotic Manipulation with Failure Recovery in Vision-Language-Action Model
AAAI 2026
Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models
AAAI 2026
Unsupervised Text Style Transfer for Controllable Intensity
EACL 2026
Ego-PMOVE: Prompt-aware Mixture of View Experts Network for Egocentric Gaze Prediction
AAAI 2026
DenoDet V2: Phase-Amplitude Cross Denoising for SAR Object Detection
AAAI 2026
LPPG-RL: Lexicographically Projected Policy Gradient Reinforcement Learning with Subproblem Exploration
AAAI 2026
TTT-UNet: Enhancing U-Net with Test-Time Training Layers for Biomedical Image Segmentation
MIDL 2026
Human Cognition Inspired RAG with Knowledge Graph for Complex Problem Solving
AAAI 2026
Beyond Adapter Retrieval: Latent Geometry-Preserving Composition via Sparse Task Projection
AAAI 2026
RATE: Reviewer Profiling and Annotation-free Training for Expertise Ranking in Peer Review Systems
ACL 2026
Community-Aware Assessment of Social Textual Engagement and Resonance: A Human-Centric Perspective on User-Generated Content Evaluation
ACL 2026
Analyzing and Internalizing Complex Policy Documents for LLM Agents
ACL 2026
FinKario: Event-Enhanced Automated Construction of Financial Knowledge Graph
ACL 2026
Efficient Transcoder Adaptation for Fine-Tuned Models: Revealing Medical Reasoning Mechanisms in Large Language Models
AAAI 2026
Analyze–Compose–Execute: A Dynamic Dialogue Framework for Multi-Agent Debate
AAAI 2026
GigaMoE: Sparsity-Guided Mixture of Experts for Efficient Gigapixel Object Detection
AAAI 2026
Multiplex Heterogeneous Graph Neural Networks with Euclidean-Riemannian Mutual Space Synergy
AAAI 2026
Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection
AAAI 2026
SpatioTemporal Difference Network for Video Depth Super-Resolution
AAAI 2026
GeoBayes: Probabilistic Image Geo-Localization Inference via Sequential Bayesian Updating
AAAI 2026
Preference Adaptive and Sequential Text-to-Image Generation
ICML 2025
Distribution-aware Fairness Learning in Medical Image Segmentation From A Control-Theoretic Perspective
ICML 2025
Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video
ICLR 2025
XFormParser: A Simple and Effective Multimodal Multilingual Semi-structured Form Parser
COLING 2025
LogiGraph: Logical Reasoning with Contrastive Learning and Lightweight Graph Networks
COLING 2025
SKIntern: Internalizing Symbolic Knowledge for Distilling Better CoT Capabilities into Small Language Models
COLING 2025
Idea23D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs
COLING 2025
Explain-Analyze-Generate: A Sequential Multi-Agent Collaboration Method for Complex Reasoning
COLING 2025
Impromptu Cybercrime Euphemism Detection
COLING 2025
InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption
CVPR 2025
Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving
ICLR 2025
PRDetect: Perturbation-Robust LLM-generated Text Detection Based on Syntax Tree
NAACL 2025
PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization
NAACL 2025
SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models
ICCV 2025
Advancing Textual Prompt Learning with Anchored Attributes
ICCV 2025
SAMed-2: Selective Memory Enhanced Medical Segment Anything Model
MICCAI 2025
MS-IQA: A Multi-Scale Feature Fusion Network for PET/CT Image Quality Assessment
MICCAI 2025
MAST-Pro: Dynamic Mixture-of-Experts for Adaptive Segmentation of Pan-Tumors with Knowledge-Driven Prompts
MICCAI 2025
LVPNet: A Latent-variable-based Prediction-driven End-to-end Framework for Lossless Compression of Medical Images
MICCAI 2025
Mitigating Spurious Correlations via Counterfactual Contrastive Learning
EMNLP 2025
DeMAC: Enhancing Multi-Agent Coordination with Dynamic DAG and Manager-Player Feedback
EMNLP 2025
Permitted Knowledge Boundary: Evaluating the Knowledge-Constrained Responsiveness of Large Language Models
EMNLP 2025
SGCD: Subtask-Guided Causal-Debiasing Framework for Robust Cross-Utterance Sentiment Quadruple Extraction in Dialogues
EMNLP 2025
TF-Mamba: Text-enhanced Fusion Mamba with Missing Modalities for Robust Multimodal Sentiment Analysis
EMNLP 2025
CAARMA: Class Augmentation with Adversarial Mixup Regularization
EMNLP 2025
ASD-iLLM:An Intervention Large Language Model for Autistic Children based on Real Clinical Dialogue Intervention Dataset
EMNLP 2025
Multimodal Document-level Triple Extraction via Dynamic Graph Enhancement and Relation-Aware Reflection
EMNLP 2025
SEAGraph: Unveiling the Whole Story of Paper Review Comments
IJCNLP 2025
Backdoor Attacks on Neural Networks via One-Bit Flip
ICCV 2025
Not All Layers of LLMs Are Necessary During Inference
IJCAI 2025
From Words to Worth: Newborn Article Impact Prediction with LLM
AAAI 2025
Multi-clue Consistency Learning to Bridge Gaps Between General and Oriented Object in Semi-supervised Detection
AAAI 2025
Hierarchically Controlled Deformable 3D Gaussians for Talking Head Synthesis
AAAI 2025
Leveraging Large Language Models for Node Generation in Few-Shot Learning on Text-Attributed Graphs
AAAI 2025
Coupling-based Convergence Diagnostic and Stepsize Scheme for Stochastic Gradient Descent
AAAI 2025
TreeEval: Benchmark-Free Evaluation of Large Language Models through Tree Planning
AAAI 2025
Every Opinion Matters: Evaluating and Building Models with Pluralistic Views
AAAI 2025
LLMsPark: A Benchmark for Evaluating Large Language Models in Strategic Gaming Contexts
EMNLP 2025
SEAGraph: Unveiling the Whole Story of Paper Review Comments
AACL 2025
Multi-Modal Large Language Model with RAG Strategies in Soccer Commentary Generation
WACV 2025
UniTMGE: Uniform Text-Motion Generation and Editing Model via Diffusion
WACV 2025
GroundingMate: Aiding Object Grounding for Goal-Oriented Vision-and-Language Navigation
WACV 2025
MaskDGNN: Self-Supervised Dynamic Graph Neural Networks with Activeness-aware Temporal Masking
IJCAI 2025
Corruption-Robust Variance-aware Algorithms for Generalized Linear Bandits under Heavy-tailed Rewards
UAI 2025
Holmes: Localizing Irregularities in LLM Training with Mega-scale GPU Clusters
NSDI 2025
Text Detoxification: Data Efficiency, Semantic Preservation and Model Generalization
EMNLP 2025
Can Large Language Models Act as Ensembler for Multi-GNNs?
EMNLP 2025
Cascaded 3D Diffusion Models for Whole-body 3D 18-F FDG PET/CT synthesis from Demographics
MICCAI 2025
Leveraging Diffusion Models for Continual Test-Time Adaptation in Fundus Image Classification
MICCAI 2025
Multi-Sensor Object Anomaly Detection: Unifying Appearance, Geometry, and Internal Properties
CVPR 2025
Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data
ICLR 2025
ECHOPulse: ECG Controlled Echocardio-gram Video Generation
ICLR 2025
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
ICLR 2025
Let Your Features Tell The Differences: Understanding Graph Convolution By Feature Splitting
ICLR 2025
DISTA-Net: Dynamic Closely-Spaced Infrared Small Target Unmixing
ICCV 2025
Multi-level Relevance Document Identifier Learning for Generative Retrieval
ACL 2025
Demystifying Small Language Models for Edge Deployment
ACL 2025
Initializing and Retrofitting Key-Value Adaptors for Traceable Model Editing
ACL 2025
A Survey of LLM-based Agents in Medicine: How far are we from Baymax?
ACL 2025
Let’s Be Self-generated via Step by Step: A Curriculum Learning Approach to Automated Reasoning with Large Language Models
ACL 2025
See the World, Discover Knowledge: A Chinese Factuality Evaluation for Large Vision Language Models
ACL 2025
Enhancing LLM-based Hatred and Toxicity Detection with Meta-Toxic Knowledge Graph
ACL 2025
RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark
CVPR 2025
Symmetry Strikes Back: From Single-Image Symmetry Detection to 3D Generation
CVPR 2025
OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation
ICLR 2025
Unlocking ECMP Programmability for Precise Traffic Control
NSDI 2025
SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
CVPR 2025
HSI-GPT: A General-Purpose Large Scene-Motion-Language Model for Human Scene Interaction
CVPR 2025
ImageFolder: Autoregressive Image Generation with Folded Tokens
ICLR 2025
Understanding Long Videos with Multimodal Language Models
ICLR 2025
Rethinking Point Cloud Data Augmentation: Topologically Consistent Deformation
ICML 2025
Masked Autoencoders Are Effective Tokenizers for Diffusion Models
ICML 2025
Hallucination Index: An Image Quality Metric for Generative Reconstruction Models
MICCAI 2024
AG-LSEC: Audio Grounded Lexical Speaker Error Correction
INTERSPEECH 2024
Efficient LLM Jailbreak via Adaptive Dense-to-sparse Constrained Optimization
NIPS 2024
A General Framework for Learning from Weak Supervision
ICML 2024
Position: TrustLLM: Trustworthiness in Large Language Models
ICML 2024
Completing Visual Objects via Bridging Generation and Segmentation
ICML 2024
Speakers Unembedded: Embedding-free Approach to Long-form Neural Diarization
INTERSPEECH 2024
RisQNet: Rescuing SMEs from Financial Shocks with a Novel Networked-Loan Risk Assessment
IJCAI 2024
No Regularization Is Needed: Efficient and Effective Incomplete Label Distribution Learning
IJCAI 2024
UniAudio 1.5: Large Language Model-Driven Audio Codec is A Few-Shot Audio Task Learner
NIPS 2024
Understanding Generalizability of Diffusion Models Requires Rethinking the Hidden Gaussian Structure
NIPS 2024
Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations
NIPS 2024
DCDepth: Progressive Monocular Depth Estimation in Discrete Cosine Domain
NIPS 2024
Cross-model Control: Improving Multiple Large Language Models in One-time Training
NIPS 2024
Memory-Efficient Gradient Unrolling for Large-Scale Bi-level Optimization
NIPS 2024
Biomedical Visual Instruction Tuning with Clinician Preference Alignment
NIPS 2024
Suitable is the Best: Task-Oriented Knowledge Fusion in Vulnerability Detection
NIPS 2024
Slight Corruption in Pre-training Data Makes Better Diffusion Models
NIPS 2024
SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection
NIPS 2024
3DCoMPaT200: Language Grounded Large-Scale 3D Vision Dataset for Compositional Recognition
NIPS 2024
Novel Object Synthesis via Adaptive Text-Image Harmony
NIPS 2024
In-Hand 3D Object Reconstruction from a Monocular RGB Video
AAAI 2024
DI-V2X: Learning Domain-Invariant Representation for Vehicle-Infrastructure Collaborative 3D Object Detection
AAAI 2024
AltNeRF: Learning Robust Neural Radiance Field via Alternating Depth-Pose Optimization
AAAI 2024
Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders
WACV 2024
Boosting Language Models Reasoning with Chain-of-Knowledge Prompting
ACL 2024
KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction
ACL 2024
Fine-Grained Image-Text Alignment in Medical Imaging Enables Explainable Cyclic Image-Report Generation
ACL 2024
Teaching Small Language Models to Reason for Knowledge-Intensive Multi-Hop Question Answering
ACL 2024
Visual In-Context Learning for Large Vision-Language Models
ACL 2024
Parameter-Agnostic Optimization under Relaxed Smoothness
AISTATS 2024
AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain Framework
COLING 2024
Conjoin after Decompose: Improving Few-Shot Performance of Named Entity Recognition
COLING 2024
Make Prompt-based Black-Box Tuning Colorful: Boosting Model Generalization from Three Orthogonal Perspectives
COLING 2024
MMAD:Multi-modal Movie Audio Description
COLING 2024
MoDE-CoTD: Chain-of-Thought Distillation for Complex Reasoning Tasks with Mixture of Decoupled LoRA-Experts
COLING 2024
Structure-aware Fine-tuning for Code Pre-trained Models
COLING 2024
TransCoder: Towards Unified Transferable Code Representation Learning Inspired by Human Skills
COLING 2024
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs
NSDI 2024
CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models
NAACL 2024
Beyond Read-Only: Crafting a Comprehensive Chinese Text-to-SQL Dataset for Database Manipulation and Query
NAACL 2024
Planning and Editing What You Retrieve for Enhanced Tool Learning
NAACL 2024
AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition
NAACL 2024
CrossKD: Cross-Head Knowledge Distillation for Object Detection
CVPR 2024
QDFormer: Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition
CVPR 2024
VA3: Virtually Assured Amplification Attack on Probabilistic Copyright Protection for Text-to-Image Generative Models
CVPR 2024
PromptKD: Unsupervised Prompt Distillation for Vision-Language Models
CVPR 2024
Volumetric Conditional Score-based Residual Diffusion Model for PET/MR Denoising
MICCAI 2024
R^2-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations
ECCV 2024
Uni3DL: A Unified Model for 3D Vision-Language Understanding
ECCV 2024
Cascade Prompt Learning for Visual-Language Model Adaptation
ECCV 2024
Distilling Knowledge from Large-Scale Image Models for Object Detection
ECCV 2024
VRSBench: A Versatile Vision-Language Benchmark Dataset for Remote Sensing Image Understanding
NIPS 2024
Eye-gaze Guided Multi-modal Alignment for Medical Representation Learning
NIPS 2024
Achieving Near-Optimal Convergence for Distributed Minimax Optimization with Adaptive Stepsizes
NIPS 2024
Automated Peer Reviewing in Paper SEA: Standardization, Evaluation, and Analysis
EMNLP 2024
Medical Image Synthesis via Fine-Grained Image-Text Alignment and Anatomy-Pathology Prompting
MICCAI 2024
F2TNet: FMRI to T1w MRI Knowledge Transfer Network for Brain Multi-phenotype Prediction
MICCAI 2024
Diffusion-Enhanced Transformation Consistency Learning for Retinal Image Segmentation
MICCAI 2024
CryoSAM: Training-free CryoET Tomogram Segmentation with Foundation Models
MICCAI 2024
Conditional Score-Based Diffusion Model for Cortical Thickness Trajectory Prediction
MICCAI 2024
Cache-Driven Spatial Test-Time Adaptation for Cross-Modality Medical Image Segmentation
MICCAI 2024
A Random Projection Approach to Personalized Federated Learning: Enhancing Communication Efficiency, Robustness, and Fairness
JMLR 2024
Training-free Multi-objective Diffusion Model for 3D Molecule Generation
ICLR 2024
MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models
ICLR 2024
Decoding Natural Images from EEG for Object Recognition
ICLR 2024
Creative Birds: Self-Supervised Single-View 3D Style Transfer
ICCV 2023
Contact2Grasp: 3D Grasp Synthesis via Hand-Object Contact Constraint
IJCAI 2023
Multi-Target Semantic Parsing with Collaborative Deliberation Network
IJCNLP 2023
The Hidden Dance of Phonemes and Visage: Unveiling the Enigmatic Link between Phonemes and Facial Features
INTERSPEECH 2023
Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction
INTERSPEECH 2023
Towards Noise-Tolerant Speech-Referring Video Object Segmentation: Bridging Speech and Text
EMNLP 2023
Exploring All-In-One Knowledge Distillation Framework for Neural Machine Translation
EMNLP 2023
OssCSE: Overcoming Surface Structure Bias in Contrastive Learning for Unsupervised Sentence Embedding
EMNLP 2023
DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models
EMNLP 2023
Pass-Tuning: Towards Structure-Aware Parameter-Efficient Tuning for Code Representation Learning
EMNLP 2023
Uncertainty-aware Parameter-Efficient Self-training for Semi-supervised Language Understanding
EMNLP 2023
Evaluating and Enhancing the Robustness of Code Pre-trained Models through Structure-Aware Adversarial Samples Generation
EMNLP 2023
In-Image Neural Machine Translation with Segmented Pixel Sequence-to-Sequence Model
EMNLP 2023
Near-optimal Policy Identification in Active Reinforcement Learning
ICLR 2023
Ranking-Enhanced Unsupervised Sentence Representation Learning
ACL 2023
Exploring Better Text Image Translation with Multimodal Codebook
ACL 2023
Multi-Target Semantic Parsing with Collaborative Deliberation Network
AACL 2023
PGSS: Pitch-Guided Speech Separation
AAAI 2023
Decision-Making Context Interaction Network for Click-Through Rate Prediction
AAAI 2023
Structure Flow-Guided Network for Real Depth Super-resolution
AAAI 2023
Recurrent Structure Attention Guidance for Depth Super-resolution
AAAI 2023
DesNet: Decomposed Scale-Consistent Network for Unsupervised Depth Completion
AAAI 2023
Curriculum Temperature for Knowledge Distillation
AAAI 2023
LWSIS: LiDAR-Guided Weakly Supervised Instance Segmentation for Autonomous Driving
AAAI 2023
Panoramic Video Salient Object Detection with Ambisonic Audio Guidance
AAAI 2023
Distortion and Uncertainty Aware Loss for Panoramic Depth Completion
ICML 2023
Compositional Zero-Shot Artistic Font Synthesis
IJCAI 2023
Diverse and Expressive Speech Prosody Prediction with Denoising Diffusion Probabilistic Model
INTERSPEECH 2023
Explaining Temporal Graph Models through an Explorer-Navigator Framework
ICLR 2023
TiAda: A Time-scale Adaptive Algorithm for Nonconvex Minimax Optimization
ICLR 2023
DFRD: Data-Free Robustness Distillation for Heterogeneous Federated Learning
NIPS 2023
Causally-Aware Intraoperative Imputation for Overall Survival Time Prediction
CVPR 2023
MoStGAN-V: Video Generation With Temporal Motion Styles
CVPR 2023
GradMA: A Gradient-Memory-Based Accelerated Federated Learning With Alleviated Catastrophic Forgetting
CVPR 2023
ADNet: Lane Shape Prediction via Anchor Decomposition
ICCV 2023
Video State-Changing Object Segmentation
ICCV 2023
HopFIR: Hop-wise GraphFormer with Intragroup Joint Refinement for 3D Human Pose Estimation
ICCV 2023
Robust Referring Video Object Segmentation with Cyclic Structural Consensus
ICCV 2023
A Unified Solution for Privacy and Communication Efficiency in Vertical Federated Learning
NIPS 2023
LD2: Scalable Heterophilous Graph Neural Network with Decoupled Embeddings
NIPS 2023
PaintSeg: Painting Pixels for Training-free Segmentation
NIPS 2023
A Statistical Analysis of Polyak-Ruppert Averaged Q-Learning
AISTATS 2023
Statistical Analysis of Karcher Means for Random Restricted PSD Matrices
AISTATS 2023
The Xiaomi AI Lab’s Speech Translation Systems for IWSLT 2023 Offline Task, Simultaneous Task and Speech-to-Speech Task
ACL 2023
When Gradient Descent Meets Derivative-Free Optimization: A Match Made in Black-Box Scenario
ACL 2023
S3HQA: A Three-Stage Approach for Multi-hop Text-Table Hybrid Question Answering
ACL 2023
FishNet: A Large-scale Dataset and Benchmark for Fish Recognition, Detection, and Functional Trait Prediction
ICCV 2023
Learning to Compress Prompts with Gist Tokens
NIPS 2023
Fine-Grained Visual Prompting
NIPS 2023
YouTubePD: A Multimodal Benchmark for Parkinson’s Disease Analysis
NIPS 2023
Two Sides of One Coin: the Limits of Untuned SGD and the Power of Adaptive Methods
NIPS 2023
Large Selective Kernel Network for Remote Sensing Object Detection
ICCV 2023
DLGSANet: Lightweight Dynamic Local and Global Self-Attention Networks for Image Super-Resolution
ICCV 2023
Weakly Supervised Text Classification using Supervision Signals from a Language Model
NAACL 2022
Diffusion-LM Improves Controllable Text Generation
NIPS 2022
RecursiveMix: Mixed Learning with History
NIPS 2022
DTG-SSOD: Dense Teacher Guidance for Semi-Supervised Object Detection
NIPS 2022
Nest Your Adaptive Algorithm for Parameter-Agnostic Nonconvex Minimax Optimization
NIPS 2022
Personalized Federated Learning towards Communication Efficiency, Robustness and Fairness
NIPS 2022
Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?
NIPS 2022
Asymptotic Behaviors of Projected Stochastic Approximation: A Jump Diffusion Perspective
NIPS 2022
TRITON: Neural Neural Textures for Better Sim2Real
CORL 2022
Knowledge Distillation for Object Detection via Rank Mimicking and Prediction-Guided Feature Imitation
AAAI 2022
Hybrid Instance-Aware Temporal Fusion for Online Video Instance Segmentation
AAAI 2022
JointCL: A Joint Contrastive Learning Framework for Zero-Shot Stance Detection
ACL 2022
Multi-Modal Sarcasm Detection via Cross-Modal Graph Convolutional Network
ACL 2022
A Neural Network Architecture for Program Understanding Inspired by Human Behaviors
ACL 2022
Lexical Knowledge Internalization for Neural Dialog Generation
ACL 2022
The Xiaomi Text-to-Text Simultaneous Speech Translation System for IWSLT 2022
ACL 2022
Answering Numerical Reasoning Questions in Table-Text Hybrid Contents with Graph-based Encoder and Tree-based Decoder
COLING 2022
CofeNet: Context and Former-Label Enhanced Net for Complicated Quotation Extraction
COLING 2022
Towards Robust Neural Machine Translation with Iterative Scheduled Data-Switch Training
COLING 2022
Statistical Estimation and Online Inference via Local SGD
COLT 2022
Dynamic MLP for Fine-Grained Image Classification by Leveraging Geographical and Temporal Information
CVPR 2022
Multi-modal Masked Pre-training for Monocular Panoramic Depth Completion
ECCV 2022
PseCo: Pseudo Labeling and Consistency Training for Semi-Supervised Object Detection
ECCV 2022
RigNet: Repetitive Image Guided Network for Depth Completion
ECCV 2022
StARformer: Transformer with State-Action-Reward Representations for Visual Reinforcement Learning
ECCV 2022
Knowledge Prompting in Pre-trained Language Model for Natural Language Understanding
EMNLP 2022
CAT-probing: A Metric-based Approach to Interpret How Pre-trained Models for Programming Language Attend Code Structure
EMNLP 2022
Detecting Relevant Differences Between Similar Legal Texts
EMNLP 2022
Finding Global Homophily in Graph Neural Networks When Meeting Heterophily
ICML 2022
CGMN: A Contrastive Graph Matching Network for Self-Supervised Graph Similarity Learning
IJCAI 2022
RAW-GNN: RAndom Walk Aggregation based Graph Neural Network
IJCAI 2022
Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis
INTERSPEECH 2022
Towards Cross-speaker Reading Style Transfer on Audiobook Dataset
INTERSPEECH 2022
CALM: Constrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis
INTERSPEECH 2022
BIT-Xiaomi’s System for AutoSimTrans 2022
NAACL 2022
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction Without Convolutions
ICCV 2021
Regularizing Nighttime Weirdness: Efficient Self-Supervised Monocular Depth Estimation in the Dark
ICCV 2021
HITSZ-HLT at SemEval-2021 Task 5: Ensemble Sequence Labeling and Span Boundary Detection for Toxic Span Detection
IJCNLP 2021
The Image Local Autoregressive Transformer
NIPS 2021
Reinforcement Learning Enhanced Explainer for Graph Neural Networks
NIPS 2021
Towards Multi-Scale Style Control for Expressive Speech Synthesis
INTERSPEECH 2021
Capturing Delayed Feedback in Conversion Rate Prediction via Elapsed-Time Sampling
AAAI 2021
Improving Tree-Structured Decoder Training for Code Generation via Mutual Learning
AAAI 2021
Real-Time Gait-Based Age Estimation and Gender Classification From a Single Image
WACV 2021
Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine Translation
ACL 2021
HITSZ-HLT at SemEval-2021 Task 5: Ensemble Sequence Labeling and Span Boundary Detection for Toxic Span Detection
SEMEVAL 2021
HITSZ-HLT at SemEval-2021 Task 5: Ensemble Sequence Labeling and Span Boundary Detection for Toxic Span Detection
ACL 2021
Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection
CVPR 2021
Communication-Efficient Distributed SVD via Local Power Iterations
ICML 2021
Answering Complex Open-Domain Questions with Multi-Hop Dense Retrieval
ICLR 2021
Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine Translation
IJCNLP 2021
Improving One-Shot NAS by Suppressing the Posterior Fading
CVPR 2020
On the Convergence of FedAvg on Non-IID Data
ICLR 2020
Gait Recognition from a Single Image using a Phase-Aware Gait Cycle Reconstruction Network
ECCV 2020
Gait Recognition via Semi-supervised Disentangled Representation Learning to Identity and Covariate Features
CVPR 2020
Few-Shot Learning of Part-Specific Probability Space for 3D Shape Segmentation
CVPR 2020
FSS-1000: A 1000-Class Dataset for Few-Shot Segmentation
CVPR 2020
Scalog: Seamless Reconfiguration and Total Order in a Scalable Shared Log
NSDI 2020
Xiaomi’s Submissions for IWSLT 2020 Open Domain Translation Task
ACL 2020
Modeling Discourse Structure for Document-level Neural Machine Translation
ACL 2020
ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems
ACL 2020
Safe Sample Screening for Robust Support Vector Machine
AAAI 2020
Quadruply Stochastic Gradient Method for Large Scale Nonlinear Semi-Supervised Ordinal Regression AUC Optimization
AAAI 2020
Do Subsampled Newton Methods Work for High-Dimensional Data?
AAAI 2020
Understanding the Disharmony between Weight Normalization Family and Weight Decay
AAAI 2020
Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object Detection
NIPS 2020
Neuron-level Structured Pruning using Polarization Regularizer
NIPS 2020
Improving Local Identifiability in Probabilistic Box Embeddings
NIPS 2020
An Iterative Multi-Source Mutual Knowledge Transfer Framework for Machine Reading Comprehension
IJCAI 2020
Understanding the Disharmony Between Dropout and Batch Normalization by Variance Shift
CVPR 2019
Selective Kernel Networks
CVPR 2019
Scalable Semi-Supervised SVM via Triply Stochastic Gradients
IJCAI 2019
Joint Optimization of Tree-based Index and Deep Model for Recommender Systems
NIPS 2019
Spectral Clustering in Heterogeneous Information Networks
AAAI 2019
Inter-Class Angular Loss for Convolutional Neural Networks
AAAI 2019
ConvLab: Multi-Domain End-to-End Dialog System Platform
ACL 2019
A Regularized Approach to Sparse Optimal Policy in Reinforcement Learning
NIPS 2019
Smoothing the Geometry of Probabilistic Box Embeddings
ICLR 2019
Enhancing Low Light Videos by Exploring High Sensitivity Camera Noise
ICCV 2019
Arbicon-Net: Arbitrary Continuous Geometric Transformation Networks for Image Registration
NIPS 2019
Dynamic Feature Fusion for Semantic Edge Detection
IJCAI 2019
Group-Attention Single-Shot Detector (GA-SSD): Finding Pulmonary Nodules in Large-Scale CT Images
MIDL 2019
Quadruply Stochastic Gradients for Large Scale Nonlinear Semi-Supervised AUC Optimization
IJCAI 2019
Shape Robust Text Detection With Progressive Scale Expansion Network
CVPR 2019
Adversarial Metric Learning
IJCAI 2018
Pelee: A Real-Time Object Detection System on Mobile Devices
NIPS 2018
Adversarial Open-World Person Re-Identification
ECCV 2018
Joint Task-Recursive Learning for Semantic Segmentation and Depth Estimation
ECCV 2018
Mixed Link Networks
IJCAI 2018
Probabilistic Embedding of Knowledge Graphs with Box Lattice Measures
ACL 2018
Stacked Conditional Generative Adversarial Networks for Jointly Learning Shadow Detection and Shadow Removal
CVPR 2018
Pairwise-Ranking based Collaborative Recurrent Neural Networks for Clinical Event Prediction
IJCAI 2018
Few-Shot Charge Prediction with Discriminative Legal Attributes
COLING 2018
Faster Training Algorithms for Structured Sparsity-Inducing Norm
IJCAI 2018
Joint Intensity and Spatial Metric Learning for Robust Gait Recognition
CVPR 2017
Commonsense Knowledge Base Completion
ACL 2016
LightRNN: Memory and Computation-Efficient Recurrent Neural Networks
NIPS 2016
StalemateBreaker: A Proactive Content-Introducing Approach to Automatic Human-Computer Conversation
IJCAI 2016
Top-Push Video-Based Person Re-Identification
CVPR 2016
Data Sparseness in Linear SVM
IJCAI 2015
Tackling Sparsity, the Achilles Heel of Social Networks: Language Model Smoothing via Social Regularization
ACL 2015
Tackling Sparsity, the Achilles Heel of Social Networks: Language Model Smoothing via Social Regularization
IJCNLP 2015
Partial Person Re-Identification
ICCV 2015
Multi-Scale Learning for Low-Resolution Person Re-Identification
ICCV 2015
Iterative Transformation of Annotation Guidelines for Constituency Parsing
ACL 2013