conftrace_

Xiang Li

317 papers · 2013–2026 · 26 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+18 more ↓

🗺️ Taxonomy Completionist (48) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (7) 🐣 Hot Topic Early Bird

🌈 Renaissance Researcher (7) 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (7) 🏠 Conference Loyalist (41) 🤝 Dynamic Duo (40) 👑 Triple Crown 🏆 Keyword Champion (2) 🏆 Grand Slam 👥 Mega-Team (71) 🔬 Deep Specialist (35) 🧬 Topic Evolution 🚀 Conference Pioneer ⚡ Prolific Year (66) 🔥 Unstoppable (11) 🗃️ Keyword Collector (154) 💎 Century Club (296) 📈 Trend Setter ❓ The Questioner (4)

Conferences

AAAI (43) NIPS (41) ACL (36) CVPR (25) EMNLP (23) IJCAI (18) ICLR (18) COLING (17) ICCV (17) MICCAI (14) ECCV (11) ICML (10) INTERSPEECH (9) NAACL (8) IJCNLP (5) WACV (5) NSDI (4) AISTATS (3) MIDL (2) AACL (2) EACL (1) CORL (1) COLT (1) JMLR (1) SEMEVAL (1) UAI (1)

Top co-authors

Jian Yang (42) Ming Gao (17) Bhiksha Raj (17) Jun Li (15) Quanzheng Li (12) Bin Wang (11) Qiushi Sun (10) Ming-Ming Cheng (10) Shuo Chen (9) Zhihua Zhang (9)

Research topics

Computer Vision (1) Optimization & Theory (1)

Keywords

large language model (29) object detection (18) knowledge distillation (17) graph neural network (15) contrastive learning (15) multimodal learning (14) model compression (11) neural network (10) representation learning (9) diffusion model (9) semi-supervised learning (8) attention mechanism (8) few-shot learning (7) self-supervised learning (7) unsupervised learning (7) text classification (7) convolutional neural network (7) transfer learning (7) zero-shot learning (6) reinforcement learning (6)

Papers

SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection AAAI 2026 TCoT: Trajectory Chain-of-Thoughts for Robotic Manipulation with Failure Recovery in Vision-Language-Action Model AAAI 2026 Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models AAAI 2026 Unsupervised Text Style Transfer for Controllable Intensity EACL 2026 Ego-PMOVE: Prompt-aware Mixture of View Experts Network for Egocentric Gaze Prediction AAAI 2026 DenoDet V2: Phase-Amplitude Cross Denoising for SAR Object Detection AAAI 2026 LPPG-RL: Lexicographically Projected Policy Gradient Reinforcement Learning with Subproblem Exploration AAAI 2026 TTT-UNet: Enhancing U-Net with Test-Time Training Layers for Biomedical Image Segmentation MIDL 2026 Human Cognition Inspired RAG with Knowledge Graph for Complex Problem Solving AAAI 2026 Beyond Adapter Retrieval: Latent Geometry-Preserving Composition via Sparse Task Projection AAAI 2026 RATE: Reviewer Profiling and Annotation-free Training for Expertise Ranking in Peer Review Systems ACL 2026 Community-Aware Assessment of Social Textual Engagement and Resonance: A Human-Centric Perspective on User-Generated Content Evaluation ACL 2026 Analyzing and Internalizing Complex Policy Documents for LLM Agents ACL 2026 FinKario: Event-Enhanced Automated Construction of Financial Knowledge Graph ACL 2026 Efficient Transcoder Adaptation for Fine-Tuned Models: Revealing Medical Reasoning Mechanisms in Large Language Models AAAI 2026 Analyze–Compose–Execute: A Dynamic Dialogue Framework for Multi-Agent Debate AAAI 2026 GigaMoE: Sparsity-Guided Mixture of Experts for Efficient Gigapixel Object Detection AAAI 2026 Multiplex Heterogeneous Graph Neural Networks with Euclidean-Riemannian Mutual Space Synergy AAAI 2026 Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection AAAI 2026 SpatioTemporal Difference Network for Video Depth Super-Resolution AAAI 2026 GeoBayes: Probabilistic Image Geo-Localization Inference via Sequential Bayesian Updating AAAI 2026 Preference Adaptive and Sequential Text-to-Image Generation ICML 2025 Distribution-aware Fairness Learning in Medical Image Segmentation From A Control-Theoretic Perspective ICML 2025 Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video ICLR 2025 XFormParser: A Simple and Effective Multimodal Multilingual Semi-structured Form Parser COLING 2025 LogiGraph: Logical Reasoning with Contrastive Learning and Lightweight Graph Networks COLING 2025 SKIntern: Internalizing Symbolic Knowledge for Distilling Better CoT Capabilities into Small Language Models COLING 2025 Idea23D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs COLING 2025 Explain-Analyze-Generate: A Sequential Multi-Agent Collaboration Method for Complex Reasoning COLING 2025 Impromptu Cybercrime Euphemism Detection COLING 2025 InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption CVPR 2025 Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving ICLR 2025 PRDetect: Perturbation-Robust LLM-generated Text Detection Based on Syntax Tree NAACL 2025 PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization NAACL 2025 SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models ICCV 2025 Advancing Textual Prompt Learning with Anchored Attributes ICCV 2025 SAMed-2: Selective Memory Enhanced Medical Segment Anything Model MICCAI 2025 MS-IQA: A Multi-Scale Feature Fusion Network for PET/CT Image Quality Assessment MICCAI 2025 MAST-Pro: Dynamic Mixture-of-Experts for Adaptive Segmentation of Pan-Tumors with Knowledge-Driven Prompts MICCAI 2025 LVPNet: A Latent-variable-based Prediction-driven End-to-end Framework for Lossless Compression of Medical Images MICCAI 2025 Mitigating Spurious Correlations via Counterfactual Contrastive Learning EMNLP 2025 DeMAC: Enhancing Multi-Agent Coordination with Dynamic DAG and Manager-Player Feedback EMNLP 2025 Permitted Knowledge Boundary: Evaluating the Knowledge-Constrained Responsiveness of Large Language Models EMNLP 2025 SGCD: Subtask-Guided Causal-Debiasing Framework for Robust Cross-Utterance Sentiment Quadruple Extraction in Dialogues EMNLP 2025 TF-Mamba: Text-enhanced Fusion Mamba with Missing Modalities for Robust Multimodal Sentiment Analysis EMNLP 2025 CAARMA: Class Augmentation with Adversarial Mixup Regularization EMNLP 2025 ASD-iLLM:An Intervention Large Language Model for Autistic Children based on Real Clinical Dialogue Intervention Dataset EMNLP 2025 Multimodal Document-level Triple Extraction via Dynamic Graph Enhancement and Relation-Aware Reflection EMNLP 2025 SEAGraph: Unveiling the Whole Story of Paper Review Comments IJCNLP 2025 Backdoor Attacks on Neural Networks via One-Bit Flip ICCV 2025 Not All Layers of LLMs Are Necessary During Inference IJCAI 2025 From Words to Worth: Newborn Article Impact Prediction with LLM AAAI 2025 Multi-clue Consistency Learning to Bridge Gaps Between General and Oriented Object in Semi-supervised Detection AAAI 2025 Hierarchically Controlled Deformable 3D Gaussians for Talking Head Synthesis AAAI 2025 Leveraging Large Language Models for Node Generation in Few-Shot Learning on Text-Attributed Graphs AAAI 2025 Coupling-based Convergence Diagnostic and Stepsize Scheme for Stochastic Gradient Descent AAAI 2025 TreeEval: Benchmark-Free Evaluation of Large Language Models through Tree Planning AAAI 2025 Every Opinion Matters: Evaluating and Building Models with Pluralistic Views AAAI 2025 LLMsPark: A Benchmark for Evaluating Large Language Models in Strategic Gaming Contexts EMNLP 2025 SEAGraph: Unveiling the Whole Story of Paper Review Comments AACL 2025 Multi-Modal Large Language Model with RAG Strategies in Soccer Commentary Generation WACV 2025 UniTMGE: Uniform Text-Motion Generation and Editing Model via Diffusion WACV 2025 GroundingMate: Aiding Object Grounding for Goal-Oriented Vision-and-Language Navigation WACV 2025 MaskDGNN: Self-Supervised Dynamic Graph Neural Networks with Activeness-aware Temporal Masking IJCAI 2025 Corruption-Robust Variance-aware Algorithms for Generalized Linear Bandits under Heavy-tailed Rewards UAI 2025 Holmes: Localizing Irregularities in LLM Training with Mega-scale GPU Clusters NSDI 2025 Text Detoxification: Data Efficiency, Semantic Preservation and Model Generalization EMNLP 2025 Can Large Language Models Act as Ensembler for Multi-GNNs? EMNLP 2025 Cascaded 3D Diffusion Models for Whole-body 3D 18-F FDG PET/CT synthesis from Demographics MICCAI 2025 Leveraging Diffusion Models for Continual Test-Time Adaptation in Fundus Image Classification MICCAI 2025 Multi-Sensor Object Anomaly Detection: Unifying Appearance, Geometry, and Internal Properties CVPR 2025 Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data ICLR 2025 ECHOPulse: ECG Controlled Echocardio-gram Video Generation ICLR 2025 LLaRA: Supercharging Robot Learning Data for Vision-Language Policy ICLR 2025 Let Your Features Tell The Differences: Understanding Graph Convolution By Feature Splitting ICLR 2025 DISTA-Net: Dynamic Closely-Spaced Infrared Small Target Unmixing ICCV 2025 Multi-level Relevance Document Identifier Learning for Generative Retrieval ACL 2025 Demystifying Small Language Models for Edge Deployment ACL 2025 Initializing and Retrofitting Key-Value Adaptors for Traceable Model Editing ACL 2025 A Survey of LLM-based Agents in Medicine: How far are we from Baymax? ACL 2025 Let’s Be Self-generated via Step by Step: A Curriculum Learning Approach to Automated Reasoning with Large Language Models ACL 2025 See the World, Discover Knowledge: A Chinese Factuality Evaluation for Large Vision Language Models ACL 2025 Enhancing LLM-based Hatred and Toxicity Detection with Meta-Toxic Knowledge Graph ACL 2025 RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark CVPR 2025 Symmetry Strikes Back: From Single-Image Symmetry Detection to 3D Generation CVPR 2025 OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation ICLR 2025 Unlocking ECMP Programmability for Precise Traffic Control NSDI 2025 SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer CVPR 2025 HSI-GPT: A General-Purpose Large Scene-Motion-Language Model for Human Scene Interaction CVPR 2025 ImageFolder: Autoregressive Image Generation with Folded Tokens ICLR 2025 Understanding Long Videos with Multimodal Language Models ICLR 2025 Rethinking Point Cloud Data Augmentation: Topologically Consistent Deformation ICML 2025 Masked Autoencoders Are Effective Tokenizers for Diffusion Models ICML 2025 Hallucination Index: An Image Quality Metric for Generative Reconstruction Models MICCAI 2024 AG-LSEC: Audio Grounded Lexical Speaker Error Correction INTERSPEECH 2024 Efficient LLM Jailbreak via Adaptive Dense-to-sparse Constrained Optimization NIPS 2024 A General Framework for Learning from Weak Supervision ICML 2024 Position: TrustLLM: Trustworthiness in Large Language Models ICML 2024 Completing Visual Objects via Bridging Generation and Segmentation ICML 2024 Speakers Unembedded: Embedding-free Approach to Long-form Neural Diarization INTERSPEECH 2024 RisQNet: Rescuing SMEs from Financial Shocks with a Novel Networked-Loan Risk Assessment IJCAI 2024 No Regularization Is Needed: Efficient and Effective Incomplete Label Distribution Learning IJCAI 2024 UniAudio 1.5: Large Language Model-Driven Audio Codec is A Few-Shot Audio Task Learner NIPS 2024 Understanding Generalizability of Diffusion Models Requires Rethinking the Hidden Gaussian Structure NIPS 2024 Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations NIPS 2024 DCDepth: Progressive Monocular Depth Estimation in Discrete Cosine Domain NIPS 2024 Cross-model Control: Improving Multiple Large Language Models in One-time Training NIPS 2024 Memory-Efficient Gradient Unrolling for Large-Scale Bi-level Optimization NIPS 2024 Biomedical Visual Instruction Tuning with Clinician Preference Alignment NIPS 2024 Suitable is the Best: Task-Oriented Knowledge Fusion in Vulnerability Detection NIPS 2024 Slight Corruption in Pre-training Data Makes Better Diffusion Models NIPS 2024 SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection NIPS 2024 3DCoMPaT200: Language Grounded Large-Scale 3D Vision Dataset for Compositional Recognition NIPS 2024 Novel Object Synthesis via Adaptive Text-Image Harmony NIPS 2024 In-Hand 3D Object Reconstruction from a Monocular RGB Video AAAI 2024 DI-V2X: Learning Domain-Invariant Representation for Vehicle-Infrastructure Collaborative 3D Object Detection AAAI 2024 AltNeRF: Learning Robust Neural Radiance Field via Alternating Depth-Pose Optimization AAAI 2024 Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders WACV 2024 Boosting Language Models Reasoning with Chain-of-Knowledge Prompting ACL 2024 KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction ACL 2024 Fine-Grained Image-Text Alignment in Medical Imaging Enables Explainable Cyclic Image-Report Generation ACL 2024 Teaching Small Language Models to Reason for Knowledge-Intensive Multi-Hop Question Answering ACL 2024 Visual In-Context Learning for Large Vision-Language Models ACL 2024 Parameter-Agnostic Optimization under Relaxed Smoothness AISTATS 2024 AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain Framework COLING 2024 Conjoin after Decompose: Improving Few-Shot Performance of Named Entity Recognition COLING 2024 Make Prompt-based Black-Box Tuning Colorful: Boosting Model Generalization from Three Orthogonal Perspectives COLING 2024 MMAD:Multi-modal Movie Audio Description COLING 2024 MoDE-CoTD: Chain-of-Thought Distillation for Complex Reasoning Tasks with Mixture of Decoupled LoRA-Experts COLING 2024 Structure-aware Fine-tuning for Code Pre-trained Models COLING 2024 TransCoder: Towards Unified Transferable Code Representation Learning Inspired by Human Skills COLING 2024 MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs NSDI 2024 CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models NAACL 2024 Beyond Read-Only: Crafting a Comprehensive Chinese Text-to-SQL Dataset for Database Manipulation and Query NAACL 2024 Planning and Editing What You Retrieve for Enhanced Tool Learning NAACL 2024 AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition NAACL 2024 CrossKD: Cross-Head Knowledge Distillation for Object Detection CVPR 2024 QDFormer: Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition CVPR 2024 VA3: Virtually Assured Amplification Attack on Probabilistic Copyright Protection for Text-to-Image Generative Models CVPR 2024 PromptKD: Unsupervised Prompt Distillation for Vision-Language Models CVPR 2024 Volumetric Conditional Score-based Residual Diffusion Model for PET/MR Denoising MICCAI 2024 R^2-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations ECCV 2024 Uni3DL: A Unified Model for 3D Vision-Language Understanding ECCV 2024 Cascade Prompt Learning for Visual-Language Model Adaptation ECCV 2024 Distilling Knowledge from Large-Scale Image Models for Object Detection ECCV 2024 VRSBench: A Versatile Vision-Language Benchmark Dataset for Remote Sensing Image Understanding NIPS 2024 Eye-gaze Guided Multi-modal Alignment for Medical Representation Learning NIPS 2024 Achieving Near-Optimal Convergence for Distributed Minimax Optimization with Adaptive Stepsizes NIPS 2024 Automated Peer Reviewing in Paper SEA: Standardization, Evaluation, and Analysis EMNLP 2024 Medical Image Synthesis via Fine-Grained Image-Text Alignment and Anatomy-Pathology Prompting MICCAI 2024 F2TNet: FMRI to T1w MRI Knowledge Transfer Network for Brain Multi-phenotype Prediction MICCAI 2024 Diffusion-Enhanced Transformation Consistency Learning for Retinal Image Segmentation MICCAI 2024 CryoSAM: Training-free CryoET Tomogram Segmentation with Foundation Models MICCAI 2024 Conditional Score-Based Diffusion Model for Cortical Thickness Trajectory Prediction MICCAI 2024 Cache-Driven Spatial Test-Time Adaptation for Cross-Modality Medical Image Segmentation MICCAI 2024 A Random Projection Approach to Personalized Federated Learning: Enhancing Communication Efficiency, Robustness, and Fairness JMLR 2024 Training-free Multi-objective Diffusion Model for 3D Molecule Generation ICLR 2024 MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models ICLR 2024 Decoding Natural Images from EEG for Object Recognition ICLR 2024 Creative Birds: Self-Supervised Single-View 3D Style Transfer ICCV 2023 Contact2Grasp: 3D Grasp Synthesis via Hand-Object Contact Constraint IJCAI 2023 Multi-Target Semantic Parsing with Collaborative Deliberation Network IJCNLP 2023 The Hidden Dance of Phonemes and Visage: Unveiling the Enigmatic Link between Phonemes and Facial Features INTERSPEECH 2023 Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction INTERSPEECH 2023 Towards Noise-Tolerant Speech-Referring Video Object Segmentation: Bridging Speech and Text EMNLP 2023 Exploring All-In-One Knowledge Distillation Framework for Neural Machine Translation EMNLP 2023 OssCSE: Overcoming Surface Structure Bias in Contrastive Learning for Unsupervised Sentence Embedding EMNLP 2023 DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models EMNLP 2023 Pass-Tuning: Towards Structure-Aware Parameter-Efficient Tuning for Code Representation Learning EMNLP 2023 Uncertainty-aware Parameter-Efficient Self-training for Semi-supervised Language Understanding EMNLP 2023 Evaluating and Enhancing the Robustness of Code Pre-trained Models through Structure-Aware Adversarial Samples Generation EMNLP 2023 In-Image Neural Machine Translation with Segmented Pixel Sequence-to-Sequence Model EMNLP 2023 Near-optimal Policy Identification in Active Reinforcement Learning ICLR 2023 Ranking-Enhanced Unsupervised Sentence Representation Learning ACL 2023 Exploring Better Text Image Translation with Multimodal Codebook ACL 2023 Multi-Target Semantic Parsing with Collaborative Deliberation Network AACL 2023 PGSS: Pitch-Guided Speech Separation AAAI 2023 Decision-Making Context Interaction Network for Click-Through Rate Prediction AAAI 2023 Structure Flow-Guided Network for Real Depth Super-resolution AAAI 2023 Recurrent Structure Attention Guidance for Depth Super-resolution AAAI 2023 DesNet: Decomposed Scale-Consistent Network for Unsupervised Depth Completion AAAI 2023 Curriculum Temperature for Knowledge Distillation AAAI 2023 LWSIS: LiDAR-Guided Weakly Supervised Instance Segmentation for Autonomous Driving AAAI 2023 Panoramic Video Salient Object Detection with Ambisonic Audio Guidance AAAI 2023 Distortion and Uncertainty Aware Loss for Panoramic Depth Completion ICML 2023 Compositional Zero-Shot Artistic Font Synthesis IJCAI 2023 Diverse and Expressive Speech Prosody Prediction with Denoising Diffusion Probabilistic Model INTERSPEECH 2023 Explaining Temporal Graph Models through an Explorer-Navigator Framework ICLR 2023 TiAda: A Time-scale Adaptive Algorithm for Nonconvex Minimax Optimization ICLR 2023 DFRD: Data-Free Robustness Distillation for Heterogeneous Federated Learning NIPS 2023 Causally-Aware Intraoperative Imputation for Overall Survival Time Prediction CVPR 2023 MoStGAN-V: Video Generation With Temporal Motion Styles CVPR 2023 GradMA: A Gradient-Memory-Based Accelerated Federated Learning With Alleviated Catastrophic Forgetting CVPR 2023 ADNet: Lane Shape Prediction via Anchor Decomposition ICCV 2023 Video State-Changing Object Segmentation ICCV 2023 HopFIR: Hop-wise GraphFormer with Intragroup Joint Refinement for 3D Human Pose Estimation ICCV 2023 Robust Referring Video Object Segmentation with Cyclic Structural Consensus ICCV 2023 A Unified Solution for Privacy and Communication Efficiency in Vertical Federated Learning NIPS 2023 LD2: Scalable Heterophilous Graph Neural Network with Decoupled Embeddings NIPS 2023 PaintSeg: Painting Pixels for Training-free Segmentation NIPS 2023 A Statistical Analysis of Polyak-Ruppert Averaged Q-Learning AISTATS 2023 Statistical Analysis of Karcher Means for Random Restricted PSD Matrices AISTATS 2023 The Xiaomi AI Lab’s Speech Translation Systems for IWSLT 2023 Offline Task, Simultaneous Task and Speech-to-Speech Task ACL 2023 When Gradient Descent Meets Derivative-Free Optimization: A Match Made in Black-Box Scenario ACL 2023 S3HQA: A Three-Stage Approach for Multi-hop Text-Table Hybrid Question Answering ACL 2023 FishNet: A Large-scale Dataset and Benchmark for Fish Recognition, Detection, and Functional Trait Prediction ICCV 2023 Learning to Compress Prompts with Gist Tokens NIPS 2023 Fine-Grained Visual Prompting NIPS 2023 YouTubePD: A Multimodal Benchmark for Parkinson’s Disease Analysis NIPS 2023 Two Sides of One Coin: the Limits of Untuned SGD and the Power of Adaptive Methods NIPS 2023 Large Selective Kernel Network for Remote Sensing Object Detection ICCV 2023 DLGSANet: Lightweight Dynamic Local and Global Self-Attention Networks for Image Super-Resolution ICCV 2023 Weakly Supervised Text Classification using Supervision Signals from a Language Model NAACL 2022 Diffusion-LM Improves Controllable Text Generation NIPS 2022 RecursiveMix: Mixed Learning with History NIPS 2022 DTG-SSOD: Dense Teacher Guidance for Semi-Supervised Object Detection NIPS 2022 Nest Your Adaptive Algorithm for Parameter-Agnostic Nonconvex Minimax Optimization NIPS 2022 Personalized Federated Learning towards Communication Efficiency, Robustness and Fairness NIPS 2022 Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels? NIPS 2022 Asymptotic Behaviors of Projected Stochastic Approximation: A Jump Diffusion Perspective NIPS 2022 TRITON: Neural Neural Textures for Better Sim2Real CORL 2022 Knowledge Distillation for Object Detection via Rank Mimicking and Prediction-Guided Feature Imitation AAAI 2022 Hybrid Instance-Aware Temporal Fusion for Online Video Instance Segmentation AAAI 2022 JointCL: A Joint Contrastive Learning Framework for Zero-Shot Stance Detection ACL 2022 Multi-Modal Sarcasm Detection via Cross-Modal Graph Convolutional Network ACL 2022 A Neural Network Architecture for Program Understanding Inspired by Human Behaviors ACL 2022 Lexical Knowledge Internalization for Neural Dialog Generation ACL 2022 The Xiaomi Text-to-Text Simultaneous Speech Translation System for IWSLT 2022 ACL 2022 Answering Numerical Reasoning Questions in Table-Text Hybrid Contents with Graph-based Encoder and Tree-based Decoder COLING 2022 CofeNet: Context and Former-Label Enhanced Net for Complicated Quotation Extraction COLING 2022 Towards Robust Neural Machine Translation with Iterative Scheduled Data-Switch Training COLING 2022 Statistical Estimation and Online Inference via Local SGD COLT 2022 Dynamic MLP for Fine-Grained Image Classification by Leveraging Geographical and Temporal Information CVPR 2022 Multi-modal Masked Pre-training for Monocular Panoramic Depth Completion ECCV 2022 PseCo: Pseudo Labeling and Consistency Training for Semi-Supervised Object Detection ECCV 2022 RigNet: Repetitive Image Guided Network for Depth Completion ECCV 2022 StARformer: Transformer with State-Action-Reward Representations for Visual Reinforcement Learning ECCV 2022 Knowledge Prompting in Pre-trained Language Model for Natural Language Understanding EMNLP 2022 CAT-probing: A Metric-based Approach to Interpret How Pre-trained Models for Programming Language Attend Code Structure EMNLP 2022 Detecting Relevant Differences Between Similar Legal Texts EMNLP 2022 Finding Global Homophily in Graph Neural Networks When Meeting Heterophily ICML 2022 CGMN: A Contrastive Graph Matching Network for Self-Supervised Graph Similarity Learning IJCAI 2022 RAW-GNN: RAndom Walk Aggregation based Graph Neural Network IJCAI 2022 Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis INTERSPEECH 2022 Towards Cross-speaker Reading Style Transfer on Audiobook Dataset INTERSPEECH 2022 CALM: Constrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis INTERSPEECH 2022 BIT-Xiaomi’s System for AutoSimTrans 2022 NAACL 2022 Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction Without Convolutions ICCV 2021 Regularizing Nighttime Weirdness: Efficient Self-Supervised Monocular Depth Estimation in the Dark ICCV 2021 HITSZ-HLT at SemEval-2021 Task 5: Ensemble Sequence Labeling and Span Boundary Detection for Toxic Span Detection IJCNLP 2021 The Image Local Autoregressive Transformer NIPS 2021 Reinforcement Learning Enhanced Explainer for Graph Neural Networks NIPS 2021 Towards Multi-Scale Style Control for Expressive Speech Synthesis INTERSPEECH 2021 Capturing Delayed Feedback in Conversion Rate Prediction via Elapsed-Time Sampling AAAI 2021 Improving Tree-Structured Decoder Training for Code Generation via Mutual Learning AAAI 2021 Real-Time Gait-Based Age Estimation and Gender Classification From a Single Image WACV 2021 Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine Translation ACL 2021 HITSZ-HLT at SemEval-2021 Task 5: Ensemble Sequence Labeling and Span Boundary Detection for Toxic Span Detection SEMEVAL 2021 HITSZ-HLT at SemEval-2021 Task 5: Ensemble Sequence Labeling and Span Boundary Detection for Toxic Span Detection ACL 2021 Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection CVPR 2021 Communication-Efficient Distributed SVD via Local Power Iterations ICML 2021 Answering Complex Open-Domain Questions with Multi-Hop Dense Retrieval ICLR 2021 Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine Translation IJCNLP 2021 Improving One-Shot NAS by Suppressing the Posterior Fading CVPR 2020 On the Convergence of FedAvg on Non-IID Data ICLR 2020 Gait Recognition from a Single Image using a Phase-Aware Gait Cycle Reconstruction Network ECCV 2020 Gait Recognition via Semi-supervised Disentangled Representation Learning to Identity and Covariate Features CVPR 2020 Few-Shot Learning of Part-Specific Probability Space for 3D Shape Segmentation CVPR 2020 FSS-1000: A 1000-Class Dataset for Few-Shot Segmentation CVPR 2020 Scalog: Seamless Reconfiguration and Total Order in a Scalable Shared Log NSDI 2020 Xiaomi’s Submissions for IWSLT 2020 Open Domain Translation Task ACL 2020 Modeling Discourse Structure for Document-level Neural Machine Translation ACL 2020 ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems ACL 2020 Safe Sample Screening for Robust Support Vector Machine AAAI 2020 Quadruply Stochastic Gradient Method for Large Scale Nonlinear Semi-Supervised Ordinal Regression AUC Optimization AAAI 2020 Do Subsampled Newton Methods Work for High-Dimensional Data? AAAI 2020 Understanding the Disharmony between Weight Normalization Family and Weight Decay AAAI 2020 Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object Detection NIPS 2020 Neuron-level Structured Pruning using Polarization Regularizer NIPS 2020 Improving Local Identifiability in Probabilistic Box Embeddings NIPS 2020 An Iterative Multi-Source Mutual Knowledge Transfer Framework for Machine Reading Comprehension IJCAI 2020 Understanding the Disharmony Between Dropout and Batch Normalization by Variance Shift CVPR 2019 Selective Kernel Networks CVPR 2019 Scalable Semi-Supervised SVM via Triply Stochastic Gradients IJCAI 2019 Joint Optimization of Tree-based Index and Deep Model for Recommender Systems NIPS 2019 Spectral Clustering in Heterogeneous Information Networks AAAI 2019 Inter-Class Angular Loss for Convolutional Neural Networks AAAI 2019 ConvLab: Multi-Domain End-to-End Dialog System Platform ACL 2019 A Regularized Approach to Sparse Optimal Policy in Reinforcement Learning NIPS 2019 Smoothing the Geometry of Probabilistic Box Embeddings ICLR 2019 Enhancing Low Light Videos by Exploring High Sensitivity Camera Noise ICCV 2019 Arbicon-Net: Arbitrary Continuous Geometric Transformation Networks for Image Registration NIPS 2019 Dynamic Feature Fusion for Semantic Edge Detection IJCAI 2019 Group-Attention Single-Shot Detector (GA-SSD): Finding Pulmonary Nodules in Large-Scale CT Images MIDL 2019 Quadruply Stochastic Gradients for Large Scale Nonlinear Semi-Supervised AUC Optimization IJCAI 2019 Shape Robust Text Detection With Progressive Scale Expansion Network CVPR 2019 Adversarial Metric Learning IJCAI 2018 Pelee: A Real-Time Object Detection System on Mobile Devices NIPS 2018 Adversarial Open-World Person Re-Identification ECCV 2018 Joint Task-Recursive Learning for Semantic Segmentation and Depth Estimation ECCV 2018 Mixed Link Networks IJCAI 2018 Probabilistic Embedding of Knowledge Graphs with Box Lattice Measures ACL 2018 Stacked Conditional Generative Adversarial Networks for Jointly Learning Shadow Detection and Shadow Removal CVPR 2018 Pairwise-Ranking based Collaborative Recurrent Neural Networks for Clinical Event Prediction IJCAI 2018 Few-Shot Charge Prediction with Discriminative Legal Attributes COLING 2018 Faster Training Algorithms for Structured Sparsity-Inducing Norm IJCAI 2018 Joint Intensity and Spatial Metric Learning for Robust Gait Recognition CVPR 2017 Commonsense Knowledge Base Completion ACL 2016 LightRNN: Memory and Computation-Efficient Recurrent Neural Networks NIPS 2016 StalemateBreaker: A Proactive Content-Introducing Approach to Automatic Human-Computer Conversation IJCAI 2016 Top-Push Video-Based Person Re-Identification CVPR 2016 Data Sparseness in Linear SVM IJCAI 2015 Tackling Sparsity, the Achilles Heel of Social Networks: Language Model Smoothing via Social Regularization ACL 2015 Tackling Sparsity, the Achilles Heel of Social Networks: Language Model Smoothing via Social Regularization IJCNLP 2015 Partial Person Re-Identification ICCV 2015 Multi-Scale Learning for Low-Resolution Person Re-Identification ICCV 2015 Iterative Transformation of Annotation Guidelines for Constituency Parsing ACL 2013