Xin Wang
343 papers · 2009–2026 · 26 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (50) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (9) π£ Hot Topic Early Bird
π
Academic Marathon
(16)
π
Renaissance Researcher
(9)
π
Interdisciplinary Bridge
π
Conference Loyalist
(32)
π
Keyword Trendsetter Combo
(3)
π€
Dynamic Duo
(52)
π
Triple Crown
π
Keyword Champion
π
Grand Slam
π¬
Deep Specialist
(45)
β
The Questioner
(9)
π
Trend Setter
π
Conference Pioneer
π₯
Unstoppable
(13)
β‘
Prolific Year
(29)
π
Century Club
(320)
ποΈ
Keyword Collector
(148)
Conferences
AAAI (45)
NIPS (35)
CVPR (35)
INTERSPEECH (31)
ICML (30)
ACL (30)
EMNLP (22)
IJCAI (20)
ICCV (18)
ECCV (11)
ICLR (11)
NAACL (10)
MICCAI (10)
IJCNLP (8)
EACL (5)
COLING (5)
AISTATS (3)
ACML (2)
JMLR (2)
AACL (2)
NSDI (2)
WACV (2)
CORL (1)
MIDL (1)
OSDI (1)
UAI (1)
Top co-authors
Research topics
Keywords
graph neural network
(30)
large language model
(23)
multimodal learning
(17)
representation learning
(16)
self-supervised learning
(15)
contrastive learning
(13)
neural architecture search
(12)
few-shot learning
(11)
neural network
(11)
vision-language model
(10)
speaker verification
(10)
zero-shot learning
(10)
domain adaptation
(9)
transfer learning
(9)
reinforcement learning
(9)
diffusion model
(8)
curriculum learning
(7)
attention mechanism
(7)
model compression
(7)
vision-language navigation
(7)
Papers
U2UData+: A Scalable Swarm UAVs Autonomous Flight Dataset for Embodied Long-horizon Tasks
AAAI 2026
A-ADAPT: Adaptive Intracranial Artery Segmentation with Morphology-Guided Prompts and Difficulty-Aware Learning
MIDL 2026
Unleashing Low-Bit Inference on Ascend NPUs: A Comprehensive Evaluation of HiFloat Formats
ACL 2026
BlindGuard: Safeguarding LLM-based Multi-Agent Systems under Unknown Attacks
ACL 2026
When Efficiency Becomes a Vulnerability: Computational Cost Attacks on WebAgents
ACL 2026
The Retrieval Bottleneck: Scaling Laws for Reinforcement Learning in RAG
ACL 2026
Probing the Safety Robustness of LLMs in Latent Space
ACL 2026
Is the Attention Matrix Really the Key to Self-Attention in Multivariate Long-Term Time Series Forecasting?
ACL 2026
LAFaCT: Attribution-based Localization and Focused Sequential Analysis of Fact-Critical Tokens for Hallucination Detection
ACL 2026
Donβt Click That: Teaching Web Agents to Resist Deceptive Interfaces
ACL 2026
SMART: A Surrogate Model for Predicting Application Runtime in Dragonfly Systems
AAAI 2026
A Causal Target for Learning to Defer Under Hidden Confounding
AAAI 2026
Dual Mamba for Node-Specific Representation Learning: Tackling Over-Smoothing with Selective State Space Modeling
AAAI 2026
rMMEA: Robust Multi-Modal Entity Alignment with Missing and Noise Visual Modality
AAAI 2026
Inference Scaling Law for Retrieval Augmented Generation
AAAI 2026
Cross-Scale Collaboration between LLMs and Lightweight Sequential Recommenders with Domain-Specific Latent Reasoning
AAAI 2026
Binary Message Passing for Generalizable Semi-Supervised Graph Anomaly Detection
AAAI 2026
HyperD: Hybrid Periodicity Decoupling Framework for Traffic Forecasting
AAAI 2026
Scalable Semi-supervised Community Search via Graph Transformer on Attributed Heterogeneous Information Networks
AAAI 2026
Selective Diffusion Distillation for Real-World High-Scale Image Super-Resolution
AAAI 2026
LUMIN: A Longitudinal Multi-modal Knowledge Decomposition Network for Predicting Breast Cancer Recurrence
AAAI 2026
BuildingWorld: A Structured 3D Building Dataset for Urban Foundation Models
AAAI 2026
DpDNet: An Dual-Prompt-Driven Network for Universal PET-CT Segmentation
MICCAI 2025
SCALM: Detecting Bad Practices in Smart Contracts Through LLMs
AAAI 2025
Identity-Text Video Corpus Grounding
AAAI 2025
ConDSeg: A General Medical Image Segmentation Framework via Contrast-Driven Feature Enhancement
AAAI 2025
Modular-Cam: Modular Dynamic Camera-view Video Generation with LLM
AAAI 2025
Behavior Importance-Aware Graph Neural Architecture Search for Cross-Domain Recommendation
AAAI 2025
Adaptive Dual Guidance Knowledge Distillation
AAAI 2025
Improving Generalization for AI-Synthesized Voice Detection
AAAI 2025
JAQ: Joint Efficient Architecture Design and Low-Bit Quantization with Hardware-Software Co-Exploration
AAAI 2025
Set-Valued Sensitivity Analysis of Deep Neural Networks
AAAI 2025
MEIT: Multimodal Electrocardiogram Instruction Tuning on Large Language Models for Report Generation
ACL 2025
MERIT: Multi-Agent Collaboration for Unsupervised Time Series Representation Learning
ACL 2025
Generation-Augmented and Embedding Fusion in Document-Level Event Argument Extraction
COLING 2025
Fusion meets Function: The Adaptive Selection-Generation Approach in Event Argument Extraction
COLING 2025
AI-Face: A Million-Scale Demographically Annotated AI-Generated Face Dataset and Fairness Benchmark
CVPR 2025
FIRE: Robust Detection of Diffusion-Generated Images via Frequency-Guided Reconstruction Error
CVPR 2025
TAPT: Test-Time Adversarial Prompt Tuning for Robust Inference in Vision-Language Models
CVPR 2025
Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers
CVPR 2025
Complementary Advantages: Exploiting Cross-Field Frequency Correlation for NIR-Assisted Image Denoising
CVPR 2025
MAR-3D: Progressive Masked Auto-regressor for High-Resolution 3D Generation
CVPR 2025
Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards
CVPR 2025
Understanding the Information Propagation Effects of Communication Topologies in LLM-based Multi-Agent Systems
EMNLP 2025
CrystalICL: Enabling In-Context Learning for Crystal Generation
EMNLP 2025
A Sequential Multi-Stage Approach for Code Vulnerability Detection via Confidence- and Collaboration-based Decision Making
EMNLP 2025
NESTFUL: A Benchmark for Evaluating LLMs on Nested Sequences of API Calls
EMNLP 2025
NoiseController: Towards Consistent Multi-view Video Generation via Noise Decomposition and Collaboration
ICCV 2025
Beyond Single Images: Retrieval Self-Augmented Unsupervised Camouflaged Object Detection
ICCV 2025
TopicGeo: An Efficient Unified Framework for Geolocation
ICCV 2025
From Abyssal Darkness to Blinding Glare: A Benchmark on Extreme Exposure Correction in Real World
ICCV 2025
$\text{D}_{2}\text{O}$: Dynamic Discriminative Operations for Efficient Long-Context Inference of Large Language Models
ICLR 2025
Trajectory-LLM: A Language-based Data Generator for Trajectory Prediction in Autonomous Driving
ICLR 2025
SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression
ICLR 2025
Unifying Unsupervised Graph-Level Anomaly Detection and Out-of-Distribution Detection: A Benchmark
ICLR 2025
Implicit degree bias in the link prediction task
ICML 2025
Differentiable Structure Learning with Ancestral Constraints
ICML 2025
AutoGFM: Automated Graph Foundation Model with Adaptive Architecture Customization
ICML 2025
Dynamic Mixture of Curriculum LoRA Experts for Continual Multimodal Instruction Tuning
ICML 2025
3D-LMVIC: Learning-based Multi-View Image Compression with 3D Gaussian Geometric Priors
ICML 2025
Self-supervised Masked Graph Autoencoder via Structure-aware Curriculum
ICML 2025
Disentangling Invariant Subgraph via Variance Contrastive Estimation under Distribution Shifts
ICML 2025
ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals
ICML 2025
Modularized Self-Reflected Video Reasoner for Multimodal LLM with Application to Video Question Answering
ICML 2025
Variational Counterfactual Intervention Planning to Achieve Target Outcomes
ICML 2025
Predictive Performance of Deep Quantum Data Re-uploading Models
ICML 2025
Preserving AUC Fairness in Learning with Noisy Protected Groups
ICML 2025
RLMiniStyler: Light-weight RL Style Agent for Arbitrary Sequential Neural Style Generation
IJCAI 2025
Adversarial Propensity Weighting for Debiasing in Collaborative Filtering
IJCAI 2025
Mamba-Based Graph Convolutional Networks: Tackling Over-smoothing with Selective State Space
IJCAI 2025
Latte: Transfering LLMs' Latent-level Knowledge for Few-shot Tabular Learning
IJCAI 2025
Enhancing Counterfactual Estimation: A Focus on Temporal Treatments
IJCAI 2025
FedSaaS: Class-Consistency Federated Semantic Segmentation via Global Prototype Supervision and Local Adversarial Harmonization
IJCAI 2025
Dyn-D^2P: Dynamic Differentially Private Decentralized Learning with Provable Utility Guarantee
IJCAI 2025
LLM-based Business Process Models Generation from Textual Descriptions
IJCNLP 2025
Edge-Aware Hierarchical Graph Transformer to Decode Brain Arterial Network
MICCAI 2025
GrInAdapt: Source-free Multi-Target Domain Adaptation for Retinal Vessel Segmentation
MICCAI 2025
RefineNet: Elevating Medical Foundation Models through Quality-Centric Data Curation by MLLM-Annotated Proxy Distillation
MICCAI 2025
MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference
NAACL 2025
SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression
NAACL 2025
Texture Shape and Order Matter: A New Transformer Design for Sequential DeepFake Detection
WACV 2025
LLM-based Business Process Models Generation from Textual Descriptions
AACL 2025
Exponential Hardness of Optimization from the Locality in Quantum Neural Networks
AAAI 2024
LSSNet: A Method for Colon Polyp Segmentation Based on Local Feature Supplementation and Shallow Feature Supplementation
MICCAI 2024
Revisiting and Improving Scoring Fusion for Spoofing-aware Speaker Verification Using Compositional Data Analysis
INTERSPEECH 2024
Spoof Diarization: "What Spoofed When" in Partially Spoofed Audio
INTERSPEECH 2024
Data-Augmented Curriculum Graph Neural Architecture Search under Distribution Shifts
AAAI 2024
Rethinking Propagation for Unsupervised Graph Domain Adaptation
AAAI 2024
Multimodal Graph Neural Architecture Search under Distribution Shifts
AAAI 2024
Muffin or Chihuahua? Challenging Multimodal Large Language Models with Multipanel VQA
ACL 2024
PokeMQA: Programmable knowledge editing for Multi-hop Question Answering
ACL 2024
Data-Centric Explainable Debiasing for Improving Fairness in Pre-trained Language Models
ACL 2024
Continuous Optical Zooming: A Benchmark for Arbitrary-Scale Image Super-Resolution in Real World
CVPR 2024
Enhancing Video Super-Resolution via Implicit Resampling-based Alignment
CVPR 2024
VTimeLLM: Empower LLM to Grasp Video Moments
CVPR 2024
In2SET: Intra-Inter Similarity Exploiting Transformer for Dual-Camera Compressive Hyperspectral Imaging
CVPR 2024
Molecular Data Programming: Towards Molecule Pseudo-labeling with Systematic Weak Supervision
CVPR 2024
Preserving Fairness Generalization in Deepfake Detection
CVPR 2024
Non-Adversarial Learning: Vector-Quantized Common Latent Space for Multi-Sequence MRI
MICCAI 2024
Ordinal Learning: Longitudinal Attention Alignment Model for Predicting Time to Future Breast Cancer Events from Mammograms
MICCAI 2024
Robustly Optimized Deep Feature Decoupling Network for Fatty Liver Diseases Detection
MICCAI 2024
Gorilla: Large Language Model Connected with Massive APIs
NIPS 2024
Differentiable Structure Learning with Partial Orders
NIPS 2024
Non-asymptotic Approximation Error Bounds of Parameterized Quantum Circuits
NIPS 2024
Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models
NIPS 2024
Causal language modeling can elicit search and reasoning capabilities on logic puzzles
NIPS 2024
WelQrate: Defining the Gold Standard in Small Molecule Drug Discovery Benchmarking
NIPS 2024
VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understanding
NIPS 2024
FUG: Feature-Universal Graph Contrastive Pre-training for Graphs with Diverse Node Features
NIPS 2024
ViCor: Bridging Visual Understanding and Commonsense Reasoning with Large Language Models
ACL 2024
Affinity Learning Based Brain Function Representation for Disease Diagnosis
MICCAI 2024
When Do We Not Need Larger Vision Models?
ECCV 2024
Adversarial Prompt Tuning for Vision-Language Models
ECCV 2024
Two-Stage Video Shadow Detection via Temporal-Spatial Adaption
ECCV 2024
Post-training Quantization with Progressive Calibration and Activation Relaxing for Text-to-Image Diffusion Models
ECCV 2024
PrivSGP-VR: Differentially Private Variance-Reduced Stochastic Gradient Push with Tight Utility Bounds
IJCAI 2024
Self-Supervised Learning for Enhancing Spatial Awareness in Free-Hand Sketches
IJCAI 2024
Mitigate Extrinsic Social Bias in Pre-trained Language Models via Continuous Prompts Adjustment
EMNLP 2024
From Text Segmentation to Enhanced Representation Learning: A Novel Approach to Multi-Label Classification for Long Texts
EMNLP 2024
Pioneering Reliable Assessment in Text-to-Image Knowledge Editing: Leveraging a Fine-Grained Dataset and an Innovative Criterion
EMNLP 2024
FTP: A Human Pose Estimation Method Integrating Temporal and Fine-Grained Feature Fusion
ACML 2024
CurBench: Curriculum Learning Benchmark
ICML 2024
Disentangled Continual Graph Neural Architecture Search with Invariant Modular Supernet
ICML 2024
A Dual-module Framework for Counterfactual Estimation over Time
ICML 2024
Rethinking Independent Cross-Entropy Loss For Graph-Structured Data
ICML 2024
Disentangled Graph Self-supervised Learning for Out-of-Distribution Generalization
ICML 2024
Efficient Sharpness-Aware Minimization for Molecular Graph Transformer Models
ICLR 2024
MIntRec2.0: A Large-scale Benchmark Dataset for Multimodal Intent Recognition and Out-of-scope Detection in Conversations
ICLR 2024
DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven Text-to-Image Generation
ICLR 2024
An Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios
INTERSPEECH 2024
Navigation as Attackers Wish? Towards Building Robust Embodied Agents under Federated Learning
NAACL 2024
ComCLIP: Training-Free Compositional Image and Text Matching
NAACL 2024
Speaker Detection by the Individual Listener and the Crowd: Parametric Models Applicable to Bonafide and Deepfake Speech
INTERSPEECH 2024
To what extent can ASV systems naturally defend against spoofing attacks?
INTERSPEECH 2024
Unsupervised Multimodal Clustering for Semantics Discovery in Multimodal Utterances
ACL 2024
Improving Neoadjuvant Therapy Response Prediction by Integrating Longitudinal Mammogram Generation with Cross-Modal Radiological Reports: A Vision-Language Alignment-guided Model
MICCAI 2024
Wasserstein Barycenter Matching for Graph Size Generalization of Message Passing Neural Networks
ICML 2023
Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation
EMNLP 2023
R2H: Building Multimodal Navigation Helpers that Respond to Help Requests
EMNLP 2023
Parameter-Efficient Cross-lingual Transfer of Vision and Language Models via Translation-based Alignment
EMNLP 2023
AutoGT: Automated Graph Transformer Architecture Search
ICLR 2023
Decouple Before Interact: Multi-Modal Prompt Learning for Continual Visual Question Answering
ICCV 2023
HDG-ODE: A Hierarchical Continuous-Time Model for Human Pose Forecasting
ICCV 2023
HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World
ICCV 2023
Prompt Tuning Pushes Farther, Contrastive Learning Pulls Closer: A Two-Stage Approach to Mitigate Social Biases
ACL 2023
Scaling Novel Object Detection With Weakly Supervised Detection Transformers
WACV 2023
Statistical Analysis of Quantum State Learning Process in Quantum Neural Networks
NIPS 2023
Multi-task Graph Neural Architecture Search with Task-aware Collaboration and Curriculum
NIPS 2023
Understanding Zero-shot Adversarial Robustness for Large-Scale Models
ICLR 2023
Improving Generalization Ability of Countermeasures for New Mismatch Scenario by Combining Multiple Advanced Regularization Terms
INTERSPEECH 2023
SMART: A High-Performance Adaptive Radix Tree for Disaggregated Memory
OSDI 2023
Large Language Models with Controllable Working Memory
ACL 2023
Curriculum Graph Machine Learning: A Survey
IJCAI 2023
Controlling Neural Style Transfer with Deep Reinforcement Learning
IJCAI 2023
Spectral Invariant Learning for Dynamic Graphs under Distribution Shifts
NIPS 2023
Multimodal Graph Transformer for Multimodal Question Answering
EACL 2023
Towards Single Integrated Spoofing-aware Speaker Verification Embeddings
INTERSPEECH 2023
Range-Based Equal Error Rate for Spoof Localization
INTERSPEECH 2023
Emotion Prompting for Speech Emotion Recognition
INTERSPEECH 2023
Curriculum Co-disentangled Representation Learning across Multiple Environments for Social Recommendation
ICML 2023
Visualize Before You Write: Imagination-Guided Open-Ended Text Generation
EACL 2023
ImaginE: An Imagination-Based Automatic Evaluation Metric for Natural Language Generation
EACL 2023
Joint Data-Task Generation for Auxiliary Learning
NIPS 2023
Reducing Sentiment Bias in Pre-trained Sentiment Classification via Adaptive Gumbel Attack
AAAI 2023
Dynamic Heterogeneous Graph Attention Neural Architecture Search
AAAI 2023
JR2Net: Joint Monocular 3D Face Reconstruction and Reenactment
AAAI 2023
Curriculum Multi-Negative Augmentation for Debiased Video Grounding
AAAI 2023
T2IAT: Measuring Valence and Stereotypical Biases in Text-to-Image Generation
ACL 2023
Public Opinion Field Effect Fusion in Representation Learning for Trending Topics Diffusion
NIPS 2023
Alternating Updates for Efficient Transformers
NIPS 2023
Unsupervised Graph Neural Architecture Search with Disentangled Self-Supervision
NIPS 2023
You Do Not Need Additional Priors or Regularizers in Retinex-Based Low-Light Image Enhancement
CVPR 2023
Doubly Right Object Recognition: A Why Prompt for Visual Rationales
CVPR 2023
Adversarially Robust Neural Architecture Search for Graph Neural Networks
CVPR 2023
Top-Down Visual Attention From Analysis by Synthesis
CVPR 2023
CIMI4D: A Large Multimodal Climbing Motion Dataset Under Human-Scene Interactions
CVPR 2023
Outlier Robust Adversarial Training
ACML 2023
Aerial Vision-and-Dialog Navigation
ACL 2023
On the Benefits of Learning to Route in Mixture-of-Experts Models
EMNLP 2023
Learning to Solve Travelling Salesman Problem with Hardness-Adaptive Curriculum
AAAI 2022
Sum of Ranked Range Loss for Supervised Learning
JMLR 2022
Pan More Gold from the Sand: Refining Open-domain Dialogue Training with Noisy Self-Retrieval Generation
COLING 2022
DETReg: Unsupervised Pretraining With Region Priors for Object Detection
CVPR 2022
Robust Contrastive Learning Against Noisy Views
CVPR 2022
Unknown-Aware Object Detection: Learning What You Don't Know From Videos in the Wild
CVPR 2022
Neural-Sim: Learning to Generate Training Data with NeRF
ECCV 2022
LiDAL: Inter-Frame Uncertainty Based Active Learning for 3D LiDAR Semantic Segmentation
ECCV 2022
Context-Aware Streaming Perception in Dynamic Environments
ECCV 2022
CPL: Counterfactual Prompt Learning for Vision and Language Models
EMNLP 2022
Imagination-Augmented Natural Language Understanding
NAACL 2022
Diagnosing Vision-and-Language Navigation: What Really Matters
NAACL 2022
CODE-MVP: Learning to Represent Source Code from Multiple Views with Contrastive Pre-Training
NAACL 2022
Dependency Position Encoding for Relation Extraction
NAACL 2022
Auxiliary Learning with Joint Task and Data Scheduling
ICML 2022
DNA: Domain Generalization with Diversified Neural Averaging
ICML 2022
Parametric Visual Program Induction with Function Modularization
ICML 2022
Large-Scale Graph Neural Architecture Search
ICML 2022
Graph Neural Architecture Search Under Distribution Shifts
ICML 2022
Visual Attention Emerges from Recurrent Sparse Reconstruction
ICML 2022
Module-Aware Optimization for Auxiliary Learning
NIPS 2022
A Theoretical View on Sparsely Activated Networks
NIPS 2022
Power and limitations of single-qubit native quantum neural networks
NIPS 2022
Concentration of Data Encoding in Parameterized Quantum Circuits
NIPS 2022
Learning Invariant Graph Representations for Out-of-Distribution Generalization
NIPS 2022
Generalization Bounds for Estimating Causal Effects of Continuous Treatments
NIPS 2022
Dynamic Graph Neural Networks Under Spatio-Temporal Distribution Shift
NIPS 2022
Sketching based Representations for Robust Image Classification with Provable Guarantees
NIPS 2022
Generative Status Estimation and Information Decoupling for Image Rain Removal
NIPS 2022
VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation
NIPS 2022
NAS-Bench-Graph: Benchmarking Graph Neural Architecture Search
NIPS 2022
Towards Unified Representations of Knowledge Graph and Expert Rules for Machine Learning and Reasoning
IJCNLP 2022
A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes
INTERSPEECH 2022
Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions
INTERSPEECH 2022
A Novel Phoneme-based Modeling for Text-independent Speaker Identification
INTERSPEECH 2022
Is Anyone There? Learning a Planner Contingent on Perceptual Uncertainty
CORL 2022
Stochastic Planner-Actor-Critic for Unsupervised Deformable Image Registration
AAAI 2022
Orthogonal Graph Neural Networks
AAAI 2022
Seq2Pat: Sequence-to-Pattern Generation for Constraint-Based Sequential Pattern Mining
AAAI 2022
Towards Unified Representations of Knowledge Graph and Expert Rules for Machine Learning and Reasoning
AACL 2022
OIE@OIA: an Adaptable and Efficient Open Information Extraction Framework
ACL 2022
Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions
ACL 2022
Compilable Neural Code Generation with Compiler Feedback
ACL 2022
Assessing Multilingual Fairness in Pre-trained Multimodal Representations
ACL 2022
Interpretable Research Replication Prediction via Variational Contextual Consistency Sentence Masking
ACL 2022
Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation
EACL 2021
A Unified Approach to Interpreting and Boosting Adversarial Transferability
ICLR 2021
Online Learning of a Probabilistic and Adaptive Scene Representation
CVPR 2021
Stochastic Actor-Executor-Critic for Image-to-Image Translation
IJCAI 2021
Automated Machine Learning on Graphs: A Survey
IJCAI 2021
A Multi-Level Attention Model for Evidence-Based Fact Checking
ACL 2021
VSQL: Variational Shadow Quantum Learning for Classification
AAAI 2021
L2C: Describing Visual Differences Needs Semantic Understanding of Individuals
EACL 2021
Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing
INTERSPEECH 2021
Graph Differentiable Architecture Search with Structure Learning
NIPS 2021
Disentangled Contrastive Learning on Graphs
NIPS 2021
Not All Low-Pass Filters are Robust in Graph Convolutional Networks
NIPS 2021
Are Gender-Neutral Queries Really Gender-Neutral? Mitigating Gender Bias in Image Search
EMNLP 2021
Sketch based Memory for Neural Networks
AISTATS 2021
Explainable Automated Graph Representation Learning with Hyperparameter Importance
ICML 2021
AutoAttend: Automated Attention Representation Search
ICML 2021
Wanderlust: Online Continual Object Detection in the Real World
ICCV 2021
VMNet: Voxel-Mesh Network for Geodesic-Aware 3D Semantic Segmentation
ICCV 2021
A Multi-Level Attention Model for Evidence-Based Fact Checking
IJCNLP 2021
Curriculum Disentangled Recommendation with Noisy Multi-feedback
NIPS 2021
Towards a Unified Game-Theoretic View of Adversarial Perturbations and Robustness
NIPS 2021
TkML-AP: Adversarial Attacks to Top-k Multi-Label Learning
ICCV 2021
Robust Object Detection via Instance-Level Temporal Cycle Confusion
ICCV 2021
Distilling Holistic Knowledge With Graph Neural Networks
ICCV 2021
Interpreting Attributions and Interactions of Adversarial Attacks
ICCV 2021
One Network Fits All? Modular versus Monolithic Task Formulations in Neural Networks
ICLR 2021
Dynamic Multi-Scale Convolution for Dialect Identification
INTERSPEECH 2021
A Comparative Study on Recent Neural Spoofing Countermeasures for Synthetic Speech Detection
INTERSPEECH 2021
An Initial Investigation for Detecting Partially Spoofed Audio
INTERSPEECH 2021
Frustratingly Simple Few-Shot Object Detection
ICML 2020
Learning by Minimizing the Sum of Ranked Range
NIPS 2020
Subjective Quality Evaluation of Speech Signals Transmitted via BPL-PLC Wired System
INTERSPEECH 2020
Self-Supervised Deep Visual Odometry With Online Adaptation
CVPR 2020
Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards
ECCV 2020
Generative Adversarial Zero-Shot Relational Learning for Knowledge Graphs
AAAI 2020
DGE: Deep Generative Network Embedding Based on Commonality and Individuality
AAAI 2020
Attention-Guide Walk Model in Heterogeneous Information Network for Multi-Style Recommendation Explanation
AAAI 2020
BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning
CVPR 2020
Towards Understanding Sample Variance in Visually Grounded Language Generation: Evaluations and Observations
EMNLP 2020
How fine can fine-tuning be? Learning efficient language models
AISTATS 2020
Classification with Rejection: Scaling Generative Classifiers with Supervised Deep Infomax
IJCAI 2020
TransRHS: A Representation Learning Method for Knowledge Graphs with Relation Hierarchical Structure
IJCAI 2020
Adaptive Activation Network and Functional Regularization for Efficient and Flexible Deep Multi-Task Learning
AAAI 2020
SSCR: Iterative Language-Based Image Editing via Self-Supervised Counterfactual Reasoning
EMNLP 2020
SNEQ: Semi-Supervised Attributed Network Embedding with Attention-Based Quantisation
AAAI 2020
A Predicate-Function-Argument Annotation of Natural Language for Open-Domain Information eXpression
EMNLP 2020
Introducing the VoicePrivacy Initiative
INTERSPEECH 2020
Learning to Stop: A Simple yet Effective Approach to Urban Vision-Language Navigation
EMNLP 2020
Learning Saliency Propagation for Semi-Supervised Instance Segmentation
CVPR 2020
Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation
CVPR 2020
Sentence Matching with Syntax- and Semantics-Aware BERT
COLING 2020
REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments
CVPR 2020
Design Choices for X-Vector Based Speaker Anonymization
INTERSPEECH 2020
Using Cyclic Noise as the Source Signal for Neural Source-Filter-Based Speech Waveform Model
INTERSPEECH 2020
Reverberation Modeling for Source-Filter-Based Neural Vocoder
INTERSPEECH 2020
Discrete Social Recommendation
AAAI 2019
Beyond Tracking: Selecting Memory and Refining Poses for Deep Visual Odometry
CVPR 2019
Visual Query Answering by Entity-Attribute Graph Matching and Reasoning
CVPR 2019
Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation
CVPR 2019
TAFE-Net: Task-Aware Feature Embeddings for Low Shot Learning
CVPR 2019
MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment
CVPR 2019
Disparity-preserved Deep Cross-platform Association for Cross-platform Video Recommendation
IJCAI 2019
TIGEr: Text-to-Image Grounding for Image Caption Evaluation
EMNLP 2019
Latent Suicide Risk Detection on Microblog via Suicide-Oriented Word Embeddings and Layered Attention
EMNLP 2019
Latent Part-of-Speech Sequences for Neural Machine Translation
IJCNLP 2019
Latent Suicide Risk Detection on Microblog via Suicide-Oriented Word Embeddings and Layered Attention
IJCNLP 2019
TIGEr: Text-to-Image Grounding for Image Caption Evaluation
IJCNLP 2019
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection
INTERSPEECH 2019
Joint Training Framework for Text-to-Speech and Voice Conversion Using Multi-Source Tacotron and WaveNet
INTERSPEECH 2019
Training Multi-Speaker Neural Text-to-Speech Systems Using Speaker-Imbalanced Speech Corpora
INTERSPEECH 2019
MOSNet: Deep Learning-Based Objective Assessment for Voice Conversion
INTERSPEECH 2019
Towards Generating Long and Coherent Text with Multi-Level Latent Variable Models
ACL 2019
Latent Part-of-Speech Sequences for Neural Machine Translation
EMNLP 2019
Self-Supervised Learning for Contextualized Extractive Summarization
ACL 2019
Dynamic Spatial-Temporal Graph Convolutional Neural Networks for Traffic Forecasting
AAAI 2019
Recursively Learning Causal Structures Using Regression-Based Conditional Independence Test
AAAI 2019
RS3CIS: Robust Single-Step Spectral Clustering with Intrinsic Subspace
AAAI 2019
Learning to Compose Topic-Aware Mixture of Experts for Zero-Shot Video Captioning
AAAI 2019
Self-Supervised Dialogue Learning
ACL 2019
Generalized Boltzmann Machine with Deep Neural Structure
AISTATS 2019
Disentangled Graph Convolutional Networks
ICML 2019
Parameter efficient training of deep convolutional neural networks by dynamic sparse reparameterization
ICML 2019
Accel: A Corrective Fusion Network for Efficient Semantic Segmentation on Video
CVPR 2019
Few-Shot Object Detection via Feature Reweighting
ICCV 2019
VaTeX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research
ICCV 2019
Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry
ICCV 2019
Local Supports Global: Deep Camera Relocalization With Sequence Enhancement
ICCV 2019
ACE: Adapting to Changing Environments for Semantic Segmentation
ICCV 2019
Extract and Edit: An Alternative to Back-Translation for Unsupervised Neural Machine Translation
NAACL 2019
Deep Mixture of Experts via Shallow Embedding
UAI 2019
Look Before You Leap: Bridging Model-Free and Model-Based Reinforcement Learning for Planned-Ahead Vision-and-Language Navigation
ECCV 2018
PSDF Fusion: Probabilistic Signed Distance Function for On-the-fly 3D Data Fusion and Scene Reconstruction
ECCV 2018
Robust Auto-Weighted Multi-View Clustering
IJCAI 2018
Video Captioning via Hierarchical Reinforcement Learning
CVPR 2018
No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling
ACL 2018
Investigating Accuracy of Pitch-accent Annotations in Neural Network-based Speech Synthesis and Denoising Effects
INTERSPEECH 2018
XL-NBT: A Cross-lingual Neural Belief Tracking Framework
EMNLP 2018
Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning
NAACL 2018
SkipNet: Learning Dynamic Routing in Convolutional Networks
ECCV 2018
Multimodal Transfer: A Hierarchical Deep Convolutional Neural Network for Fast Artistic Style Transfer
CVPR 2017
An RNN-Based Quantized F0 Model with Multi-Tier Feedback Links for Text-to-Speech Synthesis
INTERSPEECH 2017
Predicting Usersβ Negative Feedbacks in Multi-Turn Human-Computer Dialogues
IJCNLP 2017
Clipper: A Low-Latency Online Prediction Serving System
NSDI 2017
Principles for Learning Controllable TTS from Annotated and Latent Variation
INTERSPEECH 2017
Flexpoint: An Adaptive Numerical Format for Efficient Training of Deep Neural Networks
NIPS 2017
Understanding Users' Budgets for Recommendation with Hierarchical Poisson Factorization
IJCAI 2017
Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System Using Deep Recurrent Neural Networks
INTERSPEECH 2016
Using Text and Acoustic Features in Predicting Glottal Excitation Waveforms for Parametric Speech Synthesis with Recurrent Neural Networks
INTERSPEECH 2016
Enhance the Word Vector with Prosodic Information for the Recurrent Neural Network Based TTS System
INTERSPEECH 2016
Diamond: Nesting the Data Center Network with Wireless Rings in 3D Space
NSDI 2016
Multiplicative Multitask Feature Learning
JMLR 2016
Constrained Preference Embedding for Item Recommendation
IJCAI 2016
Predicting Polarities of Tweets by Composing Word Embeddings with Long Short-Term Memory
ACL 2015
Recommendation Algorithms for Optimizing Hit Rate, User Satisfaction and Website Revenue
IJCAI 2015
Predicting Polarities of Tweets by Composing Word Embeddings with Long Short-Term Memory
IJCNLP 2015
On Multiplicative Multitask Feature Learning
NIPS 2014
On Algorithms for Sparse Multi-factor NMF
NIPS 2013
Chinese Sentence-Level Sentiment Classification Based on Fuzzy Sets
COLING 2010
Chinese Semantic Role Labeling with Shallow Parsing
EMNLP 2009