conftrace_

Yu Zhang

295 papers · 2005–2026 · 24 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+18 more ↓

🗺️ Taxonomy Completionist (51) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (8) 🐣 Hot Topic Early Bird

🏃 Academic Marathon (20) 🌈 Renaissance Researcher (8) 🌉 Interdisciplinary Bridge 🏠 Conference Loyalist (26) 🌟 Keyword Trendsetter Combo (4) 🤝 Dynamic Duo (16) 👑 Triple Crown 🏆 Keyword Champion 🏆 Grand Slam 👥 Mega-Team (30) 🔬 Deep Specialist (28) 📈 Trend Setter 🚀 Conference Pioneer 🔥 Unstoppable (14) ❓ The Questioner (4) 💎 Century Club (274) 🗃️ Keyword Collector (170) ⚡ Prolific Year (15)

Conferences

INTERSPEECH (38) AAAI (36) ACL (33) EMNLP (33) NIPS (26) CVPR (25) ICLR (16) ICML (15) IJCAI (15) ICCV (12) COLING (11) ECCV (9) SEMEVAL (4) MICCAI (4) IJCNLP (4) NAACL (3) ACML (2) AACL (2) CORL (2) AISTATS (1) JMLR (1) CONLL (1) MIDL (1) OSDI (1)

Top co-authors

Ting Liu (16) Zhou Zhao (13) Ying Wei (12) Heiga Zen (11) Ye Jia (11) Yonghui Wu (11) James Kwok (11) Tom Ko (11) Jiawei Han (11) Changhao Pan (10)

Research topics

Keywords

large language model (27) multi-task learning (16) automatic speech recognition (14) attention mechanism (14) domain adaptation (13) contrastive learning (12) neural network (11) data augmentation (10) speech synthesis (10) self-supervised learning (10) singing voice synthesis (9) representation learning (9) speech recognition (9) multimodal learning (9) transfer learning (9) language model (8) machine translation (8) convolutional neural network (7) graph neural network (7) semantic segmentation (6)

Papers

ReviewGrounder: Improving Review Substantiveness with Rubric-Guided, Tool-Integrated Agents ACL 2026 SimPBL: A Multi-Agent Framework for Project-Based Learning ACL 2026 Efficient and Effective In-context Demonstration Selection with Coreset AAAI 2026 Exact Optimization for Minimum Dominating Sets AAAI 2026 Revisiting Contrastive Learning in Collaborative Filtering via Parallel Graph Filters AAAI 2026 Post-Hoc Refinement for Multitask Symbolic Regression via Consensus-Accelerated Shapley Analysis AAAI 2026 Graph2Video: Leveraging Video Models to Model Dynamic Graph Evolution AAAI 2026 HalluClean: A Unified Framework to Combat Hallucinations in LLMs AAAI 2026 RPTS: Tree-Structured Reasoning Process Scoring for Faithful Multimodal Evaluation AAAI 2026 SafeNLIDB: A Privacy-Preserving Safety Alignment Framework for LLM-based Natural Language Database Interfaces AAAI 2026 Embracing Positional Bias in Multiple-Choice Question Answering via Permutation Equivariant Neural Networks AAAI 2026 Robust Integrative Analysis of Multi-omics Datasets via Nuclear-norm Maximization AAAI 2026 RSMeM: Knowledge-Enhanced Memory Evolution for Remote Sensing Agents with Systematic Evaluation ACL 2026 ChemReason-Bench: Benchmarking Large Language Models for Procedural Reasoning in Experimental Chemistry ACL 2026 ParaSuite: Boosting LLM Reasoning via Paradox Resolution ACL 2026 ReCode: Reinforcing Code Generation with Reasoning-Process Rewards ACL 2026 AIPO: Adaptive Information Guided Token-Level Reinforcement Learning for Large Language Model Reasoning ACL 2026 SAME: Spatial-Aware Multimodal Egocentric Human Pose Estimation AAAI 2026 S2O: Early Stopping for Sparse Attention via Online Permutation ACL 2026 Rectifying the Emotional Flow: Aligning Priors and Dynamic Guidance for High-Arousal Text-to-Speech ACL 2026 Beyond Self-Report: Bridging the Intention-Behavior Gap in Critical Thinking Assessment via Interpretable Multi-Agent System ACL 2026 AnchorAttention: Difference-Aware Sparse Attention with Stripe Granularity EMNLP 2025 Object-level Correlation for Few-Shot Segmentation ICCV 2025 SpatialSplat: Efficient Semantic 3D from Sparse Unposed Images ICCV 2025 PLAN: Proactive Low-Rank Allocation for Continual Learning ICCV 2025 Protein Large Language Models: A Comprehensive Survey EMNLP 2025 ForestCast: Open-Ended Event Forecasting with Semantic News Forest EMNLP 2025 DocAssistant: Integrating Key-region Reading and Step-wise Reasoning for Robust Document Visual Question Answering EMNLP 2025 CrossQG: Improving Difficulty-Controllable Question Generation through Consistency Enhancement EMNLP 2025 Versatile Framework for Song Generation with Prompt-based Control EMNLP 2025 Corrupted but Not Broken: Understanding and Mitigating the Negative Impacts of Corrupted Data in Visual Instruction Tuning EMNLP 2025 Inter-sentence Context Modeling and Structure-aware Representation Enhancement for Conversational Sentiment Quadruple Extraction EMNLP 2025 SPE Attention: Making Attention Equivariant to Semantic-Preserving Permutation for Code Processing EMNLP 2025 Focus on Local: Finding Reliable Discriminative Regions for Visual Place Recognition AAAI 2025 HomoMatcher: Achieving Dense Feature Matching with Semi-Dense Efficiency by Homography Estimation AAAI 2025 Adaptive Wavelet-Positional Encoding for High-Frequency Information Learning in Implicit Neural Representation AAAI 2025 Multi-Label Ranking Loss Minimization for Matrix Completion AAAI 2025 TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching AAAI 2025 Filling Memory Gaps: Enhancing Continual Semantic Parsing via SQL Syntax Variance-Guided LLMs Without Real Data Replay AAAI 2025 NaFV-Net: An Adversarial Four-view Network for Mammogram Classification AAAI 2025 Synthetic Singers: A Review of Deep-Learning-based Singing Voice Synthesis Approaches AACL 2025 ASAudio: A Survey of Advanced Spatial Audio Research AACL 2025 Think as Cardiac Sonographers: Marrying SAM with Left Ventricular Indicators Measurements According to Clinical Guidelines MICCAI 2025 Mixture of insighTful Experts (MoTE): The Synergy of Reasoning Chains and Expert Mixtures in Self-Alignment ACL 2025 ChemActor: Enhancing Automated Extraction of Chemical Synthesis Actions with LLM-Generated Data ACL 2025 Internal and External Impacts of Natural Language Processing Papers ACL 2025 A Unified Taxonomy-Guided Instruction Tuning Framework for Entity Set Expansion and Taxonomy Expansion ACL 2025 TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis ACL 2025 STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation ACL 2025 InImageTrans: Multimodal LLM-based Text Image Machine Translation ACL 2025 ASAudio: A Survey of Advanced Spatial Audio Research IJCNLP 2025 Synthetic Singers: A Review of Deep-Learning-based Singing Voice Synthesis Approaches IJCNLP 2025 ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning IJCAI 2025 Improving Efficiency of Answer Set Planning with Rough Solutions from Large Language Models for Robotic Task Planning IJCAI 2025 Gaussian Mixture Model for Graph Domain Adaptation IJCAI 2025 Come Together, But Not Right Now: A Progressive Strategy to Boost Low-Rank Adaptation ICML 2025 Strategic A/B testing via Maximum Probability-driven Two-armed Bandit ICML 2025 Pre-Training Graph Contrastive Masked Autoencoders are Strong Distillers for EEG ICML 2025 Open Your Eyes: Vision Enhances Message Passing Neural Networks in Link Prediction ICML 2025 Discarding the Crutches: Adaptive Parameter-Efficient Expert Meta-Learning for Continual Semantic Parsing COLING 2025 BANER: Boundary-Aware LLMs for Few-Shot Named Entity Recognition COLING 2025 CaDA: Cross-Problem Routing Solver with Constraint-Aware Dual-Attention ICML 2025 Dynamical Diffusion: Learning Temporal Dynamics with Diffusion Models ICLR 2025 $\text{D}_{2}\text{O}$: Dynamic Discriminative Operations for Efficient Long-Context Inference of Large Language Models ICLR 2025 HiRA: Parameter-Efficient Hadamard High-Rank Adaptation for Large Language Models ICLR 2025 ComLoRA: A Competitive Learning Approach for Enhancing LoRA ICLR 2025 MTSAM: Multi-Task Fine-Tuning for Segment Anything Model ICLR 2025 HeadMap: Locating and Enhancing Knowledge Circuits in LLMs ICLR 2025 Sharpness-Aware Black-Box Optimization ICLR 2025 Image Watermarks are Removable using Controllable Regeneration from Clean Noise ICLR 2025 EnvPoser: Environment-aware Realistic Human Motion Estimation from Sparse Observations with Uncertainty Modeling CVPR 2025 BHViT: Binarized Hybrid Vision Transformer CVPR 2025 EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions CVPR 2025 MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models ICLR 2024 Gradual Domain Adaptation via Gradient Flow ICLR 2024 Adaptive Stochastic Gradient Algorithm for Black-box Multi-Objective Learning ICLR 2024 MLIP: Efficient Multi-Perspective Language-Image Pretraining with Exhaustive Data Utilization ICML 2024 Rethinking Guidance Information to Utilize Unlabeled Samples: A Label Encoding Perspective ICML 2024 Multi-Task Interactive Robot Fleet Learning with Visual World Models CORL 2024 Gated Slot Attention for Efficient Linear-Time Sequence Modeling NIPS 2024 Parallelizing Linear Transformers with the Delta Rule over Sequence Length NIPS 2024 IDGen: Item Discrimination Induced Prompt Generation for LLM Evaluation NIPS 2024 Dissect Black Box: Interpreting for Rule-Based Explanations in Unsupervised Anomaly Detection NIPS 2024 Time-Varying LoRA: Towards Effective Cross-Domain Fine-Tuning of Diffusion Models NIPS 2024 RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models NIPS 2024 TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control EMNLP 2024 A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery EMNLP 2024 SDAC: A Multimodal Synthetic Dataset for Anomaly and Corner Case Detection in Autonomous Driving AAAI 2024 Memory-Efficient Reversible Spiking Neural Networks AAAI 2024 StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis AAAI 2024 Seed-Guided Fine-Grained Entity Typing in Science and Engineering Domains AAAI 2024 Conditional Score-Based Diffusion Model for Cortical Thickness Trajectory Prediction MICCAI 2024 CogDPM: Diffusion Probabilistic Models via Cognitive Predictive Coding ICML 2024 GITA: Graph to Visual and Textual Integration for Vision-Language Graph Reasoning NIPS 2024 GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks NIPS 2024 ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model NIPS 2024 NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction NIPS 2024 Personalized Federated Learning for Cross-City Traffic Prediction IJCAI 2024 SplattingAvatar: Realistic Real-Time Human Avatars with Mesh-Embedded Gaussian Splatting CVPR 2024 HiPose: Hierarchical Binary Surface Encoding and Correspondence Pruning for RGB-D 6DoF Object Pose Estimation CVPR 2024 SecureSQL: Evaluating Data Leakage of Large Language Models as Natural Language Interfaces to Databases EMNLP 2024 Enabling Tensor Language Model to Assist in Generating High-Performance Tensor Programs for Deep Learning OSDI 2024 NC-SDF: Enhancing Indoor Scene Reconstruction Using Neural SDFs with View-Dependent Normal Compensation CVPR 2024 Evaluating the Quality of Brain MRI Generators MICCAI 2024 Continually Tuning a Large Language Model for Multi-domain Radiology Report Generation MICCAI 2024 Question-guided Knowledge Graph Re-scoring and Injection for Knowledge Graph Question Answering EMNLP 2024 Nemesis: Normalizing the Soft-prompt Vectors of Vision-Language Models ICLR 2024 MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders ECCV 2024 IG Captioner: Information Gain Captioners are Strong Zero-shot Classifiers ECCV 2024 "Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation" ECCV 2024 Robust Singing Voice Transcription Serves Synthesis ACL 2024 Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs ACL 2024 E2-LLM: Efficient and Extreme Length Extension of Large Language Models ACL 2024 Planning First, Question Second: An LLM-Guided Method for Controllable Question Generation ACL 2024 Forward-Backward Reasoning in Large Language Models for Mathematical Verification ACL 2024 Selective Prompting Tuning for Personalized Conversations with LLMs ACL 2024 Dynamic Inertial Poser (DynaIP): Part-Based Motion Dynamics Learning for Enhanced Human Pose Estimation with Sparse Inertial Sensors CVPR 2024 Knowledge-aware Attention Network for Medication Effectiveness Prediction COLING 2024 LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus INTERSPEECH 2023 Fine-Grained Cross-View Geo-Localization Using a Correlation-Aware Homography Estimator NIPS 2023 Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference NIPS 2023 CluB: Cluster Meets BEV for LiDAR-Based 3D Object Detection NIPS 2023 Interpreting Unsupervised Anomaly Detection in Security via Rule Extraction NIPS 2023 MG-ViT: A Multi-Granularity Method for Compact and Efficient Vision Transformers NIPS 2023 Learning Conflict-Noticed Architecture for Multi-Task Learning AAAI 2023 Robust Temporal Smoothness in Multi-Task Learning AAAI 2023 Electrophysiological Brain Source Imaging via Combinatorial Search with Provable Optimality AAAI 2023 Denoising Pre-training for Machine Translation Quality Estimation with Curriculum Learning AAAI 2023 Personalized Dialogue Generation with Persona-Adaptive Attention AAAI 2023 Chain-of-Skills: A Configurable Model for Open-Domain Question Answering ACL 2023 Patton: Language Model Pretraining on Text-Rich Networks ACL 2023 Transforming Visual Scene Graphs to Image Captions ACL 2023 Explanation Graph Generation via Generative Pre-training over Synthetic Graphs ACL 2023 PersonaPKT: Building Personalized Dialogue Agents via Parameter-efficient Knowledge Transfer ACL 2023 Bi-LRFusion: Bi-Directional LiDAR-Radar Fusion for 3D Dynamic Object Detection CVPR 2023 PEAL: Prior-Embedded Explicit Attention Learning for Low-Overlap Point Cloud Registration CVPR 2023 Leveraging per Image-Token Consistency for Vision-Language Pre-Training CVPR 2023 Range-Nullspace Video Frame Interpolation With Focalized Motion Estimation CVPR 2023 Learning Retrieval Augmentation for Personalized Dialogue Generation EMNLP 2023 Non-autoregressive Text Editing with Copy-aware Latent Alignments EMNLP 2023 Improved Pseudo Data for Machine Translation Quality Estimation with Constrained Beam Search EMNLP 2023 PIEClass: Weakly-Supervised Text Classification with Prompting and Noise-Robust Iterative Ensemble Training EMNLP 2023 KICGPT: Large Language Model with Knowledge in Context for Knowledge Graph Completion EMNLP 2023 Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding EMNLP 2023 Unify Word-level and Span-level Tasks: NJUNLP’s Participation for the WMT2023 Quality Estimation Shared Task EMNLP 2023 E2NeRF: Event Enhanced Neural Radiance Fields from Blurry Images ICCV 2023 Learning Trajectory-Word Alignments for Video-Language Tasks ICCV 2023 Adaptive Positional Encoding for Bundle-Adjusting Neural Radiance Fields ICCV 2023 Multi-view Self-supervised Disentanglement for General Image Denoising ICCV 2023 Edgeformers: Graph-Empowered Transformers for Representation Learning on Textual-Edge Networks ICLR 2023 An Adaptive Policy to Employ Sharpness-Aware Minimization ICLR 2023 Mu$^2$SLAM: Multitask, Multilingual Speech and Language Models ICML 2023 Effective Structured Prompting by Meta-Learning and Representative Verbalizer ICML 2023 Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning ICML 2023 Multi-Task Learning via Time-Aware Neural ODE IJCAI 2023 Max Markov Chain IJCAI 2023 How to Estimate Model Transferability of Pre-Trained Speech Models? INTERSPEECH 2023 Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention INTERSPEECH 2023 Mixture-of-Expert Conformer for Streaming Multilingual ASR INTERSPEECH 2023 PronScribe: Highly Accurate Multimodal Phonemic Transcription From Speech and Text INTERSPEECH 2023 LibMTL: A Python Library for Deep Multi-Task Learning JMLR 2023 MedSegDiff: Medical Image Segmentation with Diffusion Probabilistic Model MIDL 2023 LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT INTERSPEECH 2022 A Study of Modeling Rising Intonation in Cantonese Neural Speech Synthesis INTERSPEECH 2022 TSGP: Two-Stage Generative Prompting for Unsupervised Commonsense Question Answering EMNLP 2022 Semantic Role Labeling as Dependency Parsing: Exploring Latent Tree Structures inside Arguments COLING 2022 Fast and Accurate End-to-End Span-based Semantic Role Labeling as Word-based Graph Parsing COLING 2022 Joint Goal Segmentation and Goal Success Prediction on Multi-Domain Conversations COLING 2022 Generating Training Data with Language Models: Towards Zero-Shot Language Understanding NIPS 2022 Deep Bayesian Video Frame Interpolation ECCV 2022 PCR-CG: Point Cloud Registration via Deep Explicit Color and Geometry ECCV 2022 Policy Optimization with Stochastic Mirror Descent AAAI 2022 NJUNLP’s Participation for the WMT2022 Quality Estimation Shared Task EMNLP 2022 Dense Cross-Query-and-Support Attention Weighted Mask Aggregation for Few-Shot Segmentation ECCV 2022 Disentangling Task Relations for Few-shot Text Classification via Self-Supervised Hierarchical Task Clustering EMNLP 2022 All Information is Valuable: Question Matching over Full Information Transmission Network NAACL 2022 JointLK: Joint Reasoning with Language Models and Knowledge Graphs for Commonsense Question Answering NAACL 2022 AutoMine: An Unmanned Mine Dataset CVPR 2022 Balanced and Hierarchical Relation Learning for One-Shot Object Detection CVPR 2022 An Efficient Person Clustering Algorithm for Open Checkout-Free Groceries ECCV 2022 LEMON: Language-Based Environment Manipulation via Execution-Guided Pre-training EMNLP 2022 Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds NAACL 2022 SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing ACL 2022 DuReadervis: A Chinese Dataset for Open-domain Document Visual Question Answering ACL 2022 Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks INTERSPEECH 2022 MAESTRO: Matched Speech Text Representations through Modality Matching INTERSPEECH 2022 Unsupervised Data Selection via Discrete Speech Representation for ASR INTERSPEECH 2022 XTREME-S: Evaluating Cross-lingual Speech Representations INTERSPEECH 2022 Reducing Domain mismatch in Self-supervised speech pre-training INTERSPEECH 2022 Improving Distortion Robustness of Self-supervised Speech Processing Tasks with Domain Adaptation INTERSPEECH 2022 Self-supervised learning with random-projection quantizer for speech recognition ICML 2022 Subspace Learning for Effective Meta-Learning ICML 2022 Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation INTERSPEECH 2022 Dual-Curriculum Contrastive Multi-Instance Learning for Cancer Prognosis Analysis with Whole Slide Images NIPS 2022 Dynamic Sparse Network for Time Series Classification: Learning What to “See” NIPS 2022 Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation INTERSPEECH 2022 On Convergence of Gradient Expected Sarsa(λ) AAAI 2021 Training Weakly Supervised Video Frame Interpolation With Events ICCV 2021 Personalized Image Semantic Segmentation ICCV 2021 WaveGrad: Estimating Gradients for Waveform Generation ICLR 2021 Sparse Multi-Path Corrections in Fringe Projection Profilometry CVPR 2021 Informative and Consistent Correspondence Mining for Cross-Domain Weakly Supervised Object Detection CVPR 2021 A Coarse-to-Fine Labeling Framework for Joint Word Segmentation, POS Tagging, and Constituent Parsing CONLL 2021 Effective Meta-Regularization by Kernelized Proximal Regularization NIPS 2021 Regularized Mutual Learning for Personalized Federated Learning ACML 2021 Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling INTERSPEECH 2021 PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS INTERSPEECH 2021 Semi-Supervision in ASR: Sequential MixMatch and Factorized TTS-Based Augmentation INTERSPEECH 2021 Exploring Targeted Universal Adversarial Perturbations to End-to-End ASR Models INTERSPEECH 2021 Pushing the Limits of Non-Autoregressive Speech Recognition INTERSPEECH 2021 WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis INTERSPEECH 2021 Residual Energy-Based Models for End-to-End Speech Recognition INTERSPEECH 2021 Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction INTERSPEECH 2021 Unsupervised Learning of Disentangled Speech Content and Style Representation INTERSPEECH 2021 Multi-Objective Meta Learning NIPS 2021 Learn to Predict Vertical Track Irregularity with Extremely Imbalanced Data ACML 2021 Distant Transfer Learning via Deep Random Walk AAAI 2021 A Coarse-to-Fine Labeling Framework for Joint Word Segmentation, POS Tagging, and Constituent Parsing EMNLP 2021 Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training EMNLP 2021 Logic-level Evidence Retrieval and Graph-based Verification Network for Table-based Fact Verification EMNLP 2021 Knowledge Distillation from Internal Representations AAAI 2020 Deep Image Clustering with Category-Style Representation ECCV 2020 Learning to See in the Dark with Events ECCV 2020 Learn to Cross-lingual Transfer with Meta Graph Learning Across Heterogeneous Languages EMNLP 2020 What Is It You Really Want of Me? Generalized Reward Learning with Biased Beliefs about Domain Dynamics AAAI 2020 Scalability in Perception for Autonomous Driving: Waymo Open Dataset CVPR 2020 Efficient Second-Order TreeCRF for Neural Dependency Parsing ACL 2020 Learning Event-Based Motion Deblurring CVPR 2020 Learn to Combine Linguistic and Symbolic Information for Table-based Fact Verification COLING 2020 Fast and Accurate Neural CRF Constituency Parsing IJCAI 2020 WISE: Word-Level Interaction-Based Multimodal Fusion for Speech Emotion Recognition INTERSPEECH 2020 Improving Speech Recognition Using GAN-Based Speech Synthesis and Contrastive Unspoken Text Selection INTERSPEECH 2020 Improved Noisy Student Training for Automatic Speech Recognition INTERSPEECH 2020 SCADA: Stochastic, Consistent and Adversarial Data Augmentation to Improve ASR INTERSPEECH 2020 ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context INTERSPEECH 2020 Conformer: Convolution-augmented Transformer for Speech Recognition INTERSPEECH 2020 Label Enhancement for Label Distribution Learning via Prior Knowledge IJCAI 2020 Label Distribution for Learning with Noisy Labels IJCAI 2020 Transferable End-to-End Aspect-based Sentiment Analysis with Selective Adversarial Learning IJCNLP 2019 LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech INTERSPEECH 2019 Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning INTERSPEECH 2019 SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition INTERSPEECH 2019 Gaussian Transformer: A Lightweight Approach for Natural Language Inference AAAI 2019 K3S: Knowledge-Driven Solution Support System AAAI 2019 Selectivity or Invariance: Boundary-Aware Salient Object Detection ICCV 2019 Transferable End-to-End Aspect-based Sentiment Analysis with Selective Adversarial Learning EMNLP 2019 End-to-End Multi-View Fusion for 3D Object Detection in LiDAR Point Clouds CORL 2019 Causes and Corrections for Bimodal Multi-Path Scanning With Structured Light CVPR 2019 Structure-Preserving Stereoscopic View Synthesis With Multi-Scale Adversarial Correlation Matching CVPR 2019 Exploiting Coarse-to-Fine Task Transfer for Aspect-Level Sentiment Classification AAAI 2019 Learning (from) Deep Hierarchical Structure among Features AAAI 2019 Hierarchical Generative Modeling for Controllable Speech Synthesis ICLR 2019 HLT@SUDA at SemEval-2019 Task 1: UCCA Graph Parsing as Constituent Tree Parsing SEMEVAL 2019 Multi-Class Part Parsing With Joint Boundary-Semantic Awareness ICCV 2019 Zero Pronoun Resolution with Attention-based Neural Network COLING 2018 Simple Recurrent Units for Highly Parallelizable Recurrence EMNLP 2018 Cross-lingual Knowledge Graph Alignment via Graph Convolutional Networks EMNLP 2018 Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis NIPS 2018 Transfer Learning via Learning to Transfer ICML 2018 Learning to Multitask NIPS 2018 Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis ICML 2018 Deep Reinforcement Learning for Chinese Zero Pronoun Resolution ACL 2018 On the Duration of Mandarin Tones INTERSPEECH 2017 Chinese Zero Pronoun Resolution with Deep Memory Network EMNLP 2017 Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data NIPS 2017 Supervision by Fusion: Towards Unsupervised Learning of Deep Salient Object Detector ICCV 2017 SCIR-QA at SemEval-2017 Task 3: CNN Model Based on Similar and Dissimilar Information between Keywords for Question Similarity SEMEVAL 2017 Benben: A Chinese Intelligent Conversational Robot ACL 2017 Attention-Based LSTM with Multi-Task Learning for Distant Speech Recognition INTERSPEECH 2017 Plan Explanations as Model Reconciliation: Moving Beyond Explanation as Soliloquy IJCAI 2017 End-to-End Adversarial Memory Network for Cross-domain Sentiment Classification IJCAI 2017 Deep Neural Networks for High Dimension, Low Sample Size Data IJCAI 2017 A Deep Neural Network for Chinese Zero Pronoun Resolution IJCAI 2017 Multimodal Linear Discriminant Analysis via Structural Sparsity IJCAI 2017 Learning Latent Representations for Speech Generation and Transformation INTERSPEECH 2017 Advances in Joint CTC-Attention Based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LM INTERSPEECH 2017 What Is and What Is Not a Salient Object? Learning Salient Object Detector by Ensembling Linear Exemplar Regressors CVPR 2017 Exploit Bounding Box Annotations for Multi-Label Object Recognition CVPR 2016 Exploiting Depth and Highway Connections in Convolutional Recurrent Deep Neural Networks for Speech Recognition INTERSPEECH 2016 Neural Attention for Learning to Rank Questions in Community Question Answering COLING 2016 SLS at SemEval-2016 Task 3: Neural-based Approaches for Ranking in Community Question Answering SEMEVAL 2016 Semantic Object Segmentation via Detection in Weakly Labeled Video CVPR 2015 3D Reconstruction in the Presence of Glasses by Acoustic and Stereo Fusion CVPR 2015 Towards Good Practices for Action Video Encoding CVPR 2014 Compact Representation for Image Classification: To Choose or to Compress? CVPR 2014 Heterogeneous-Neighborhood-based Multi-Task Local Learning Algorithms NIPS 2013 Learning High-Order Task Relationships in Multi-Task Learning IJCAI 2013 Joint Learning of Phonetic Units and Word Pronunciations for ASR EMNLP 2013 The Use of Dependency Relation Graph to Enhance the Term Weighting in Question Retrieval COLING 2012 Multi-Task Learning using Generalized t Process AISTATS 2010 Probabilistic Multi-Task Feature Selection NIPS 2010 Worst-Case Linear Discriminant Analysis NIPS 2010 Bridging Topic Modeling and Personalized Search COLING 2010 HIT: Web based Scoring Method for English Lexical Substitution SEMEVAL 2007 Automated Generalization of Phrasal Paraphrases from the Web IJCNLP 2005