Yu Zhang
295 papers · 2005–2026 · 24 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+18 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (51) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (8) π£ Hot Topic Early Bird
π
Academic Marathon
(20)
π
Renaissance Researcher
(8)
π
Interdisciplinary Bridge
π
Conference Loyalist
(26)
π
Keyword Trendsetter Combo
(4)
π€
Dynamic Duo
(16)
π
Triple Crown
π
Keyword Champion
π
Grand Slam
π₯
Mega-Team
(30)
π¬
Deep Specialist
(28)
π
Trend Setter
π
Conference Pioneer
π₯
Unstoppable
(14)
β
The Questioner
(4)
π
Century Club
(274)
ποΈ
Keyword Collector
(170)
β‘
Prolific Year
(15)
Conferences
INTERSPEECH (38)
AAAI (36)
ACL (33)
EMNLP (33)
NIPS (26)
CVPR (25)
ICLR (16)
ICML (15)
IJCAI (15)
ICCV (12)
COLING (11)
ECCV (9)
SEMEVAL (4)
MICCAI (4)
IJCNLP (4)
NAACL (3)
ACML (2)
AACL (2)
CORL (2)
AISTATS (1)
JMLR (1)
CONLL (1)
MIDL (1)
OSDI (1)
Top co-authors
Research topics
Keywords
large language model
(27)
multi-task learning
(16)
automatic speech recognition
(14)
attention mechanism
(14)
domain adaptation
(13)
contrastive learning
(12)
neural network
(11)
data augmentation
(10)
speech synthesis
(10)
self-supervised learning
(10)
singing voice synthesis
(9)
representation learning
(9)
speech recognition
(9)
multimodal learning
(9)
transfer learning
(9)
language model
(8)
machine translation
(8)
convolutional neural network
(7)
graph neural network
(7)
semantic segmentation
(6)
Papers
ReviewGrounder: Improving Review Substantiveness with Rubric-Guided, Tool-Integrated Agents
ACL 2026
SimPBL: A Multi-Agent Framework for Project-Based Learning
ACL 2026
Efficient and Effective In-context Demonstration Selection with Coreset
AAAI 2026
Exact Optimization for Minimum Dominating Sets
AAAI 2026
Revisiting Contrastive Learning in Collaborative Filtering via Parallel Graph Filters
AAAI 2026
Post-Hoc Refinement for Multitask Symbolic Regression via Consensus-Accelerated Shapley Analysis
AAAI 2026
Graph2Video: Leveraging Video Models to Model Dynamic Graph Evolution
AAAI 2026
HalluClean: A Unified Framework to Combat Hallucinations in LLMs
AAAI 2026
RPTS: Tree-Structured Reasoning Process Scoring for Faithful Multimodal Evaluation
AAAI 2026
SafeNLIDB: A Privacy-Preserving Safety Alignment Framework for LLM-based Natural Language Database Interfaces
AAAI 2026
Embracing Positional Bias in Multiple-Choice Question Answering via Permutation Equivariant Neural Networks
AAAI 2026
Robust Integrative Analysis of Multi-omics Datasets via Nuclear-norm Maximization
AAAI 2026
RSMeM: Knowledge-Enhanced Memory Evolution for Remote Sensing Agents with Systematic Evaluation
ACL 2026
ChemReason-Bench: Benchmarking Large Language Models for Procedural Reasoning in Experimental Chemistry
ACL 2026
ParaSuite: Boosting LLM Reasoning via Paradox Resolution
ACL 2026
ReCode: Reinforcing Code Generation with Reasoning-Process Rewards
ACL 2026
AIPO: Adaptive Information Guided Token-Level Reinforcement Learning for Large Language Model Reasoning
ACL 2026
SAME: Spatial-Aware Multimodal Egocentric Human Pose Estimation
AAAI 2026
S2O: Early Stopping for Sparse Attention via Online Permutation
ACL 2026
Rectifying the Emotional Flow: Aligning Priors and Dynamic Guidance for High-Arousal Text-to-Speech
ACL 2026
Beyond Self-Report: Bridging the Intention-Behavior Gap in Critical Thinking Assessment via Interpretable Multi-Agent System
ACL 2026
AnchorAttention: Difference-Aware Sparse Attention with Stripe Granularity
EMNLP 2025
Object-level Correlation for Few-Shot Segmentation
ICCV 2025
SpatialSplat: Efficient Semantic 3D from Sparse Unposed Images
ICCV 2025
PLAN: Proactive Low-Rank Allocation for Continual Learning
ICCV 2025
Protein Large Language Models: A Comprehensive Survey
EMNLP 2025
ForestCast: Open-Ended Event Forecasting with Semantic News Forest
EMNLP 2025
DocAssistant: Integrating Key-region Reading and Step-wise Reasoning for Robust Document Visual Question Answering
EMNLP 2025
CrossQG: Improving Difficulty-Controllable Question Generation through Consistency Enhancement
EMNLP 2025
Versatile Framework for Song Generation with Prompt-based Control
EMNLP 2025
Corrupted but Not Broken: Understanding and Mitigating the Negative Impacts of Corrupted Data in Visual Instruction Tuning
EMNLP 2025
Inter-sentence Context Modeling and Structure-aware Representation Enhancement for Conversational Sentiment Quadruple Extraction
EMNLP 2025
SPE Attention: Making Attention Equivariant to Semantic-Preserving Permutation for Code Processing
EMNLP 2025
Focus on Local: Finding Reliable Discriminative Regions for Visual Place Recognition
AAAI 2025
HomoMatcher: Achieving Dense Feature Matching with Semi-Dense Efficiency by Homography Estimation
AAAI 2025
Adaptive Wavelet-Positional Encoding for High-Frequency Information Learning in Implicit Neural Representation
AAAI 2025
Multi-Label Ranking Loss Minimization for Matrix Completion
AAAI 2025
TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching
AAAI 2025
Filling Memory Gaps: Enhancing Continual Semantic Parsing via SQL Syntax Variance-Guided LLMs Without Real Data Replay
AAAI 2025
NaFV-Net: An Adversarial Four-view Network for Mammogram Classification
AAAI 2025
Synthetic Singers: A Review of Deep-Learning-based Singing Voice Synthesis Approaches
AACL 2025
ASAudio: A Survey of Advanced Spatial Audio Research
AACL 2025
Think as Cardiac Sonographers: Marrying SAM with Left Ventricular Indicators Measurements According to Clinical Guidelines
MICCAI 2025
Mixture of insighTful Experts (MoTE): The Synergy of Reasoning Chains and Expert Mixtures in Self-Alignment
ACL 2025
ChemActor: Enhancing Automated Extraction of Chemical Synthesis Actions with LLM-Generated Data
ACL 2025
Internal and External Impacts of Natural Language Processing Papers
ACL 2025
A Unified Taxonomy-Guided Instruction Tuning Framework for Entity Set Expansion and Taxonomy Expansion
ACL 2025
TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis
ACL 2025
STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation
ACL 2025
InImageTrans: Multimodal LLM-based Text Image Machine Translation
ACL 2025
ASAudio: A Survey of Advanced Spatial Audio Research
IJCNLP 2025
Synthetic Singers: A Review of Deep-Learning-based Singing Voice Synthesis Approaches
IJCNLP 2025
ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning
IJCAI 2025
Improving Efficiency of Answer Set Planning with Rough Solutions from Large Language Models for Robotic Task Planning
IJCAI 2025
Gaussian Mixture Model for Graph Domain Adaptation
IJCAI 2025
Come Together, But Not Right Now: A Progressive Strategy to Boost Low-Rank Adaptation
ICML 2025
Strategic A/B testing via Maximum Probability-driven Two-armed Bandit
ICML 2025
Pre-Training Graph Contrastive Masked Autoencoders are Strong Distillers for EEG
ICML 2025
Open Your Eyes: Vision Enhances Message Passing Neural Networks in Link Prediction
ICML 2025
Discarding the Crutches: Adaptive Parameter-Efficient Expert Meta-Learning for Continual Semantic Parsing
COLING 2025
BANER: Boundary-Aware LLMs for Few-Shot Named Entity Recognition
COLING 2025
CaDA: Cross-Problem Routing Solver with Constraint-Aware Dual-Attention
ICML 2025
Dynamical Diffusion: Learning Temporal Dynamics with Diffusion Models
ICLR 2025
$\text{D}_{2}\text{O}$: Dynamic Discriminative Operations for Efficient Long-Context Inference of Large Language Models
ICLR 2025
HiRA: Parameter-Efficient Hadamard High-Rank Adaptation for Large Language Models
ICLR 2025
ComLoRA: A Competitive Learning Approach for Enhancing LoRA
ICLR 2025
MTSAM: Multi-Task Fine-Tuning for Segment Anything Model
ICLR 2025
HeadMap: Locating and Enhancing Knowledge Circuits in LLMs
ICLR 2025
Sharpness-Aware Black-Box Optimization
ICLR 2025
Image Watermarks are Removable using Controllable Regeneration from Clean Noise
ICLR 2025
EnvPoser: Environment-aware Realistic Human Motion Estimation from Sparse Observations with Uncertainty Modeling
CVPR 2025
BHViT: Binarized Hybrid Vision Transformer
CVPR 2025
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
CVPR 2025
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
ICLR 2024
Gradual Domain Adaptation via Gradient Flow
ICLR 2024
Adaptive Stochastic Gradient Algorithm for Black-box Multi-Objective Learning
ICLR 2024
MLIP: Efficient Multi-Perspective Language-Image Pretraining with Exhaustive Data Utilization
ICML 2024
Rethinking Guidance Information to Utilize Unlabeled Samples: A Label Encoding Perspective
ICML 2024
Multi-Task Interactive Robot Fleet Learning with Visual World Models
CORL 2024
Gated Slot Attention for Efficient Linear-Time Sequence Modeling
NIPS 2024
Parallelizing Linear Transformers with the Delta Rule over Sequence Length
NIPS 2024
IDGen: Item Discrimination Induced Prompt Generation for LLM Evaluation
NIPS 2024
Dissect Black Box: Interpreting for Rule-Based Explanations in Unsupervised Anomaly Detection
NIPS 2024
Time-Varying LoRA: Towards Effective Cross-Domain Fine-Tuning of Diffusion Models
NIPS 2024
RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models
NIPS 2024
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control
EMNLP 2024
A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery
EMNLP 2024
SDAC: A Multimodal Synthetic Dataset for Anomaly and Corner Case Detection in Autonomous Driving
AAAI 2024
Memory-Efficient Reversible Spiking Neural Networks
AAAI 2024
StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis
AAAI 2024
Seed-Guided Fine-Grained Entity Typing in Science and Engineering Domains
AAAI 2024
Conditional Score-Based Diffusion Model for Cortical Thickness Trajectory Prediction
MICCAI 2024
CogDPM: Diffusion Probabilistic Models via Cognitive Predictive Coding
ICML 2024
GITA: Graph to Visual and Textual Integration for Vision-Language Graph Reasoning
NIPS 2024
GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks
NIPS 2024
ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model
NIPS 2024
NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction
NIPS 2024
Personalized Federated Learning for Cross-City Traffic Prediction
IJCAI 2024
SplattingAvatar: Realistic Real-Time Human Avatars with Mesh-Embedded Gaussian Splatting
CVPR 2024
HiPose: Hierarchical Binary Surface Encoding and Correspondence Pruning for RGB-D 6DoF Object Pose Estimation
CVPR 2024
SecureSQL: Evaluating Data Leakage of Large Language Models as Natural Language Interfaces to Databases
EMNLP 2024
Enabling Tensor Language Model to Assist in Generating High-Performance Tensor Programs for Deep Learning
OSDI 2024
NC-SDF: Enhancing Indoor Scene Reconstruction Using Neural SDFs with View-Dependent Normal Compensation
CVPR 2024
Evaluating the Quality of Brain MRI Generators
MICCAI 2024
Continually Tuning a Large Language Model for Multi-domain Radiology Report Generation
MICCAI 2024
Question-guided Knowledge Graph Re-scoring and Injection for Knowledge Graph Question Answering
EMNLP 2024
Nemesis: Normalizing the Soft-prompt Vectors of Vision-Language Models
ICLR 2024
MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders
ECCV 2024
IG Captioner: Information Gain Captioners are Strong Zero-shot Classifiers
ECCV 2024
"Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation"
ECCV 2024
Robust Singing Voice Transcription Serves Synthesis
ACL 2024
Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs
ACL 2024
E2-LLM: Efficient and Extreme Length Extension of Large Language Models
ACL 2024
Planning First, Question Second: An LLM-Guided Method for Controllable Question Generation
ACL 2024
Forward-Backward Reasoning in Large Language Models for Mathematical Verification
ACL 2024
Selective Prompting Tuning for Personalized Conversations with LLMs
ACL 2024
Dynamic Inertial Poser (DynaIP): Part-Based Motion Dynamics Learning for Enhanced Human Pose Estimation with Sparse Inertial Sensors
CVPR 2024
Knowledge-aware Attention Network for Medication Effectiveness Prediction
COLING 2024
LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus
INTERSPEECH 2023
Fine-Grained Cross-View Geo-Localization Using a Correlation-Aware Homography Estimator
NIPS 2023
Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference
NIPS 2023
CluB: Cluster Meets BEV for LiDAR-Based 3D Object Detection
NIPS 2023
Interpreting Unsupervised Anomaly Detection in Security via Rule Extraction
NIPS 2023
MG-ViT: A Multi-Granularity Method for Compact and Efficient Vision Transformers
NIPS 2023
Learning Conflict-Noticed Architecture for Multi-Task Learning
AAAI 2023
Robust Temporal Smoothness in Multi-Task Learning
AAAI 2023
Electrophysiological Brain Source Imaging via Combinatorial Search with Provable Optimality
AAAI 2023
Denoising Pre-training for Machine Translation Quality Estimation with Curriculum Learning
AAAI 2023
Personalized Dialogue Generation with Persona-Adaptive Attention
AAAI 2023
Chain-of-Skills: A Configurable Model for Open-Domain Question Answering
ACL 2023
Patton: Language Model Pretraining on Text-Rich Networks
ACL 2023
Transforming Visual Scene Graphs to Image Captions
ACL 2023
Explanation Graph Generation via Generative Pre-training over Synthetic Graphs
ACL 2023
PersonaPKT: Building Personalized Dialogue Agents via Parameter-efficient Knowledge Transfer
ACL 2023
Bi-LRFusion: Bi-Directional LiDAR-Radar Fusion for 3D Dynamic Object Detection
CVPR 2023
PEAL: Prior-Embedded Explicit Attention Learning for Low-Overlap Point Cloud Registration
CVPR 2023
Leveraging per Image-Token Consistency for Vision-Language Pre-Training
CVPR 2023
Range-Nullspace Video Frame Interpolation With Focalized Motion Estimation
CVPR 2023
Learning Retrieval Augmentation for Personalized Dialogue Generation
EMNLP 2023
Non-autoregressive Text Editing with Copy-aware Latent Alignments
EMNLP 2023
Improved Pseudo Data for Machine Translation Quality Estimation with Constrained Beam Search
EMNLP 2023
PIEClass: Weakly-Supervised Text Classification with Prompting and Noise-Robust Iterative Ensemble Training
EMNLP 2023
KICGPT: Large Language Model with Knowledge in Context for Knowledge Graph Completion
EMNLP 2023
Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding
EMNLP 2023
Unify Word-level and Span-level Tasks: NJUNLPβs Participation for the WMT2023 Quality Estimation Shared Task
EMNLP 2023
E2NeRF: Event Enhanced Neural Radiance Fields from Blurry Images
ICCV 2023
Learning Trajectory-Word Alignments for Video-Language Tasks
ICCV 2023
Adaptive Positional Encoding for Bundle-Adjusting Neural Radiance Fields
ICCV 2023
Multi-view Self-supervised Disentanglement for General Image Denoising
ICCV 2023
Edgeformers: Graph-Empowered Transformers for Representation Learning on Textual-Edge Networks
ICLR 2023
An Adaptive Policy to Employ Sharpness-Aware Minimization
ICLR 2023
Mu$^2$SLAM: Multitask, Multilingual Speech and Language Models
ICML 2023
Effective Structured Prompting by Meta-Learning and Representative Verbalizer
ICML 2023
Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning
ICML 2023
Multi-Task Learning via Time-Aware Neural ODE
IJCAI 2023
Max Markov Chain
IJCAI 2023
How to Estimate Model Transferability of Pre-Trained Speech Models?
INTERSPEECH 2023
Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention
INTERSPEECH 2023
Mixture-of-Expert Conformer for Streaming Multilingual ASR
INTERSPEECH 2023
PronScribe: Highly Accurate Multimodal Phonemic Transcription From Speech and Text
INTERSPEECH 2023
LibMTL: A Python Library for Deep Multi-Task Learning
JMLR 2023
MedSegDiff: Medical Image Segmentation with Diffusion Probabilistic Model
MIDL 2023
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
INTERSPEECH 2022
A Study of Modeling Rising Intonation in Cantonese Neural Speech Synthesis
INTERSPEECH 2022
TSGP: Two-Stage Generative Prompting for Unsupervised Commonsense Question Answering
EMNLP 2022
Semantic Role Labeling as Dependency Parsing: Exploring Latent Tree Structures inside Arguments
COLING 2022
Fast and Accurate End-to-End Span-based Semantic Role Labeling as Word-based Graph Parsing
COLING 2022
Joint Goal Segmentation and Goal Success Prediction on Multi-Domain Conversations
COLING 2022
Generating Training Data with Language Models: Towards Zero-Shot Language Understanding
NIPS 2022
Deep Bayesian Video Frame Interpolation
ECCV 2022
PCR-CG: Point Cloud Registration via Deep Explicit Color and Geometry
ECCV 2022
Policy Optimization with Stochastic Mirror Descent
AAAI 2022
NJUNLPβs Participation for the WMT2022 Quality Estimation Shared Task
EMNLP 2022
Dense Cross-Query-and-Support Attention Weighted Mask Aggregation for Few-Shot Segmentation
ECCV 2022
Disentangling Task Relations for Few-shot Text Classification via Self-Supervised Hierarchical Task Clustering
EMNLP 2022
All Information is Valuable: Question Matching over Full Information Transmission Network
NAACL 2022
JointLK: Joint Reasoning with Language Models and Knowledge Graphs for Commonsense Question Answering
NAACL 2022
AutoMine: An Unmanned Mine Dataset
CVPR 2022
Balanced and Hierarchical Relation Learning for One-Shot Object Detection
CVPR 2022
An Efficient Person Clustering Algorithm for Open Checkout-Free Groceries
ECCV 2022
LEMON: Language-Based Environment Manipulation via Execution-Guided Pre-training
EMNLP 2022
Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds
NAACL 2022
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing
ACL 2022
DuReadervis: A Chinese Dataset for Open-domain Document Visual Question Answering
ACL 2022
Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks
INTERSPEECH 2022
MAESTRO: Matched Speech Text Representations through Modality Matching
INTERSPEECH 2022
Unsupervised Data Selection via Discrete Speech Representation for ASR
INTERSPEECH 2022
XTREME-S: Evaluating Cross-lingual Speech Representations
INTERSPEECH 2022
Reducing Domain mismatch in Self-supervised speech pre-training
INTERSPEECH 2022
Improving Distortion Robustness of Self-supervised Speech Processing Tasks with Domain Adaptation
INTERSPEECH 2022
Self-supervised learning with random-projection quantizer for speech recognition
ICML 2022
Subspace Learning for Effective Meta-Learning
ICML 2022
Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation
INTERSPEECH 2022
Dual-Curriculum Contrastive Multi-Instance Learning for Cancer Prognosis Analysis with Whole Slide Images
NIPS 2022
Dynamic Sparse Network for Time Series Classification: Learning What to βSeeβ
NIPS 2022
Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation
INTERSPEECH 2022
On Convergence of Gradient Expected Sarsa(Ξ»)
AAAI 2021
Training Weakly Supervised Video Frame Interpolation With Events
ICCV 2021
Personalized Image Semantic Segmentation
ICCV 2021
WaveGrad: Estimating Gradients for Waveform Generation
ICLR 2021
Sparse Multi-Path Corrections in Fringe Projection Profilometry
CVPR 2021
Informative and Consistent Correspondence Mining for Cross-Domain Weakly Supervised Object Detection
CVPR 2021
A Coarse-to-Fine Labeling Framework for Joint Word Segmentation, POS Tagging, and Constituent Parsing
CONLL 2021
Effective Meta-Regularization by Kernelized Proximal Regularization
NIPS 2021
Regularized Mutual Learning for Personalized Federated Learning
ACML 2021
Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
INTERSPEECH 2021
PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS
INTERSPEECH 2021
Semi-Supervision in ASR: Sequential MixMatch and Factorized TTS-Based Augmentation
INTERSPEECH 2021
Exploring Targeted Universal Adversarial Perturbations to End-to-End ASR Models
INTERSPEECH 2021
Pushing the Limits of Non-Autoregressive Speech Recognition
INTERSPEECH 2021
WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
INTERSPEECH 2021
Residual Energy-Based Models for End-to-End Speech Recognition
INTERSPEECH 2021
Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction
INTERSPEECH 2021
Unsupervised Learning of Disentangled Speech Content and Style Representation
INTERSPEECH 2021
Multi-Objective Meta Learning
NIPS 2021
Learn to Predict Vertical Track Irregularity with Extremely Imbalanced Data
ACML 2021
Distant Transfer Learning via Deep Random Walk
AAAI 2021
A Coarse-to-Fine Labeling Framework for Joint Word Segmentation, POS Tagging, and Constituent Parsing
EMNLP 2021
Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training
EMNLP 2021
Logic-level Evidence Retrieval and Graph-based Verification Network for Table-based Fact Verification
EMNLP 2021
Knowledge Distillation from Internal Representations
AAAI 2020
Deep Image Clustering with Category-Style Representation
ECCV 2020
Learning to See in the Dark with Events
ECCV 2020
Learn to Cross-lingual Transfer with Meta Graph Learning Across Heterogeneous Languages
EMNLP 2020
What Is It You Really Want of Me? Generalized Reward Learning with Biased Beliefs about Domain Dynamics
AAAI 2020
Scalability in Perception for Autonomous Driving: Waymo Open Dataset
CVPR 2020
Efficient Second-Order TreeCRF for Neural Dependency Parsing
ACL 2020
Learning Event-Based Motion Deblurring
CVPR 2020
Learn to Combine Linguistic and Symbolic Information for Table-based Fact Verification
COLING 2020
Fast and Accurate Neural CRF Constituency Parsing
IJCAI 2020
WISE: Word-Level Interaction-Based Multimodal Fusion for Speech Emotion Recognition
INTERSPEECH 2020
Improving Speech Recognition Using GAN-Based Speech Synthesis and Contrastive Unspoken Text Selection
INTERSPEECH 2020
Improved Noisy Student Training for Automatic Speech Recognition
INTERSPEECH 2020
SCADA: Stochastic, Consistent and Adversarial Data Augmentation to Improve ASR
INTERSPEECH 2020
ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context
INTERSPEECH 2020
Conformer: Convolution-augmented Transformer for Speech Recognition
INTERSPEECH 2020
Label Enhancement for Label Distribution Learning via Prior Knowledge
IJCAI 2020
Label Distribution for Learning with Noisy Labels
IJCAI 2020
Transferable End-to-End Aspect-based Sentiment Analysis with Selective Adversarial Learning
IJCNLP 2019
LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech
INTERSPEECH 2019
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
INTERSPEECH 2019
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
INTERSPEECH 2019
Gaussian Transformer: A Lightweight Approach for Natural Language Inference
AAAI 2019
K3S: Knowledge-Driven Solution Support System
AAAI 2019
Selectivity or Invariance: Boundary-Aware Salient Object Detection
ICCV 2019
Transferable End-to-End Aspect-based Sentiment Analysis with Selective Adversarial Learning
EMNLP 2019
End-to-End Multi-View Fusion for 3D Object Detection in LiDAR Point Clouds
CORL 2019
Causes and Corrections for Bimodal Multi-Path Scanning With Structured Light
CVPR 2019
Structure-Preserving Stereoscopic View Synthesis With Multi-Scale Adversarial Correlation Matching
CVPR 2019
Exploiting Coarse-to-Fine Task Transfer for Aspect-Level Sentiment Classification
AAAI 2019
Learning (from) Deep Hierarchical Structure among Features
AAAI 2019
Hierarchical Generative Modeling for Controllable Speech Synthesis
ICLR 2019
HLT@SUDA at SemEval-2019 Task 1: UCCA Graph Parsing as Constituent Tree Parsing
SEMEVAL 2019
Multi-Class Part Parsing With Joint Boundary-Semantic Awareness
ICCV 2019
Zero Pronoun Resolution with Attention-based Neural Network
COLING 2018
Simple Recurrent Units for Highly Parallelizable Recurrence
EMNLP 2018
Cross-lingual Knowledge Graph Alignment via Graph Convolutional Networks
EMNLP 2018
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
NIPS 2018
Transfer Learning via Learning to Transfer
ICML 2018
Learning to Multitask
NIPS 2018
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
ICML 2018
Deep Reinforcement Learning for Chinese Zero Pronoun Resolution
ACL 2018
On the Duration of Mandarin Tones
INTERSPEECH 2017
Chinese Zero Pronoun Resolution with Deep Memory Network
EMNLP 2017
Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data
NIPS 2017
Supervision by Fusion: Towards Unsupervised Learning of Deep Salient Object Detector
ICCV 2017
SCIR-QA at SemEval-2017 Task 3: CNN Model Based on Similar and Dissimilar Information between Keywords for Question Similarity
SEMEVAL 2017
Benben: A Chinese Intelligent Conversational Robot
ACL 2017
Attention-Based LSTM with Multi-Task Learning for Distant Speech Recognition
INTERSPEECH 2017
Plan Explanations as Model Reconciliation: Moving Beyond Explanation as Soliloquy
IJCAI 2017
End-to-End Adversarial Memory Network for Cross-domain Sentiment Classification
IJCAI 2017
Deep Neural Networks for High Dimension, Low Sample Size Data
IJCAI 2017
A Deep Neural Network for Chinese Zero Pronoun Resolution
IJCAI 2017
Multimodal Linear Discriminant Analysis via Structural Sparsity
IJCAI 2017
Learning Latent Representations for Speech Generation and Transformation
INTERSPEECH 2017
Advances in Joint CTC-Attention Based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LM
INTERSPEECH 2017
What Is and What Is Not a Salient Object? Learning Salient Object Detector by Ensembling Linear Exemplar Regressors
CVPR 2017
Exploit Bounding Box Annotations for Multi-Label Object Recognition
CVPR 2016
Exploiting Depth and Highway Connections in Convolutional Recurrent Deep Neural Networks for Speech Recognition
INTERSPEECH 2016
Neural Attention for Learning to Rank Questions in Community Question Answering
COLING 2016
SLS at SemEval-2016 Task 3: Neural-based Approaches for Ranking in Community Question Answering
SEMEVAL 2016
Semantic Object Segmentation via Detection in Weakly Labeled Video
CVPR 2015
3D Reconstruction in the Presence of Glasses by Acoustic and Stereo Fusion
CVPR 2015
Towards Good Practices for Action Video Encoding
CVPR 2014
Compact Representation for Image Classification: To Choose or to Compress?
CVPR 2014
Heterogeneous-Neighborhood-based Multi-Task Local Learning Algorithms
NIPS 2013
Learning High-Order Task Relationships in Multi-Task Learning
IJCAI 2013
Joint Learning of Phonetic Units and Word Pronunciations for ASR
EMNLP 2013
The Use of Dependency Relation Graph to Enhance the Term Weighting in Question Retrieval
COLING 2012
Multi-Task Learning using Generalized t Process
AISTATS 2010
Probabilistic Multi-Task Feature Selection
NIPS 2010
Worst-Case Linear Discriminant Analysis
NIPS 2010
Bridging Topic Modeling and Personalized Search
COLING 2010
HIT: Web based Scoring Method for English Lexical Substitution
SEMEVAL 2007
Automated Generalization of Phrasal Paraphrases from the Web
IJCNLP 2005