Hao Zhang
322 papers · 2004–2026 · 25 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (47) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (8) π£ Hot Topic Early Bird
π
Renaissance Researcher
(8)
π
Interdisciplinary Bridge
π
Cross-Pollinator
(7)
π
Conference Loyalist
(29)
π€
Dynamic Duo
(20)
π
Triple Crown
π
Keyword Champion
(2)
π
Grand Slam
π₯
Mega-Team
(22)
π¬
Deep Specialist
(35)
β
The Questioner
(2)
π
Conference Pioneer
β‘
Prolific Year
(27)
π₯
Unstoppable
(16)
ποΈ
Keyword Collector
(127)
π
Century Club
(307)
π
Trend Setter
Conferences
ACL (40)
CVPR (39)
AAAI (38)
NIPS (29)
ICLR (25)
EMNLP (24)
ICML (21)
ICCV (17)
INTERSPEECH (16)
ECCV (15)
COLING (11)
IJCAI (9)
IJCNLP (5)
WACV (4)
RSS (4)
OSDI (4)
NAACL (4)
MICCAI (4)
AISTATS (3)
SEMEVAL (3)
EACL (2)
CORL (2)
MLHC (1)
AACL (1)
CONLL (1)
Top co-authors
Keywords
large language model
(24)
question answering
(12)
semantic segmentation
(12)
zero-shot learning
(11)
neural network
(11)
multimodal learning
(11)
representation learning
(10)
diffusion model
(10)
domain adaptation
(10)
transfer learning
(9)
adversarial attack
(9)
benchmark evaluation
(9)
neural network optimization
(8)
vision-language model
(8)
3d reconstruction
(8)
language model
(8)
object detection
(7)
text classification
(7)
causal discovery
(7)
attention mechanism
(7)
Papers
Invariant Feature Learning for Counterfactual Watch-time Prediction in Video Recommendation
AAAI 2026
Tiny Scales, Great Challenges: The Limits of Multimodal LLMs in Scale Recognition
ACL 2026
PDTrim: Targeted Pruning for Prefill-Decode Disaggregation in Inference
ACL 2026
Stop Mixing Things Up! BISCUIT Teaches Vision-Language Models to Learn New Concepts from Images on the Spot
AAAI 2026
Iterative Structured Pruning for Large Language Models with Multi-Domain Calibration
EACL 2026
DetectRL-X: Towards Reliable Multilingual and Real-World LLM-Generated Text Detection
ACL 2026
Robust Fusion Controller: Degradation-Aware Image Fusion with Fine-Grained Language Instructions
AAAI 2026
Revisiting Audio-language Pretraining for Learning General-purpose Audio Representation
ACL 2026
Diff-NAT: Better Naturalistic and Aggressive Adversarial Attacks via Class-Optimized Diffusion for Object Detection
AAAI 2026
SGPFeat: Semantic and Geometric Priors for Multi-modal Image Matching
AAAI 2026
CycleChemist: A Dual-Pronged Machine Learning Framework for Organic Photovoltaic Discovery
AAAI 2026
PKR-QA: A Benchmark for Procedural Knowledge Reasoning with Knowledge Module Learning
AAAI 2026
MoLoRA: Boosting LLM-based End-to-end Speech Translation with Mixture of Low-rank Experts
AAAI 2026
Audio-Thinker: Guiding Large Audio Language Model When and How to Think via Reinforcement Learning
AAAI 2026
Bidirectional Noise Injection: Enhancing Diffusion Models via Coordinated Input-Output Perturbation
AAAI 2026
FreeSim: Toward Free-viewpoint Camera Simulation in Driving Scenes
CVPR 2025
OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation
CVPR 2025
GALA: Geometry-Aware Local Adaptive Grids for Detailed 3D Generation
ICLR 2025
LLaVA-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models
ICLR 2025
GameArena: Evaluating LLM Reasoning through Live Computer Games
ICLR 2025
Multi-Task Dense Predictions via Unleashing the Power of Diffusion
ICLR 2025
3DMolFormer: A Dual-channel Framework for Structure-based Drug Discovery
ICLR 2025
High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity
ICLR 2025
ArcPro: Architectural Programs for Structured 3D Abstraction of Sparse Points
CVPR 2025
Planning with Multi-Constraints via Collaborative Language Agents
COLING 2025
MatryoshkaKV: Adaptive KV Compression via Trainable Orthogonal Projection
ICLR 2025
Scaling Long Context Training Data by Long-Distance Referrals
ICLR 2025
PALMBENCH: A COMPREHENSIVE BENCHMARK OF COMPRESSED LARGE LANGUAGE MODELS ON MOBILE PLATFORMS
ICLR 2025
Frame-Voyager: Learning to Query Frames for Video Large Language Models
ICLR 2025
Explaining Domain Shifts in Language: Concept Erasing for Interpretable Image Classification
CVPR 2025
Subteaming and Adaptive Formation Control for Coordinated Multi-Robot Navigation
CORL 2025
PhysRig: Differentiable Physics-Based Skinning and Rigging Framework for Realistic Articulated Object Modeling
ICCV 2025
IMoRe: Implicit Program-Guided Reasoning for Human Motion Q&A
ICCV 2025
TemCoCo: Temporally Consistent Multi-modal Video Fusion with Visual-Semantic Collaboration
ICCV 2025
SDMatte: Grafting Diffusion Models for Interactive Matting
ICCV 2025
GeoPQA: Bridging the Visual Perception Gap in MLLMs for Geometric Reasoning
EMNLP 2025
Sensitivity-LoRA : Low-Load Sensitivity-Based Fine-Tuning for Large Language Models
EMNLP 2025
Sugar-Coated Poison: Benign Generation Unlocks Jailbreaking
EMNLP 2025
MetaMixSpeech: Meta Task Augmentation for Low-Resource Speech Recognition
EMNLP 2025
ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning
EMNLP 2025
SolEval: Benchmarking Large Language Models for Repository-level Solidity Smart Contract Generation
EMNLP 2025
Transferable Direct Prompt Injection via Activation-Guided MCMC Sampling
EMNLP 2025
Advancing Comprehensive Aesthetic Insight with Multi-Scale Text-Guided Self-Supervised Learning
AAAI 2025
Cross-Modal Stealth: A Coarse-to-Fine Attack Framework for RGB-T Tracker
AAAI 2025
Boosting Vision State Space Model with Fractal Scanning
AAAI 2025
Scalable Trajectory-User Linking with Dual-Stream Representation Networks
AAAI 2025
Efficient Constraint-based Window Causal Graph Discovery in Time Series with Multiple Time Lags
IJCAI 2025
Rethinking Federated Graph Learning: A Data Condensation Perspective
IJCAI 2025
Identifying Causal Mechanism Shifts Under Additive Models with Arbitrary Noise
IJCAI 2025
Towards Automatic Sampling of User Behaviors for Sequential Recommender Systems
IJCAI 2025
Fast Video Generation with Sliding Tile Attention
ICML 2025
Learning Adaptive Lighting via Channel-Aware Guidance
ICML 2025
FedSMU: Communication-Efficient and Generalization-Enhanced Federated Learning through Symbolic Model Updates
ICML 2025
SERENA: A Unified Stochastic Recursive Variance Reduced Gradient Framework for Riemannian Non-Convex Optimization
ICML 2025
Local Identifying Causal Relations in the Presence of Latent Variables
ICML 2025
Data-Driven Selection of Instrumental Variables for Additive Nonlinear, Constant Effects Models
ICML 2025
LBI-FL: Low-Bit Integerized Federated Learning with Temporally Dynamic Bit-Width Allocation
ICML 2025
HeightMapNet: Explicit Height Modeling for End-to-End HD Map Learning
WACV 2025
Deduce and Select Evidences with Language Models for Training-Free Video Goal Inference
WACV 2025
Discovering Fine-Grained Visual-Concept Relations by Disentangled Optimal Transport Concept Bottleneck Models
CVPR 2025
ACAttack: Adaptive Cross Attacking RGB-T Tracker via Multi-Modal Response Decoupling
CVPR 2025
Reverse Modeling in Large Language Models
NAACL 2025
SafetyQuizzer: Timely and Dynamic Evaluation on the Safety of LLMs
NAACL 2025
Multi-view Graph Contrastive Learning with Dynamic Self-aware and Cross-sample Topology Augmentation for Brain Disorder Diagnosis
MICCAI 2025
Multiscale Graph and Multi-Step Cross-Frame Mamba for Myocarditis Lesion Segmentation
MICCAI 2025
TrimLLM: Progressive Layer Dropping for Domain-Specific LLMs
ACL 2025
FineReason: Evaluating and Improving LLMsβ Deliberate Reasoning through Reflective Puzzle Solving
ACL 2025
Adaptive Tool Use in Large Language Models with Meta-Cognition Trigger
ACL 2025
Reversal of Thought: Enhancing Large Language Models with Preference-Guided Reverse Reasoning Warm-up
ACL 2025
CoIR: A Comprehensive Benchmark for Code Information Retrieval Models
ACL 2025
Analyzing LLMsβ Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations
ACL 2025
Long-form Hallucination Detection with Self-elicitation
ACL 2025
CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control
ACL 2025
RAPID: Efficient Retrieval-Augmented Long Text Generation with Writing Planning and Information Discovery
ACL 2025
Knowledge-Enhanced Complementary Information Fusion with Temporal Heterogeneous Graph Learning for Disease Prediction
MICCAI 2025
Dispel Darkness for Better Fusion: A Controllable Visual Enhancer based on Cross-modal Conditional Adversarial Learning
CVPR 2024
Parameter-Efficient Conversational Recommender System as a Language Processing Task
EACL 2024
LONG2RAG: Evaluating Long-Context & Long-Form Retrieval-Augmented Generation with Key Point Recall
EMNLP 2024
MC-indexing: Effective Long Document Retrieval via Multi-view Content-aware Indexing
EMNLP 2024
DVD: Dynamic Contrastive Decoding for Knowledge Amplification in Multi-Document Question Answering
EMNLP 2024
Optimizing Code Retrieval: High-Quality and Scalable Dataset Annotation through Large Language Models
EMNLP 2024
DPA-Net: Structured 3D Abstraction from Sparse Views via Differentiable Primitive Assembly
ECCV 2024
Beta-Tuned Timestep Diffusion Model
ECCV 2024
TAPTR: Tracking Any Point with Transformers as Detection
ECCV 2024
SAFNet: Selective Alignment Fusion Network for Efficient HDR Imaging
ECCV 2024
Active Coarse-to-Fine Segmentation of Moveable Parts from Real Images
ECCV 2024
"DECOLLAGE: 3D Detailization by Controllable, Localized, and Learned Geometry Enhancement"
ECCV 2024
LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models
ECCV 2024
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
ECCV 2024
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
ECCV 2024
Segment and Recognize Anything at Any Granularity
ECCV 2024
UnSeg: One Universal Unlearnable Example Generator is Enough against All Image Segmentation
NIPS 2024
Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation
NIPS 2024
Efficiently Learning Significant Fourier Feature Pairs for Statistical Independence Testing
NIPS 2024
ConvBench: A Multi-Turn Conversation Evaluation Benchmark with Hierarchical Ablation Capability for Large Vision-Language Models
NIPS 2024
TAPTRv2: Attention-based Position Update Improves Tracking Any Point
NIPS 2024
MR-Ben: A Meta-Reasoning Benchmark for Evaluating System-2 Thinking in LLMs
NIPS 2024
Improving Generalization in Federated Learning with Model-Data Mutual Information Regularization: A Posterior Inference Approach
NIPS 2024
Open-NeRF: Towards Open Vocabulary NeRF Decomposition
WACV 2024
DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving
OSDI 2024
Hierarchical multiple instance learning for COPD grading with relatively specific similarity
MICCAI 2024
Neural Network Augmented Kalman Filter for Robust Acoustic Howling Suppression
INTERSPEECH 2024
Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification
AAAI 2024
Deep Unfolded Network with Intrinsic Supervision for Pan-Sharpening
AAAI 2024
CSL: Class-Agnostic Structure-Constrained Learning for Segmentation Including the Unseen
AAAI 2024
A Robust Mutual-Reinforcing Framework for 3D Multi-Modal Medical Image Fusion Based on Visual-Semantic Consistency
AAAI 2024
Clarifying the Behavior and the Difficulty of Adversarial Training
AAAI 2024
Explaining Generalization Power of a DNN Using Interactive Concepts
AAAI 2024
PointTFA: Training-Free Clustering Adaption for Large 3D Point Cloud Models
IJCAI 2024
Cross-Scale Domain Adaptation with Comprehensive Information for Pansharpening
IJCAI 2024
When Will Gradient Regularization Be Harmful?
ICML 2024
S3O: A Dual-Phase Approach for Reconstructing Dynamic Shape and Skeleton of Articulated Objects from Single Monocular Video
ICML 2024
MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
ICML 2024
Online Speculative Decoding
ICML 2024
CLLMs: Consistency Large Language Models
ICML 2024
Improving Adversarial Energy-Based Model via Diffusion Process
ICML 2024
Efficient Detection of LLM-generated Texts with a Bayesian Surrogate Model
ACL 2024
LMDX: Language Model-based Document Information Extraction and Localization
ACL 2024
Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
ICML 2024
MuxServe: Flexible Spatial-Temporal Multiplexing for Multiple LLM Serving
ICML 2024
Learning Adaptive Kernels for Statistical Independence Tests
AISTATS 2024
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
ICML 2024
InferCept: Efficient Intercept Support for Augmented Large Language Model Inference
ICML 2024
EpiGEN: An Efficient Multi-Api Code GENeration Framework under Enterprise Scenario
COLING 2024
Meta-Adapter for Self-Supervised Speech Models: A Solution to Low-Resource Speech Recognition Challenges
COLING 2024
Learning Implicit Representation for Reconstructing Articulated Objects
ICLR 2024
Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding
ICLR 2024
LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
ICLR 2024
An Instruction Tuning-Based Contrastive Learning Framework for Aspect Sentiment Quad Prediction with Implicit Aspects and Opinions
EMNLP 2024
AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models
EMNLP 2024
CRAYM: Neural Field Optimization via Camera RAY Matching
NIPS 2024
Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model
NIPS 2024
Interfacing Foundation Models' Embeddings
NIPS 2024
Efficient LLM Scheduling by Learning to Rank
NIPS 2024
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length
NIPS 2024
MeaCap: Memory-Augmented Zero-shot Image Captioning
CVPR 2024
Revisiting Single Image Reflection Removal In the Wild
CVPR 2024
Multi-Task Dense Prediction via Mixture of Low-Rank Experts
CVPR 2024
Text-IF: Leveraging Semantic Text Guidance for Degradation-Aware and Interactive Image Fusion
CVPR 2024
Visual In-Context Prompting
CVPR 2024
Uncovering What Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly
CVPR 2024
Slice3D: Multi-Slice Occlusion-Revealing Single View 3D Reconstruction
CVPR 2024
MRFS: Mutually Reinforcing Image Fusion and Segmentation
CVPR 2024
DFA3D: 3D Deformable Attention For 2D-to-3D Feature Lifting
ICCV 2023
De novo Drug Design using Reinforcement Learning with Multiple GPT Agents
NIPS 2023
Segment Everything Everywhere All at Once
NIPS 2023
D$^2$CSG: Unsupervised Learning of Compact CSG Trees with Dual Complements and Dropouts
NIPS 2023
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
NIPS 2023
FaceDNeRF: Semantics-Driven Face Reconstruction, Prompt Editing and Relighting with Diffusion Models
NIPS 2023
DiViNeT: 3D Reconstruction from Disparate Views using Neural Template Regularization
NIPS 2023
DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding
AAAI 2023
Multi-Level Wavelet Mapping Correlation for Statistical Dependence Measurement: Methodology and Performance
AAAI 2023
Differentially Private Nonlinear Causal Discovery from Numerical Data
AAAI 2023
MS-DETR: Natural Language Video Localization with Sampling Moment-Moment Interaction
ACL 2023
FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
ACL 2023
UKP-SQuARE v3: A Platform for Multi-Agent QA Research
ACL 2023
QueryForm: A Simple Zero-shot Form Entity Query Framework
ACL 2023
NoisywikiHow: A Benchmark for Learning with Real-world Noisy Labels in Natural Language Processing
ACL 2023
ZBL2W at SemEval-2023 Task 9: A Multilingual Fine-tuning Model with Data Augmentation for Tweet Intimacy Analysis
ACL 2023
DUTIR at SemEval-2023 Task 10: Semi-supervised Learning for Sexism Detection in English
ACL 2023
ARO-Net: Learning Implicit Fields From Anchored Radial Observations
CVPR 2023
ConZIC: Controllable Zero-Shot Image Captioning by Sampling-Based Polishing
CVPR 2023
Mask DINO: Towards a Unified Transformer-Based Framework for Object Detection and Segmentation
CVPR 2023
MP-Former: Mask-Piloted Transformer for Image Segmentation
CVPR 2023
Lite DETR: An Interleaved Multi-Scale Encoder for Efficient DETR
CVPR 2023
TLM: Token-Level Masking for Transformers
EMNLP 2023
Long-Form Speech Translation through Segmentation with Finite-State Decoding Constraints on Large Language Models
EMNLP 2023
Diff-Retinex: Rethinking Low-light Image Enhancement with A Generative Diffusion Model
ICCV 2023
HAL3D: Hierarchical Active Learning for Fine-Grained 3D Part Labeling
ICCV 2023
A Simple Framework for Open-Vocabulary Segmentation and Detection
ICCV 2023
DS-Fusion: Artistic Typography via Discriminated and Stylized Diffusion
ICCV 2023
Detection Transformer with Stable Matching
ICCV 2023
SoftZoo: A Soft Robot Co-design Benchmark For Locomotion In Diverse Environments
ICLR 2023
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection
ICLR 2023
MPCFORMER: FAST, PERFORMANT AND PRIVATE TRANSFORMER INFERENCE WITH MPC
ICLR 2023
FedCR: Personalized Federated Learning Based on Across-Client Common Representation with Conditional Mutual Information Regularization
ICML 2023
Hybrid AHS: A Hybrid of Kalman Filter and Deep Learning for Acoustic Howling Suppression
INTERSPEECH 2023
Text Injection for Capitalization and Turn-Taking Prediction in Speech Models
INTERSPEECH 2023
Semantic Segmentation with Bidirectional Language Models Improves Long-form ASR
INTERSPEECH 2023
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving
OSDI 2023
ZBL2W at SemEval-2023 Task 9: A Multilingual Fine-tuning Model with Data Augmentation for Tweet Intimacy Analysis
SEMEVAL 2023
DUTIR at SemEval-2023 Task 10: Semi-supervised Learning for Sexism Detection in English
SEMEVAL 2023
ETR: An Efficient Transformer for Re-Ranking in Visual Place Recognition
WACV 2023
CAPRI-Net: Learning Compact CAD Shapes With Adaptive Primitive Assembly
CVPR 2022
STGN: an Implicit Regularization Method for Learning with Noisy Labels in Natural Language Processing
EMNLP 2022
UKP-SQuARE v2: Explainability and Adversarial Attacks for Trustworthy QA
IJCNLP 2022
UKP-SQuARE v2: Explainability and Adversarial Attacks for Trustworthy QA
AACL 2022
RIM-Net: Recursive Implicit Fields for Unsupervised Learning of Hierarchical Shape Structures
CVPR 2022
Language Model Decomposition: Quantifying the Dependency and Correlation of Language Models
EMNLP 2022
DN-DETR: Accelerate DETR Training by Introducing Query DeNoising
CVPR 2022
GUTS at SemEval-2022 Task 4: Adversarial Training and Balancing Methods for Patronizing and Condescending Language Detection
SEMEVAL 2022
Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning
OSDI 2022
Quantitative Performance Assessment of CNN Units via Topological Entropy Calculation
ICLR 2022
DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR
ICLR 2022
DISCOVERING AND EXPLAINING THE REPRESENTATION BOTTLENECK OF DNNS
ICLR 2022
GALOIS: Boosting Deep Reinforcement Learning via Generalizable Logic Synthesis
NIPS 2022
A Variational Edge Partition Model for Supervised Graph Representation Learning
NIPS 2022
UNIST: Unpaired Neural Implicit Shape Translation Network
CVPR 2022
Group Contextualization for Video Recognition
CVPR 2022
GUTS at SemEval-2022 Task 4: Adversarial Training and Balancing Methods for Patronizing and Condescending Language Detection
NAACL 2022
Incorporating Instructional Prompts into a Unified Generative Framework for Joint Multiple Intent Detection and Slot Filling
COLING 2022
FCGCL: Fine- and Coarse-Granularity Contrastive Learning for Speech Translation
EMNLP 2022
Residual Similarity Based Conditional Independence Test and Its Application in Causal Discovery
AAAI 2022
Hybrid Neural Networks for On-Device Directional Hearing
AAAI 2022
Quantification and Analysis of Layer-wise and Pixel-wise Information Discarding
ICML 2022
Penalizing Gradient Norm for Efficiently Improving Generalization in Deep Learning
ICML 2022
AMP: Automatically Finding Model Parallel Strategies with Heterogeneity Awareness
NIPS 2022
Attentive Recurrent Network for Low-Latency Active Noise Control
INTERSPEECH 2022
Interventional Training for Out-Of-Distribution Natural Language Understanding
EMNLP 2022
Translate-Train Embracing Translationese Artifacts
ACL 2022
Interventional Video Grounding With Dual Contrastive Learning
CVPR 2021
DECOR-GAN: 3D Shape Detailization by Conditional Refinement
CVPR 2021
Interpreting and Boosting Dropout from a Game-Theoretic View
ICLR 2021
Parallel Attention Network with Sequence Matching for Video Grounding
ACL 2021
Interpreting Multivariate Shapley Interactions in DNNs
AAAI 2021
TeraPipe: Token-Level Pipeline Parallelism for Training Large-Scale Language Models
ICML 2021
EnsLM: Ensemble Language Model for Data Diversity by Semantic Clustering
ACL 2021
COSY: COunterfactual SYntax for Cross-Lingual Understanding
ACL 2021
Enhancing Consistent Ground Maneuverability by Robot Adaptation to Complex Off-Road Terrains
CORL 2021
3D-FRONT: 3D Furnished Rooms With layOuts and semaNTics
ICCV 2021
A Deep Learning Approach to Multi-Channel and Multi-Microphone Acoustic Echo Cancellation
INTERSPEECH 2021
A Deep Learning Method to Multi-Channel Active Noise Control
INTERSPEECH 2021
A Prototype-Oriented Framework for Unsupervised Domain Adaptation
NIPS 2021
Parallel Attention Network with Sequence Matching for Video Grounding
IJCNLP 2021
PhotoChat: A Human-Human Dialogue Dataset With Photo Sharing Behavior For Joint Image-Text Modeling
IJCNLP 2021
EnsLM: Ensemble Language Model for Data Diversity by Semantic Clustering
IJCNLP 2021
COSY: COunterfactual SYntax for Cross-Lingual Understanding
IJCNLP 2021
GDPNet: Refining Latent Multi-View Graph for Relation Extraction
AAAI 2021
Interpreting Attributions and Interactions of Adversarial Attacks
ICCV 2021
PhotoChat: A Human-Human Dialogue Dataset With Photo Sharing Behavior For Joint Image-Text Modeling
ACL 2021
Building Interpretable Interaction Trees for Deep NLP Models
AAAI 2021
MetaSCI: Scalable and Adaptive Reconstruction for Video Compressive Sensing
CVPR 2021
A Hybrid Seq-2-Seq ASR Design for On-Device and Server Applications
INTERSPEECH 2021
Pollux: Co-adaptive Cluster Scheduling for Goodput-Optimized Deep Learning
OSDI 2021
Ada-Segment: Automated Multi-loss Adaptation for Panoptic Segmentation
AAAI 2021
D2IM-Net: Learning Detail Disentangled Implicit Fields From Single Images
CVPR 2021
Roof-GAN: Learning To Generate Roof Geometry and Relations for Residential Houses
CVPR 2021
LayoutGMN: Neural Graph Matching for Structural Layout Similarity
CVPR 2021
Memory-Efficient Network for Large-Scale Video Compressive Sensing
CVPR 2021
Bayesian Deep Graph Matching for Correspondence Identification in Collaborative Perception
RSS 2021
Appearance-Motion Memory Consistency Network for Video Anomaly Detection
AAAI 2021
Testing Independence Between Linear Combinations for Causal Discovery
AAAI 2021
GANHopper: Multi-Hop GAN for Unsupervised Image-to-Image Translation
ECCV 2020
RoboCoDraw: Robotic Avatar Drawing with GAN-Based Style Transfer and Time-Efficient Path Optimization
AAAI 2020
Long-Term Loop Closure Detection through Visual-Spatial Information Preserving Multi-Order Graph Matching
AAAI 2020
Mastering Complex Control in MOBA Games with Deep Reinforcement Learning
AAAI 2020
AdaCoSeg: Adaptive Shape Co-Segmentation With Group Consistency Loss
CVPR 2020
Variational Hetero-Encoder Randomized GANs for Joint Image-Text Modeling
ICLR 2020
Interpretable Complex-Valued Neural Networks for Privacy Protection
ICLR 2020
BSP-Net: Generating Compact Meshes via Binary Space Partitioning
CVPR 2020
PQ-NET: A Generative Part Seq2Seq Network for 3D Shapes
CVPR 2020
Semi-supervised URL Segmentation with Recurrent Neural Networks Pre-trained on Knowledge Graph Entities
COLING 2020
Learning Dynamic Hierarchical Topic Graph with Graph Convolutional Network for Document Classification
AISTATS 2020
Multi-source Meta Transfer for Low Resource Multiple-Choice Question Answering
ACL 2020
Span-based Localizing Network for Natural Language Video Localization
ACL 2020
Speeding up Very Fast Decision Tree with Low Computational Cost
IJCAI 2020
A Deep Learning Approach to Active Noise Control
INTERSPEECH 2020
PIE-NET: Parametric Inference of Point Cloud Edges
NIPS 2020
Bidirectional Convolutional Poisson Gamma Dynamical Systems
NIPS 2020
Students Need More Attention: BERT-based Attention Model for Small Data with Application to Automatic Patient Message Triage
MLHC 2020
AutoSync: Learning to Synchronize for Data-Parallel Distributed Deep Learning
NIPS 2020
Leading Multi-Agent Teams to Multiple Goals While Maintaining Communication
RSS 2020
Regularized Graph Matching for Correspondence Identification under Uncertainty in Collaborative Perception
RSS 2020
Deep Relational Topic Modeling via Graph Poisson Gamma Belief Network
NIPS 2020
Friendly Topic Assistant for Transformer Based Abstractive Summarization
EMNLP 2020
Rethinking the Image Fusion: A Fast Unified Image Fusion Network based on Proportional Maintenance of Gradient and Intensity
AAAI 2020
FDN: Feature Decoupling Network for Head Pose Estimation
AAAI 2020
BIRNAT: Bidirectional Recurrent Neural Networks with Adversarial Training for Video Snapshot Compressive Imaging
ECCV 2020
DR-KFS: A Differentiable Visual Similarity Metric for 3D Shape Reconstruction
ECCV 2020
Learning Implicit Fields for Generative Shape Modeling
CVPR 2019
Deep Learning for Joint Acoustic Echo and Noise Cancellation with Nonlinear Distortions
INTERSPEECH 2019
Dual Encoder Classifier Models as Constraints in Neural Text Normalization
INTERSPEECH 2019
Multiple Noisy Label Distribution Propagation for Crowdsourcing
IJCAI 2019
Dual Adversarial Neural Transfer for Low-Resource Named Entity Recognition
ACL 2019
Visual Place Recognition via Robust β2-Norm Distance Based Holism and Landmark Integration
AAAI 2019
CompoNet: Learning to Generate the Unseen by Part Synthesis and Composition
ICCV 2019
Recursively Learning Causal Structures Using Regression-Based Conditional Independence Test
AAAI 2019
BAE-NET: Branched Autoencoder for Shape Co-Segmentation
ICCV 2019
Toward Understanding the Impact of Staleness in Distributed Machine Learning
ICLR 2019
AutoLoss: Learning Discrete Schedule for Alternate Optimization
ICLR 2019
Improving Performance of End-to-End ASR on Numeric Sequences
INTERSPEECH 2019
Deep Learning for Acoustic Echo Cancellation in Noisy and Double-Talk Scenarios
INTERSPEECH 2018
Fast and Accurate Reordering with ITG Transition RNN
COLING 2018
SketchyScene: Richly-Annotated Scene Sketches
ECCV 2018
Learning Multi-Instance Enriched Image Representations via Non-Greedy Ratio Maximization of the l1-Norm Distances
CVPR 2018
WHAI: Weibull Hybrid Autoencoding Inference for Deep Topic Modeling
ICLR 2018
UKP-Athene: Multi-Sentence Textual Entailment for Claim Verification
EMNLP 2018
Generative Semantic Manipulation with Mask-Contrasting GAN
ECCV 2018
Deep Poisson gamma dynamical systems
NIPS 2018
Symbolic Graph Reasoning Meets Convolutions
NIPS 2018
Cross-Corpora Convolutional Deep Neural Network Dereverberation Preprocessing for Speaker Verification and Speech Enhancement
INTERSPEECH 2018
DualGAN: Unsupervised Dual Learning for Image-To-Image Translation
ICCV 2017
Recurrent Topic-Transition GAN for Visual Paragraph Generation
ICCV 2017
Structured Generative Adversarial Networks
NIPS 2017
Development of Mandarin Onset-Rime Detection in Relation to Age and Pinyin Instruction
INTERSPEECH 2016
Robust Multimodal Sequence-Based Loop Closure Detection via Structured Sparsity
RSS 2016
On the Reducibility of Submodular Functions
AISTATS 2016
The Influence of Language Experience on the Categorical Perception of Vowels: Evidence from Mandarin and Korean
INTERSPEECH 2016
Learning Concept Taxonomies from Multi-modal Data
ACL 2016
Enforcing Template Representability and Temporal Consistency for Adaptive Sparse Tracking
IJCAI 2016
HD-CNN: Hierarchical Deep Convolutional Neural Networks for Large Scale Visual Recognition
ICCV 2015
Simplex-Based 3D Spatio-Temporal Feature Description for Action Recognition
CVPR 2014
Enforcing Structural Diversity in Cube-pruned Dependency Parsing
ACL 2014
Sparse Dictionary Learning for Edit Propagation of High-Resolution Images
CVPR 2014
Online Learning for Inexact Hypergraph Search
EMNLP 2013
Universal Dependency Annotation for Multilingual Parsing
ACL 2013
Generalized Higher-Order Dependency Parsing with Cube Pruning
EMNLP 2012
NiuTrans: An Open Source Toolkit for Phrase-based and Syntax-based Machine Translation
ACL 2012
Generalized Higher-Order Dependency Parsing with Cube Pruning
CONLL 2012
Binarized Forest to String Translation
ACL 2011
An Empirical Study of Translation Rule Extraction with Multiple Parsers
COLING 2010
Efficient Multi-Pass Decoding for Synchronous Context Free Grammars
ACL 2008
Bayesian Learning of Non-Compositional Phrases with Synchronous Parsing
ACL 2008
Extracting Synchronous Grammar Rules From Word-Level Alignments in Linear Time
COLING 2008
Inducing Word Alignments with Bilexical Synchronous Trees
ACL 2006
Factoring Synchronous Grammars by Sorting
COLING 2006
Efficient Search for Inversion Transduction Grammar
EMNLP 2006
Inducing Word Alignments with Bilexical Synchronous Trees
COLING 2006
Factoring Synchronous Grammars by Sorting
ACL 2006
Synchronous Binarization for Machine Translation
NAACL 2006
Stochastic Lexicalized Inversion Transduction Grammar for Alignment
ACL 2005
Syntax-Based Alignment: Supervised or Unsupervised?
COLING 2004