Yang Zhang
220 papers · 2006–2026 · 23 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
๐บ๏ธ Taxonomy Completionist (31) ๐งญ Keyword Pioneer ๐ Interdisciplinary Bridge ๐ Renaissance Researcher (8) ๐ Conference Polyglot (22)
๐
Renaissance Researcher
(8)
๐
Interdisciplinary Bridge
๐
Conference Polyglot
(22)
๐
Conference Loyalist
(21)
๐ค
Dynamic Duo
(39)
๐
Triple Crown
๐
Keyword Champion
๐
Grand Slam
๐ฅ
Mega-Team
(32)
๐ฌ
Deep Specialist
(20)
๐
Trend Setter
๐
Conference Pioneer
๐ฅ
Unstoppable
(16)
โ
The Questioner
(5)
๐
Century Club
(194)
๐๏ธ
Keyword Collector
(85)
โก
Prolific Year
(41)
Conferences
ACL (29)
ICML (27)
AAAI (26)
NIPS (18)
EMNLP (18)
INTERSPEECH (15)
IJCAI (13)
ICCV (13)
CVPR (13)
ICLR (11)
NAACL (8)
EACL (5)
ECCV (5)
COLING (5)
IJCNLP (3)
WACV (3)
OSDI (2)
CORL (1)
AISTATS (1)
ACML (1)
MICCAI (1)
AACL (1)
NSDI (1)
Top co-authors
Research topics
Keywords
large language model
(28)
representation learning
(11)
diffusion model
(9)
adversarial attack
(9)
contrastive learning
(7)
neural network
(7)
jailbreak attack
(6)
lottery ticket hypothesis
(6)
model compression
(6)
transfer learning
(6)
language model
(6)
generative model
(6)
few-shot learning
(6)
adversarial robustness
(6)
self-supervised learning
(6)
reinforcement learning
(6)
semantic segmentation
(5)
attention mechanism
(5)
zero-shot learning
(5)
recommendation system
(5)
Papers
Right Branches Matter in Failure-based Variable Ordering Heuristics
AAAI 2026
SL-CBM: Enhancing Concept Bottleneck Models with Semantic Locality for Better Interpretability
AAAI 2026
DE-CLIP: Few-Shot Anomaly Detection via Difference-Guided Embedding Editing
ACL 2026
Open Schrรถdingerโs Closed Box: Identifying Retrieval Augmented Generation in API-Accessible Large Language Model Services
ACL 2026
OCR-Memory: Optical Context Retrieval for Long-Horizon Agent Memory
ACL 2026
Beyond Random Sampling: Efficient Language Model Pretraining via Curriculum Learning
EACL 2026
A Reinforcement Learning Framework for Robust and Secure LLM Watermarking
EACL 2026
Coordinated Humanoid Robot Locomotion with Symmetry Equivariant Reinforcement Learning Policy
AAAI 2026
Plug-and-Play Parameter-Efficient Tuning of Embeddings for Federated Recommendation
AAAI 2026
Hyperbolic-Enhanced Mixture-of-Experts Mamba for Sequential Recommendation
AAAI 2026
Seeing but Not Thinking: Routing Distraction in Multimodal Mixture-of-Experts
ACL 2026
Pruning Unsafe Tickets: A Resource-Efficient Framework for Safer and More Robust LLMs
ACL 2026
Subspace-Aware Graph Construction and Contrastive Alignment for Multimodal Recommendation with Large Language Models
AAAI 2026
Defeating Cerberus: Privacy-Leakage Mitigation in Vision Language Models
EACL 2026
The Curse of Verbalization: How Presentation Order Constrains LLM Reasoning
EACL 2026
UniToolBench: A Benchmark for Tool-Augmented LLMs in Cross-Domain, Universal Task Automation
EACL 2026
Scene Experts: Specializing in 3D Gaussian Splatting with Adaptive Decomposition
AAAI 2026
SAME: Spatial-Aware Multimodal Egocentric Human Pose Estimation
AAAI 2026
Donโt Start Over: A Cost-Effective Framework for Migrating Personalized Prompts Between LLMs
AAAI 2026
UDA: Unsupervised Debiasing Alignment for Pair-wise LLM-as-a-Judge
AAAI 2026
QiMeng-CRUX: Narrowing the Gap Between Natural Language and Verilog via Core Refined Understanding eXpression
AAAI 2026
Keep On Going: Learning Robust Humanoid Motion Skills via Selective Adversarial Training
AAAI 2026
HAD: HAllucination Detection Language Models Based on a Comprehensive Hallucination Taxonomy
ACL 2026
Lost in the Mix: Evaluating LLM Understanding of Code-Switched Text
ACL 2026
Tailored Primitive Initialization is the Secret Key to Reinforcement Learning
ACL 2026
Pre-Trained Video Generative Models as World Simulators
AAAI 2026
SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation
ICLR 2025
Q-Supervised Contrastive Representation: A State Decoupling Framework for Safe Offline Reinforcement Learning
ICML 2025
CommVQ: Commutative Vector Quantization for KV Cache Compression
ICML 2025
Task-Agnostic Pre-training and Task-Guided Fine-tuning for Versatile Diffusion Planner
ICML 2025
A Hitchhikerโs Guide to Scaling Law Estimation
ICML 2025
CodeSteer: Symbolic-Augmented Language Models via Code/Text Guidance
ICML 2025
Contrastive Forward Prediction Reinforcement Learning for Adaptive Fault-Tolerant Legged Robots
CORL 2025
EMHI: A Multimodal Egocentric Human Motion Dataset with HMD and Body-Worn IMUs
AAAI 2025
DoGA: Enhancing Grounded Object Detection via Grouped Pre-Training with Attributes
AAAI 2025
VIoTGPT: Learning to Schedule Vision Tools Towards Intelligent Video Internet of Things
AAAI 2025
Behavior Importance-Aware Graph Neural Architecture Search for Cross-Domain Recommendation
AAAI 2025
SIDE: Socially Informed Drought Estimation Toward Understanding Societal Impact Dynamics of Environmental Crisis
AAAI 2025
Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing
AACL 2025
When GPT Spills the Tea: Comprehensive Assessment of Knowledge File Leakage in GPTs
ACL 2025
JailbreakRadar: Comprehensive Assessment of Jailbreak Attacks Against LLMs
ACL 2025
Are We in the AI-Generated Text World Already? Quantifying and Monitoring AIGT on Social Media
ACL 2025
A Self-Denoising Model for Robust Few-Shot Relation Extraction
ACL 2025
Online Iterative Self-Alignment for Radiology Report Generation
ACL 2025
Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration
ACL 2025
K-order Ranking Preference Optimization for Large Language Models
ACL 2025
Customizing In-context Learning for Dynamic Interest Adaption in LLM-based Recommendation
ACL 2025
Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning
ACL 2025
Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization
ACL 2025
PLAY2PROMPT: Zero-shot Tool Instruction Optimization for LLM Agents via Tool Play
ACL 2025
CLIP-Fusion: A Spatio-Temporal Quality Metric for Frame Interpolation
WACV 2025
Evolution of Aegis: Fault Diagnosis for AI Model Training Service in Production
NSDI 2025
Bridging the Gap between Gaussian Diffusion Models and Universal Quantization for Image Compression
CVPR 2025
Large Language Models Can Solve Real-World Planning Rigorously with Formal Verification Tools
NAACL 2025
A Probabilistic Framework for LLM Hallucination Detection via Belief Tree Propagation
NAACL 2025
Automated Characterization of Myocardial Scar Topological Patterns for Ventricular Tachycardia Screening
MICCAI 2025
Latent Inter-User Difference Modeling for LLM Personalization
EMNLP 2025
Breaking Agents: Compromising Autonomous LLM Agents Through Malfunction Amplification
EMNLP 2025
Decoding in Latent Spaces for Efficient Inference in LLM-based Recommendation
EMNLP 2025
Augment before You Try: Knowledge-Enhanced Table Question Answering via Table Expansion
EMNLP 2025
Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing
IJCNLP 2025
A Multimodal AI Dialogue System for Unified Document, Visual, and Audio Interaction
IJCAI 2025
Bidirectional HumanโAI Collaboration for Equitable Student Performance Prediction via Deep Uncertainty Learning
IJCAI 2025
Token-Level Accept or Reject: A Micro Alignment Approach for Large Language Models
IJCAI 2025
Anti-Tamper Protection for Unauthorized Individual Image Generation
ICCV 2025
LDIP: Long Distance Information Propagation for Video Super-Resolution
ICCV 2025
VSP: Diagnosing the Dual Challenges of Perception and Reasoning in Spatial Planning Tasks for MLLMs
ICCV 2025
Plug-in Feedback Self-adaptive Attention in CLIP for Training-free Open-Vocabulary Segmentation
ICCV 2025
LOTA: Bit-Planes Guided AI-Generated Image Detection
ICCV 2025
Hate in Plain Sight: On the Risks of Moderating AI-Generated Hateful Illusions
ICCV 2025
Event-guided HDR Reconstruction with Diffusion Priors
ICCV 2025
Agreement aware and dissimilarity oriented GLOM
ICCV 2025
Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot Learning
IJCAI 2025
AI for Global Climate Cooperation: Modeling Global Climate Negotiations, Agreements, and Long-Term Cooperation in RICE-N
ICML 2025
The Ripple Effect: On Unforeseen Complications of Backdoor Attacks
ICML 2025
Online Preference Alignment for Language Models via Count-based Exploration
ICLR 2025
Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning
ICLR 2025
DeeperForward: Enhanced Forward-Forward Training for Deeper and Better Performance
ICLR 2025
Planning Anything with Rigor: General-Purpose Zero-Shot Planning with LLM-based Formalized Programming
ICLR 2025
Minimalist Concept Erasure in Generative Models
ICML 2025
Are LLM-based Evaluators Confusing NLG Quality Criteria?
ACL 2024
Retrieval Augmented Fact Verification by Synthesizing Contrastive Arguments
ACL 2024
Fair Federated Learning with Biased Vision-Language Models
ACL 2024
Evaluating Mathematical Reasoning of Large Language Models: A Focus on Error Identification and Correction
ACL 2024
Advancing the Robustness of Large Language Models through Self-Denoised Smoothing
NAACL 2024
Evidence-Driven Retrieval Augmented Response Generation for Online Misinformation
NAACL 2024
Paraphrase and Solve: Exploring and Exploiting the Impact of Surface Form on Mathematical Reasoning in Large Language Models
NAACL 2024
HairDiffusion: Vivid Multi-Colored Hair Editing via Latent Diffusion
NIPS 2024
PRompt Optimization in Multi-Step Tasks (PROMST): Integrating Human Feedback and Heuristic-based Sampling
EMNLP 2024
Reconstruct Your Previous Conversations! Comprehensively Investigating Privacy Leakage Risks in Conversations with GPT Models
EMNLP 2024
Revisiting Whoโs Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective
EMNLP 2024
Decoding Matters: Addressing Amplification Bias and Homogeneity Issue in Recommendations for Large Language Models
EMNLP 2024
ModSCAN: Measuring Stereotypical Bias in Large Vision-Language Models from Vision and Language Modalities
EMNLP 2024
Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak Attacks
EMNLP 2024
The Death and Life of Great Prompts: Analyzing the Evolution of LLM Prompts from the Structural Perspective
EMNLP 2024
Text-like Encoding of Collaborative Information in Large Language Models for Recommendation
ACL 2024
Polyper: Boundary Sensitive Polyp Segmentation
AAAI 2024
Collaborative Weakly Supervised Video Correlation Learning for Procedure-Aware Instructional Video Analysis
AAAI 2024
MECD: Unlocking Multi-Event Causal Discovery in Video Reasoning
NIPS 2024
The Stronger the Diffusion Model, the Easier the Backdoor: Data Poisoning to Induce Copyright BreachesWithout Adjusting Finetuning Pipeline
ICML 2024
Accurate LoRA-Finetuning Quantization of LLMs via Information Retention
ICML 2024
Decomposing Uncertainty for Large Language Models through Input Clarification Ensembling
ICML 2024
Speech Self-Supervised Learning Using Diffusion Model Synthetic Data
ICML 2024
Memory-Efficient Gradient Unrolling for Large-Scale Bi-level Optimization
NIPS 2024
Learning Distinguishable Trajectory Representation with Contrastive Loss
NIPS 2024
Aegis:An Advanced LLM-Based Multi-Agent for Intelligent Functional Safety Engineering
EMNLP 2024
Generated Distributions Are All You Need for Membership Inference Attacks Against Generative Models
WACV 2024
Investigating Layer Importance in Large Language Models
EMNLP 2024
LSH-MoE: Communication-efficient MoE Training via Locality-Sensitive Hashing
NIPS 2024
ProgressGym: Alignment with a Millennium of Moral Progress
NIPS 2024
Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference
NIPS 2024
Correcting Diffusion Generation through Resampling
CVPR 2024
APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation
CVPR 2024
HMD-Poser: On-Device Real-time Human Motion Tracking from Scalable Sparse Observations
CVPR 2024
Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning
ICML 2024
Continual Compositional Zero-Shot Learning
IJCAI 2024
EquiPocket: an E(3)-Equivariant Geometric Graph Neural Network for Ligand Binding Site Prediction
ICML 2024
TimeCraft: Navigate Weakly-Supervised Temporal Grounded Video Question Answering via Bi-directional Reasoning
ECCV 2024
CatchBackdoor: Backdoor Detection via Critical Trojan Neural Path Fuzzing
ECCV 2024
Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models
ECCV 2024
Composite Backdoor Attacks Against Large Language Models
NAACL 2024
NL2TL: Transforming Natural Languages to Temporal Logics using Large Language Models
EMNLP 2023
Dual Memory Aggregation Network for Event-Based Object Detection with Learnable Representation
AAAI 2023
Pseudo Label-Guided Model Inversion Attack via Conditional Generative Adversarial Network
AAAI 2023
A Crowd-AI Collaborative Duo Relational Graph Learning Framework towards Social Impact Aware Photo Classification
AAAI 2023
MetaAdapt: Domain Adaptive Few-Shot Misinformation Detection via Meta Learning
ACL 2023
NOTABLE: Transferable Backdoor Attacks Against Prompt-based NLP Models
ACL 2023
SAP-DETR: Bridging the Gap Between Salient Points and Queries-Based Transformer Detector for Fast Model Convergency
CVPR 2023
Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models
CVPR 2023
Can't Steal? Cont-Steal! Contrastive Stealing Attacks Against Image Encoders
CVPR 2023
Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis
ICCV 2023
TextGrad: Advancing Robustness Evaluation in NLP by Gradient-Driven Optimization
ICLR 2023
Is Adversarial Training Really a Silver Bullet for Mitigating Data Poisoning?
ICLR 2023
PromptBoosting: Black-Box Text Classification with Ten Forward Passes
ICML 2023
Generated Graph Detection
ICML 2023
Data Poisoning Attacks Against Multimodal Encoders
ICML 2023
Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning
ICML 2023
Towards Coherent Image Inpainting Using Denoising Diffusion Implicit Models
ICML 2023
On Adversarial Robustness of Demographic Fairness in Face Attribute Recognition
IJCAI 2023
Long-term Wind Power Forecasting with Hierarchical Spatial-Temporal Transformer
IJCAI 2023
On Optimizing Model Generality in AI-based Disaster Damage Assessment: A Subjective Logic-driven Crowd-AI Hybrid Learning Approach
IJCAI 2023
Unifying Margin-Based Softmax Losses in Face Recognition
WACV 2023
DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings
NAACL 2022
Amplifying Membership Exposure via Data Poisoning
NIPS 2022
Exploring Timbre Disentanglement in Non-Autoregressive Cross-Lingual Text-to-Speech
INTERSPEECH 2022
WavPrompt: Towards Few-Shot Spoken Language Understanding with Frozen Language Models
INTERSPEECH 2022
Shallow Fusion of Weighted Finite-State Transducer and Language Model for Text Normalization
INTERSPEECH 2022
Unsupervised Text-to-Speech Synthesis by Unsupervised Automatic Speech Recognition
INTERSPEECH 2022
MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic Speaker Verification
INTERSPEECH 2022
Data-Efficient Double-Win Lottery Tickets from Robust Pre-training
ICML 2022
BiFSMN: Binary Neural Network for Keyword Spotting
IJCAI 2022
On Attacking Out-Domain Uncertainty Estimation in Deep Neural Networks
IJCAI 2022
Crowd, Expert & AI: A Human-AI Interactive Approach Towards Natural Language Explanation Based COVID-19 Misinformation Detection
IJCAI 2022
ContentVec: An Improved Self-Supervised Speech Representation by Disentangling Speakers
ICML 2022
Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing
NIPS 2022
Fairness Reprogramming
NIPS 2022
An Adversarial Framework for Generating Unseen Images by Activation Maximization
AAAI 2022
TempFormer: Temporally Consistent Transformer for Video Denoising
ECCV 2022
Linking Emergent and Natural Languages via Corpus Transfer
ICLR 2022
Adversarial Support Alignment
ICLR 2022
Semi-Leak: Membership Inference Attacks against Semi-Supervised Learning
ECCV 2022
Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators
ICML 2021
User Retention: A Causal Approach with Triple Task Modeling
IJCAI 2021
Drawing Robust Scratch Tickets: Subnetworks with Inborn Robustness Are Found within Randomly Initialized Networks
NIPS 2021
Understanding Interlocking Dynamics of Cooperative Rationalization
NIPS 2021
A General Recurrent Tracking Framework Without Real Data
ICCV 2021
SACoD: Sensor Algorithm Co-Design Towards Efficient CNN-Powered Intelligent PhlatCam
ICCV 2021
Frustratingly Simple Few-Shot Slot Tagging
IJCNLP 2021
Zero-Shot Cross-Lingual Phonetic Recognition with External Language Embedding
INTERSPEECH 2021
Speech Denoising with Auditory Models
INTERSPEECH 2021
Hi-Fi Multi-Speaker English TTS Dataset
INTERSPEECH 2021
Voting for the Right Answer: Adversarial Defense for Speaker Verification
INTERSPEECH 2021
NeMo Inverse Text Normalization: From Development to Production
INTERSPEECH 2021
NeMo (Inverse) Text Normalization: From Development to Production
INTERSPEECH 2021
The Lottery Tickets Hypothesis for Supervised and Self-Supervised Pre-Training in Computer Vision Models
CVPR 2021
Panoptic-PolarNet: Proposal-Free LiDAR Point Cloud Panoptic Segmentation
CVPR 2021
Frustratingly Simple Few-Shot Slot Tagging
ACL 2021
Coordination Between Individual Agents in Multi-Agent Reinforcement Learning
AAAI 2021
PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition
NIPS 2021
BCORLE($\lambda$): An Offline Reinforcement Learning and Evaluation Framework for Coupons Allocation in E-commerce Market
NIPS 2021
Fine-Grained Neural Network Explanation by Identifying Input Features with Predictive Information
NIPS 2021
Global Prosody Style Transfer Without Text Transcriptions
ICML 2021
Unsupervised Speech Decomposition via Triple Information Bottleneck
ICML 2020
Deep Symbolic Superoptimization Without Human Knowledge
ICLR 2020
FASTMATCH: Accelerating the Inference of BERT-based Text Matching
COLING 2020
Dual Attention Model for Citation Recommendation
COLING 2020
SQL Generation via Machine Reading Comprehension
COLING 2020
Invariant Rationalization
ICML 2020
The Lottery Ticket Hypothesis for Pre-trained BERT Networks
NIPS 2020
Crowd-Assisted Disaster Scene Assessment with Human-AI Interactive Attention
AAAI 2020
AntMan: Dynamic Scaling on GPU Clusters for Deep Learning
OSDI 2020
BioMegatron: Larger Biomedical Domain Language Model
EMNLP 2020
Mention Extraction and Linking for SQL Query Generation
EMNLP 2020
Copy and Paste GAN: Face Hallucination From Shaded Thumbnails
CVPR 2020
PolarNet: An Improved Grid Representation for Online LiDAR Point Clouds Semantic Segmentation
CVPR 2020
Rethinking Cooperative Rationalization: Introspective Extraction and Complement Control
IJCNLP 2019
Rethinking Cooperative Rationalization: Introspective Extraction and Complement Control
EMNLP 2019
Domain Randomization and Pyramid Consistency: Simulation-to-Real Generalization Without Accessing Target Domain Data
ICCV 2019
CAMOU: Learning Physical Vehicle Camouflages to Adversarially Attack Detectors in the Wild
ICLR 2019
A Game Theoretic Approach to Class-wise Selective Rationalization
NIPS 2019
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
ICML 2019
Fairwalk: Towards Fair Graph Embedding
IJCAI 2019
VAE-Based Regularization for Deep Speaker Embedding
INTERSPEECH 2019
Adaptive Learning of Local Semantic and Global Structure Representations for Text Classification
COLING 2018
LSTM Based Cross-corpus and Cross-task Acoustic Emotion Recognition
INTERSPEECH 2018
A Theoretical Explanation for Perplexing Behaviors of Backpropagation-based Visualizations
ICML 2018
Speech Enhancement Using Bayesian Wavenet
INTERSPEECH 2017
Dilated Recurrent Neural Networks
NIPS 2017
Curriculum Domain Adaptation for Semantic Segmentation of Urban Scenes
ICCV 2017
Sentiment Lexicon Expansion Based on Neural PU Learning, Double Dictionary Lookup, and Polarity Association
EMNLP 2017
Glottal Model Based Speech Beamforming for ad-hoc Microphone Arrays
INTERSPEECH 2017
Fast Zero-Shot Image Tagging
CVPR 2016
Clustering Sentences with Density Peaks for Multi-document Summarization
NAACL 2015
Extracting More Concurrency from Distributed Transactions
OSDI 2014
Scene Text Recognition Using Part-Based Tree-Structured Character Detection
CVPR 2013
Probabilistic acoustic tube: a probabilistic generative model of speech for speech analysis/synthesis
AISTATS 2012
Why Press Backspace? Understanding User Input Behaviors in Chinese Pinyin Input Method
ACL 2011
Decision Tree for Dynamic and Uncertain Data Streams
ACML 2010
Exploring Distributional Similarity Based Models for Query Spelling Correction
ACL 2006
Exploring Distributional Similarity Based Models for Query Spelling Correction
COLING 2006