Yan Zhang
153 papers · 2002–2026 · 22 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+18 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (29) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (5) π£ Hot Topic Early Bird
π
Renaissance Researcher
(5)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(29)
π
Conference Loyalist
(26)
π€
Dynamic Duo
(20)
π
Triple Crown
π§¬
Topic Evolution
π
Grand Slam
π₯
Mega-Team
(20)
π¬
Deep Specialist
(18)
π
Keyword Champion
π
Conference Pioneer
β‘
Prolific Year
(28)
π₯
Unstoppable
(14)
ποΈ
Keyword Collector
(60)
π
Century Club
(143)
π
Trend Setter
β
The Questioner
(6)
Conferences
EMNLP (26)
AAAI (22)
ACL (20)
CVPR (16)
IJCAI (10)
ICLR (9)
ICCV (9)
ICML (8)
NAACL (5)
ECCV (4)
COLING (4)
IJCNLP (4)
INTERSPEECH (4)
NIPS (4)
CONLL (1)
AISTATS (1)
ACML (1)
L4DC (1)
MICCAI (1)
MIDL (1)
UAI (1)
WACV (1)
Top co-authors
Keywords
large language model
(15)
contrastive learning
(9)
graph neural network
(8)
representation learning
(8)
retrieval-augmented generation
(6)
semi-supervised learning
(6)
multimodal learning
(6)
in-context learning
(5)
zero-shot learning
(5)
neural network
(5)
reinforcement learning
(5)
named entity recognition
(5)
unsupervised learning
(5)
diffusion model
(5)
machine translation
(4)
variational autoencoder
(4)
knowledge distillation
(4)
deep learning
(4)
point cloud
(4)
knowledge graph
(4)
Papers
SynPlay: Large-Scale Synthetic Human Data with Real-World Diversity for Aerial-View Perception
WACV 2026
COSMOS: Connectivity-Oriented Submodular Maximization for Optimal Subgraph Retrieval
ACL 2026
LiGen: Active Lipid Generation via a Molecular Language Model
ACL 2026
Act as you think: Reinforcing Consistent Reasoning in Medical Visual Question Answering
ACL 2026
Mitigating Error Accumulation in Knowledge Editing for Multi-Hop Question Answering
AAAI 2026
SegMem-RAG: Adaptive Memory for Retrieval-Augmented Generation in Open-Ended Knowledge Environments
AAAI 2026
SIAM: Towards Generalizable Articulated Object Modeling via Single Robot-Object Interaction
AAAI 2026
Graph-Driven Domain Co-Adaptation for Cross-Domain Image Quality Assessment
AAAI 2026
Beyond Counting: Evaluating Abstract and Emotional Reasoning in Vision-Language Models
AAAI 2026
LookFlow: Training-Free and Efficient High-Resolution Image Synthesis via Dynamic Lookahead Guidance Flow
AAAI 2026
UniFit: Towards Universal Virtual Try-on with MLLM-Guided Semantic Alignment
AAAI 2026
A Theory for Conditional Generative Modeling on Multiple Data Sources
ICML 2025
Learning Concept Prerequisite Relation via Global Knowledge Relation Optimization
AAAI 2025
BUFF: Bayesian Uncertainty Guided Diffusion Probabilistic Model for Single Image Super-Resolution
AAAI 2025
Feature Denoising Diffusion Model for Blind Image Quality Assessment
AAAI 2025
Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues
AAAI 2025
APKGC: Noise-enhanced Multi-Modal Knowledge Graph Completion with Attention Penalty
AAAI 2025
Towards Macro-AUC Oriented Imbalanced Multi-Label Continual Learning
AAAI 2025
M-MAD: Multidimensional Multi-Agent Debate for Advanced Machine Translation Evaluation
ACL 2025
HSCR: Hierarchical Self-Contrastive Rewarding for Aligning Medical Vision Language Models
ACL 2025
Unveil: Unified Visual-Textual Integration and Distillation for Multi-modal Document Retrieval
ACL 2025
Enhancing Retrieval-Augmented Generation via Evidence Tree Search
ACL 2025
MΒ³GQA: A Multi-Entity Multi-Hop Multi-Setting Graph Question Answering Benchmark
ACL 2025
CFBench: A Comprehensive Constraints-Following Benchmark for LLMs
ACL 2025
Fast or Slow? Integrating Fast Intuition and Deliberate Thinking for Enhancing Visual Question Answering
ACL 2025
Mitigating Posterior Salience Attenuation in Long-Context LLMs with Positional Contrastive Decoding
ACL 2025
FairSteer: Inference Time Debiasing for LLMs with Dynamic Activation Steering
ACL 2025
Retrieval Augmented Instruction Tuning for Open NER with Large Language Models
COLING 2025
Distilling Spatially-Heterogeneous Distortion Perception for Blind Image Quality Assessment
CVPR 2025
UCOD-DPL: Unsupervised Camouflaged Object Detection via Dynamic Pseudo-label Learning
CVPR 2025
SerialGen: Personalized Image Generation by First Standardization Then Personalization
CVPR 2025
CoA: Towards Real Image Dehazing via Compression-and-Adaptation
CVPR 2025
DecoupleSearch: Decouple Planning and Search via Hierarchical Reward Modeling
EMNLP 2025
Tuning Less, Prompting More: In-Context Preference Learning Pipeline for Natural Language Transformation
EMNLP 2025
ESGenius: Benchmarking LLMs on Environmental, Social, and Governance (ESG) and Sustainability Knowledge
EMNLP 2025
MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning
EMNLP 2025
ESCNet:Edge-Semantic Collaborative Network for Camouflaged Object Detection
ICCV 2025
Few-Shot Image Quality Assessment via Adaptation of Vision-Language Models
ICCV 2025
PRIMAL: Physically Reactive and Interactive Motor Model for Avatar Learning
ICCV 2025
ProtPainter: Draw or Drag Protein via Topology-guided Diffusion
ICLR 2025
SysBench: Can LLMs Follow System Message?
ICLR 2025
Improving Equivariant Networks with Probabilistic Symmetry Breaking
ICLR 2025
Modality-Fair Preference Optimization for Trustworthy MLLM Alignment
IJCAI 2025
MAGE: Multimodal Alignment and Generation Enhancement via Bridging Visual and Semantic Spaces
IJCAI 2025
SCOUT: Semi-supervised Camouflaged Object Detection by Utilizing Text and Adaptive Data Selection
IJCAI 2025
Towards Micro-Action Recognition with Limited Annotations: An Asynchronous Pseudo Labeling and Training Approach
IJCAI 2025
DGCPL: Dual Graph Distillation for Concept Prerequisite Relation Learning
IJCAI 2025
Knowing or Guessing? Robust Medical Visual Question Answering via Joint Consistency and Contrastive Learning
MICCAI 2025
Test-Time Code-Switching for Cross-lingual Aspect Sentiment Triplet Extraction
NAACL 2025
TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement
NAACL 2025
Retrieved In-Context Principles from Previous Mistakes
EMNLP 2024
Towards Verifiable Text Generation with Evolving Memory and Self-Reflection
EMNLP 2024
DynaThink: Fast or Slow? A Dynamic Decision-Making Framework for Large Language Models
EMNLP 2024
Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to the Next Level
EMNLP 2024
CELLO: Causal Evaluation of Large Vision-Language Models
EMNLP 2024
Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language Models
EMNLP 2024
Quantifying and Mitigating Unimodal Biases in Multimodal Large Language Models: A Causal Perspective
EMNLP 2024
Med-Tuning: A New Parameter-Efficient Tuning Framework for Medical Volumetric Segmentation
MIDL 2024
EgoGen: An Egocentric Synthetic Data Generator
CVPR 2024
Self-Improving for Zero-Shot Named Entity Recognition with Large Language Models
NAACL 2024
UNO-DST: Leveraging Unlabelled Data in Zero-Shot Dialogue State Tracking
NAACL 2024
Degrees of Freedom Matter: Inferring Dynamics from Point Trajectories
CVPR 2024
RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method
CVPR 2024
CrossTune: Black-Box Few-Shot Classification with Label Enhancement
COLING 2024
Improving Large Language Models in Event Relation Logical Prediction
ACL 2024
Semi-Supervised Blind Image Quality Assessment through Knowledge Distillation and Incremental Learning
AAAI 2024
Object centric architectures enable efficient causal representation learning
ICLR 2024
DiffAIL: Diffusion Adversarial Imitation Learning
AAAI 2024
Improved Generalization of Weight Space Networks via Augmentations
ICML 2024
Integrating Global Context Contrast and Local Sensitivity for Blind Image Quality Assessment
ICML 2024
Adaptive Feature Selection for No-Reference Image Quality Assessment by Mitigating Semantic Noise Sensitivity
ICML 2024
Unsupervised Concept Discovery Mitigates Spurious Correlations
ICML 2024
FLY-TTS: Fast, Lightweight and High-Quality End-to-End Text-to-Speech Synthesis
INTERSPEECH 2024
LiDAR-Net: A Real-scanned 3D Point Cloud Dataset for Indoor Scenes
CVPR 2024
RLE: A Unified Perspective of Data Augmentation for Cross-Spectral Re-Identification
NIPS 2024
Graph Neural Networks for Learning Equivariant Representations of Neural Networks
ICLR 2024
Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives
CVPR 2024
AdaSwitch: Adaptive Switching between Small and Large Agents for Effective Cloud-Local Collaborative Learning
EMNLP 2024
CrossSplit: Mitigating Label Noise Memorization through Data Splitting
ICML 2023
Data-Efficient Image Quality Assessment with Attention-Panel Decoder
AAAI 2023
Synthesizing Diverse Human Motions in 3D Indoor Scenes
ICCV 2023
Probabilistic Human Mesh Recovery in 3D Scenes from Egocentric Views
ICCV 2023
Cascading Bandits: Optimizing Recommendation Frequency in Delayed Feedback Environments
NIPS 2023
CHEER: Centrality-aware High-order Event Reasoning Network for Document-level Event Causality Identification
ACL 2023
History Semantic Graph Enhanced Conversational KBQA with Temporal Information Modeling
ACL 2023
Empirical Study of Zero-Shot NER with ChatGPT
EMNLP 2023
Allies: Prompting Large Language Model with Beam Search
EMNLP 2023
How Well Do Text Embedding Models Understand Syntax?
EMNLP 2023
Unlocking Slot Attention by Changing Optimal Transport Costs
ICML 2023
Equivariance with Learned Canonicalization Functions
ICML 2023
The Wanderings of Odysseus in 3D Scenes
CVPR 2022
Small Changes Make Big Differences: Improving Multi-turn Response Selection in Dialogue Systems via Fine-Grained Contrastive Learning
INTERSPEECH 2022
EgoBody: Human Body Shape and Motion of Interacting People from Head-Mounted Devices
ECCV 2022
SAGA: Stochastic Whole-Body Grasping with Contact
ECCV 2022
Compositional Human-Scene Interaction Synthesis with Semantic Control
ECCV 2022
Med-DANet: Dynamic Architecture Network for Efficient Medical Volumetric Segmentation
ECCV 2022
Analyzing and Evaluating Faithfulness in Dialogue Summarization
EMNLP 2022
Generate, Discriminate and Contrast: A Semi-Supervised Sentence Representation Learning Framework
EMNLP 2022
Contrastive latent variable models for neural text generation
UAI 2022
Multiset-Equivariant Set Prediction with Approximate Implicit Differentiation
ICLR 2022
IAM: A Comprehensive and Large-Scale Dataset for Integrated Argument Mining Tasks
ACL 2022
Neural Enhanced Dynamic Message Passing
AISTATS 2022
ERGO: Event Relational Graph Transformer for Document-level Event Causality Identification
COLING 2022
Pay More Attention to History: A Context Modeling Strategy for Conversational Text-to-SQL
INTERSPEECH 2022
Bootstrapped Unsupervised Sentence Representation Learning
IJCNLP 2021
ChicHealth @ MEDIQA 2021: Exploring the limits of pre-trained seq2seq models for medical summarization
NAACL 2021
Keep the Structure: A Latent Shift-Reduce Parser for Semantic Parsing
IJCAI 2021
Learning from History: Modeling Temporal Knowledge Graphs with Sequential Copy-Generation Networks
AAAI 2021
DynaEval: Unifying Turn and Dialogue Level Evaluation
ACL 2021
Bootstrapped Unsupervised Sentence Representation Learning
ACL 2021
Revisiting Self-training for Few-shot Learning of Language Model
EMNLP 2021
Attention Is Not Enough: Mitigating the Distribution Discrepancy in Asynchronous Multimodal Sequence Fusion
ICCV 2021
Partial-Label and Structure-constrained Deep Coupled Factorization Network
AAAI 2021
Learning Motion Priors for 4D Human Body Capture in 3D Scenes
ICCV 2021
We Are More Than Our Joints: Predicting How 3D Bodies Move
CVPR 2021
Learning without Knowing: Unobserved Context in Continuous Transfer Reinforcement Learning
L4DC 2021
LEAP: Learning Articulated Occupancy of People
CVPR 2021
DynaEval: Unifying Turn and Dialogue Level Evaluation
IJCNLP 2021
Better Set Representations For Relational Reasoning
NIPS 2020
βWhat Do You Mean by That?β A Parser-Independent Interactive Approach for Enhancing Text-to-SQL
EMNLP 2020
Disentangle-based Continual Graph Representation Learning
EMNLP 2020
Lightweight, Dynamic Graph Convolutional Networks for AMR-to-Text Generation
EMNLP 2020
An Unsupervised Sentence Embedding Method by Mutual Information Maximization
EMNLP 2020
Model-theoretic Characterizations of Existential Rule Languages
IJCAI 2020
Generating 3D People in Scenes Without People
CVPR 2020
ENT-DESC: Entity Description Generation by Exploring Knowledge Graph
EMNLP 2020
Learning from Positive and Unlabeled Data without Explicit Estimation of Class Prior
AAAI 2020
SK-Net: Deep Learning on Point Cloud via End-to-End Discovery of Spatial Keypoints
AAAI 2020
Towards Universal Languages for Tractable Ontology Mediated Query Answering
AAAI 2020
Non-Parallel Many-to-Many Voice Conversion with PSR-StarGAN
INTERSPEECH 2020
FSPool: Learning Set Representations with Featurewise Sort Pooling
ICLR 2020
Learning Representations of Sets through Optimized Permutations
ICLR 2019
Local Temporal Bilinear Pooling for Fine-Grained Action Parsing
CVPR 2019
EA Reader: Enhance Attentive Reader for Cloze-Style Question Answering via Multi-Space Context Fusion
AAAI 2019
Deep Set Prediction Networks
NIPS 2019
Attention Guided Graph Convolutional Networks for Relation Extraction
ACL 2019
PartNet: A Recursive Part Decomposition Network for Fine-Grained and Hierarchical Shape Segmentation
CVPR 2019
Visualization of Convolutional Neural Networks for Monocular Depth Estimation
ICCV 2019
Feature Quantization for Defending Against Distortion of Images
CVPR 2018
Learning to Count Objects in Natural Images for Visual Question Answering
ICLR 2018
Truncating Wide Networks Using Binary Tree Architectures
ICCV 2017
Expressive Completeness of Existential Rule Languages for Ontology-Based Query Answering
IJCAI 2016
Non-Linear Smoothed Transductive Network Embedding with Text Information
ACML 2016
User Based Aggregation for Biterm Topic Model
IJCNLP 2015
User Based Aggregation for Biterm Topic Model
ACL 2015
Tailor knowledge graph for query understanding: linking intent topics by propagation
EMNLP 2014
Definability of Horn Revision from Horn Contraction
IJCAI 2013
First-Order Expressibility and Boundedness of Disjunctive Logic Programs
IJCAI 2013
Summarizing Complex Events: a Cross-Modal Solution of Storylines Extraction and Reconstruction
EMNLP 2013
Combining Syntactic and Semantic Features by SVM for Unrestricted Coreference Resolution
CONLL 2011
Timeline Generation through Evolutionary Trans-Temporal Summarization
EMNLP 2011
Corpus-oriented Acquisition of Chinese Grammar
IJCNLP 2005
Chinese Syntactic Parsing Based on Extended GLR Parsing Algorithm with PCFG*
COLING 2002