Yichi Zhang
77 papers · 2020–2026 · 21 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (15) π Interdisciplinary Bridge π Renaissance Researcher (6) π Conference Polyglot (21)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(15)
π§
Keyword Pioneer
π
Grand Slam
π€
Dynamic Duo
(10)
π¬
Deep Specialist
(13)
π§¬
Topic Evolution
π₯
Unstoppable
(6)
β‘
Prolific Year
(16)
ποΈ
Keyword Collector
(285)
π
Trend Setter
β
The Questioner
(5)
π
Century Club
(70)
π
Conference Pioneer
Conferences
ACL (13)
EMNLP (11)
AAAI (9)
NIPS (8)
ICLR (8)
CVPR (6)
IJCNLP (3)
ECCV (2)
ICCV (2)
COLING (2)
ICML (2)
MIDL (2)
EACL (1)
CORL (1)
IJCAI (1)
INTERSPEECH (1)
JMLR (1)
MICCAI (1)
NAACL (1)
AACL (1)
NSDI (1)
Top co-authors
Keywords
large language model
(11)
multimodal learning
(8)
knowledge graph
(5)
knowledge graph completion
(5)
multimodal large language model
(4)
semi-supervised learning
(4)
vision-language model
(4)
dialog system
(3)
neural network optimization
(3)
contrastive learning
(3)
representation learning
(3)
question answering
(3)
adversarial attack
(3)
data augmentation
(3)
link prediction
(3)
response generation
(3)
autonomous driving
(2)
reinforcement learning
(2)
multi-modal learning
(2)
adversarial robustness
(2)
Papers
Collaboration of Fusion and Independence: Hypercomplex-driven Robust Multi-Modal Knowledge Graph Completion
ACL 2026
rMMEA: Robust Multi-Modal Entity Alignment with Missing and Noise Visual Modality
AAAI 2026
CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization
ACL 2026
Know the Known and the Unknown: Reasonable Answer Generation with Knowledge-Informed Citations
ACL 2026
UniHR: Hierarchical Representation Learning for Unified Knowledge Graph Link Prediction
AAAI 2026
Topological-Aware Regularization for Semi-Supervised Intracranial Aneurysm Vessel Segmentation
MIDL 2026
PET2Rep: Towards Vision-Language Model-Drived Automated Radiology Report Generation for Positron Emission Tomography
AAAI 2026
Tokenization, Fusion, and Augmentation: Towards Fine-grained Multi-modal Entity Representation
AAAI 2025
Bias Amplification: Large Language Models as Increasingly Biased Media
AACL 2025
MMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient Evaluation
NAACL 2025
FIG: Flow with Interpolant Guidance for Linear Inverse Problems
ICLR 2025
Towards Hierarchical Rectified Flow
ICLR 2025
Multiple Heads are Better than One: Mixture of Modality Knowledge Experts for Entity Representation Learning
ICLR 2025
MetaOOD: Automatic Selection of OOD Detection Models
ICLR 2025
AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of Visuomotor Policies
CORL 2025
K-ON: Stacking Knowledge on the Head Layer of Large Language Model
AAAI 2025
Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors
ACL 2025
RL-Guider: Leveraging Historical Decisions and Feedback for Drug Editing with Large Language Models
ACL 2025
Noise-powered Multi-modal Knowledge Graph Representation Framework
COLING 2025
Improve Representation for Imbalanced Regression through Geometric Constraints
CVPR 2025
Balanced Rate-Distortion Optimization in Learned Image Compression
CVPR 2025
NDD: A Decision Diagram for Network Verification
NSDI 2025
MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data Engine
ICLR 2025
Bias Amplification: Large Language Models as Increasingly Biased Media
IJCNLP 2025
STAIR: Improving Safety Alignment with Introspective Reasoning
ICML 2025
Multiobjective distribution matching
ICML 2025
Proactive Assistant Dialogue Generation from Streaming Egocentric Videos
EMNLP 2025
Looking Beyond Text: Reducing Language Bias in Large Vision-Language Models via Multimodal Dual-Attention and Soft-Image Guidance
EMNLP 2025
Exploring the Generalizability of Factual Hallucination Mitigation via Enhancing Precise Knowledge Utilization
EMNLP 2025
CogAtom: From Cognitive Atoms to Olympiad-level Mathematical Reasoning in Large Language Models
EMNLP 2025
A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation
ICLR 2025
SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images
ICCV 2025
Have We Designed Generalizable Structural Knowledge Promptings? Systematic Evaluation and Rethinking
ACL 2025
Breaking the Ceiling: Exploring the Potential of Jailbreak Attacks through Expanding Strategy Space
ACL 2025
Eliciting Honest Information from Authors Using Sequential Review
AAAI 2024
Another Way to the Top: Exploit Contextual Clustering in Learned Image Coding
AAAI 2024
SC-NeuS: Consistent Neural Surface Reconstruction from Sparse and Noisy Views
AAAI 2024
MKGL: Mastery of a Three-Word Language
NIPS 2024
Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering
ACL 2024
PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain
ACL 2024
Gliding over the Pareto Front with Uniform Designs
NIPS 2024
Prompt Your Brain: Scaffold Prompt Tuning for Efficient Adaptation of fMRI Pre-trained Model
MICCAI 2024
MultiTrust: A Comprehensive Benchmark Towards Trustworthy Multimodal Large Language Models
NIPS 2024
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
ECCV 2024
Unleashing the Power of Imbalanced Modality Information for Multi-modal Knowledge Graph Completion
COLING 2024
"SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-modal Large Language Models"
ECCV 2024
PINNacle: A Comprehensive Benchmark of Physics-Informed Neural Networks for Solving PDEs
NIPS 2024
Rethinking Model Ensemble in Transfer-based Adversarial Attacks
ICLR 2024
GROUNDHOG: Grounding Large Language Models to Holistic Segmentation
CVPR 2024
Exploring the Transferability of Visual Prompting for Multimodal Large Language Models
CVPR 2024
Understanding the Robustness of 3D Object Detection With Bird's-Eye-View Representations in Autonomous Driving
CVPR 2023
Revisiting the Evaluation of Image Synthesis with GANs
NIPS 2023
Binarized Neural Machine Translation
NIPS 2023
Bridge the Gap Between CV and NLP! A Gradient-based Textual Adversarial Attack Framework
ACL 2023
Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?
EMNLP 2023
Can Foundation Models Watch, Talk and Guide You Step by Step to Make a Cake?
EMNLP 2023
DANLI: Deliberative Agent for Following Natural Language Instructions
EMNLP 2022
PokeBNN: A Binary Pursuit of Lightweight Accuracy
CVPR 2022
Understanding Hyperdimensional Computing for Parallel Single-Pass Learning
NIPS 2022
Prior Adaptive Semi-supervised Learning with Application to EHR Phenotyping
JMLR 2022
Hierarchical Task Learning from Language Instructions with Unified Transformers and Self-Monitoring
IJCNLP 2021
Is Multi-Hop Reasoning Really Explainable? Towards Benchmarking Reasoning Interpretability
EMNLP 2021
Product1M: Towards Weakly Supervised Instance-Level Product Retrieval via Cross-Modal Pretraining
ICCV 2021
Tiered Reasoning for Intuitive Physics: Toward Verifiable Commonsense Language Understanding
EMNLP 2021
Drop Redundant, Shrink Irrelevant: Selective Knowledge Injection for Language Pretraining
IJCAI 2021
BulletTrain: Accelerating Robust Neural Network Training via Boundary Example Mining
NIPS 2021
Alternating Recurrent Dialog Model with Large-scale Pre-trained Language Models
EACL 2021
Hierarchical Task Learning from Language Instructions with Unified Transformers and Self-Monitoring
ACL 2021
Interpretable and Low-Resource Entity Matching via Decoupling Feature Learning from Decision Making
ACL 2021
Interpretable and Low-Resource Entity Matching via Decoupling Feature Learning from Decision Making
IJCNLP 2021
A Probabilistic End-To-End Task-Oriented Dialog Model with Latent Belief States towards Semi-Supervised Learning
EMNLP 2020
Improved Learning of Word Embeddings with Word Definitions and Semantic Injection
INTERSPEECH 2020
Precision Gating: Improving Neural Network Efficiency with Dynamic Dual-Precision Activations
ICLR 2020
SAU-Net: Efficient 3D Spine MRI Segmentation Using Inter-Slice Attention
MIDL 2020
Dynamic Anticipation and Completion for Multi-Hop Reasoning over Sparse Knowledge Graph
EMNLP 2020
Paraphrase Augmented Task-Oriented Dialog Generation
ACL 2020
Task-Oriented Dialog Systems That Consider Multiple Appropriate Responses under the Same Context
AAAI 2020