Fan Yang
212 papers · 2005–2026 · 25 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+18 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (24) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (7) π Conference Polyglot (24)
π£
Hot Topic Early Bird
π
Renaissance Researcher
(7)
π
Interdisciplinary Bridge
π
Conference Loyalist
(22)
π
Keyword Trendsetter Combo
(3)
π€
Dynamic Duo
(17)
π
Triple Crown
π¬
Deep Specialist
(11)
π
Grand Slam
π₯
Mega-Team
(37)
π
Keyword Champion
π
Trend Setter
π₯
Unstoppable
(14)
β
The Questioner
(3)
π
Conference Pioneer
π
Century Club
(200)
β‘
Prolific Year
(17)
ποΈ
Keyword Collector
(94)
Conferences
AAAI (30)
CVPR (25)
NIPS (19)
ACL (18)
ICLR (17)
OSDI (15)
ECCV (12)
EMNLP (12)
ICML (10)
COLING (9)
IJCAI (8)
ICCV (8)
WACV (7)
NAACL (5)
IJCNLP (3)
UAI (3)
EACL (2)
CORL (2)
INTERSPEECH (1)
JMLR (1)
MICCAI (1)
NSDI (1)
AISTATS (1)
RSS (1)
SEMEVAL (1)
Top co-authors
Research topics
Keywords
large language model
(17)
diffusion model
(8)
object detection
(8)
zero-shot learning
(7)
representation learning
(7)
domain adaptation
(7)
deep neural network
(7)
attention mechanism
(6)
feature learning
(5)
self-supervised learning
(5)
benchmark evaluation
(5)
graph neural network
(5)
neural network
(5)
contrastive learning
(4)
feature extraction
(4)
text classification
(4)
deep learning
(4)
semantic segmentation
(4)
motion estimation
(4)
uncertainty quantification
(4)
Papers
UDCH: Unsupervised Dynamic Weighted Cluster-cooperative Hashing for Cross-modal Retreival
AAAI 2026
GeM-VG: Towards Generalized Multi-image Visual Grounding with Multimodal Large Language Models
AAAI 2026
TIME: Temporal-Sensitive Multi-Dimensional Instruction Tuning and Robust Benchmarking for Video-LLMs
AAAI 2026
KnowThyself: An Agentic Assistant for LLM Interpretability
AAAI 2026
Catastrophic Forgetting in Kolmogorov-Arnold Networks
AAAI 2026
Beyond Euclidean Assumptions: Geometry-Aware Adaptive Routing for Remote Sensing Segmentation
AAAI 2026
Denoising Concept Vectors with Sparse Autoencoders for Improved Language Model Steering
EACL 2026
FaithLM: Towards Faithful Explanations for Large Language Models
EACL 2026
Unsupervised Discovery of Long-Term Spatiotemporal Periodic Workflows in Human Activities
WACV 2026
Breaking the Stealth-Potency Trade-off in Clean-Image Backdoors with Generative Trigger Optimization
AAAI 2026
Compressing then Matching: An Efficient Pre-training Paradigm for Multimodal Embedding
ACL 2026
Explain the Synth: Interpretable Evaluation of LLM Data Synthesis
ACL 2026
FilmSceneDesigner: Chaining Set Design for Procedural Film Scene Generation
AAAI 2026
Automated Proof Generation for Rust Code via Self-Evolution
ICLR 2025
Beyond Single Concept Vector: Modeling Concept Subspace in LLMs with Gaussian Distribution
ICLR 2025
FB-Bench: A Fine-Grained Multi-Task Benchmark for Evaluating LLMsβ Responsiveness to Human Feedback
EMNLP 2025
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solver
ICLR 2025
SVBench: A Benchmark with Temporal Multi-Turn Dialogues for Streaming Video Understanding
ICLR 2025
Proving Olympiad Inequalities by Synergizing LLMs and Symbolic Reasoning
ICLR 2025
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
ICLR 2025
The Source Image is the Best Attention for Infrared and Visible Image Fusion
ICCV 2025
Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring
ICCV 2025
Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning
ICLR 2025
In vivo cell-type and brain region classification via multimodal contrastive learning
ICLR 2025
VIIS: Visible and Infrared Information Synthesis for Severe Low-Light Image Enhancement
WACV 2025
Contrasting Adversarial Perturbations: The Space of Harmless Perturbations
AAAI 2025
3DHumanEdit: Multi-modal Body Part-aware Conditioning Information Integration for 3D Human Manipulation
AAAI 2025
Language Ranker: A Metric for Quantifying LLM Performance Across High and Low-Resource Languages
AAAI 2025
NaFV-Net: An Adversarial Four-view Network for Mammogram Classification
AAAI 2025
Divide and Orthogonalize: Efficient Continual Learning with Local Model Space Projection
UAI 2025
PipeThreader: Software-Defined Pipelining for Efficient DNN Execution
OSDI 2025
MM-Verify: Enhancing Multimodal Reasoning with Chain-of-Thought Verification
ACL 2025
SeedBench: A Multi-task Benchmark for Evaluating Large Language Models in Seed Science
ACL 2025
CFBench: A Comprehensive Constraints-Following Benchmark for LLMs
ACL 2025
EdgeInfinite: A Memory-Efficient Infinite-Context Transformer for Edge Devices
ACL 2025
Beyond Surface-Level Patterns: An Essence-Driven Defense Framework Against Jailbreak Attacks in LLMs
ACL 2025
iMOVE : Instance-Motion-Aware Video Understanding
ACL 2025
WaferLLM: Large Language Model Inference at Wafer Scale
OSDI 2025
Exploring Concept Depth: How Large Language Models Acquire Knowledge and Concept at Different Layers?
COLING 2025
Opportunistic Osteoporosis Diagnosis via Texture-Preserving Self-Supervision, Mixture of Experts and Multi-Task Integration
MICCAI 2025
Precise High-Dimensional Asymptotics for Quantifying Heterogeneous Transfers
JMLR 2025
Detection and Geographic Localization of Natural Objects in the Wild: A Case Study on Palms
IJCAI 2025
Dynamic Multiple High-order Correlations Fusion with Noise Filtering for Incomplete Multi-view Noisy-label Learning
IJCAI 2025
MagicArticulate: Make Your 3D Models Articulation-Ready
CVPR 2025
MExD: An Expert-Infused Diffusion Model for Whole-Slide Image Classification
CVPR 2025
Libra-Merging: Importance-redundancy and Pruning-merging Trade-off for Acceleration Plug-in in Large Vision-Language Model
CVPR 2025
HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator
CVPR 2025
CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation
CVPR 2025
Oracle-MoE: Locality-preserving Routing in the Oracle Space for Memory-constrained Large Language Model Inference
ICML 2025
MM-RLHF: The Next Step Forward in Multimodal LLM Alignment
ICML 2025
Simple Policy Optimization
ICML 2025
LongRoPE2: Near-Lossless LLM Context Window Scaling
ICML 2025
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
ICML 2025
TaskGalaxy: Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types
ICLR 2025
Sculpt3D: Multi-View Consistent Text-to-3D Generation with Sparse 3D Prior
CVPR 2024
Neuro-Symbolic Data Generation for Math Reasoning
NIPS 2024
Empowering and Assessing the Utility of Large Language Models in Crop Science
NIPS 2024
Autoformalize Mathematical Statements by Symbolic Equivalence and Semantic Consistency
NIPS 2024
IRGen: Generative Modeling for Image Retrieval
ECCV 2024
Orthogonal Gradient Boosting for Simpler Additive Rule Ensembles
AISTATS 2024
MobileNetV4: Universal Models for the Mobile Ecosystem
ECCV 2024
Parrot: Efficient Serving of LLM-based Applications with Semantic Variable
OSDI 2024
nnScaler: Constraint-Guided Parallelization Plan Generation for Deep Learning Training
OSDI 2024
Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation
OSDI 2024
Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought
COLING 2024
Towards Multi-Modal Co-Reference Resolution in Conversational Shopping Agents
COLING 2024
Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models
ECCV 2024
Masking Latent Gender Knowledge for Debiasing Image Captioning
NAACL 2024
RecMind: Large Language Model Powered Agent For Recommendation
NAACL 2024
FocusDiffuser: Perceiving Local Disparities for Camouflaged Object Detection
ECCV 2024
Finite-Time Convergence and Sample Complexity of Actor-Critic Multi-Objective Reinforcement Learning
ICML 2024
TVE: Learning Meta-attribution for Transferable Vision Explainer
ICML 2024
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens
ICML 2024
MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
WACV 2024
Sparse Bayesian Deep Learning for Cross Domain Medical Image Reconstruction
AAAI 2024
Implicit Modeling of Non-rigid Objects with Cross-Category Signals
AAAI 2024
Geometry-Guided Domain Generalization for Monocular 3D Object Detection
AAAI 2024
Multi-View Randomized Kernel Classification via Nonconvex Optimization
AAAI 2024
An Effective Augmented Lagrangian Method for Fine-Grained Multi-View Optimization
AAAI 2024
Multi-Modal Disordered Representation Learning Network for Description-Based Person Search
AAAI 2024
Causal-Driven Skill Prerequisite Structure Discovery
AAAI 2024
Once Read is Enough: Domain-specific Pretraining-free Language Models with Cluster-guided Sparse Experts for Long-tail Domain Knowledge
NIPS 2024
Exploring High-dimensional Search Space via Voronoi Graph Traversing
UAI 2024
AttriHuman-3D: Editable 3D Human Avatar Generation with Attribute Decomposition and Indexing
CVPR 2024
Fewer is More: Boosting Math Reasoning with Reinforced Context Pruning
EMNLP 2024
Enhancing Explainable Rating Prediction through Annotated Macro Concepts
ACL 2024
FlowDiffuser: Advancing Optical Flow Estimation with Diffusion Models
CVPR 2024
Optimizing Dynamic Neural Networks with Brainstorm
OSDI 2023
Graph Meets LLM: A Novel Approach to Collaborative Filtering for Robust Conversational Understanding
EMNLP 2023
Exploring Stochastic Autoregressive Image Modeling for Visual Representation
AAAI 2023
Train Faster, Perform Better: Modular Adaptive Training in Over-Parameterized Models
NIPS 2023
Model-enhanced Vector Index
NIPS 2023
Welder: Scheduling Deep Learning Memory Access via Tile-graph
OSDI 2023
Cocktailer: Analyzing and Optimizing Dynamic Control Flow in Deep Learning
OSDI 2023
VBASE: Unifying Online Vector Similarity Search and Relational Queries via Relaxed Monotonicity
OSDI 2023
On Modular Learning of Distributed Systems for Predicting End-to-End Latency
NSDI 2023
PyPose: A Library for Robot Learning With Physics-Based Optimization
CVPR 2023
NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation
ACL 2023
iPlanner: Imperative Path Planning
RSS 2023
Ambiguous Learning from Retrieval: Towards Zero-shot Semantic Parsing
ACL 2023
Multilingual context-based pronunciation learning for Text-to-Speech
INTERSPEECH 2023
GAFlow: Incorporating Gaussian Attention into Optical Flow
ICCV 2023
CoRTX: Contrastive Framework for Real-time Explanation
ICLR 2023
Towards Noise-Tolerant Speech-Referring Video Object Segmentation: Bridging Speech and Text
EMNLP 2023
Learning 3D Photography Videos via Self-supervised Diffusion on Single Images
IJCAI 2023
DSP: Discriminative Soft Prompts for Zero-Shot Entity and Relation Extraction
ACL 2023
Hard To Track Objects With Irregular Motions and Similar Appearances? Make It Easier by Buffering the Matching Space
WACV 2023
Over-parameterized Model Optimization with Polyak-{\L}ojasiewicz Condition
ICLR 2023
HACMan: Learning Hybrid Actor-Critic Maps for 6D Non-Prehensile Manipulation
CORL 2023
AdaMV-MoE: Adaptive Multi-Task Vision Mixture-of-Experts
ICCV 2023
Learning Optical Flow With Kernel Patch Attention
CVPR 2022
Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks
NIPS 2022
One-Inlier is First: Towards Efficient Position Encoding for Point Cloud Registration
NIPS 2022
Forecasting Human Trajectory from Scene History
NIPS 2022
UMIX: Improving Importance Weighting for Subpopulation Shift via Uncertainty-Aware Mixup
NIPS 2022
DeTarNet: Decoupling Translation and Rotation by Siamese Network for Point Cloud Registration
AAAI 2022
Learning Optical Flow with Adaptive Graph Reasoning
AAAI 2022
Confidence Calibration for Intent Detection via Hyperspherical Space and Rebalanced Accuracy-Uncertainty Loss
AAAI 2022
Improving Relevance Quality in Product Search using High-Precision Query-Product Semantic Similarity
ACL 2022
Spelling Correction using Phonetics in E-commerce Search
ACL 2022
DESED: Dialogue-based Explanation for Sentence-level Event Detection
COLING 2022
Class-Aware Contrastive Semi-Supervised Learning
CVPR 2022
SC2-PCR: A Second Order Spatial Compatibility for Efficient and Robust Point Cloud Registration
CVPR 2022
Multimodal Dynamics: Dynamical Fusion for Trustworthy Multimodal Classification
CVPR 2022
UniVIP: A Unified Framework for Self-Supervised Visual Pre-Training
CVPR 2022
A Simple Single-Scale Vision Transformer for Object Detection and Instance Segmentation
ECCV 2022
Detecting Generated Images by Real Images
ECCV 2022
NΓWA: Visual Synthesis Pre-training for Neural visUal World creAtion
ECCV 2022
MUSIED: A Benchmark for Event Detection from Multi-Source Heterogeneous Informal Texts
EMNLP 2022
Multimodal Context Carryover
EMNLP 2022
DEGREE: Decomposition Based Explanation for Graph Neural Networks
ICLR 2022
EXACT: Scalable Graph Neural Networks Training via Extreme Activation Compression
ICLR 2022
Recursive Disentanglement Network
ICLR 2022
SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian Approximation
ICLR 2022
Generalized Demographic Parity for Group Fairness
ICLR 2022
Accelerating Shapley Explanation via Contributive Cooperator Selection
ICML 2022
MT-Speech at SemEval-2022 Task 10: Incorporating Data Augmentation and Auxiliary Task with Cross-Lingual Pretrained Language Model for Structured Sentiment Analysis
NAACL 2022
SparTA: Deep-Learning Model Sparsity via Tensor-with-Sparsity-Attribute
OSDI 2022
ROLLER: Fast and Efficient Tensor Compilation for Deep Learning
OSDI 2022
MT-Speech at SemEval-2022 Task 10: Incorporating Data Augmentation and Auxiliary Task with Cross-Lingual Pretrained Language Model for Structured Sentiment Analysis
SEMEVAL 2022
Multi-Motion and Appearance Self-Supervised Moving Object Detection
WACV 2022
From Paraphrasing to Semantic Parsing: Unsupervised Semantic Parsing via Synchronous Semantic Decoding
ACL 2021
CT-Net: Complementary Transfering Network for Garment Transfer With Arbitrary Geometric Changes
CVPR 2021
Mutual Graph Learning for Camouflaged Object Detection
CVPR 2021
Probabilistic Model Distillation for Semantic Correspondence
CVPR 2021
CReST: A Class-Rebalancing Self-Training Framework for Imbalanced Semi-Supervised Learning
CVPR 2021
From Paraphrasing to Semantic Parsing: Unsupervised Semantic Parsing via Synchronous Semantic Decoding
IJCNLP 2021
Towards Compact CNNs via Collaborative Compression
CVPR 2021
Uncertainty-Guided Transformer Reasoning for Camouflaged Object Detection
ICCV 2021
Domain-Lifelong Learning for Dialogue State Tracking via Knowledge Preservation Networks
EMNLP 2021
RAIN: Reinforced Hybrid Attention Inference Network for Motion Forecasting
ICCV 2021
MST: Masked Self-Supervised Transformer for Visual Representation
NIPS 2021
Evaluations of the Gap between Supervised and Reinforcement Lifelong Learning on Robotic Manipulation Tasks
CORL 2021
Cascade Network with Guided Loss and Hybrid Attention for Finding Good Correspondences
AAAI 2021
TracKlinic: Diagnosis of Challenge Factors in Visual Tracking
WACV 2021
Defending SVMs against poisoning attacks: the hardness and DBSCAN approach
UAI 2021
Learning Interpretable Decision Rule Sets: A Submodular Optimization Approach
NIPS 2021
Time Series Data Augmentation for Deep Learning: A Survey
IJCAI 2021
Towards Fast, Accurate and Stable 3D Dense Face Alignment
ECCV 2020
EvolveGraph: Multi-Agent Trajectory Prediction with Dynamic Relational Reasoning
NIPS 2020
PAMS: Quantized Super-Resolution via Parameterized Max Scale
ECCV 2020
XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation
EMNLP 2020
Cascade Graph Neural Networks for RGB-D Salient Object Detection
ECCV 2020
Beyond 3DMM Space: Towards Fine-grained 3D Face Reconstruction
ECCV 2020
On Metric DBSCAN with Low Doubling Dimension
IJCAI 2020
Bayesian Multi-type Mean Field Multi-agent Imitation Learning
NIPS 2020
Which Is Plagiarism: Fashion Image Retrieval Based on Regional Representation for Design Protection
CVPR 2020
Predicting Lymph Node Metastasis Using Histopathological Images Based on Multiple Instance Learning With Deep Graph Convolution
CVPR 2020
HiveD: Sharing a GPU Cluster for Deep Learning with Guarantees
OSDI 2020
Rammer: Enabling Holistic Deep Learning Compiler Optimizations with rTasks
OSDI 2020
Retiarii: A Deep Learning Exploratory-Training Framework
OSDI 2020
Logic-guided Semantic Representation Learning for Zero-Shot Relation Classification
COLING 2020
Predicting Personal Opinion on Future Events with Fingerprints
COLING 2020
Learning to Detect Head Movement in Unconstrained Remote Gaze Estimation in the Wild
WACV 2020
Mining on Heterogeneous Manifolds for Zero-Shot Cross-Modal Image Retrieval
AAAI 2020
Hybrid Graph Neural Networks for Crowd Counting
AAAI 2020
Variational Adversarial Kernel Learned Imitation Learning
AAAI 2020
Relational State-Space Model for Stochastic Multi-Object Systems
ICLR 2020
Efficient Image Retrieval via Decoupling Diffusion into Online and Offline Processing
AAAI 2019
Clustered Object Detection in Aerial Images
ICCV 2019
Large-Scale Heterogeneous Feature Embedding
AAAI 2019
LaSOT: A High-Quality Benchmark for Large-Scale Single Object Tracking
CVPR 2019
Exploring Deep Multimodal Fusion of Text and Photo for Hate Speech Classification
ACL 2019
Game Design for Eliciting Distinguishable Behavior
NIPS 2019
Decoding EEG by Visual-guided Deep Neural Networks
IJCAI 2019
Understanding Pictograph with Facial Features: End-to-End Sentence-Level Lip Reading of Chinese
AAAI 2019
Cascaded SR-GAN for Scale-Adaptive Low Resolution Person Re-identification
IJCAI 2018
Contour Knowledge Transfer for Salient Object Detection
ECCV 2018
Batch Bayesian Optimization via Multi-objective Acquisition Ensemble for Automated Analog Circuit Design
ICML 2018
Depth-Based 3D Hand Pose Estimation: From Current Achievements to Future Goals
CVPR 2018
Attending Sentences to detect Satirical Fake News
COLING 2018
Gandiva: Introspective Cluster Scheduling for Deep Learning
OSDI 2018
Differentiable Learning of Logical Rules for Knowledge Base Reasoning
NIPS 2017
Object-Aware Dense Semantic Correspondence
CVPR 2017
Satirical News Detection and Analysis using Attention Mechanism and Linguistic Features
EMNLP 2017
Good Semi-supervised Learning That Requires a Bad GAN
NIPS 2017
Expectation Propagation with Stochastic Kinetic Model in Complex Interaction Systems
NIPS 2017
Saliency Transfer: An Example-Based Method for Salient Object Detection
IJCAI 2016
An Empirical Study of Automatic Chinese Word Segmentation for Spoken Language Understanding and Named Entity Recognition
NAACL 2016
Selective inference for group-sparse linear models
NIPS 2016
Leveraging Multiple Domains for Sentiment Classification
COLING 2016
Exploit All the Layers: Fast and Accurate CNN Object Detector With Scale Dependent Pooling and Cascaded Rejection Classifiers
CVPR 2016
Multi-Task Learning With Low Rank Attribute Embedding for Person Re-Identification
ICCV 2015
Semi-Supervised Chinese Word Segmentation Using Partial-Label Learning With Conditional Random Fields
EMNLP 2014
An Empirical Study Of Semi-Supervised Chinese Word Segmentation Using Co-Training
EMNLP 2013
A Chinese-English Organization Name Translation System Using Heuristic Web Mining and Asymmetric Alignment
IJCNLP 2009
A Chinese-English Organization Name Translation System Using Heuristic Web Mining and Asymmetric Alignment
ACL 2009
Switching to Real-Time Tasks in Multi-Tasking Dialogue
COLING 2008
Chinese-English Backward Transliteration Assisted with Mining Monolingual Web Pages
ACL 2008
CRFs-Based Named Entity Recognition Incorporated with Heuristic Entity List Searching
IJCNLP 2008
Avoiding and Resolving Initiative Conflicts in Dialogue
NAACL 2007
DialogueView: an Annotation Tool for Dialogue
EMNLP 2005