Fan Yang

212 papers · 2005–2026 · 25 conferences · across top CS/AI conferences

Achievements

+18 more ↓

🗺️ Taxonomy Completionist (24) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (7) 🌍 Conference Polyglot (24)

🐣 Hot Topic Early Bird 🌈 Renaissance Researcher (7) 🌉 Interdisciplinary Bridge 🏠 Conference Loyalist (22) 🌟 Keyword Trendsetter Combo (3) 🤝 Dynamic Duo (17) 👑 Triple Crown 🔬 Deep Specialist (11) 🏆 Grand Slam 👥 Mega-Team (37) 🏆 Keyword Champion 📈 Trend Setter 🔥 Unstoppable (14) ❓ The Questioner (3) 🚀 Conference Pioneer 💎 Century Club (200) ⚡ Prolific Year (17) 🗃️ Keyword Collector (94)

Conferences

AAAI (30) CVPR (25) NIPS (19) ACL (18) ICLR (17) OSDI (15) ECCV (12) EMNLP (12) ICML (10) COLING (9) IJCAI (8) ICCV (8) WACV (7) NAACL (5) IJCNLP (3) UAI (3) EACL (2) CORL (2) INTERSPEECH (1) JMLR (1) MICCAI (1) NSDI (1) AISTATS (1) RSS (1) SEMEVAL (1)

Top co-authors

Mao Yang (17) Xin Li (16) Lidong Zhou (12) Ao Luo (10) Lingxiao Ma (9) Hong Cheng (9) Mengnan Du (9) Jilong Xue (8) Xia Hu (8) Tingting Gao (8)

Research topics

Computer Vision (1) Understanding (1) Applications (1) Applications (1) Education (1)

Keywords

large language model (17) diffusion model (8) object detection (8) zero-shot learning (7) representation learning (7) domain adaptation (7) deep neural network (7) attention mechanism (6) feature learning (5) self-supervised learning (5) benchmark evaluation (5) graph neural network (5) neural network (5) contrastive learning (4) feature extraction (4) text classification (4) deep learning (4) semantic segmentation (4) motion estimation (4) uncertainty quantification (4)

Papers

UDCH: Unsupervised Dynamic Weighted Cluster-cooperative Hashing for Cross-modal Retreival AAAI 2026 GeM-VG: Towards Generalized Multi-image Visual Grounding with Multimodal Large Language Models AAAI 2026 TIME: Temporal-Sensitive Multi-Dimensional Instruction Tuning and Robust Benchmarking for Video-LLMs AAAI 2026 KnowThyself: An Agentic Assistant for LLM Interpretability AAAI 2026 Catastrophic Forgetting in Kolmogorov-Arnold Networks AAAI 2026 Beyond Euclidean Assumptions: Geometry-Aware Adaptive Routing for Remote Sensing Segmentation AAAI 2026 Denoising Concept Vectors with Sparse Autoencoders for Improved Language Model Steering EACL 2026 FaithLM: Towards Faithful Explanations for Large Language Models EACL 2026 Unsupervised Discovery of Long-Term Spatiotemporal Periodic Workflows in Human Activities WACV 2026 Breaking the Stealth-Potency Trade-off in Clean-Image Backdoors with Generative Trigger Optimization AAAI 2026 Compressing then Matching: An Efficient Pre-training Paradigm for Multimodal Embedding ACL 2026 Explain the Synth: Interpretable Evaluation of LLM Data Synthesis ACL 2026 FilmSceneDesigner: Chaining Set Design for Procedural Film Scene Generation AAAI 2026 Automated Proof Generation for Rust Code via Self-Evolution ICLR 2025 Beyond Single Concept Vector: Modeling Concept Subspace in LLMs with Gaussian Distribution ICLR 2025 FB-Bench: A Fine-Grained Multi-Task Benchmark for Evaluating LLMs’ Responsiveness to Human Feedback EMNLP 2025 Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solver ICLR 2025 SVBench: A Benchmark with Temporal Multi-Turn Dialogues for Streaming Video Understanding ICLR 2025 Proving Olympiad Inequalities by Synergizing LLMs and Symbolic Reasoning ICLR 2025 Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model ICLR 2025 The Source Image is the Best Attention for Infrared and Visible Image Fusion ICCV 2025 Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring ICCV 2025 Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning ICLR 2025 In vivo cell-type and brain region classification via multimodal contrastive learning ICLR 2025 VIIS: Visible and Infrared Information Synthesis for Severe Low-Light Image Enhancement WACV 2025 Contrasting Adversarial Perturbations: The Space of Harmless Perturbations AAAI 2025 3DHumanEdit: Multi-modal Body Part-aware Conditioning Information Integration for 3D Human Manipulation AAAI 2025 Language Ranker: A Metric for Quantifying LLM Performance Across High and Low-Resource Languages AAAI 2025 NaFV-Net: An Adversarial Four-view Network for Mammogram Classification AAAI 2025 Divide and Orthogonalize: Efficient Continual Learning with Local Model Space Projection UAI 2025 PipeThreader: Software-Defined Pipelining for Efficient DNN Execution OSDI 2025 MM-Verify: Enhancing Multimodal Reasoning with Chain-of-Thought Verification ACL 2025 SeedBench: A Multi-task Benchmark for Evaluating Large Language Models in Seed Science ACL 2025 CFBench: A Comprehensive Constraints-Following Benchmark for LLMs ACL 2025 EdgeInfinite: A Memory-Efficient Infinite-Context Transformer for Edge Devices ACL 2025 Beyond Surface-Level Patterns: An Essence-Driven Defense Framework Against Jailbreak Attacks in LLMs ACL 2025 iMOVE : Instance-Motion-Aware Video Understanding ACL 2025 WaferLLM: Large Language Model Inference at Wafer Scale OSDI 2025 Exploring Concept Depth: How Large Language Models Acquire Knowledge and Concept at Different Layers? COLING 2025 Opportunistic Osteoporosis Diagnosis via Texture-Preserving Self-Supervision, Mixture of Experts and Multi-Task Integration MICCAI 2025 Precise High-Dimensional Asymptotics for Quantifying Heterogeneous Transfers JMLR 2025 Detection and Geographic Localization of Natural Objects in the Wild: A Case Study on Palms IJCAI 2025 Dynamic Multiple High-order Correlations Fusion with Noise Filtering for Incomplete Multi-view Noisy-label Learning IJCAI 2025 MagicArticulate: Make Your 3D Models Articulation-Ready CVPR 2025 MExD: An Expert-Infused Diffusion Model for Whole-Slide Image Classification CVPR 2025 Libra-Merging: Importance-redundancy and Pruning-merging Trade-off for Acceleration Plug-in in Large Vision-Language Model CVPR 2025 HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator CVPR 2025 CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation CVPR 2025 Oracle-MoE: Locality-preserving Routing in the Oracle Space for Memory-constrained Large Language Model Inference ICML 2025 MM-RLHF: The Next Step Forward in Multimodal LLM Alignment ICML 2025 Simple Policy Optimization ICML 2025 LongRoPE2: Near-Lossless LLM Context Window Scaling ICML 2025 rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking ICML 2025 TaskGalaxy: Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types ICLR 2025 Sculpt3D: Multi-View Consistent Text-to-3D Generation with Sparse 3D Prior CVPR 2024 Neuro-Symbolic Data Generation for Math Reasoning NIPS 2024 Empowering and Assessing the Utility of Large Language Models in Crop Science NIPS 2024 Autoformalize Mathematical Statements by Symbolic Equivalence and Semantic Consistency NIPS 2024 IRGen: Generative Modeling for Image Retrieval ECCV 2024 Orthogonal Gradient Boosting for Simpler Additive Rule Ensembles AISTATS 2024 MobileNetV4: Universal Models for the Mobile Ecosystem ECCV 2024 Parrot: Efficient Serving of LLM-based Applications with Semantic Variable OSDI 2024 nnScaler: Constraint-Guided Parallelization Plan Generation for Deep Learning Training OSDI 2024 Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation OSDI 2024 Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought COLING 2024 Towards Multi-Modal Co-Reference Resolution in Conversational Shopping Agents COLING 2024 Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models ECCV 2024 Masking Latent Gender Knowledge for Debiasing Image Captioning NAACL 2024 RecMind: Large Language Model Powered Agent For Recommendation NAACL 2024 FocusDiffuser: Perceiving Local Disparities for Camouflaged Object Detection ECCV 2024 Finite-Time Convergence and Sample Complexity of Actor-Critic Multi-Objective Reinforcement Learning ICML 2024 TVE: Learning Meta-attribution for Transferable Vision Explainer ICML 2024 LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens ICML 2024 MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation WACV 2024 Sparse Bayesian Deep Learning for Cross Domain Medical Image Reconstruction AAAI 2024 Implicit Modeling of Non-rigid Objects with Cross-Category Signals AAAI 2024 Geometry-Guided Domain Generalization for Monocular 3D Object Detection AAAI 2024 Multi-View Randomized Kernel Classification via Nonconvex Optimization AAAI 2024 An Effective Augmented Lagrangian Method for Fine-Grained Multi-View Optimization AAAI 2024 Multi-Modal Disordered Representation Learning Network for Description-Based Person Search AAAI 2024 Causal-Driven Skill Prerequisite Structure Discovery AAAI 2024 Once Read is Enough: Domain-specific Pretraining-free Language Models with Cluster-guided Sparse Experts for Long-tail Domain Knowledge NIPS 2024 Exploring High-dimensional Search Space via Voronoi Graph Traversing UAI 2024 AttriHuman-3D: Editable 3D Human Avatar Generation with Attribute Decomposition and Indexing CVPR 2024 Fewer is More: Boosting Math Reasoning with Reinforced Context Pruning EMNLP 2024 Enhancing Explainable Rating Prediction through Annotated Macro Concepts ACL 2024 FlowDiffuser: Advancing Optical Flow Estimation with Diffusion Models CVPR 2024 Optimizing Dynamic Neural Networks with Brainstorm OSDI 2023 Graph Meets LLM: A Novel Approach to Collaborative Filtering for Robust Conversational Understanding EMNLP 2023 Exploring Stochastic Autoregressive Image Modeling for Visual Representation AAAI 2023 Train Faster, Perform Better: Modular Adaptive Training in Over-Parameterized Models NIPS 2023 Model-enhanced Vector Index NIPS 2023 Welder: Scheduling Deep Learning Memory Access via Tile-graph OSDI 2023 Cocktailer: Analyzing and Optimizing Dynamic Control Flow in Deep Learning OSDI 2023 VBASE: Unifying Online Vector Similarity Search and Relational Queries via Relaxed Monotonicity OSDI 2023 On Modular Learning of Distributed Systems for Predicting End-to-End Latency NSDI 2023 PyPose: A Library for Robot Learning With Physics-Based Optimization CVPR 2023 NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation ACL 2023 iPlanner: Imperative Path Planning RSS 2023 Ambiguous Learning from Retrieval: Towards Zero-shot Semantic Parsing ACL 2023 Multilingual context-based pronunciation learning for Text-to-Speech INTERSPEECH 2023 GAFlow: Incorporating Gaussian Attention into Optical Flow ICCV 2023 CoRTX: Contrastive Framework for Real-time Explanation ICLR 2023 Towards Noise-Tolerant Speech-Referring Video Object Segmentation: Bridging Speech and Text EMNLP 2023 Learning 3D Photography Videos via Self-supervised Diffusion on Single Images IJCAI 2023 DSP: Discriminative Soft Prompts for Zero-Shot Entity and Relation Extraction ACL 2023 Hard To Track Objects With Irregular Motions and Similar Appearances? Make It Easier by Buffering the Matching Space WACV 2023 Over-parameterized Model Optimization with Polyak-{\L}ojasiewicz Condition ICLR 2023 HACMan: Learning Hybrid Actor-Critic Maps for 6D Non-Prehensile Manipulation CORL 2023 AdaMV-MoE: Adaptive Multi-Task Vision Mixture-of-Experts ICCV 2023 Learning Optical Flow With Kernel Patch Attention CVPR 2022 Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks NIPS 2022 One-Inlier is First: Towards Efficient Position Encoding for Point Cloud Registration NIPS 2022 Forecasting Human Trajectory from Scene History NIPS 2022 UMIX: Improving Importance Weighting for Subpopulation Shift via Uncertainty-Aware Mixup NIPS 2022 DeTarNet: Decoupling Translation and Rotation by Siamese Network for Point Cloud Registration AAAI 2022 Learning Optical Flow with Adaptive Graph Reasoning AAAI 2022 Confidence Calibration for Intent Detection via Hyperspherical Space and Rebalanced Accuracy-Uncertainty Loss AAAI 2022 Improving Relevance Quality in Product Search using High-Precision Query-Product Semantic Similarity ACL 2022 Spelling Correction using Phonetics in E-commerce Search ACL 2022 DESED: Dialogue-based Explanation for Sentence-level Event Detection COLING 2022 Class-Aware Contrastive Semi-Supervised Learning CVPR 2022 SC2-PCR: A Second Order Spatial Compatibility for Efficient and Robust Point Cloud Registration CVPR 2022 Multimodal Dynamics: Dynamical Fusion for Trustworthy Multimodal Classification CVPR 2022 UniVIP: A Unified Framework for Self-Supervised Visual Pre-Training CVPR 2022 A Simple Single-Scale Vision Transformer for Object Detection and Instance Segmentation ECCV 2022 Detecting Generated Images by Real Images ECCV 2022 NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion ECCV 2022 MUSIED: A Benchmark for Event Detection from Multi-Source Heterogeneous Informal Texts EMNLP 2022 Multimodal Context Carryover EMNLP 2022 DEGREE: Decomposition Based Explanation for Graph Neural Networks ICLR 2022 EXACT: Scalable Graph Neural Networks Training via Extreme Activation Compression ICLR 2022 Recursive Disentanglement Network ICLR 2022 SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian Approximation ICLR 2022 Generalized Demographic Parity for Group Fairness ICLR 2022 Accelerating Shapley Explanation via Contributive Cooperator Selection ICML 2022 MT-Speech at SemEval-2022 Task 10: Incorporating Data Augmentation and Auxiliary Task with Cross-Lingual Pretrained Language Model for Structured Sentiment Analysis NAACL 2022 SparTA: Deep-Learning Model Sparsity via Tensor-with-Sparsity-Attribute OSDI 2022 ROLLER: Fast and Efficient Tensor Compilation for Deep Learning OSDI 2022 MT-Speech at SemEval-2022 Task 10: Incorporating Data Augmentation and Auxiliary Task with Cross-Lingual Pretrained Language Model for Structured Sentiment Analysis SEMEVAL 2022 Multi-Motion and Appearance Self-Supervised Moving Object Detection WACV 2022 From Paraphrasing to Semantic Parsing: Unsupervised Semantic Parsing via Synchronous Semantic Decoding ACL 2021 CT-Net: Complementary Transfering Network for Garment Transfer With Arbitrary Geometric Changes CVPR 2021 Mutual Graph Learning for Camouflaged Object Detection CVPR 2021 Probabilistic Model Distillation for Semantic Correspondence CVPR 2021 CReST: A Class-Rebalancing Self-Training Framework for Imbalanced Semi-Supervised Learning CVPR 2021 From Paraphrasing to Semantic Parsing: Unsupervised Semantic Parsing via Synchronous Semantic Decoding IJCNLP 2021 Towards Compact CNNs via Collaborative Compression CVPR 2021 Uncertainty-Guided Transformer Reasoning for Camouflaged Object Detection ICCV 2021 Domain-Lifelong Learning for Dialogue State Tracking via Knowledge Preservation Networks EMNLP 2021 RAIN: Reinforced Hybrid Attention Inference Network for Motion Forecasting ICCV 2021 MST: Masked Self-Supervised Transformer for Visual Representation NIPS 2021 Evaluations of the Gap between Supervised and Reinforcement Lifelong Learning on Robotic Manipulation Tasks CORL 2021 Cascade Network with Guided Loss and Hybrid Attention for Finding Good Correspondences AAAI 2021 TracKlinic: Diagnosis of Challenge Factors in Visual Tracking WACV 2021 Defending SVMs against poisoning attacks: the hardness and DBSCAN approach UAI 2021 Learning Interpretable Decision Rule Sets: A Submodular Optimization Approach NIPS 2021 Time Series Data Augmentation for Deep Learning: A Survey IJCAI 2021 Towards Fast, Accurate and Stable 3D Dense Face Alignment ECCV 2020 EvolveGraph: Multi-Agent Trajectory Prediction with Dynamic Relational Reasoning NIPS 2020 PAMS: Quantized Super-Resolution via Parameterized Max Scale ECCV 2020 XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation EMNLP 2020 Cascade Graph Neural Networks for RGB-D Salient Object Detection ECCV 2020 Beyond 3DMM Space: Towards Fine-grained 3D Face Reconstruction ECCV 2020 On Metric DBSCAN with Low Doubling Dimension IJCAI 2020 Bayesian Multi-type Mean Field Multi-agent Imitation Learning NIPS 2020 Which Is Plagiarism: Fashion Image Retrieval Based on Regional Representation for Design Protection CVPR 2020 Predicting Lymph Node Metastasis Using Histopathological Images Based on Multiple Instance Learning With Deep Graph Convolution CVPR 2020 HiveD: Sharing a GPU Cluster for Deep Learning with Guarantees OSDI 2020 Rammer: Enabling Holistic Deep Learning Compiler Optimizations with rTasks OSDI 2020 Retiarii: A Deep Learning Exploratory-Training Framework OSDI 2020 Logic-guided Semantic Representation Learning for Zero-Shot Relation Classification COLING 2020 Predicting Personal Opinion on Future Events with Fingerprints COLING 2020 Learning to Detect Head Movement in Unconstrained Remote Gaze Estimation in the Wild WACV 2020 Mining on Heterogeneous Manifolds for Zero-Shot Cross-Modal Image Retrieval AAAI 2020 Hybrid Graph Neural Networks for Crowd Counting AAAI 2020 Variational Adversarial Kernel Learned Imitation Learning AAAI 2020 Relational State-Space Model for Stochastic Multi-Object Systems ICLR 2020 Efficient Image Retrieval via Decoupling Diffusion into Online and Offline Processing AAAI 2019 Clustered Object Detection in Aerial Images ICCV 2019 Large-Scale Heterogeneous Feature Embedding AAAI 2019 LaSOT: A High-Quality Benchmark for Large-Scale Single Object Tracking CVPR 2019 Exploring Deep Multimodal Fusion of Text and Photo for Hate Speech Classification ACL 2019 Game Design for Eliciting Distinguishable Behavior NIPS 2019 Decoding EEG by Visual-guided Deep Neural Networks IJCAI 2019 Understanding Pictograph with Facial Features: End-to-End Sentence-Level Lip Reading of Chinese AAAI 2019 Cascaded SR-GAN for Scale-Adaptive Low Resolution Person Re-identification IJCAI 2018 Contour Knowledge Transfer for Salient Object Detection ECCV 2018 Batch Bayesian Optimization via Multi-objective Acquisition Ensemble for Automated Analog Circuit Design ICML 2018 Depth-Based 3D Hand Pose Estimation: From Current Achievements to Future Goals CVPR 2018 Attending Sentences to detect Satirical Fake News COLING 2018 Gandiva: Introspective Cluster Scheduling for Deep Learning OSDI 2018 Differentiable Learning of Logical Rules for Knowledge Base Reasoning NIPS 2017 Object-Aware Dense Semantic Correspondence CVPR 2017 Satirical News Detection and Analysis using Attention Mechanism and Linguistic Features EMNLP 2017 Good Semi-supervised Learning That Requires a Bad GAN NIPS 2017 Expectation Propagation with Stochastic Kinetic Model in Complex Interaction Systems NIPS 2017 Saliency Transfer: An Example-Based Method for Salient Object Detection IJCAI 2016 An Empirical Study of Automatic Chinese Word Segmentation for Spoken Language Understanding and Named Entity Recognition NAACL 2016 Selective inference for group-sparse linear models NIPS 2016 Leveraging Multiple Domains for Sentiment Classification COLING 2016 Exploit All the Layers: Fast and Accurate CNN Object Detector With Scale Dependent Pooling and Cascaded Rejection Classifiers CVPR 2016 Multi-Task Learning With Low Rank Attribute Embedding for Person Re-Identification ICCV 2015 Semi-Supervised Chinese Word Segmentation Using Partial-Label Learning With Conditional Random Fields EMNLP 2014 An Empirical Study Of Semi-Supervised Chinese Word Segmentation Using Co-Training EMNLP 2013 A Chinese-English Organization Name Translation System Using Heuristic Web Mining and Asymmetric Alignment IJCNLP 2009 A Chinese-English Organization Name Translation System Using Heuristic Web Mining and Asymmetric Alignment ACL 2009 Switching to Real-Time Tasks in Multi-Tasking Dialogue COLING 2008 Chinese-English Backward Transliteration Assisted with Mining Monolingual Web Pages ACL 2008 CRFs-Based Named Entity Recognition Incorporated with Heuristic Entity List Searching IJCNLP 2008 Avoiding and Resolving Initiative Conflicts in Dialogue NAACL 2007 DialogueView: an Annotation Tool for Dialogue EMNLP 2005