Beyond Discrepancy: A Closer Look at the Theory of Distribution Shift

Robi Bhattacharjee; Nicholas Rittler; Kamalika Chaudhuri

2026 ALT ALT 2026

Beyond Discrepancy: A Closer Look at the Theory of Distribution Shift

Abstract

Learning theory of distribution shift generally bounds performance on the target distribution as a function of the discrepancy between the source and target, rarely guaranteeing high target accuracy. Instead of relying on the discrepancy, we adopt an assumption inspired by Invariant Risk Minimization, where the source and target distributions are unified by an unknown feature projection. Under this assumption, we show that a learner can leverage the relationship between the source and target distributions to greatly reduce the number of required target samples to achieve high accuracy. To quantify this effect, we introduce a new combinatorial complexity measure—the distance dimension—and derive bounds for linear maps and neural networks.

Authors

Robi Bhattacharjee , Nicholas Rittler , Kamalika Chaudhuri

Topics

Machine Learning > Optimization & Theory > Learning Theory Artificial Intelligence > Learning Paradigms > Domain Adaptation Machine Learning > Learning Types > Distribution Shift

Keywords

distribution shift invariant risk minimization distance dimension

Download PDF

Related papers

No Scale Sensitive Dimension for Distribution Learning 2026

Sample-Near-Optimal Agnostic Boosting with Improved Running Time 2026

Variance Reduction and Low Sample Complexity in Stochastic Optimization via Proximal Point Method 2026

Improved Replicable Boosting with Majority-of-Majorities 2026

Learning with Monotone Adversarial Corruptions 2026