Unsupervised Supervised Learning I: Estimating Classification and Regression Errors without Labels

Pinar Donmez, Guy Lebanon, Krishnakumar Balasubramanian.

Year: 2010, Volume: 11, Issue: 44, Pages: 1323−1351


Estimating the error rates of classifiers or regression models is a fundamental task in machine learning which has thus far been studied exclusively using supervised learning techniques. We propose a novel unsupervised framework for estimating these error rates using only unlabeled data and mild assumptions. We prove consistency results for the framework and demonstrate its practical applicability on both synthetic and real world data.