Estimating Labels from Label Proportions
Novi Quadrianto, Alex J. Smola, Tibério S. Caetano, Quoc V. Le; 10(82):2349−2374, 2009.
Consider the following problem: given sets of unlabeled observations, each set with known label proportions, predict the labels of another set of observations, possibly with known label proportions. This problem occurs in areas like e-commerce, politics, spam filtering and improper content detection. We present consistent estimators which can reconstruct the correct labels with high probability in a uniform convergence sense. Experiments show that our method works well in practice.
|© JMLR 2009. (edit, beta)|