arXiv:1603.02501 Abstract | arXiv Analytics

arXiv:1603.02501 [cs.LG]Abstract References Reviews Resources

Mixture Proportion Estimation via Kernel Embedding of Distributions

Harish G. Ramaswamy, Clayton Scott, Ambuj Tewari

Published 2016-03-08Version 1

Mixture proportion estimation (MPE) is the problem of estimating the weight of a component distribution in a mixture, given samples from the mixture and component. This problem constitutes a key part in many "weakly supervised learning" problems like learning with positive and unlabelled samples, learning with label noise, anomaly detection and crowdsourcing. While there have been several methods proposed to solve this problem, to the best of our knowledge no efficient algorithm with a proven convergence rate towards the true proportion exists for this problem. We fill this gap by constructing a provably correct algorithm for MPE, and derive convergence rates under certain assumptions on the distribution. Our method is based on embedding distributions onto an RKHS, and implementing it only requires solving a simple convex quadratic programming problem a few times. We run our algorithm on several standard classification datasets, and demonstrate that it performs comparably to or better than other algorithms on most datasets.

Categories: cs.LG, stat.ML

Keywords: mixture proportion estimation, distribution, kernel embedding, simple convex quadratic programming problem, proven convergence rate

Related articles: Most relevant | Search more

arXiv:1906.03574 [cs.LG] (Published 2019-06-09)

Transfer Learning by Modeling a Distribution over Policies

Disha Shrivastava, Eeshan Gunesh Dhekane, Riashat Islam

arXiv:2010.15100 [cs.LG] (Published 2020-10-28)

Evaluating Model Robustness to Dataset Shift

Adarsh Subbaswamy, Roy Adams, Suchi Saria

arXiv:2006.06887 [cs.LG] (Published 2020-06-12)

Stochastic Optimization for Performative Prediction

Celestine Mendler-Dünner, Juan C. Perdomo, Tijana Zrnic, Moritz Hardt