Einbeck, Jochen (2011) 'Bandwidth selection for mean-shift based unsupervised learning techniques : a unified approach via self-coverage.', Journal of pattern recognition research., 6 (2). pp. 175-192.
The mean shift is a simple but powerful tool emerging from the computer science literature which shifts a point to the local center of mass around this point. It has been used as a building block for several nonparametric unsupervised learning techniques, such as density mode estimation, clustering, and the estimation of principal curves. Due to the localized way of averaging, it requires the specification of a window size in form of a bandwidth (matrix). This paper proposes to use a so-called self-coverage measure as a general device for bandwidth selection in this context. In short, a bandwidth h will be favorable if a high proportion of data points falls within circles or ``hypertubes"; of radius h centered at the fitted object. The method is illustrated through real data examples in the light of several unsupervised estimation problems.
|Full text:||(VoR) Version of Record|
Download PDF (574Kb)
|Publisher Web site:||http://dx.doi.org/10.13176/11.288|
|Date accepted:||No date available|
|Date deposited:||25 October 2011|
|Date of first online publication:||2011|
|Date first made open access:||No date available|
Save or Share this output
|Look up in GoogleScholar|