Skip to main content

Research Repository

Advanced Search

Data compression and regression based on local principal curves

Einbeck, J; Evers, L; Hinchliff, K

Data compression and regression based on local principal curves Thumbnail


Authors

L Evers

K Hinchliff



Contributors

A Fink
Editor

B Lausen
Editor

W Seidel
Editor

A Ultsch
Editor

Abstract

Frequently the predictor space of a multivariate regression problem of the type y = m(x_1, …, x_p ) + ε is intrinsically one-dimensional, or at least of far lower dimension than p. Usual modeling attempts such as the additive model y = m_1(x_1) + … + m_p (x_p ) + ε, which try to reduce the complexity of the regression problem by making additional structural assumptions, are then inefficient as they ignore the inherent structure of the predictor space and involve complicated model and variable selection stages. In a fundamentally different approach, one may consider first approximating the predictor space by a (usually nonlinear) curve passing through it, and then regressing the response only against the one-dimensional projections onto this curve. This entails the reduction from a p- to a one-dimensional regression problem. As a tool for the compression of the predictor space we apply local principal curves. Taking things on from the results presented in Einbeck et al. (Classification – The Ubiquitous Challenge. Springer, Heidelberg, 2005, pp. 256–263), we show how local principal curves can be parametrized and how the projections are obtained. The regression step can then be carried out using any nonparametric smoother. We illustrate the technique using data from the physical sciences.

Citation

Einbeck, J., Evers, L., & Hinchliff, K. (2010). Data compression and regression based on local principal curves. In A. Fink, B. Lausen, W. Seidel, & A. Ultsch (Eds.), Advances in data analysis, data handling and business intelligence (701-712). https://doi.org/10.1007/978-3-642-01044-6_64

Conference Name 32nd annual Conference of the German Classification Society
Conference Location Hamburg
Publication Date Jan 1, 2010
Deposit Date Jan 13, 2011
Publicly Available Date Jan 31, 2011
Pages 701-712
Series Title Studies in classification, data analysis, and knowledge organization
Series ISSN 1431-8814
Book Title Advances in data analysis, data handling and business intelligence.
ISBN 9783642010446
DOI https://doi.org/10.1007/978-3-642-01044-6_64
Keywords Dimension reduction, Principal component regression, Principal curves, Smoothing,
Public URL https://durham-repository.worktribe.com/output/1158408

Files

Accepted Conference Proceeding (3.2 Mb)
PDF

Copyright Statement
The original publication is available at www.springerlink.com





You might also like



Downloadable Citations