Generalised correlated cross-validation

Patrick S. Carmack; Jeffrey S. Spence; William R. Schucany

doi:10.1080/10485252.2012.655733

Generalised correlated cross-validation

Patrick S. Carmack, Jeffrey S. Spence, William R. Schucany

Clinical Sciences

Research output: Contribution to journal › Article › peer-review

9 Scopus citations

Abstract

Since its introduction by [Stone, M. (1974), 'Cross-validatory Choice and the Assessment of Statistical Predictions (with discussion)', Journal of the Royal Statistical Society, B36, 111-133] and [Geisser, S. (1975), 'The Predictive Sample Reuse Method with Applications', Journal of the American Statistical Association, 70, 320-328], cross-validation has been studied and improved by several authors including [Burman, P., Chow, E., and Nolan, D. (1994), 'A Cross-validatory Method for Dependent Data', Biometrika, 81(2), 351-358], [Hart, J. and Yi, S. (1998), 'One-sided Cross-validation', Journal of the American Statistical Association, 93(442), 620-630], [Racine, J. (2000), 'Consistent Cross-validatory Model-selection for Dependent Data: hv-block Cross-validation', Journal of Econometrics, 99, 39-61], [Hart, J. and Lee, C. (2005), 'Robustness of One-sided Cross-validation to Autocorrelation', Journal of Multivariate Analysis, 92(1), 77-96], and [Carmack, P., Spence, J., Schucany, W., Gunst, R., Lin, Q., and Haley, R. (2009), 'Far Casting Cross Validation', Journal of Computational and Graphical Statistics, 18(4), 879-893]. Perhaps the most widely used and best known is generalised cross-validation (GCV) [Craven, P. and Wahba, G. (1979), 'Smoothing Noisy Data with Spline Functions', Numerical Mathematics, 31, 377-403], which establishes a single-pass method that penalises the fit by the trace of the smoother matrix assuming independent errors. We propose an extension to GCV in the context of correlated errors, which is motivated by a natural definition for residual degrees of freedom. The efficacy of the new method is investigated with a simulation experiment on a kernel smoother with bandwidth selection in local linear regression. Next, the winning methodology is illustrated by application to spatial modelling of fMRI data using a nonparametric semivariogram. We conclude with remarks about the heteroscedastic case and a potential maximum likelihood framework for Gaussian random processes.

Original language	English (US)
Pages (from-to)	269-282
Number of pages	14
Journal	Journal of Nonparametric Statistics
Volume	24
Issue number	2
DOIs	https://doi.org/10.1080/10485252.2012.655733
State	Published - Jun 2012

Keywords

effective degrees of freedom
fMRI
model selection
nonparametric
spatial semivariogram
supervised learning
tuning parameter

ASJC Scopus subject areas

Statistics and Probability
Statistics, Probability and Uncertainty

Access to Document

10.1080/10485252.2012.655733

Cite this

@article{1c244d18195d4ca38adab9b4cfada506,

title = "Generalised correlated cross-validation",

abstract = "Since its introduction by [Stone, M. (1974), 'Cross-validatory Choice and the Assessment of Statistical Predictions (with discussion)', Journal of the Royal Statistical Society, B36, 111-133] and [Geisser, S. (1975), 'The Predictive Sample Reuse Method with Applications', Journal of the American Statistical Association, 70, 320-328], cross-validation has been studied and improved by several authors including [Burman, P., Chow, E., and Nolan, D. (1994), 'A Cross-validatory Method for Dependent Data', Biometrika, 81(2), 351-358], [Hart, J. and Yi, S. (1998), 'One-sided Cross-validation', Journal of the American Statistical Association, 93(442), 620-630], [Racine, J. (2000), 'Consistent Cross-validatory Model-selection for Dependent Data: hv-block Cross-validation', Journal of Econometrics, 99, 39-61], [Hart, J. and Lee, C. (2005), 'Robustness of One-sided Cross-validation to Autocorrelation', Journal of Multivariate Analysis, 92(1), 77-96], and [Carmack, P., Spence, J., Schucany, W., Gunst, R., Lin, Q., and Haley, R. (2009), 'Far Casting Cross Validation', Journal of Computational and Graphical Statistics, 18(4), 879-893]. Perhaps the most widely used and best known is generalised cross-validation (GCV) [Craven, P. and Wahba, G. (1979), 'Smoothing Noisy Data with Spline Functions', Numerical Mathematics, 31, 377-403], which establishes a single-pass method that penalises the fit by the trace of the smoother matrix assuming independent errors. We propose an extension to GCV in the context of correlated errors, which is motivated by a natural definition for residual degrees of freedom. The efficacy of the new method is investigated with a simulation experiment on a kernel smoother with bandwidth selection in local linear regression. Next, the winning methodology is illustrated by application to spatial modelling of fMRI data using a nonparametric semivariogram. We conclude with remarks about the heteroscedastic case and a potential maximum likelihood framework for Gaussian random processes.",

keywords = "effective degrees of freedom, fMRI, model selection, nonparametric, spatial semivariogram, supervised learning, tuning parameter",

author = "Carmack, {Patrick S.} and Spence, {Jeffrey S.} and Schucany, {William R.}",

year = "2012",

month = jun,

doi = "10.1080/10485252.2012.655733",

language = "English (US)",

volume = "24",

pages = "269--282",

journal = "Journal of Nonparametric Statistics",

issn = "1048-5252",

publisher = "Taylor and Francis Ltd.",

number = "2",

}

TY - JOUR

T1 - Generalised correlated cross-validation

AU - Carmack, Patrick S.

AU - Spence, Jeffrey S.

AU - Schucany, William R.

PY - 2012/6

Y1 - 2012/6

N2 - Since its introduction by [Stone, M. (1974), 'Cross-validatory Choice and the Assessment of Statistical Predictions (with discussion)', Journal of the Royal Statistical Society, B36, 111-133] and [Geisser, S. (1975), 'The Predictive Sample Reuse Method with Applications', Journal of the American Statistical Association, 70, 320-328], cross-validation has been studied and improved by several authors including [Burman, P., Chow, E., and Nolan, D. (1994), 'A Cross-validatory Method for Dependent Data', Biometrika, 81(2), 351-358], [Hart, J. and Yi, S. (1998), 'One-sided Cross-validation', Journal of the American Statistical Association, 93(442), 620-630], [Racine, J. (2000), 'Consistent Cross-validatory Model-selection for Dependent Data: hv-block Cross-validation', Journal of Econometrics, 99, 39-61], [Hart, J. and Lee, C. (2005), 'Robustness of One-sided Cross-validation to Autocorrelation', Journal of Multivariate Analysis, 92(1), 77-96], and [Carmack, P., Spence, J., Schucany, W., Gunst, R., Lin, Q., and Haley, R. (2009), 'Far Casting Cross Validation', Journal of Computational and Graphical Statistics, 18(4), 879-893]. Perhaps the most widely used and best known is generalised cross-validation (GCV) [Craven, P. and Wahba, G. (1979), 'Smoothing Noisy Data with Spline Functions', Numerical Mathematics, 31, 377-403], which establishes a single-pass method that penalises the fit by the trace of the smoother matrix assuming independent errors. We propose an extension to GCV in the context of correlated errors, which is motivated by a natural definition for residual degrees of freedom. The efficacy of the new method is investigated with a simulation experiment on a kernel smoother with bandwidth selection in local linear regression. Next, the winning methodology is illustrated by application to spatial modelling of fMRI data using a nonparametric semivariogram. We conclude with remarks about the heteroscedastic case and a potential maximum likelihood framework for Gaussian random processes.

AB - Since its introduction by [Stone, M. (1974), 'Cross-validatory Choice and the Assessment of Statistical Predictions (with discussion)', Journal of the Royal Statistical Society, B36, 111-133] and [Geisser, S. (1975), 'The Predictive Sample Reuse Method with Applications', Journal of the American Statistical Association, 70, 320-328], cross-validation has been studied and improved by several authors including [Burman, P., Chow, E., and Nolan, D. (1994), 'A Cross-validatory Method for Dependent Data', Biometrika, 81(2), 351-358], [Hart, J. and Yi, S. (1998), 'One-sided Cross-validation', Journal of the American Statistical Association, 93(442), 620-630], [Racine, J. (2000), 'Consistent Cross-validatory Model-selection for Dependent Data: hv-block Cross-validation', Journal of Econometrics, 99, 39-61], [Hart, J. and Lee, C. (2005), 'Robustness of One-sided Cross-validation to Autocorrelation', Journal of Multivariate Analysis, 92(1), 77-96], and [Carmack, P., Spence, J., Schucany, W., Gunst, R., Lin, Q., and Haley, R. (2009), 'Far Casting Cross Validation', Journal of Computational and Graphical Statistics, 18(4), 879-893]. Perhaps the most widely used and best known is generalised cross-validation (GCV) [Craven, P. and Wahba, G. (1979), 'Smoothing Noisy Data with Spline Functions', Numerical Mathematics, 31, 377-403], which establishes a single-pass method that penalises the fit by the trace of the smoother matrix assuming independent errors. We propose an extension to GCV in the context of correlated errors, which is motivated by a natural definition for residual degrees of freedom. The efficacy of the new method is investigated with a simulation experiment on a kernel smoother with bandwidth selection in local linear regression. Next, the winning methodology is illustrated by application to spatial modelling of fMRI data using a nonparametric semivariogram. We conclude with remarks about the heteroscedastic case and a potential maximum likelihood framework for Gaussian random processes.

KW - effective degrees of freedom

KW - fMRI

KW - model selection

KW - nonparametric

KW - spatial semivariogram

KW - supervised learning

KW - tuning parameter

UR - http://www.scopus.com/inward/record.url?scp=84860819075&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84860819075&partnerID=8YFLogxK

U2 - 10.1080/10485252.2012.655733

DO - 10.1080/10485252.2012.655733

M3 - Article

AN - SCOPUS:84860819075

SN - 1048-5252

VL - 24

SP - 269

EP - 282

JO - Journal of Nonparametric Statistics

JF - Journal of Nonparametric Statistics

IS - 2

ER -

Generalised correlated cross-validation

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this