Selecting neural network architectures via the prediction risk: application to corporate bond rating prediction

10 December 2002

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 35-41
https://doi.org/10.1109/aiaws.1991.236576

Abstract

The notion of generalization can be defined precisely as the prediction risk, the expected performance of an estimator on new observations. The authors propose the prediction risk as a measure of the generalization ability of multi-layer perceptron networks and use it to select the optimal network architecture. The prediction risk must be estimated from the available data. The authors approximate the prediction risk by v-fold cross-validation and asymptotic estimates of generalized cross-validation or H. Akaike's (1970) final prediction error. They apply the technique to the problem of predicting corporate bond ratings. This problem is very attractive as a case study, since it is characterized by the limited availability of the data and by the lack of complete a priori information that could be used to impose a structure to the network architecture.<>

Keywords

This publication has 5 references indexed in Scilit:

Spline Models for Observational Data
Published by Society for Industrial & Applied Mathematics (SIAM) ,1990
Cross-validation:a review²
Series Statistics, 1978
The Predictive Sample Reuse Method with Applications
Journal of the American Statistical Association, 1975
A completely automatic french curve: fitting spline functions by cross validation
Communications in Statistics, 1975
Statistical predictor identification
Annals of the Institute of Statistical Mathematics, 1970