Learning Curves for Error Minimum and Maximum Likelihood Algorithms
- 1 September 1992
- journal article
- Published by MIT Press in Neural Computation
- Vol. 4 (5) , 712-719
- https://doi.org/10.1162/neco.1992.4.5.712
Abstract
For the problem of dividing the space originally partitioned by a blurred boundary, every learning algorithm can make the probability of incorrect prediction of an individual example ε decrease with the number of training examples t. We address here the question of how the asymptotic form of ε(t) as well as its limit of convergence reflect the choice of learning algorithms. The error minimum algorithm is found to exhibit rather slow convergence of ε(t) to its lower bound ε0, ε(t) - ε0 ∼ O(t-2/3). Even for the purpose of minimizing prediction error, the maximum likelihood algorithm can be utilized as an alternative. If the true probability distribution happens to be contained in the family of hypothetical functions, then the boundary estimated from the hypothetical distribution function eventually converges to the best choice. Convergence of the prediction error is then ε(t) - ε0 ∼ O(t-1). If the true distribution is not available from the algorithm, however, the boundary generally does not converge to the best choice, but instead ε(t) - ε1 ∼ ±O(t-1/2), where ε1 > ε0 > 0.Keywords
This publication has 4 references indexed in Scilit:
- Four Types of Learning CurvesNeural Computation, 1992
- A statistical approach to learning and generalization in layered neural networksProceedings of the IEEE, 1990
- What Size Net Gives Valid Generalization?Neural Computation, 1989
- A theory of the learnableCommunications of the ACM, 1984