TY - JOUR
T1 - Statistical inference
T2 - Learning in artificial neural networks
AU - Yang, Howard Hua
AU - Murata, Noboru
AU - Amari, Shun Ichi
PY - 1998/1/1
Y1 - 1998/1/1
N2 - Artificial neural networks (ANNs) are widely used to model low-level neural activities and high-level cognitive functions. In this article, we review the applications of statistical inference for learning in ANNs. Statistical inference provides an objective way to derive learning algorithms both for training and for evaluation of the performance of trained ANNs. Solutions to the over-fitting problem by model-selection methods, based on either conventional statistical approaches or on a Bayesian approach, are discussed. The use of supervised and unsupervised learning algorithms for ANNs are reviewed. Training a multilayer ANN by supervised learning is equivalent to nonlinear regression. The ensemble methods, bagging and arching, described here, can be applied to combine ANNs to form a new predictor with improved performance. Unsupervised learning algorithms that are derived either by the Hebbian law for bottom-up self-organization, or by global objective functions for top-down self-organization are also discussed.
AB - Artificial neural networks (ANNs) are widely used to model low-level neural activities and high-level cognitive functions. In this article, we review the applications of statistical inference for learning in ANNs. Statistical inference provides an objective way to derive learning algorithms both for training and for evaluation of the performance of trained ANNs. Solutions to the over-fitting problem by model-selection methods, based on either conventional statistical approaches or on a Bayesian approach, are discussed. The use of supervised and unsupervised learning algorithms for ANNs are reviewed. Training a multilayer ANN by supervised learning is equivalent to nonlinear regression. The ensemble methods, bagging and arching, described here, can be applied to combine ANNs to form a new predictor with improved performance. Unsupervised learning algorithms that are derived either by the Hebbian law for bottom-up self-organization, or by global objective functions for top-down self-organization are also discussed.
UR - http://www.scopus.com/inward/record.url?scp=0031813794&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0031813794&partnerID=8YFLogxK
U2 - 10.1016/S1364-6613(97)01114-5
DO - 10.1016/S1364-6613(97)01114-5
M3 - Review article
C2 - 21244956
AN - SCOPUS:0031813794
SN - 1364-6613
VL - 2
SP - 4
EP - 10
JO - Trends in Cognitive Sciences
JF - Trends in Cognitive Sciences
IS - 1
ER -