eArticles

Home

eArticles

검색결과 돌아가기

검색화면

Export 프린트

Simple and Fast Calculation of the Second-Order Gradients for Globalized Dual Heuristic Dynamic Programming in Neural Networks

Resource Type: Periodical
Authors: Fairbank, M.; Alonso, E.; Prokhorov, D.
Source: IEEE Transactions on Neural Networks and Learning Systems IEEE Trans. Neural Netw. Learning Syst. Neural Networks and Learning Systems, IEEE Transactions on. 23(10):1671-1676 Oct, 2012
Subject: Computing and Processing
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
General Topics for Engineers
Neural networks
Vectors
Heuristic algorithms
Backpropagation
Training
Equations
Trajectory
Adaptive dynamic programming
dual heuristic programming
neural networks
value-gradient learning
Language
ISSN: 2162-237X
2162-2388

Online Access

초록

We derive an algorithm to exactly calculate the mixed second-order derivatives of a neural network's output with respect to its input vector and weight vector. This is necessary for the adaptive dynamic programming (ADP) algorithms globalized dual heuristic programming (GDHP) and value-gradient learning. The algorithm calculates the inner product of this second-order matrix with a given fixed vector in a time that is linear in the number of weights in the neural network. We use a “forward accumulation” of the derivative calculations which produces a much more elegant and easy-to-implement solution than has previously been published for this task. In doing so, the algorithm makes GDHP simple to implement and efficient, bridging the gap between the widely used DHP and GDHP ADP methods.

공지

DAU Library

eArticles

요약정보

Simple and Fast Calculation of the Second-Order Gradients for Globalized Dual Heuristic Dynamic Programming in Neural Networks

Online Access

초록