Use this url to cite publication: https://hdl.handle.net/20.500.12259/47460
On the asymptotic equivalence between differential Hebbian and temporal difference learning using a local third factor
Type of publication
Straipsnis konferencijos medžiagoje Scopus duomenų bazėje / Article in conference proceedings in Scopus database (P1a2)
Author(s)
Author | Affiliation | |||
---|---|---|---|---|
Kolodziejski, Christoph | ||||
Title [en]
On the asymptotic equivalence between differential Hebbian and temporal difference learning using a local third factor
Is part of
Advances in neural information processing systems 21 : 22nd annual conference on neural information processing systems, Vancouver, British Columbia, Canada, December 8-11, 2008 : proceedings. Red Hook, NY : Curran, 2009
Date Issued
Date |
---|
2009 |
Publisher
Red Hook, NY : Curran, 2009
Is Referenced by
Extent
p. 857-864
Field of Science
Abstract (en)
In this theoretical contribution we provide mathematical proof that two of the most important classes of network learning - correlation-based differential Hebbian learning and reward-based temporal difference learning - are asymptotically equivalent when timing the learning with a localmodulatory signal. This opens the opportunity to consistently reformulate most of the abstract reinforcement learning framework from a correlation based perspective that is more closely related to the biophysics of neurons.
Type of document
type::text::journal::journal article::research article
Language
Anglų / English (en)
Coverage Spatial
Jungtinės Amerikos Valstijos / United States of America (US)
Date Reporting
2010
ISBN (of the container)
9781605609492
Other Identifier(s)
VDU02-000008387