Path-finding in real and simulated rats :  assessing the influence of path characteristics on navigation learning

Tamošiūnaitė, Minija; Ainge, James; Kulvičius, Tomas; Porr, Bernd; Dudchenko, Paul; Wörgötter, Florentin

doi:10.1007/s10827-008-0094-6

Use this url to cite publication: https://hdl.handle.net/20.500.12259/54303

Path-finding in real and simulated rats : assessing the influence of path characteristics on navigation learning

Type of publication

Straipsnis Web of Science ir Scopus duomenų bazėje / Article in Web of Science and Scopus database (S1)

Author(s)

Author	Affiliation
Tamošiūnaitė, Minija	Vytauto Didžiojo universitetas / Vytautas Magnus University	LT	University of Stirling, Scotland	GB
Ainge, James	University of Stirling, Scotland	GB
Kulvičius, Tomas	Vytauto Didžiojo universitetas / Vytautas Magnus University	LT	University Göttingen, Germany	DE
Porr, Bernd	University of Glasgow, Scotland	DE
Dudchenko, Paul	University of Stirling, Scotland	DE
Wörgötter, Florentin

Title

Path-finding in real and simulated rats : assessing the influence of path characteristics on navigation learning

[en]

Is part of

Journal of vomputational neuroscience. Dordrecht: Springer, 2008, Vol. 25, no. 3

Date Issued

Date
2008

Publisher

Dordrecht: Springer

Publisher (trusted)

Springer

Is Referenced by

Science Citation Index Expanded (Web of Science)

INSPEC

SpringerLink

ProQuest Central

Extent

p. 562-582

URI

URI
https://doi.org/10.1007/s10827-008-0094-6
https://hdl.handle.net/20.500.12259/54303

DOI

10.1007/s10827-008-0094-6

Field of Science

Informatika / Inform...

Keywords (en)

Reinforcement learnin...

SARSA

Place field system

Function approximatio...

Weight decay

Abstract (en)

A large body of experimental evidence suggests that the hippocampal place field system is involved in reward based navigation learning in rodents. Reinforcement learning (RL) mechanisms have been used to model this, associating the state space in an RL-algorithm to the place-field map in a rat. The convergence properties of RL-algorithms are affected by the exploration patterns of the learner. Therefore, we first analyzed the path characteristics of freely exploring rats in a test arena. We found that straight path segments with mean length 23 cm up to a maximal length of 80 cm take up a significant proportion of the total paths. Thus, rat paths are biased as compared to random exploration. Next we designed a RL system that reproduces these specific path characteristics. Our model arena is covered by overlapping, probabilistically firing place fields (PF) of realistic size and coverage. Because convergence of RL-algorithms is also influenced by the state space characteristics, different PF-sizes and densities, leading to a different degree of overlap, were also investigated. The model rat learns finding a reward opposite to its starting point. We observed that the combination of biased straight exploration, overlapping coverage and probabilistic firing will strongly impair the convergence of learning. When the degree of randomness in the exploration is increased, convergence improves, but the distribution of straight path segments becomes unrealistic and paths become 'wiggly'. To mend this situation without affecting the path characteristic two additional mechanisms are implemented: a gradual drop of the learned weights (weight decay) and path length limitation, which prevents learning if the reward is not found after some expected time. Both mechanisms limit the memory of the system and thereby counteract effects of getting trapped on a wrong path.

Type of document

type::text::journal::journal article::research article

Language

Anglų / English (en)

Coverage Spatial

Nyderlandai / Netherlands (NL)

ISSN (of the container)

0929-5313

WOS

WOS:000259438100009

Other Identifier(s)

VDU02-000005123

Journal	IF	AIF	AIF (min)	AIF (max)	Cat	AV	Year	Quartile
JOURNAL OF COMPUTATIONAL NEUROSCIENCE	2.75	3.352	2.835	3.869	2	0.741	2008	Q1

Journal	IF	AIF	AIF (min)	AIF (max)	Cat	AV	Year	Quartile
JOURNAL OF COMPUTATIONAL NEUROSCIENCE	2.75	3.352	2.835	3.869	2	0.741	2008	Q1