Learning to reach by reinforcement learning using a receptive field based function approximation approach with continuous actions

Tamošiūnaitė, Minija; Asfour, Tamim; Wörgötter, Florentin

doi:10.1007/s00422-009-0295-8

Use this url to cite publication: https://hdl.handle.net/20.500.12259/54371

Learning to reach by reinforcement learning using a receptive field based function approximation approach with continuous actions

Type of publication

Straipsnis Web of Science ir Scopus duomenų bazėje / Article in Web of Science and Scopus database (S1)

Author(s)

Author	Affiliation
Tamošiūnaitė, Minija	Informatikos fakultetas / Faculty of Informatics	LT	University Göttingen, Germany	DE
Asfour, Tamim	University of Gottingen, Germany	DE
Wörgötter, Florentin	University Göttingen, Germany	DE

Title

Learning to reach by reinforcement learning using a receptive field based function approximation approach with continuous actions

[en]

Is part of

Biological Cybernetics. Berlyn : Springer, Vol. 100, no. 3 (2009)

Date Issued

Date
2009

Publisher

Berlyn : Springer

Publisher (trusted)

Springer

Is Referenced by

Science Citation Index Expanded (Web of Science)

SpringerLink

MEDLINE

Academic Search Complete

Scopus

Extent

p. 249-260

URI

URI
https://doi.org/10.1007/s00422-009-0295-8
https://hdl.handle.net/20.500.12259/54371

DOI

10.1007/s00422-009-0295-8

Field of Science

Informatika / Inform...

Keywords (en)

Reinforcement learnin...

Function approximatio...

Robot control

Abstract (en)

Reinforcement learning methods can be used in robotics applications especially for specific target-oriented problems, for example the reward-based recalibration of goal directed actions. To this end still relatively large and continuous state-action spaces need to be efficiently handled. The goal of this paper is, thus, to develop a novel, rather simple method which uses reinforcement learning with function approximation in conjunction with different reward-strategies for solving such problems. For the testing of our method, we use a four degree-of-freedom reaching problem in 3D-space simulated by a two-joint robot arm system with two DOF each. Function approximation is based on 4D, overlapping kernels (receptive fields) and the state-action space contains about 10,000 of these. Different types of reward structures are being compared, for example, reward-on- touching-only against reward-on-approach. Furthermore, forbidden joint configurations are punished. A continuous action space is used. In spite of a rather large number of states and the continuous action space these reward/punishment strategies allow the system to find a good solution usually within about 20 trials. The efficiency of our method demonstrated in this test scenario suggests that it might be possible to use it on a real robot for problems where mixed rewards can be defined in situations where other types of learning might be difficult.

Type of document

type::text::journal::journal article::research article

Language

Anglų / English (en)

Coverage Spatial

Vokietija / Germany (DE)

ISSN (of the container)

0340-1200

WOS

WOS:000264260600005

Other Identifier(s)

VDU02-000006216

Informatikos fakultetas / Faculty of Informatics

Vytauto Didžiojo universitetas / Vytautas Magnus University

Journal	IF	AIF	AIF (min)	AIF (max)	Cat	AV	Year	Quartile
BIOLOGICAL CYBERNETICS	1.697	2.652	1.439	3.864	2	0.495	2009	Q2

Journal	IF	AIF	AIF (min)	AIF (max)	Cat	AV	Year	Quartile
BIOLOGICAL CYBERNETICS	1.697	2.652	1.439	3.864	2	0.495	2009	Q2