Generalizing objects by analyzing language

Tamošiūnaitė, Minija; Markelic, Irene; Kulvičius, Tomas; Wörgötter, Florentin

Use this url to cite publication: https://hdl.handle.net/20.500.12259/41145

Generalizing objects by analyzing language

Type of publication

Straipsnis konferencijos medžiagoje kitoje duomenų bazėje / Article in conference proceedings in other databases (P1c)

Author(s)

Author	Affiliation
Tamošiūnaitė, Minija	Informatikos fakultetas / Faculty of Informatics	LT
Markelic, Irene	University Gottingen, Germany	DE
Kulvičius, Tomas	Informatikos fakultetas / Faculty of Informatics	LT
Wörgötter, Florentin

Title

Generalizing objects by analyzing language

[en]

Is part of

IEEE-RAS : 11th international conference on Humanoid robots bled, Slovenia, October 26-28, 2011 Piscataway : IEEE Press

Date Issued

Date
2011

Publisher

Piscataway : IEEE Press

Publisher (trusted)

IEEE Press

Is Referenced by

IEEE Xplore

Extent

p. 557-563

URI

URI
https://hdl.handle.net/20.500.12259/41145

Field of Science

Abstract (en)

Generalizing objects in an action-context by a robot, for example addressing the problem: ”Which items can be cut with which tools?”, is an unresolved and difficult problem. Answering such a question defines a complete action class and robots cannot do this so far. We use a bootstrapping mechanism similar to that known from human language acquisition, and combine language- with image-analysis to create action classes built around the verb (action) in an utterance. A human teaches the robot a certain sentence, for example: ”Cut a sausage with a knife”, from where on the machine generalizes the arguments (nouns) that the verb takes and searches for possible alternative nouns. Then, by ways of an internet-based image search and a classification algorithm, image classes for the alternative nouns are extracted, by which a large ”picture book” of the possible objects involved in an action is created. This concludes the generalization step. Using the same classifier, the machine can now also perform a recognition procedure. Without having seen the objects before, it can analyze a visual scene, discovering, for example, a cucumber and a mandolin, which match to the earlier found nouns allowing it to suggest actions like: ”I could cut a cucumber with a mandolin”. The algorithm for generalizing objects by analyzing language (GOAL) presented here, allows, thus, generalization and recognition of objects in an action-context. It can then be combined with methods for action execution (e.g. action generation-based on human demonstration) to execute so far unknown actions.

Type of document

type::text::journal::journal article::research article

Language

Anglų / English (en)

Coverage Spatial

Jungtinės Amerikos Valstijos / United States of America (US)

ISBN (of the container)

9781612848686

Other Identifier(s)

VDU02-000011206