Reinforcement Learning with History Lists

von Stephan Timmer

Softcover - 9783838106212

69,90 €

Versandkostenfrei

Auf meine Merkliste

Hinweis: Print on Demand. Lieferbar in 2 Tagen.

Lieferzeit nach Versand: ca. 1-2 Tage
inkl. MwSt. & Versandkosten (innerhalb Deutschlands)

Autorenfreundlich Bücher kaufen?!

Beschreibung

A very general framework for modeling uncertainty in learning environments is given by Partially observable Markov Decision Processes (POMDPs). In a POMDP setting, the learning agent infers a policy for acting optimally in all possible states of the environment, while receiving only observations of these states. The basic idea for coping with partial observability is to include memory into the representation of the policy. Perfect memory is provided by the belief space, i.e. the space of probability distributions over environmental states. However, computing policies defined on the belief space requires a considerable amount of prior knowledge about the learning problem and is expensive in terms of computation time.The author Stephan Timmer presents a reinforcement learning algorithm for solving POMDPs based on short term memory. In contrast to belief states, short term memory is not capable of representing optimal policies, but is far more practical and requires no prior knowledge about the learning problem. It can be shown that the algorithm can also be used to solve large Markov Decision Processes (MDPs) with continuous, multi-dimensional state spaces.

Solving Partially Observable Decision Processes by Using Short Term Memory

Details

Verlag	Südwestdeutscher Verlag für Hochschulschriften
Ersterscheinung	21. April 2009
Maße	22 cm x 15 cm x 1.1 cm
Gewicht	256 Gramm
Format	Softcover
ISBN-13	9783838106212
Seiten	160

Schlagwörter

Digital- und Informationstechnologien: allgemeine Themen

Reinforcement Learning with History Lists

von Stephan Timmer

Autorenfreundlich Bücher kaufen?!

Beschreibung

Solving Partially Observable Decision Processes by Using Short Term Memory

Details

Schlagwörter

Sinn-volles Banking mit

Verantwortungseigentum

Mitglied im

Gefördert durch

Kontakt

Shop-FAQ

Autorenprogramm

Signieraktionen

Versand und Zahlung

Datenschutz

Shop-AGB

Impressum

Widerruf

Vertrag widerrufen

Reinforcement Learning with History Lists

von Stephan Timmer

Autorenfreundlich Bücher kaufen?!

Beschreibung

Solving Partially Observable Decision Processes by Using Short Term Memory

Details

Schlagwörter

Bekannt durch

Widerrufsantrag einreichen