A Tutorial on Linear Function Approximators for Dynamic Programming and Reinforcement Learning

Geramifard, Alborz, Walsh, Thomas J., Stefanie, Tellex, Chowdhary, Girish

Omschrijving

A Markov Decision Process (MDP) is a natural framework for formulating sequential decision-making problems under uncertainty. In recent years, researchers have greatly advanced algorithms for learning and acting in MDPs. This book reviews such algorithms.
Gratis verzending vanaf
€ 19,95 binnen Nederland
Schrijver
Geramifard, Alborz, Walsh, Thomas J., Stefanie, Tellex, Chowdhary, Girish
Titel
A Tutorial on Linear Function Approximators for Dynamic Programming and Reinforcement Learning
Uitgever
now publishers Inc
Jaar
2013
Taal
Engels
Pagina's
92
Gewicht
143 gr
EAN
9781601987600
Afmetingen
234 x 156 x 5 mm
Bindwijze
Paperback

U ontvangt bij ons altijd de laatste druk!


Rubrieken

Boekstra