Książka Reinforcement Learning Richard S. Sutton

Reinforcement Learning

Język: Angielski
Oprawa: Miękka
Wydawca: Springer, Berlin
Dostępność: Dostępna u dostawcy
Wysyłamy za 5-8 dni
844.55
Reinforcement learning is the learning of a mapping from situations to actions so as to maximize a s...

Informacje o książce

Język
Angielski
Oprawa
Książka - Miękka
Data wydania
2013
strony
172
EAN
9781461366089
ISBN
1461366089
Enbook ID
06796639
Waga
284
Wymiary
155 x 235 x 10

Pełny opis

Reinforcement learning is the learning of a mapping from situations to actions so as to maximize a scalar reward or reinforcement signal. The learner is not told which action to take, as in most forms of machine learning, but instead must discover which actions yield the highest reward by trying them. In the most interesting and challenging cases, actions may affect not only the immediate reward, but also the next situation, and through that all subsequent rewards. These two characteristics -- trial-and-error search and delayed reward -- are the most important distinguishing features of reinforcement learning. §Reinforcement learning is both a new and a very old topic in AI. The term appears to have been coined by Minsk (1961), and independently in control theory by Walz and Fu (1965). The earliest machine learning research now viewed as directly relevant was Samuel's (1959) checker player, which used temporal-difference learning to manage delayed reward much as it is used today. Of course learning and reinforcement have been studied in psychology for almost a century, and that work has had a very strong impact on the AI/engineering work. One could in fact consider all of reinforcement learning to be simply the reverse engineering of certain psychological learning processes (e.g. operant conditioning and secondary reinforcement). §Reinforcement Learning is an edited volume of original research, comprising seven invited contributions by leading researchers. §

Możesz być zainteresowany

Reinforcement Learning

Richard S. Sutton
440.40
537.42
1 426.61
1 840.57

Tidal

Amanda Hocking
83.02
33.05
65.81
64.84

Revelation

David R Veerman
64.84
1 426.61
42.96

Klienci, którzy kupili tę książkę, kupili również

217.86

Stredovek

Michele Angelico
17.68

Informes de Auditoria

Cristino Mu Oz Ortiz
358.05

Sehnsuchtsfaden

Anahita Pasalar
44.61
143.20
55.02