[electronic resource] /, 172 psl.

English kalba

Publikuota 1992 m. rugpjūčio 26 d., Springer US.

ISBN:
978-1-4613-6608-9
Copied ISBN!
OCLC numeris:
851793946

Žiūrėti „OpenLibrary“

Įvertinimų nėra (0 atsiliepimai)

Reinforcement learning is the learning of a mapping from situations to actions so as to maximize a scalar reward or reinforcement signal. The learner is not told which action to take, as in most forms of machine learning, but instead must discover which actions yield the highest reward by trying them. In the most interesting and challenging cases, actions may affect not only the immediate reward, but also the next situation, and through that all subsequent rewards. These two characteristics -- trial-and-error search and delayed reward -- are the most important distinguishing features of reinforcement learning. Reinforcement learning is both a new and a very old topic in AI. The term appears to have been coined by Minsk (1961), and independently in control theory by Walz and Fu (1965). The earliest machine learning research now viewed as directly relevant was Samuel's (1959) checker player, which used temporal-difference learning to manage …

7 leidimai

Temos

  • Computer science
  • Artificial intelligence