Inhalt

[ 536MLPEREIV20 ] VL (*)Reinforcement Learning

Versionsauswahl
(*) Leider ist diese Information in Deutsch nicht verfügbar.
Workload Ausbildungslevel Studienfachbereich VerantwortlicheR Semesterstunden Anbietende Uni
3 ECTS B3 - Bachelor 3. Jahr Artificial Intelligence Gerhard Widmer 2 SSt Johannes Kepler Universität Linz
Detailinformationen
Quellcurriculum Bachelorstudium Artificial Intelligence 2025W
Lernergebnisse
Kompetenzen
(*)Students have a basic understanding of core concepts, theories, and methods relating to the field of reinforcement learning. They understand what kinds of problems can be suitably modeled as sequential decision processes and addressed with reinforcement learning algorithms.
Fertigkeiten Kenntnisse
(*)Students understand the basic concepts of and assumptions behind Markov decision processes (k2), and how to model a given sequential decision and optimisation task as a reinforcement learning problem (k3). They know fundamental algorithms of reinforcement learning, and can set up reinforcement learning agents and experiments (k3/4). (*)Fundamental concepts of reinforcement learning, underlying modeling assumpsions, and basic algorithms of reinforcement learning:

  • Solution methods for k-armed Bandits, and their practical application
  • Formal treatment of Markov Decision Problems (MDPs)
  • Theory for solving MDPs
  • Table-based solution methods for MDPs with discrete state spaces
  • Selected approximate solution methods for MDPs with continuous state spaces
  • Outlook on approximate solution of MDPs with very large discrete state spaces and deterministic dynamics (e.g. board games such as „Chess“ and „Go“)
Beurteilungskriterien (*)Positive grade on the final exam (written).
Lehrmethoden (*)Standard lectures. Positive reinforcement of active lecture participation through rewards in the form of small chocolate treats
Abhaltungssprache Englisch
Literatur (*)Richard S. Sutton and Andrew G. Barto. 2018. Introduction to Reinforcement Learning (2nd. edition). MIT Press, Cambridge, MA, USA.
Lehrinhalte wechselnd? Nein
Sonstige Informationen (*)This lecture course (VO) and the corresponding exercise course (UE) form a didactic unit. The study results described here are achieved through the combination of these two courses.
Äquivalenzen (*)in collaboration with 536MLPEREIU20: UE Reinforcement Learning (1.5 ECTS) equivalent to
536MLPEREIK19: KV Reinforcement Learning (4.5 ECTS)
Präsenzlehrveranstaltung
Teilungsziffer -
Zuteilungsverfahren Direktzuteilung