- Academic Registry
Course & Unit Catalogues


CM30359: Reinforcement learning

[Page last updated: 09 August 2024]

Academic Year: 2024/25
Owning Department/School: Department of Computer Science
Credits: 6 [equivalent to 12 CATS credits]
Notional Study Hours: 120
Level: Honours (FHEQ level 6)
Period:
Semester 1
Assessment Summary: CW 100%
Assessment Detail:
  • Coursework (CW 100%)
Supplementary Assessment:
Like-for-like reassessment (where allowed by programme regulations)
Requisites: Before taking this module you must take CM10310 OR take CM20220 OR take CM20252
In taking this module you cannot take CM50270
Learning Outcomes: On completion of the unit, the students will be able to:
1. formulate reinforcement learning problems by defining a state space, an action space, and a reward function, appropriate for the context,
2. apply a range of solution methods to reinforcement learning problems,
3. appreciate the difficulties encountered in solving large, complex reinforcement learning problems in practice.


Aims: To explore reinforcement learning as an approach to artificial intelligence; to understand how reinforcement learning differs from other approaches to machine learning such as supervised and unsupervised learning; to learn how to formulate and solve reinforcement learning problems, and to appreciate the difficultes involved in solving large, complex reinforcement learning problems in practice.

Skills: Intellectual skills:
* Conceptual understanding of sequential decision making under uncertainty (T, F, A)
Practical skills:
* Programming reinforcement learning algorithms (T, F, A)
* Use of software libraries for reinforcement learning (T, F, A)
Transferable skills:
* Technical report writing (F, A)

Content: Topics covered normally include: dynamic programming, Monte Carlo methods, temporal-difference algorithms (e.g., Q-learning), integration of planning and learning, value function approximation (e.g., with deep neural networks), policy-gradient methods, application areas, and an introduction to active areas of research (e.g., hierarchical reinforcement learning, intrinsically-motivated reinforcement learning systems).

Course availability:

CM30359 is Optional on the following courses:

Department of Computer Science
  • USCM-AFB06 : BSc(Hons) Computer Science (Year 3)
  • USCM-AAB07 : BSc(Hons) Computer Science with Study year abroad (Year 4)
  • USCM-AKB07 : BSc(Hons) Computer Science with Year long work placement (Year 4)
  • USCM-AFB27 : BSc(Hons) Computer Science and Artificial Intelligence (Year 3)
  • USCM-AAB27 : BSc(Hons) Computer Science and Artificial Intelligence with Study year abroad (Year 4)
  • USCM-AKB27 : BSc(Hons) Computer Science and Artificial Intelligence with Year long work placement (Year 4)
  • USCM-AFB20 : BSc(Hons) Computer Science and Mathematics (Year 3)
  • USCM-AAB20 : BSc(Hons) Computer Science and Mathematics with Study year abroad (Year 4)
  • USCM-AKB20 : BSc(Hons) Computer Science and Mathematics with Year long work placement (Year 4)
  • USCM-AFM01 : MComp(Hons) Computer Science (Year 3)
  • USCM-AAM02 : MComp(Hons) Computer Science with Study year abroad (Year 3)
  • USCM-AKM02 : MComp(Hons) Computer Science with Year long work placement (Year 3)
  • USCM-AFM27 : MComp(Hons) Computer Science and Artificial Intelligence (Year 3)
  • USCM-AAM27 : MComp(Hons) Computer Science and Artificial Intelligence with Study year abroad (Year 3)
  • USCM-AKM27 : MComp(Hons) Computer Science and Artificial Intelligence with Year long work placement (Year 3)
  • USCM-AFM14 : MComp(Hons) Computer Science and Mathematics (Year 3)
  • USCM-AAM14 : MComp(Hons) Computer Science and Mathematics with Study year abroad (Year 3)
  • USCM-AKM14 : MComp(Hons) Computer Science and Mathematics with Year long work placement (Year 3)

Notes:

  • This unit catalogue is applicable for the 2024/25 academic year only. Students continuing their studies into 2025/26 and beyond should not assume that this unit will be available in future years in the format displayed here for 2024/25.
  • Courses and units are subject to change in accordance with normal University procedures.
  • Availability of units will be subject to constraints such as staff availability, minimum and maximum group sizes, and timetabling factors as well as a student's ability to meet any pre-requisite rules.
  • Find out more about these and other important University terms and conditions here.