skip to primary navigationskip to content

Department of Computer Science and Technology

Data Science

Course pages 2021–22

Data Science

Course arrangements

  • Lecture notes These printed notes have some sections marked *, which are non-examinable. If you spot a mistake in the printed notes, please first check if it's corrected in the extended online notes (which I will update continually) and if not then email me.
  • Supervisions Your college may have have arranged either three or four supervisions for this course. If three, the suggested breakdown is:
    – Not for supervision: example sheet 0
    – Supervision 1: example sheet 1
    – Supervision 2: example sheet 2, and half of 3
    – Supervision 3: half of example sheet 3, and example sheet 4

Videos and example sheets

The videos have exactly the same material as a normal lecture, but they are short because there's no need for the lecturer to pause. You have a pause button — use it! You'll need to watch these videos several times, with the printed notes beside you. When there are handwritten equations in the videos, you should copy out the equations yourself to make certain you understand the method. When the video presents code, you should download the code and try it yourself. This will make the example sheets much easier. — The optional in-person sessions are digressions about off-syllabus topics and why it's all worth studying. THE IN-PERSON SESSIONS ARE OPTIONAL. THEY ARE NOT EXAMINABLE MATERIAL.
Example sheet 0 and solutions (prerequisites)
Lecture 1
8 Oct, 11am Optional in-person: introduction [slides]
Lecture 2
Lecture 3
Lecture 4 Mock exam question 1 and walkthrough (23:36)
15 Oct, 11am Optional in-person: prediction and parameters [slides]
Lecture 5
2.1 Linear modelling (13:27) lm
(Videos for section 2 are unchanged from last year)
Lecture 6
Lecture 7
Example sheet 1
22 Oct, 11am Optional in-person: climate dataset challenge [slides]
Lecture 8
4.2 Bayes's rule calculations (15:37)
The printed notes have an error in Example 4.2.1. See extended notes for correction.
Lecture 9
Lecture 10
29 Oct, 11am Optional in-person: R2 and model choice [slides]
Climate confidence challenge: find a 95% Bayesian confidence interval for the rate of temperature increase in the UK. Submit your answer on Moodle by 9am on Friday 5 November.
Lecture 11
Lecture 12
Lecture 13
5 Nov Optional in-person: the problem of induction [slides]
Climate confidence challenge: find a 95% frequentist confidence interval for the rate of temperature increase in the UK. Submit your answer on Moodle by 9am on Friday 12 November.
Lecture 14
Lecture 15
Lecture 16
Example sheet 4
  • Stoat-finding challenge: submit your answer on Moodle by midnight on 1 December. Top three submissions win stylish and elegant Data Stoat t-shirts.
12 Nov Optional in-person: Bayesian versus frequentist? [slides]