Draft syllabus 2017–18

Foundations of Data Science

Lecturer: Dr Damon Wischik
Taken by: Part IB
Past exam questions for the predecessor course Mathematical Methods for Computer Science
No. of lectures (including practical classes): 12
Suggested hours of supervision: 3

This course is a prerequisite for Part IB Formal Models of Language, and for Part II Machine Learning and Bayesian Inference, Bioinformatics, Computer Systems Modelling, Information Theory, Quantum Computing, Natural Language Processing, Advanced Graphics.

What's new, compared to the predecessor course Mathematical Methods for Computer Science? (i) a more practical focus on working with data, including Ticks, (ii) three new lectures on the basics of statistical inference, expanding on the ideas introduced in Machine Learning and Real World Data, (iii) material on inner product spaces will be presented as "linear data modelling" rather than abstract linear algebra, (iv) Fourier analysis is removed.

Aims

This course introduces fundamental tools for describing and reasoning about data. There are two themes: describing the behaviour of randomised systems; and making inferences based on data generated by such systems. The course will survey a wide range of models and tools, and it will emphasize how to design a model and what sorts of questions one might ask about it.

Lectures

Objectives

At the end of the course students should

Recommended reading