Deep Neural Networks

Principal lecturer: Dr Ferenc Huszar
Additional lecturer: Dr Nic Lane
Taken by: Part II CST
Code: DNN
Term: Lent
Hours: 14 (14hrs lectures)
Format: In-person lectures
Class limit: max. 50 students
Moodle, timetable

Objectives

You will gain detailed knowledge of

Current understanding of generalization in neural networks vs classical statistical models.
Optimization procedures for neural network models such as stochastic gradient descent and ADAM.
Automatic differentiation and at least one software framework (PyTorch, TensorFolow) as well as an overview of other software approaches.
Architectures that are deployed to deal with different data types such as images or sequences including a. convolutional networks b. recurrent networks and c. transformers.

In addition you will gain knowledge of more advanced topics reflecting recent research in machine learning. These change from year to year, examples include:

Approaches to unsupervised learning including autoencoders and self-supervised learning..
Techniques for deploying models in low data regimes such as transfer learning and meta-learning.
Techniques for propagating uncertainty such as Bayesian neural networks.
Reinforcement learning and its applications to robotics or games.
Graph Neural Networks and Geometric Deep Learning.

Teaching Style

The start of the course will focus on the latest undertanding of current theory of neural networks, contrasting with previous classical understandings of generalization performance. Then we will move to practical examples of network architectures and deployment. We will end with more advanced topics reflecting current research, mostly delivered by guest lecturers from the Department and beyond.

Schedule

Week 1

Two lectures: Generalization and Approximation with Neural Networks

Week 2

Two lectures: Optimization: Stochastic Gradient Descent and ADAM

Week 3

Two lectures: Hardware Acceleration and Convolutional Neural Networks

Week 4

Two lectures: Vanishing Gradients, Recurrent and Residual Networks, Sequence Modeling.

Week 5

Two lectures: Attention, The Transformer Architecture, Large Language Models

Week 6-8

Lectures and guest lectures from the special topics below.

Special topics

Self-Supervised Learning, AutoEncoders, Generative Modeling of Images
Hardware Implementations
Reinforcement learning
Transfer learning and meta-learning
Uncertainty and Bayesian Neural Networks
Graph Neural Networks

Assessment

Assessment involves solving a number of exercises in a jupyter notebook environment. Familiarity with the python programming language is assumed. The exercises rely on the pytorch deep learning framework, the syntax of which we don’t cover in detail in the lectures, students are referred to documentation and tutorials to learn this component of the assessment.

Assignment 1

30% of the total marks, with guided exercises.

Assignment 2

70% of the total marks, a combination of guided exercises and a more exploratory mini-project.