Haoyan Luo: Explainability of large language models.
Yuval Shaval: Training Language Agents to Learn from Experience.
George Shaikovski: Explainable and Steerable Complex Reasoning in Transformer-based Language Models.
Elaf Almahmoud: The Value of AI Assistance. (First supervisor with Umang Bhatt)
Past PhD Students
Mateo Espinosa Zarlenga (2025): Help Wanted: Robust Concept Interventions for Interpretable Deep Neural Networks.
Albert Qiaochu Jiang (2024): Language models for verifiable mathematical automation: Interaction, integration, and autoformalization.
Andrei Margeloiu (2025): Breaking the curse of dimensionality in low-data tasks.
Dimitrios Deslis (2025): Developing an Interactive Computer Support System to Facilitate Teacher Learning of Proof-related Instruction in Mathematics. (First supervisor with Andreas Stylianides in Education Faculty)
Mei Yang (2024): Designing a Computer-assisted System For Prospective Mathematical Teachers’ Training of Proof-Related Instruction
. (First supervisor with Andreas Stylianides in Education Faculty)
Dmitry Kazhdan (2024): Enhancing Interpretability: The Role of Concept-based Explanations Across Data Types. (First supervisor with Pietro Lio)
Paul Scherer (2023): Distributional and relational inductive biases for graph representation learning in biomedicine. (First supervisor with Pietro Lio)
Agnieszka Slowik (2023): Out-of-distribution generalisation in machine learning. (First supervisor with Sean Holden)
Aaron
Stockdill (2021): Automating representation change across domains for reasoning.