New! Paper accepted for CogInterp @ NeurIPS 2025: ‘Position: Pedagogical Alignment of LLMs requires Diverse Cognitively-Inspired Student Proxies’ by Suchir Salhan, Andrew Caines, Paula Buttery: full text paper to follow

New! Paper accepted for BabyLM 2025: ‘Teacher Demonstrations in a BabyLM’s Zone of Proximal Development for Contingent Multi-Turn Interaction’ by Suchir Salhan, Hongyi Gu, Donya Rooein, Diana Galvan-Sosa, Gabrielle Gaudeau, Andrew Caines, Zheng Yuan, Paula Buttery: full text paper to follow

New! Paper accepted for BabyLM 2025: ‘Looking to Learn: Token-wise Dynamic Gating for Low-Resource Vision-Language Modelling’ byBianca-Mihaela Ganescu, Suchir Salhan, Andrew Caines, Paula Buttery: full text paper to follow

New! Paper accepted for BabyLM 2025: ‘BLiSS: Evaluating Bilingual Learner Competence in Second Language Small Language Models’ by Yuan Gao, Suchir Salhan, Andrew Caines, Paula Buttery, Weiwei Sun: full text paper to follow

New! New paper in a special issue of Education Sciences on Technology & Language Teacher Education: ‘Corpus-Based Reflective Practice to Support Chatroom Teaching Practice’ by Elaine Riordan, Fiona Farr, Andrew Caines, and Paula Buttery: full text paper

New! New paper and dataset: ‘DACTYL: Diverse Adversarial Corpus of Texts Yielded from Large Language Models’ by Shantanu Thorat,& Andrew Caines: full text paper

New! Paper accepted for BEA 2025: ‘LLM-based post-editing as reference-free GEC evaluation’ with Robert Östling and Murathan Kurfalı: full text paper


Hello, I’m a Research Professor in the NLIP Group & ALTA Institute directed by Prof Paula Buttery, based in the Computer Laboratory at the University of Cambridge, U.K. I’m a member of the Cambridge Language Sciences Interdisciplinary Research Centre and I’m also currently working on a collaboration with the Cambridge Cybercrime Centre, funded by the ESRC and led by Dr Alice Hutchings. I recently obtained funding from the Cambridge Global Challenges Research Fund for a new project on machine translation of public health documents (more information).

I’ve been on the staff at the University since October 2013, having previously worked in Literature Services at the European Bioinformatics Institute, carried out research for English Profile, and studied for a PhD in what was the Research Centre for English & Applied Linguistics (before its amalgamation into the section of Theoretical & Applied Linguistics) from 2006 to 2010.

You can find out more about my research, teaching and outreach activities by exploring the rest of my site, my ACL Anthology, DBLP or Google Scholar profiles, and my GitHub page.


Contact me: firstname.lastname @ cl.cam.ac.uk