New! Paper accepted for LREC-COLING 2024: ‘Logging Keystrokes in Writing by English Learners’ with Georgios Velentzas, Rita Borgo, Erin Pacquetet, Clive Hamilton, Taylor Arnold, Diane Nicholls, Paula Buttery, Thomas Gaillat, Nicolas Ballier and Helen Yannakoudakis: full text paper to follow.

New! Paper accepted for LREC-COLING 2024: ‘Grammatical Error Correction for Code-Switched Sentences by Learners of English’ with Kelvin Chan, Christopher Bryant, Li Nguyen and Zheng Yuan: full text paper to follow.

New! Paper published online: ‘Prompting open-source and commercial language models for grammatical error correction of English learner text’ with Christopher Davis, Andrew Caines, Øistein Andersen, Shiva Taslimipoor, Helen Yannakoudakis, Zheng Yuan, Christopher Bryant, Marek Rei, and Paula Buttery: full text paper

New! Paper accepted for the BabyLM Challenge at CoNLL 2023: ‘CLIMB – Curriculum Learning for Infant-inspired Model Building’ with Richard Diehl Martinez, Hope McGovern, Zébulon Goriely, Christopher Davis, Paula Buttery and Lisa Beinborn: full text paper. Winner of “Most Interesting Paper Award”

New! Paper accepted for the 1st Empowering Education with LLMs – the Next-Gen Interface and Content Generation Workshop at AIED: ‘On the application of Large Language Models for language teaching and assessment technology’ with Luca Benedetto, Shiva Taslimipoor, Christopher Davis, Yuan Gao, Øistein Andersen, Zheng Yuan, Mark Elliott, Russell Moore, Christopher Bryant, Marek Rei, Helen Yannakoudakis, Andrew Mullooly, Diane Nicholls and Paula Buttery: full text paper

New! Paper accepted for NLP4CALL 2022: ‘The Teacher-Student Chatroom Corpus version 2: more lessons, new annotation, automatic detection of sequence shifts’ with Helen Yannakoudakis, Helen Allen, Pascual Pérez-Paredes, Bill Byrne and Paula Buttery: full text paper


Hello, I’m a Senior Research Associate in the NLIP Group & ALTA Institute directed by Prof Paula Buttery, based in the Computer Laboratory at the University of Cambridge, U.K. I’m a member of the Cambridge Language Sciences Interdisciplinary Research Centre and I’m also currently working on a collaboration with the Cambridge Cybercrime Centre, funded by the ESRC and led by Dr Alice Hutchings. I recently obtained funding from the Cambridge Global Challenges Research Fund for a new project on machine translation of public health documents (more information).

I’ve been on the staff at the University since October 2013, having previously worked in Literature Services at the European Bioinformatics Institute, carried out research for English Profile, and studied for a PhD in what was the Research Centre for English & Applied Linguistics (before its amalgamation into the section of Theoretical & Applied Linguistics) from 2006 to 2010.

You can find out more about my research, teaching and outreach activities by exploring the rest of my site, my ACL Anthology, DBLP or Google Scholar profiles, and my GitHub page.


Contact me: firstname.lastname @ cl.cam.ac.uk