New! New paper and dataset: ‘The Write & Improve Corpus 2024’ with Diane Nicholls, Paula Buttery: full text paper

New! Paper accepted for BabyLM 2024: ‘From Babble to Words: Pre-Training Language Models on Continuous Streams of Phonemes’ with Zebulon Goriely, Richard Diehl Martinez, Lisa Beinborn, Paula Buttery: full text paper Winner of Outstanding Paper

New! Paper accepted for EMNLP 2024: ‘Mitigating Frequency Bias and Anisotropy in Language Model Pre-Training with Syntactic Smoothing’ with Richard Diehl Martinez, Zebulon Goriely, Paula Buttery, Lisa Beinborn: full text paper

New! Paper accepted for NLP4CALL 2024: ‘LLM chatbots as a language practice tool: a user study’ with Gladys Tyen and Paula Buttery: full text paper

New! Paper accepted for ACL Findings 2024: ‘Prompting open-source and commercial language models for grammatical error correction of English learner text’ with Christopher Davis, Øistein Andersen, Shiva Taslimipoor, Helen Yannakoudakis, Zheng Yuan, Christopher Bryant, Marek Rei, and Paula Buttery: full text paper

New! Paper accepted for LREC-COLING 2024: ‘Logging Keystrokes in Writing by English Learners’ with Georgios Velentzas, Rita Borgo, Erin Pacquetet, Clive Hamilton, Taylor Arnold, Diane Nicholls, Paula Buttery, Thomas Gaillat, Nicolas Ballier and Helen Yannakoudakis: full text paper


Hello, I’m a Senior Research Associate in the NLIP Group & ALTA Institute directed by Prof Paula Buttery, based in the Computer Laboratory at the University of Cambridge, U.K. I’m a member of the Cambridge Language Sciences Interdisciplinary Research Centre and I’m also currently working on a collaboration with the Cambridge Cybercrime Centre, funded by the ESRC and led by Dr Alice Hutchings. I recently obtained funding from the Cambridge Global Challenges Research Fund for a new project on machine translation of public health documents (more information).

I’ve been on the staff at the University since October 2013, having previously worked in Literature Services at the European Bioinformatics Institute, carried out research for English Profile, and studied for a PhD in what was the Research Centre for English & Applied Linguistics (before its amalgamation into the section of Theoretical & Applied Linguistics) from 2006 to 2010.

You can find out more about my research, teaching and outreach activities by exploring the rest of my site, my ACL Anthology, DBLP or Google Scholar profiles, and my GitHub page.


Contact me: firstname.lastname @ cl.cam.ac.uk