New! Paper accepted for COLING 2020 with Christian Bentz, Kate Knill, Marek Rei and Paula Buttery, ‘Grammatical error detection in transcriptions of spoken English’.

New! Paper accepted for CoNLL 2020 with Roddy MacSween, ‘An Expectation Maximisation Algorithm for Automated Cognate Detection’.

New! Paper accepted for W-NUT 2020 with Jack Hughes, Seth Aycock, Paula Buttery and Alice Hutchings, ‘Detecting Trending Terms in Cybersecurity Forum Discussions’.

New! New article online about our collaboration with Dr Fridah Katushemererwe of Makerere University, Uganda, on ‘Building natural language processing tools for Runyakitara’, funded by the Cambridge-Africa Programme, with Prof Paula Buttery. https://doi.org/10.1515/applirev-2020-2004

New! Open dataset of ML & NLP papers put together by Marek Rei, used for our blogpost on NLP conferences, geographic diversity and carbon emissions, and also for Marek’s new study of 2019 publications.


Hello, I’m a Senior Research Associate in the NLIP Group & ALTA Institute, based in the Computer Laboratory at the University of Cambridge, U.K. I’m also a member of the Cambridge Language Sciences network.

I’ve been on the staff at the University since October 2013, having previously worked in Literature Services at the European Bioinformatics Institute, carried out research for English Profile, and studied for a PhD in what was the Research Centre for English & Applied Linguistics (before its amalgamation into the section of Theoretical & Applied Linguistics) from 2006 to 2010.

You can find out more about my research, teaching and outreach activities by exploring the rest of my site, my Google Scholar profile, my GitHub page and Twitter feed.


Contact me: firstname.lastname @ cl.cam.ac.uk