We hosted the Workshop on
Computational Linguistic Methods for Language Learning Technology:
Writing, Reading, Interaction, Content Creation, Evaluation (WRICE) on
12th & 13th March in Cambridge: more information, including slides
and recordings for some of the talks, available here
Paper accepted for Findings of
the ACL @ ACL 2026: ‘Bias Dynamics in BabyLMs: Towards a
Compute-Efficient Sandbox for Democratising Pre-Training Debiasing’ by
Filip Trhlík, Andrew Caines, Paula Buttery: full text paper to
follow
Paper accepted for EACL 2026:
‘PictureStories: Predicting the Task Adherence of Language Learner
Answers to a Picture Story-Based Writing Task’ with Marie Bexte, Diane
Nicholls, Paula Buttery, Torsten Zesch: full text paper
and dataset available here
Paper accepted for CogInterp @
NeurIPS 2025: ‘Pedagogical Alignment of LLMs requires Diverse
Cognitively-Inspired Student Proxies’ by Suchir Salhan, Andrew Caines,
Paula Buttery: full
text paper
Paper accepted for BabyLM 2025:
‘Teacher Demonstrations in a BabyLM’s Zone of Proximal Development for
Contingent Multi-Turn Interaction’ by Suchir Salhan, Hongyi Gu, Donya
Rooein, Diana Galvan-Sosa, Gabrielle Gaudeau, Andrew Caines, Zheng Yuan,
Paula Buttery: full text
paper
Paper accepted for BabyLM 2025:
‘Looking to Learn: Token-wise Dynamic Gating for Low-Resource
Vision-Language Modelling’ by Bianca-Mihaela Ganescu, Suchir Salhan,
Andrew Caines, Paula Buttery: full text
paper
Paper accepted for BabyLM 2025:
‘BLiSS: Evaluating Bilingual Learner Competence in Second Language Small
Language Models’ by Yuan Gao, Suchir Salhan, Andrew Caines, Paula
Buttery, Weiwei Sun: full text
paper
New paper and dataset: ‘DACTYL:
Diverse Adversarial Corpus of Texts Yielded from Large Language Models’
by Shantanu Thorat,& Andrew Caines: full text paper
Hello, I’m a Research Professor in the NLIP Group & ALTA Institute directed by Prof Paula Buttery, based in the Computer Laboratory at the University of Cambridge, U.K. I’m a member of the Cambridge Language Sciences Interdisciplinary Research Centre and I’m also currently working on a collaboration with the Cambridge Cybercrime Centre, funded by the ESRC and led by Dr Alice Hutchings. I recently obtained funding from the Cambridge Global Challenges Research Fund for a new project on machine translation of public health documents (more information).
I’ve been on the staff at the University since October 2013, having previously worked in Literature Services at the European Bioinformatics Institute, carried out research for English Profile, and studied for a PhD in what was the Research Centre for English & Applied Linguistics (before its amalgamation into the section of Theoretical & Applied Linguistics) from 2006 to 2010.
You can find out more about my research, teaching and outreach activities by exploring the rest of my site, my ACL Anthology, DBLP or Google Scholar profiles, and my GitHub page.
Contact me: firstname.lastname @ cl.cam.ac.uk