Department of Computer Science and Technology – Natural Language and Information Processing Research Group: Publications

Natural Language and Information Processing Research Group

2023

Language Variety Identification with True Labels

Marcos Zampieri, Kai North, Tommi Jauhiainen, Mariano Felice, Neha Kumari, Nishant Nair, Yash Bangera

ArXiv. 2023

[pdf]

Word segmentation from transcriptions of child-directed speech using lexical and sub-lexical cues

Zébulon Goriely, Andrew Caines, Paula Buttery

Journal of Child Language. 2023

[pdf]

Automated hate speech detection and span extraction in underground hacking and extremist forums

Linda Zhou, Andrew Caines, Ildiko Pete, Alice Hutchings

Natural Language Engineering. 2023

[pdf]

Shibboleth: An agent-based model of signalling mimicry

Jonathan R Goodman, Andrew Caines, Robert A Foley

PLoS ONE. 2023

[pdf]

On the application of Large Language Models for language teaching and assessment technology

Andrew Caines, Luca Benedetto, Shiva Taslimipoor, Christopher Davis, Yuan Gao, Oeistein Andersen, Zheng Yuan, Mark Elliott, Russell Moore, Christopher Bryant, Marek Rei, Helen Yannakoudakis, Andrew Mullooly, Diane Nicholls, Paula Buttery

Proceedings of Empowering Education with LLMs – the Next-Gen Interface and Content Generation. 2023

[pdf]

Argot as a Trust Signal: Slang, Jargon & Reputation on a Large Cybercrime Forum

Jack Hughes, Andrew Caines, Alice Hutchings

Proceedings of the 22nd Annual Workshop on the Economics of Information Security. 2023

[pdf]

MultiGED-2023 shared task at NLP4CALL: Multilingual Grammatical Error Detection

Elena Volodina, Christopher Bryant, Andrew Caines, Orphée De Clercq, Jennifer-Carmen Frey, Elizaveta Ershova, Alexandr Rosen, Olga Vinogradova

Proceedings of the 12th Workshop on NLP for Computer Assisted Language Learning. 2023

[pdf]

Visual Spatial Reasoning

Fangyu Liu, Guy Emerson, Nigel Collier

Transactions of the Association for Computational Linguistics (TACL). 2023

[pdf]

Functional Distributional Semantics at Scale

Chun Hei Lo, Hong Cheng, Wai Lam, Guy Emerson

Proceedings of the 12th Joint Conference on Lexical and Computational Semantics (*SEM). 2023

[pdf]

SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval)

Shamsuddeen Hassan Muhammad, Idris Abdulmumin, Seid Muhie Yimam, David Ifeoluwa Adelani, Ibrahim Sa'id Ahmad, Nedjma Ousidhoum, Abinew Ayele, Saif M Mohammad, Meriem Beloucif

Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval 2023)

[pdf]

AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages

Shamsuddeen Hassan Muhammad, Idris Abdulmumin, Abinew Ali Ayele, Nedjma Ousidhoum, David Ifeoluwa Adelani, Seid Muhie Yimam, Ibrahim Sa'id Ahmad, Meriem Beloucif, Saif Mohammad, Sebastian Ruder, Oumaima Hourrane, Pavel Brazdil, Felermino Dário Mário António Ali, Davis Davis, Salomey Osei, Bello Shehu Bello, Falalu Ibrahim, Tajuddeen Gwadabe, Samuel Rutunda, Tadesse Belay, Wendimu Baye Messelle, Hailu Beshada Balcha, Sisay Adugna Chala, Hagos Tesfahun Gebremichael, Bernard Opoku, Steven Arthur

Arxiv. 2023

[pdf]

On the Intersection of Context-Free and Regular Languages

Clemente Pasti, Andreas Opedal, Tiago Pimentel, Tim Vieira, Jason Eisner, Ryan Cotterell

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics. 2023

[pdf]

On the Usefulness of Embeddings, Clusters and Strings for Text Generation Evaluation

Tiago Pimentel, Clara Meister, Ryan Cotterell

Proceedings of the International Conference on Learning Representations (ICLR). 2023

[pdf]

On the Effect of Anticipation on Reading Times

Tiago Pimentel, Clara Meister, Ethan G. Wilcox, Roger Levy, Ryan Cotterell

Transactions of the Association for Computational Linguistics. 2023

[pdf]

Locally Typical Sampling

Clara Meister, Tiago Pimentel, Gian Wiher, Ryan Cotterell

Transactions of the Association for Computational Linguistics. 2023

[pdf]

A survey on recent approaches to Question Difficulty Estimation from text

Luca Benedetto, Paolo Cremonesi, Andrew Caines, Paula Buttery, Andrea Cappelli, Andrea Giussani and Roberto Turrin

ACM Computing Surveys. 2023

[pdf]

Probabilistic Lexical Semantics: From Gaussian Embeddings to Bernoulli Fields

Guy Emerson

Probabilistic Approaches to Linguistic Theory. 2023

2022

Varifocal Question Generation for Fact-checking

Nedjma Ousidhoum, Zhangdie Yuan, Andreas Vlachos

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing

[pdf]

The Architectural Bottleneck Principle

Tiago Pimentel, Josef Valvoda, Niklas Stoehr, Ryan Cotterell

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing

[pdf]

Opening up Minds with Argumentative Dialogues

Youmna Farag, Charlotte O. Brand, Jacopo Amidei, Paul Piwek, Tom Stafford, Svetlana Stoyanchev, Andreas Vlachos

Findings of the Association for Computational Linguistics: EMNLP 2022

[pdf]

CEPOC: The Cambridge Exams Publishing Open Cloze dataset

Mariano Felice, Shiva Taslimipoor, Øistein E. Andersen and Paula Buttery.

Proceedings of the 2022 International Conference on Language Resources and Evaluation (LREC 2022)

[pdf]

Prompting for a conversation: How to control a dialog model?

Josef Valvoda, Yimai Fang, David Vandyke

Proceedings of the 2nd Workshop on When Creative AI Meets Conversational AI 29th International Conference on Computational Linguistics. 2022

[pdf]

On the Role of Negative Precedent in Legal Outcome Prediction

Josef Valvoda, Ryan Cotterell, Simone Teufel

Transations of the Association for Computational Linguistics. 2022

[pdf]

Benchmarking Compositionality with Formal Languages

Josef Valvoda, Naomi Saphra, Jonathan Rawski, Adina Williams, Ryan Cotterell

Proceedings of the 29th International Conference on Computational Linguistics. 2022

[pdf]

Identifying relevant common sense information in knowledge graphs

Guy Aglionby, Simone Teufel

Proceedings of the First Workshop on Commonsense Representation and Reasoning. 2022

[pdf]

Using machine learning to create a repository of judgments concerning a new practice area: a case study in animal protection law

Joe Watson, Guy Aglionby, Samuel March

Artificial Intelligence and Law. 2022

[pdf]

20 years of the Grammar Matrix: cross-linguistic hypothesis testing of increasingly complex interactions

Olga Zamaraeva, Chris Curtis, Guy Emerson, Antske Fokkens, Michael Wayne Goodman, Kristen Howell, T.J. Trimble, Emily M. Bender

Journal of Language Modelling. 2022

[pdf]

Using dependency parsing for few-shot learning in distributional semantics

Stefania Preda, Guy Emerson

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop. 2022

[pdf]

Extended Rater Representations in the Many-Facet Rasch Model

Mark Elliott, Paula Buttery

Journal of Applied Measurement. 2022

Accelerating Human Translation of Public Health Information into Low-Resource Languages with Machine Translation

Dimitra Stasinou, Theresa Biberauer, Ebele M\d{o}g\d{o} and Andrew Caines

Cambridge Occasional Papers in Linguistics. 2022

[pdf]

ALEN App: Argumentative Writing Support To Foster English Language Learning

Thiemo Wambsganss, Andrew Caines and Paula Buttery

Proceedings of the 17th Workshop on Innovative Use of {NLP} for Building Educational Applications. 2022

[pdf]

Towards an open-domain chatbot for language practice

Gladys Tyen, Mark Brenchley, Andrew Caines and Paula Buttery

Proceedings of the 17th Workshop on Innovative Use of {NLP} for Building Educational Applications. 2022

[pdf]

The Specificity and Helpfulness of Peer-to-Peer Feedback in Higher Education

Roman Rietsche, Andrew Caines, Cornelius Schramm, Dominik Pf{\"u}tze and Paula Buttery

Proceedings of the 17th Workshop on Innovative Use of {NLP} for Building Educational Applications. 2022

[pdf]

POSTCOG: A tool for interdisciplinary research into underground forums at scale

Ildikó Pete, Jack Hughes, Andrew Caines, Alice Hutchings, Ross Anderson and Paula Buttery

Proceedings of WACCO. 2022

[pdf]

Probing for targeted syntactic knowledge through grammatical error detection

Christopher Davis, Christopher Bryant, Andrew Caines, Marek Rei and Paula Buttery

Proceedings of the 2022 SIGNLL Conference on Computational Natural Language Learning

Naturalistic Causal Probing for Morpho-Syntax

Afra Amini, Tiago Pimentel, Clara Meister, Ryan Cotterell

Transactions of the Association for Computational Linguistics. 2022

[pdf]

Rethinking Reinforcement Learning for Recommendation: A Prompt Perspective

Xin Xin, Tiago Pimentel, Alexandros Karatzoglou, Pengjie Ren, Konstantina Christakopoulou, Zhaochun Ren

SIGIR '22: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2022

[pdf]

Probing for the Usage of Grammatical Number

Karim Lasri, Tiago Pimentel, Alessandro Lenci, Thierry Poibeau, Ryan Cotterell

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2022

[pdf]

Analyzing Wrap-Up Effects through an Information-Theoretic Lens

Clara Meister, Tiago Pimentel, Thomas Clark, Ryan Cotterell, Roger Levy

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 2022

[pdf]

On the probability-quality paradox in language generation

Clara Meister, Gian Wiher, Tiago Pimentel, Ryan Cotterell

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 2022

[pdf]

Constructing Open Cloze Tests Using Generation and Discrimination Capabilities of Transformers

Mariano Felice, Shiva Taslimipoor and Paula Buttery

Findings of the Association for Computational Linguistics: ACL 2022

Learning Functional Distributional Semantics with Visual Data

Yinhong Liu, Guy Emerson

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics. 2022

[pdf]

2021

Non-Iterative Conditional Pairwise Estimation for the Rating Scale Model

Mark Elliott, Paula Buttery

Educational and Psychological Measurement. 2021

[pdf]

Word Complexity is in the Eye of the Beholder

S Gooding, E Kochmar, SM Yimam, C Biemann

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

[pdf]

Predicting Text Readability from Scrolling Interactions

S Gooding, Y Berzak, T Mak, M Sharifi

Proceedings of the 25th Conference on Computational Natural Language Learning. 2021

[pdf]

Efficient Unsupervised NMT for Related Languages with Cross-Lingual Language Models and Fidelity Objectives

Rami Aly, Andrew Caines, Paula Buttery

Proceedings of the Eighth Workshop on NLP for Similar Languages, Varieties and Dialects. 2021

[pdf]

A surprisal–duration trade-off across and within the world’s languages

Tiago Pimentel, Clara Meister, Elizabeth Salesky, Simone Teufel, Damián Blasi, Ryan Cotterell

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

[pdf]

Revisiting the Uniform Information Density Hypothesis

Clara Meister, Tiago Pimentel, Patrick Haller, Lena Jäger, Ryan Cotterell, Roger Levy

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

[pdf]

A Bayesian Framework for Information-Theoretic Probing

Tiago Pimentel, Ryan Cotterell

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

[pdf]

On Homophony and Rényi Entropy

Tiago Pimentel, Clara Meister, Simone Teufel, Ryan Cotterell

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

[pdf]

Disambiguatory Signals are Stronger in Word-initial Positions

Tiago Pimentel, Ryan Cotterell, Brian Roark

Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume. 2021

[pdf]

Modeling the Unigram Distribution

Irene Nikkarinen, Tiago Pimentel, Damián Blasi, Ryan Cotterell

Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021

[pdf]

A Non-Linear Structural Probe

Jennifer C. White, Tiago Pimentel, Naomi Saphra, Ryan Cotterell

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

[pdf]

What About the Precedent: An Information-Theoretic Analysis of Common Law

Josef Valvoda, Tiago Pimentel, Niklas Stoehr, Ryan Cotterell, Simone Teufel

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

[pdf]

Finding Concept-specific Biases in Form–Meaning Associations

Tiago Pimentel, Brian Roark, Søren Wichmann, Ryan Cotterell, Damián Blasi

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

[pdf]

How (Non-)Optimal is the Lexicon?

Tiago Pimentel, Irene Nikkarinen, Kyle Mahowald, Ryan Cotterell, Damián Blasi

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

[pdf]

Incremental Beam Manipulation for Natural Language Generation

James Hargreaves, Andreas Vlachos, Guy Emerson

Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics (EACL). 2021

[pdf]

Synthetic Textual Features for the Large-Scale Detection of Basic-level Categories in English and Mandarin

Yiwen Chen and Simone Teufel

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

[pdf]

Quantifying Social Biases in NLP: A Generalization and Empirical Comparison of Extrinsic Fairness Metrics

Paula Czarnowska, Yogarshi Vyas, Kashif Shah

Transactions of the Association for Computational Linguistics (TACL). 2021

[pdf]

Computational linguistics and grammar engineering

Emily M. Bender, Guy Emerson

Head-Driven Phrase Structure Grammar: The handbook. 2021

[pdf]

2020

Analyzing Neural Discourse Coherence Models

Youmna Farag, Josef Valvoda, Helen Yannakoudakis, Ted Briscoe

Proceedings of the First Workshop on Computational Approaches to Discourse. 2020

[pdf]

The Teacher-Student Chatroom Corpus

Andrew Caines, Helen Yannakoudakis, Helena Edmondson, Helen Allen, Pascual Pérez-Paredes, Bill Byrne, Paula Buttery

Proceedings of the 9th Workshop on NLP for Computer Assisted Language Learning (NLP4CALL). 2020

[pdf]

Morphologically Aware Word-Level Translation

Paula Czarnowska, Sebastian Ruder, Ryan Cotterell and Ann Copestake

Proceedings of the 2020 International Conference on Computational Linguistics (COLING)

[pdf]

A Graph Based Framework for Structured Prediction Tasks in Sanskrit

Amrith Krishna, Bishal Santra, Ashim Gupta, Pavankumar Satuluri, Pawan Goyal,

Computational Linguistics. 2020

[pdf]

Keep it Surprisingly Simple: A Simple First Order Graph Based Parsing Model for Joint Morphosyntactic Parsing in Sanskrit

Amrith Krishna, Ashim Gupta, Deepak Garasangi, Pavankumar Satuluri, Pawan Goyal

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

[pdf]

Verbal Multiword Expressions for Identification of Metaphor

Omid Rohanian, Marek Rei, Shiva Taslimipoor, Le An Ha

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020)

[pdf]

Seeing Both the Forest and the Trees: Multi-head Attention for Joint Classification on Different Compositional Levels

Miruna Pislar, Marek Rei

The 28th International Conference on Computational Linguistics (COLING-2020)

[pdf]

Grammatical error detection in transcriptions of spoken English

Andrew Caines, Christian Bentz, Kate Knill, Marek Rei, Paula Buttery

The 28th International Conference on Computational Linguistics (COLING-2020)

[pdf]

Coding Textual Inputs Boosts the Accuracy of Neural Networks

Abdul Rafae Khan, Jia Xu, and Weiwei Sun

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

[pdf]

CanVEC - the Canberra Vietnamese-English code-switching natural speech corpus

Li Nguyen, Christopher Bryant

Proceedings of The 12th Language Resources and Evaluation Conference. 2020

[pdf]

Social-Computation-Supporting Kinds

David Strohmaier

Canadian Journal of Philosophy. 2020

[pdf]

SeCoDa: Sense Complexity Dataset

David Strohmaier, Sian Gooding, Shiva Taslimipoor, Ekaterina Kochmar

Proceedings of LREC. 2020

[pdf]

Building natural language processing tools for Runyakitara

Fridah Katushemererwe, Andrew Caines, Paula Buttery

Applied Linguistics Review. 2020

[pdf]

Investigating the effect of auxiliary objectives for the automated grading of learner English speech transcriptions

Hannah Craighead, Andrew Caines, Paula Buttery, Helen Yannakoudakis

Proceedings of ACL. 2020

[pdf]

REPROLANG 2020: Automatic Proficiency Scoring of Czech, English, German, Italian, and Spanish learner essays

Andrew Caines, Paula Buttery

Proceedings of LREC. 2020

[pdf]

Adaptive Forgetting Curves for Spaced Repetition Language Learning

Ahmed Zaidi, Andrew Caines, Russell Moore, Paula Buttery, Andrew Rice

Proceedings of AIED. 2020

[pdf]

Investigating Cross-Linguistic Adjective Ordering Tendencies with a Latent-Variable Model

Jun Yen Leung, Guy Emerson, Ryan Cotterell

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

[pdf]

Multiple Question Fronting without Relational Constraints: An Analysis of Russian as a Basis for Cross-Linguistic Modeling

Olga Zamaraeva, Guy Emerson

Proceedings of the 27th International Conference on Head-Driven Phrase Structure Grammar (HPSG). 2020

[pdf]

Linguists Who Use Probabilistic Models Love Them: Quantification in Functional Distributional Semantics

Guy Emerson

Proceedings of the Probability and Meaning Conference (PaM 2020)

[pdf]

Please Mind the Root: Decoding Arborescences for Dependency Parsing

Ran Zmigrod, Tim Vieira, Ryan Cotterell

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

[pdf]

Speakers Fill Lexical Semantic Gaps with Context

Tiago Pimentel, Rowan Hall Maudslay, Damián Blasi, Ryan Cotterell

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

[pdf]

Pareto Probing: Trading Off Accuracy for Complexity

Tiago Pimentel, Naomi Saphra, Adina Williams, Ryan Cotterell

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

[pdf]

Information-Theoretic Probing for Linguistic Structure

Tiago Pimentel, Josef Valvoda, Rowan Hall Maudslay, Ran Zmigrod, Adina Williams, Ryan Cotterell

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020

[pdf]

A Corpus for Large-Scale Phonetic Typology

Elizabeth Salesky, Eleanor Chodroff, Tiago Pimentel, Matthew Wiesner, Ryan Cotterell, Alan W Black, Jason Eisner

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020

[pdf]

Predicting Declension Class from Form and Meaning

Adina Williams, Tiago Pimentel, Hagen Blix, Arya D. McCarthy, Eleanor Chodroff, Ryan Cotterell

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020

[pdf]

A Tale of a Probe and a Parser

Rowan Hall Maudslay, Josef Valvoda, Tiago Pimentel, Adina Williams, Ryan Cotterell

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020

[pdf]

Phonotactic Complexity and Its Trade-offs

Tiago Pimentel, Brian Roark, Ryan Cotterell

Transactions of the Association for Computational Linguistics. 2020

[pdf]

Leveraging sentence similarity in natural language generation: Improving beam search using range voting

Sebastian Borgeaud, Guy Emerson

Proceedings of the 4th Workshop on Neural Generation and Translation (WNGT). 2020

[pdf]

What are the Goals of Distributional Semantics?

Guy Emerson

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020

[pdf]

Autoencoding Pixies: Amortised Variational Inference with Graph Convolutions for Functional Distributional Semantics

Guy Emerson

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020

[pdf]

2019

Meaning to Form: Measuring Systematicity as Information

Tiago Pimentel, Arya D. McCarthy, Damian Blasi, Brian Roark, Ryan Cotterell

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 2019

[pdf]

Counterfactual Data Augmentation for Mitigating Gender Stereotypes in Languages with Rich Morphology

Quantifying Social Biases in NLP: A Generalization and Empirical Comparison of Extrinsic Fairness Metrics

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 2019

[pdf]

Active Learning for Financial Investment Reports

Sian Gooding and Ted Briscoe

Proceedings of the Second Financial Narrative Processing Workshop (FNP 2019)

[pdf]

Don't Forget the Long Tail! A Comprehensive Analysis of Morphological Generalization in Bilingual Lexicon Induction

Paula Czarnowska, Sebastian Ruder, Edouard Grave, Ryan Cotterell and Ann Copestake

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019)

[pdf]

Multi-Task Learning for Coherence Modeling

Youmna Farag and Helen Yannakoudakis

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 2019

[pdf]

Entropy as a proxy for gap complexity in open cloze tests

Mariano Felice and Paula Buttery

Proceedings of the International Conference Recent Advances in Natural Language Processing (RANLP 2019)

[pdf]

The BEA-2019 Shared Task on Grammatical Error Correction

Christopher Bryant, Mariano Felice, Øistein E. Andersen and Ted Briscoe

Proceedings of the 14th Workshop on Innovative Use of NLP for Building Educational Applications (BEA-2019)

[pdf]

Recursive Context-Aware Lexical Simplification

Sian Gooding, Ekaterina Kochmar

Proceedings of the EMNLP 2019

Complex Word Identification as a Sequence Labelling Task

Sian Gooding, Ekaterina Kochmar

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 2019

[pdf]

Comparative judgments are more consistent than binary classification for labelling word complexity

Sian Gooding, Ekaterina Kochmar, Advait Sarkar, Alan Blackwell

Proceedings of the 13th Linguistic Annotation Workshop. 2019

[pdf]

Automatic learner summary assessment for reading comprehension.

Menglin Xia, Ekaterina Kochmar, Ted Briscoe

Proceedings of NAACL-HLT 2019

[pdf]

Words are Vectors, Dependencies are Matrices: Learning Word Embeddings from Dependency Graphs

Paula Czarnowska, Guy Emerson, Ann Copestake

Proceedings of the 13th International Conference on Computational Semantics (IWCS). 2019

[pdf]

The cross-linguistic performance of statistical word segmentation models

Andrew Caines, Emma Altmann-Richer & Paula Buttery

Journal of Child Language 46(6): 1169-1201. 2019

Overview of the 2019 Spoken CALL Shared Task

Claudia Baur, Andrew Caines, Cathy Chua, Johanna Gerlach, Mengjie Qian, Manny Rayner, Martin Russell, Helmer Strik & Xizi Wei

Proceedings of the 8th ISCA Workshop on Speech and Language Technology in Education (SLaTE). 2019

Skills Embeddings: a neural approach to multicomponent representations of students and tasks

Russell Moore, Andrew Caines, Mark Elliott, Ahmed Zaidi, Andrew Rice & Paula Buttery

Proceedings of the 12th International Conference on Educational Data Mining (EDM 2019)

Accurate modelling of language learning tasks and students using representations of grammatical proficiency

Ahmed Zaidi, Andrew Caines, Christopher Davis, Russell Moore, Paula Buttery & Andrew Rice

Proceedings of the 12th International Conference on Educational Data Mining (EDM 2019)

Automatic homework selection with deep behavioural cloning

Russell Moore, Andrew Caines, Andrew Rice & Paula Buttery

Proceedings of the 20th International Conference on Artificial Intelligence in Education (AIED 2019)

Bad Form: Comparing Context-Based and Form-Based Few-Shot Learning in Distributional Semantic Models

Jeroen Van Hautte, Guy Emerson and Marek Rei

Proceedings of the Second Workshop on Deep Learning for Low-Resource NLP (DeepLo 2019)

[pdf]

Modelling the interplay of metaphor and emotion through multitask learning

Verna Dankers, Marek Rei, Martha Lewis and Ekaterina Shutova

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019)

Semi-Supervised Bootstrapping of Dialogue State Trackers for Task-Oriented Modelling

Bo-Hsiang Tseng, Marek Rei, Paweł Budzianowski, Richard Turner, Bill Byrne and Anna Korhonen

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019)

Neural and FST-based approaches to grammatical error correction

Zheng Yuan, Felix Stahlberg, Marek Rei, Bill Byrne and Helen Yannakoudakis

Proceedings of the 14th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2019)

[pdf]

Context is Key: Grammatical Error Detection with Contextual Word Representations

Samuel Bell, Helen Yannakoudakis and Marek Rei

Proceedings of the 14th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2019)

[pdf]

CAMsterdam at SemEval-2019 Task 6: Neural and graph-based featureextraction for the identification of offensive tweets

Guy Aglionby, Christopher Davis, Pushkar Mishra, Andrew Caines, Helen Yannakoudakis, Marek Rei, Ekaterina Shutova and Paula Buttery

Proceedings of the International Workshop on Semantic Evaluation 2019 (SemEval 2019)

[pdf]

Factorising AMR generation through syntax

Kris Cao, Stephen Clark