Department of Computer Science and Technology

Natural Language and Information Processing Research Group

2023

Language Variety Identification with True Labels
Marcos Zampieri, Kai North, Tommi Jauhiainen, Mariano Felice, Neha Kumari, Nishant Nair, Yash Bangera
ArXiv. 2023
Word segmentation from transcriptions of child-directed speech using lexical and sub-lexical cues
Zébulon Goriely, Andrew Caines, Paula Buttery
Journal of Child Language. 2023
Automated hate speech detection and span extraction in underground hacking and extremist forums
Linda Zhou, Andrew Caines, Ildiko Pete, Alice Hutchings
Natural Language Engineering. 2023
Shibboleth: An agent-based model of signalling mimicry
Jonathan R Goodman, Andrew Caines, Robert A Foley
PLoS ONE. 2023
On the application of Large Language Models for language teaching and assessment technology
Andrew Caines, Luca Benedetto, Shiva Taslimipoor, Christopher Davis, Yuan Gao, Oeistein Andersen, Zheng Yuan, Mark Elliott, Russell Moore, Christopher Bryant, Marek Rei, Helen Yannakoudakis, Andrew Mullooly, Diane Nicholls, Paula Buttery
Proceedings of Empowering Education with LLMs – the Next-Gen Interface and Content Generation. 2023
Argot as a Trust Signal: Slang, Jargon & Reputation on a Large Cybercrime Forum
Jack Hughes, Andrew Caines, Alice Hutchings
Proceedings of the 22nd Annual Workshop on the Economics of Information Security. 2023
MultiGED-2023 shared task at NLP4CALL: Multilingual Grammatical Error Detection
Elena Volodina, Christopher Bryant, Andrew Caines, Orphée De Clercq, Jennifer-Carmen Frey, Elizaveta Ershova, Alexandr Rosen, Olga Vinogradova
Proceedings of the 12th Workshop on NLP for Computer Assisted Language Learning. 2023
Visual Spatial Reasoning
Fangyu Liu, Guy Emerson, Nigel Collier
Transactions of the Association for Computational Linguistics (TACL). 2023
Functional Distributional Semantics at Scale
Chun Hei Lo, Hong Cheng, Wai Lam, Guy Emerson
Proceedings of the 12th Joint Conference on Lexical and Computational Semantics (*SEM). 2023
SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval)
Shamsuddeen Hassan Muhammad, Idris Abdulmumin, Seid Muhie Yimam, David Ifeoluwa Adelani, Ibrahim Sa'id Ahmad, Nedjma Ousidhoum, Abinew Ayele, Saif M Mohammad, Meriem Beloucif
Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval 2023)
AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages
Shamsuddeen Hassan Muhammad, Idris Abdulmumin, Abinew Ali Ayele, Nedjma Ousidhoum, David Ifeoluwa Adelani, Seid Muhie Yimam, Ibrahim Sa'id Ahmad, Meriem Beloucif, Saif Mohammad, Sebastian Ruder, Oumaima Hourrane, Pavel Brazdil, Felermino Dário Mário António Ali, Davis Davis, Salomey Osei, Bello Shehu Bello, Falalu Ibrahim, Tajuddeen Gwadabe, Samuel Rutunda, Tadesse Belay, Wendimu Baye Messelle, Hailu Beshada Balcha, Sisay Adugna Chala, Hagos Tesfahun Gebremichael, Bernard Opoku, Steven Arthur
Arxiv. 2023
On the Intersection of Context-Free and Regular Languages
Clemente Pasti, Andreas Opedal, Tiago Pimentel, Tim Vieira, Jason Eisner, Ryan Cotterell
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics. 2023
On the Usefulness of Embeddings, Clusters and Strings for Text Generation Evaluation
Tiago Pimentel, Clara Meister, Ryan Cotterell
Proceedings of the International Conference on Learning Representations (ICLR). 2023
On the Effect of Anticipation on Reading Times
Tiago Pimentel, Clara Meister, Ethan G. Wilcox, Roger Levy, Ryan Cotterell
Transactions of the Association for Computational Linguistics. 2023
Locally Typical Sampling
Clara Meister, Tiago Pimentel, Gian Wiher, Ryan Cotterell
Transactions of the Association for Computational Linguistics. 2023
A survey on recent approaches to Question Difficulty Estimation from text
Luca Benedetto, Paolo Cremonesi, Andrew Caines, Paula Buttery, Andrea Cappelli, Andrea Giussani and Roberto Turrin
ACM Computing Surveys. 2023
Probabilistic Lexical Semantics: From Gaussian Embeddings to Bernoulli Fields
Guy Emerson
Probabilistic Approaches to Linguistic Theory. 2023

2022

Varifocal Question Generation for Fact-checking
Nedjma Ousidhoum, Zhangdie Yuan, Andreas Vlachos
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
The Architectural Bottleneck Principle
Tiago Pimentel, Josef Valvoda, Niklas Stoehr, Ryan Cotterell
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
Opening up Minds with Argumentative Dialogues
Youmna Farag, Charlotte O. Brand, Jacopo Amidei, Paul Piwek, Tom Stafford, Svetlana Stoyanchev, Andreas Vlachos
Findings of the Association for Computational Linguistics: EMNLP 2022
CEPOC: The Cambridge Exams Publishing Open Cloze dataset
Mariano Felice, Shiva Taslimipoor, Øistein E. Andersen and Paula Buttery.
Proceedings of the 2022 International Conference on Language Resources and Evaluation (LREC 2022)
Prompting for a conversation: How to control a dialog model?
Josef Valvoda, Yimai Fang, David Vandyke
Proceedings of the 2nd Workshop on When Creative AI Meets Conversational AI 29th International Conference on Computational Linguistics. 2022
On the Role of Negative Precedent in Legal Outcome Prediction
Josef Valvoda, Ryan Cotterell, Simone Teufel
Transations of the Association for Computational Linguistics. 2022
Benchmarking Compositionality with Formal Languages
Josef Valvoda, Naomi Saphra, Jonathan Rawski, Adina Williams, Ryan Cotterell
Proceedings of the 29th International Conference on Computational Linguistics. 2022
Identifying relevant common sense information in knowledge graphs
Guy Aglionby, Simone Teufel
Proceedings of the First Workshop on Commonsense Representation and Reasoning. 2022
Using machine learning to create a repository of judgments concerning a new practice area: a case study in animal protection law
Joe Watson, Guy Aglionby, Samuel March
Artificial Intelligence and Law. 2022
20 years of the Grammar Matrix: cross-linguistic hypothesis testing of increasingly complex interactions
Olga Zamaraeva, Chris Curtis, Guy Emerson, Antske Fokkens, Michael Wayne Goodman, Kristen Howell, T.J. Trimble, Emily M. Bender
Journal of Language Modelling. 2022
Using dependency parsing for few-shot learning in distributional semantics
Stefania Preda, Guy Emerson
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop. 2022
Extended Rater Representations in the Many-Facet Rasch Model
Mark Elliott, Paula Buttery
Journal of Applied Measurement. 2022
Accelerating Human Translation of Public Health Information into Low-Resource Languages with Machine Translation
Dimitra Stasinou, Theresa Biberauer, Ebele M\d{o}g\d{o} and Andrew Caines
Cambridge Occasional Papers in Linguistics. 2022
ALEN App: Argumentative Writing Support To Foster English Language Learning
Thiemo Wambsganss, Andrew Caines and Paula Buttery
Proceedings of the 17th Workshop on Innovative Use of {NLP} for Building Educational Applications. 2022
Towards an open-domain chatbot for language practice
Gladys Tyen, Mark Brenchley, Andrew Caines and Paula Buttery
Proceedings of the 17th Workshop on Innovative Use of {NLP} for Building Educational Applications. 2022
The Specificity and Helpfulness of Peer-to-Peer Feedback in Higher Education
Roman Rietsche, Andrew Caines, Cornelius Schramm, Dominik Pf{\"u}tze and Paula Buttery
Proceedings of the 17th Workshop on Innovative Use of {NLP} for Building Educational Applications. 2022
POSTCOG: A tool for interdisciplinary research into underground forums at scale
Ildikó Pete, Jack Hughes, Andrew Caines, Alice Hutchings, Ross Anderson and Paula Buttery
Proceedings of WACCO. 2022
Probing for targeted syntactic knowledge through grammatical error detection
Christopher Davis, Christopher Bryant, Andrew Caines, Marek Rei and Paula Buttery
Proceedings of the 2022 SIGNLL Conference on Computational Natural Language Learning
Naturalistic Causal Probing for Morpho-Syntax
Afra Amini, Tiago Pimentel, Clara Meister, Ryan Cotterell
Transactions of the Association for Computational Linguistics. 2022
Rethinking Reinforcement Learning for Recommendation: A Prompt Perspective
Xin Xin, Tiago Pimentel, Alexandros Karatzoglou, Pengjie Ren, Konstantina Christakopoulou, Zhaochun Ren
SIGIR '22: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2022
Probing for the Usage of Grammatical Number
Karim Lasri, Tiago Pimentel, Alessandro Lenci, Thierry Poibeau, Ryan Cotterell
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2022
Analyzing Wrap-Up Effects through an Information-Theoretic Lens
Clara Meister, Tiago Pimentel, Thomas Clark, Ryan Cotterell, Roger Levy
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 2022
On the probability-quality paradox in language generation
Clara Meister, Gian Wiher, Tiago Pimentel, Ryan Cotterell
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 2022
Constructing Open Cloze Tests Using Generation and Discrimination Capabilities of Transformers
Mariano Felice, Shiva Taslimipoor and Paula Buttery
Findings of the Association for Computational Linguistics: ACL 2022
Learning Functional Distributional Semantics with Visual Data
Yinhong Liu, Guy Emerson
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics. 2022

2021

Non-Iterative Conditional Pairwise Estimation for the Rating Scale Model
Mark Elliott, Paula Buttery
Educational and Psychological Measurement. 2021
Word Complexity is in the Eye of the Beholder
S Gooding, E Kochmar, SM Yimam, C Biemann
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Predicting Text Readability from Scrolling Interactions
S Gooding, Y Berzak, T Mak, M Sharifi
Proceedings of the 25th Conference on Computational Natural Language Learning. 2021
Efficient Unsupervised NMT for Related Languages with Cross-Lingual Language Models and Fidelity Objectives
Rami Aly, Andrew Caines, Paula Buttery
Proceedings of the Eighth Workshop on NLP for Similar Languages, Varieties and Dialects. 2021
A surprisal–duration trade-off across and within the world’s languages
Tiago Pimentel, Clara Meister, Elizabeth Salesky, Simone Teufel, Damián Blasi, Ryan Cotterell
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
Revisiting the Uniform Information Density Hypothesis
Clara Meister, Tiago Pimentel, Patrick Haller, Lena Jäger, Ryan Cotterell, Roger Levy
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
A Bayesian Framework for Information-Theoretic Probing
Tiago Pimentel, Ryan Cotterell
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
On Homophony and Rényi Entropy
Tiago Pimentel, Clara Meister, Simone Teufel, Ryan Cotterell
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
Disambiguatory Signals are Stronger in Word-initial Positions
Tiago Pimentel, Ryan Cotterell, Brian Roark
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume. 2021
Modeling the Unigram Distribution
Irene Nikkarinen, Tiago Pimentel, Damián Blasi, Ryan Cotterell
Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021
A Non-Linear Structural Probe
Jennifer C. White, Tiago Pimentel, Naomi Saphra, Ryan Cotterell
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
What About the Precedent: An Information-Theoretic Analysis of Common Law
Josef Valvoda, Tiago Pimentel, Niklas Stoehr, Ryan Cotterell, Simone Teufel
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Finding Concept-specific Biases in Form–Meaning Associations
Tiago Pimentel, Brian Roark, Søren Wichmann, Ryan Cotterell, Damián Blasi
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
How (Non-)Optimal is the Lexicon?
Tiago Pimentel, Irene Nikkarinen, Kyle Mahowald, Ryan Cotterell, Damián Blasi
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Incremental Beam Manipulation for Natural Language Generation
James Hargreaves, Andreas Vlachos, Guy Emerson
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics (EACL). 2021
Synthetic Textual Features for the Large-Scale Detection of Basic-level Categories in English and Mandarin
Yiwen Chen and Simone Teufel
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
Quantifying Social Biases in NLP: A Generalization and Empirical Comparison of Extrinsic Fairness Metrics
Paula Czarnowska, Yogarshi Vyas, Kashif Shah
Transactions of the Association for Computational Linguistics (TACL). 2021
Computational linguistics and grammar engineering
Emily M. Bender, Guy Emerson
Head-Driven Phrase Structure Grammar: The handbook. 2021

2020

Analyzing Neural Discourse Coherence Models
Youmna Farag, Josef Valvoda, Helen Yannakoudakis, Ted Briscoe
Proceedings of the First Workshop on Computational Approaches to Discourse. 2020
The Teacher-Student Chatroom Corpus
Andrew Caines, Helen Yannakoudakis, Helena Edmondson, Helen Allen, Pascual Pérez-Paredes, Bill Byrne, Paula Buttery
Proceedings of the 9th Workshop on NLP for Computer Assisted Language Learning (NLP4CALL). 2020
Morphologically Aware Word-Level Translation
Paula Czarnowska, Sebastian Ruder, Ryan Cotterell and Ann Copestake
Proceedings of the 2020 International Conference on Computational Linguistics (COLING)
A Graph Based Framework for Structured Prediction Tasks in Sanskrit
Amrith Krishna, Bishal Santra, Ashim Gupta, Pavankumar Satuluri, Pawan Goyal,
Computational Linguistics. 2020
Keep it Surprisingly Simple: A Simple First Order Graph Based Parsing Model for Joint Morphosyntactic Parsing in Sanskrit
Amrith Krishna, Ashim Gupta, Deepak Garasangi, Pavankumar Satuluri, Pawan Goyal
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Verbal Multiword Expressions for Identification of Metaphor
Omid Rohanian, Marek Rei, Shiva Taslimipoor, Le An Ha
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020)
Seeing Both the Forest and the Trees: Multi-head Attention for Joint Classification on Different Compositional Levels
Miruna Pislar, Marek Rei
The 28th International Conference on Computational Linguistics (COLING-2020)
Grammatical error detection in transcriptions of spoken English
Andrew Caines, Christian Bentz, Kate Knill, Marek Rei, Paula Buttery
The 28th International Conference on Computational Linguistics (COLING-2020)
Coding Textual Inputs Boosts the Accuracy of Neural Networks
Abdul Rafae Khan, Jia Xu, and Weiwei Sun
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
CanVEC - the Canberra Vietnamese-English code-switching natural speech corpus
Li Nguyen, Christopher Bryant
Proceedings of The 12th Language Resources and Evaluation Conference. 2020
Social-Computation-Supporting Kinds
David Strohmaier
Canadian Journal of Philosophy. 2020
SeCoDa: Sense Complexity Dataset
David Strohmaier, Sian Gooding, Shiva Taslimipoor, Ekaterina Kochmar
Proceedings of LREC. 2020
Building natural language processing tools for Runyakitara
Fridah Katushemererwe, Andrew Caines, Paula Buttery
Applied Linguistics Review. 2020
Investigating the effect of auxiliary objectives for the automated grading of learner English speech transcriptions
Hannah Craighead, Andrew Caines, Paula Buttery, Helen Yannakoudakis
Proceedings of ACL. 2020
REPROLANG 2020: Automatic Proficiency Scoring of Czech, English, German, Italian, and Spanish learner essays
Andrew Caines, Paula Buttery
Proceedings of LREC. 2020
Adaptive Forgetting Curves for Spaced Repetition Language Learning
Ahmed Zaidi, Andrew Caines, Russell Moore, Paula Buttery, Andrew Rice
Proceedings of AIED. 2020
Investigating Cross-Linguistic Adjective Ordering Tendencies with a Latent-Variable Model
Jun Yen Leung, Guy Emerson, Ryan Cotterell
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Multiple Question Fronting without Relational Constraints: An Analysis of Russian as a Basis for Cross-Linguistic Modeling
Olga Zamaraeva, Guy Emerson
Proceedings of the 27th International Conference on Head-Driven Phrase Structure Grammar (HPSG). 2020
Linguists Who Use Probabilistic Models Love Them: Quantification in Functional Distributional Semantics
Guy Emerson
Proceedings of the Probability and Meaning Conference (PaM 2020)
Please Mind the Root: Decoding Arborescences for Dependency Parsing
Ran Zmigrod, Tim Vieira, Ryan Cotterell
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Speakers Fill Lexical Semantic Gaps with Context
Tiago Pimentel, Rowan Hall Maudslay, Damián Blasi, Ryan Cotterell
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Pareto Probing: Trading Off Accuracy for Complexity
Tiago Pimentel, Naomi Saphra, Adina Williams, Ryan Cotterell
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Information-Theoretic Probing for Linguistic Structure
Tiago Pimentel, Josef Valvoda, Rowan Hall Maudslay, Ran Zmigrod, Adina Williams, Ryan Cotterell
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020
A Corpus for Large-Scale Phonetic Typology
Elizabeth Salesky, Eleanor Chodroff, Tiago Pimentel, Matthew Wiesner, Ryan Cotterell, Alan W Black, Jason Eisner
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020
Predicting Declension Class from Form and Meaning
Adina Williams, Tiago Pimentel, Hagen Blix, Arya D. McCarthy, Eleanor Chodroff, Ryan Cotterell
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020
A Tale of a Probe and a Parser
Rowan Hall Maudslay, Josef Valvoda, Tiago Pimentel, Adina Williams, Ryan Cotterell
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020
Phonotactic Complexity and Its Trade-offs
Tiago Pimentel, Brian Roark, Ryan Cotterell
Transactions of the Association for Computational Linguistics. 2020
Leveraging sentence similarity in natural language generation: Improving beam search using range voting
Sebastian Borgeaud, Guy Emerson
Proceedings of the 4th Workshop on Neural Generation and Translation (WNGT). 2020
What are the Goals of Distributional Semantics?
Guy Emerson
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020
Autoencoding Pixies: Amortised Variational Inference with Graph Convolutions for Functional Distributional Semantics
Guy Emerson
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020

2019

Meaning to Form: Measuring Systematicity as Information
Tiago Pimentel, Arya D. McCarthy, Damian Blasi, Brian Roark, Ryan Cotterell
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 2019
Counterfactual Data Augmentation for Mitigating Gender Stereotypes in Languages with Rich Morphology
Quantifying Social Biases in NLP: A Generalization and Empirical Comparison of Extrinsic Fairness Metrics
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 2019
Active Learning for Financial Investment Reports
Sian Gooding and Ted Briscoe
Proceedings of the Second Financial Narrative Processing Workshop (FNP 2019)
Don't Forget the Long Tail! A Comprehensive Analysis of Morphological Generalization in Bilingual Lexicon Induction
Paula Czarnowska, Sebastian Ruder, Edouard Grave, Ryan Cotterell and Ann Copestake
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019)
Multi-Task Learning for Coherence Modeling
Youmna Farag and Helen Yannakoudakis
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 2019
Entropy as a proxy for gap complexity in open cloze tests
Mariano Felice and Paula Buttery
Proceedings of the International Conference Recent Advances in Natural Language Processing (RANLP 2019)
The BEA-2019 Shared Task on Grammatical Error Correction
Christopher Bryant, Mariano Felice, Øistein E. Andersen and Ted Briscoe
Proceedings of the 14th Workshop on Innovative Use of NLP for Building Educational Applications (BEA-2019)
Recursive Context-Aware Lexical Simplification
Sian Gooding, Ekaterina Kochmar
Proceedings of the EMNLP 2019
Complex Word Identification as a Sequence Labelling Task
Sian Gooding, Ekaterina Kochmar
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 2019
Comparative judgments are more consistent than binary classification for labelling word complexity
Sian Gooding, Ekaterina Kochmar, Advait Sarkar, Alan Blackwell
Proceedings of the 13th Linguistic Annotation Workshop. 2019
Automatic learner summary assessment for reading comprehension.
Menglin Xia, Ekaterina Kochmar, Ted Briscoe
Proceedings of NAACL-HLT 2019
Words are Vectors, Dependencies are Matrices: Learning Word Embeddings from Dependency Graphs
Paula Czarnowska, Guy Emerson, Ann Copestake
Proceedings of the 13th International Conference on Computational Semantics (IWCS). 2019
The cross-linguistic performance of statistical word segmentation models
Andrew Caines, Emma Altmann-Richer & Paula Buttery
Journal of Child Language 46(6): 1169-1201. 2019
Overview of the 2019 Spoken CALL Shared Task
Claudia Baur, Andrew Caines, Cathy Chua, Johanna Gerlach, Mengjie Qian, Manny Rayner, Martin Russell, Helmer Strik & Xizi Wei
Proceedings of the 8th ISCA Workshop on Speech and Language Technology in Education (SLaTE). 2019
Skills Embeddings: a neural approach to multicomponent representations of students and tasks
Russell Moore, Andrew Caines, Mark Elliott, Ahmed Zaidi, Andrew Rice & Paula Buttery
Proceedings of the 12th International Conference on Educational Data Mining (EDM 2019)
Accurate modelling of language learning tasks and students using representations of grammatical proficiency
Ahmed Zaidi, Andrew Caines, Christopher Davis, Russell Moore, Paula Buttery & Andrew Rice
Proceedings of the 12th International Conference on Educational Data Mining (EDM 2019)
Automatic homework selection with deep behavioural cloning
Russell Moore, Andrew Caines, Andrew Rice & Paula Buttery
Proceedings of the 20th International Conference on Artificial Intelligence in Education (AIED 2019)
Bad Form: Comparing Context-Based and Form-Based Few-Shot Learning in Distributional Semantic Models
Jeroen Van Hautte, Guy Emerson and Marek Rei
Proceedings of the Second Workshop on Deep Learning for Low-Resource NLP (DeepLo 2019)
Modelling the interplay of metaphor and emotion through multitask learning
Verna Dankers, Marek Rei, Martha Lewis and Ekaterina Shutova
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019)
Semi-Supervised Bootstrapping of Dialogue State Trackers for Task-Oriented Modelling
Bo-Hsiang Tseng, Marek Rei, Paweł Budzianowski, Richard Turner, Bill Byrne and Anna Korhonen
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019)
Neural and FST-based approaches to grammatical error correction
Zheng Yuan, Felix Stahlberg, Marek Rei, Bill Byrne and Helen Yannakoudakis
Proceedings of the 14th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2019)
Context is Key: Grammatical Error Detection with Contextual Word Representations
Samuel Bell, Helen Yannakoudakis and Marek Rei
Proceedings of the 14th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2019)
CAMsterdam at SemEval-2019 Task 6: Neural and graph-based featureextraction for the identification of offensive tweets
Guy Aglionby, Christopher Davis, Pushkar Mishra, Andrew Caines, Helen Yannakoudakis, Marek Rei, Ekaterina Shutova and Paula Buttery
Proceedings of the International Workshop on Semantic Evaluation 2019 (SemEval 2019)
Factorising AMR generation through syntax
Kris Cao, Stephen Clark
Proceedings of the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2019
A Simple Joint Model for Improved Contextual Neural Lemmatization
Chaitanya Malaviya*, Shijie Wu* and Ryan Cotterell
Proceedings of the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2019
On the Idiosyncrasies of the Mandarin Chinese Classifier System
Shijia Liu, Hongyuan Mei, Adina Williams and Ryan Cotterell
Proceedings of the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2019
A Probabilistic Generative Model of Linguistic Typology
Johannes Bjerva, Yova Kementchedjhieva, Ryan Cotterell and Isabelle Augenstein
Proceedings of the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2019
Combining Disparate Sentiment Lexica with a Multi-View Variational Autoencoder
Alexander Hoyle, Lawrence Wolf-Sonkin, Hanna Wallach, Ryan Cotterell and Isabelle Augenstein
Proceedings of the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2019
Gender Bias in Contextualized Word Embeddings
Jieyu Zhao, Tianlu Wang, Mark Yatskar, Ryan Cotterell, Vicente Ordóñez and Kai-Wei Chang
Proceedings of the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2019
Contextualization of Morphological Inflection
Ekaterina Vylomova, Ryan Cotterell, Trevor Cohn, Timothy Baldwin and Jason Eisner
Proceedings of the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2019
A Simple and Robust Approach to Detecting Subject-Verb Agreement Errors
Simon Flachs, Ophélie Lacroix, Marek Rei, Helen Yannakoudakis and Anders Søgaard
Proceedings of the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2019
Abusive Language Detection with Graph Convolutional Networks
Pushkar Mishra, Marco Del Tredici, Helen Yannakoudakis and Ekaterina Shutova
Proceedings of the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2019
Broader context improves metaphor identification
Jesse Mu, Helen Yannakoudakis, Noah Goodman and Ekaterina Shutova
Proceedings of the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2019
Neural Grammatical Error Correction with Finite State Transducers
Felix Stahlberg, Christopher Bryant, Bill Byrne
Proceedings of the 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Automated Fact Checking in the News Room
Sebastião Miranda, David Nogueira, Afonso Mendes, Andreas Vlachos, Andrew Secker, Rebecca Garrett, Jeff Mitchel and Zita Marinho
Proceedings of The WebConf 2019 Conference Demonstrations
Strong Baselines for Complex Word Identification across Multiple Languages
Pierre Finnimore, Elisabeth Fritzsch, Daniel King, Alison Sneyd, Aneeq Ur Rehman, Fernando Alva-Manchego and Andreas Vlachos
Proceedings of the 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Generating Token-Level Explanations for Natural Language Inference
James Thorne, Andreas Vlachos, Christos Christodoulopoulos and Arpit Mittal
Proceedings of the 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Jointly Learning to Label Sentences and Tokens
Marek Rei and Anders Søgaard
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence (AAAI 2019)

2018

CAMB at CWI Shared Task 2018: Complex Word Identification with Ensemble-Based Voting
Sian Gooding, Ekaterina Kochmar
Proceedings of the 13th Workshop on Innovative Use of NLP for Building Educational Applications, pages 184-194, New Orleans, Louisiana, June 5, 2018
Functional Distributional Semantics: Learning Linguistically Informed Representations from a Precisely Annotated Corpus
Guy Emerson
PhD thesis, University of Cambridge. 2018
Emergent Communication Through Negotiation
Kris Cao, Angeliki Lazaridou, Marc Lanctot, Joel Z Leibo, Karl Tuyls, Stephen Clark
International Conference on Learning Representations. 2018
Neural Automated Essay Scoring and Coherence Modeling for Adversarially Crafted Input
Youmna Farag, Helen Yannakoudakis and Ted Briscoe
Proceedings of the 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2018
Author Profiling for Abuse Detection
Pushkar Mishra, Marco Del Tredici, Helen Yannakoudakis and Ekaterina Shutova
Proceedings of the 27th International Conference on Computational Linguistics. 2018
Developing an Automated Writing Placement System for ESL Learners
Helen Yannakoudakis, Øistein E. Andersen, Ardeshir Geranpayeh, Ted Briscoe and Diane Nicholls
Journal of Applied Measurement in Education. 2018
Neural Character-based Composition Models for Abuse Detection
Pushkar Mishra, Helen Yannakoudakis and Ekaterina Shutova
Proceedings of the EMNLP 2018 Workshop on Abusive Language Online
Advance Prediction of Ventricular Tachyarrhythmias using Patient Metadata and Multi-Task Networks
Marek Rei, Josh Oppenheimer and Marek Sirendi
Proceedings of the NIPS Workshop on Machine Learning for Health (ML4H 2018)
Characterizing Eve: Analysing Cybercrime Actors in a Large Underground Forum
Sergio Pastrana, Alice Hutchings, Andrew Caines, Paula Buttery
Proceedings of the 21st International Symposium on Research in Attacks, Intrusions and Defenses (RAID 2018)
You Still Talking to Me?' The Zero Auxiliary Progressive in Spoken British English, Twenty Years On
Andrew Caines, Paula Buttery, Michael McCarthy
In: Vaclav Brezina, Robbie Love and Karin Aijmer (eds.), Corpus Approaches to Contemporary British Speech: Sociolinguistic Studies of the Spoken BNC2014. 2018
Overview of the 2018 Spoken CALL Shared Task
Claudia Baur, Andrew Caines, Cathy Chua, Johanna Gerlach, Mengjie Qian, Manny Rayner, Martin Russell, Helmer Strik, Xizi Wei
Proceedings of INTERSPEECH. 2018
Impact of ASR Performance on Free Speaking Language Assessment
Kate Knill, Mark Gales, Konstantinos Kyriakopoulos, Andrey Malinin, Anton Ragni, Yu Wang, Andrew Caines
Proceedings of INTERSPEECH. 2018
Aggressive language in an online hacking forum
Andrew Caines, Sergio Pastrana, Alice Hutchings, Paula Buttery
Proceedings of the 2nd Abusive Language Workshop (ALW 2018)
How clever is the FiLM model, and how clever can it be?
Alexander Kuhnle, Huiyuan Xie, Ann Copestake
Workshop on Shortcomings in Vision and Language (ECCV 2018)
Deep learning evaluation using deep linguistic processing
Alexander Kuhnle, Ann Copestake
Workshop on Generalization in the Age of Deep Learning (NAACL 2018)
Neural sequence modelling for learner error prediction
Zheng Yuan
The 13th Workshop on Innovative Use of NLP for Building Educational Applications (BEA-2018)
Sequence classification with human attention
Maria Barrett, Joachim Bingel, Nora Hollenstein, Marek Rei and Anders Søgaard
Proceedings of the SIGNLL Conference on Computational Natural Language Learning (CoNLL 2018)
Scoring Lexical Entailment with a Supervised Directional Similarity Network
Marek Rei, Daniela Gerz and Ivan Vulić
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018)
Towards automatically generating supply chain maps from natural language text
Pascal Wichmann, Alexandra Brintrup, Simon Baker, Philip Woodall, Duncan Campbell McFarlane
Proceedings of INCOM2018 (To Appear)
Language Model Based Grammatical Error Correction without Annotated Training Data
Christopher Bryant and Ted Briscoe
The 13th Workshop on Innovative Use of NLP for Building Educational Applications (BEA). 2018
Zero-shot Sequence Labeling through Transfer Learning
Marek Rei and Anders Søgaard
Proceedings of the 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2018)
Neural Multi-task Learning in Automated Assessment
Ronan Cummins, Marek Rei
arXiv. 2018
Variable Typing: Assigning Meaning to Variables in Mathematical Text
Yiannos Stathopoulos, Simon Baker, Marek Rei and Simone Teufel
Proceedings of the 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2018)

2017

Finding enthymemes in real-world texts: A feasibility study
Olesya Razuvayevskaya, Simone Teufel
Journal of Argument & Computation, pp. 1-17, doi: 10.3233/AAC-170020. 2017
Speaking, Seeing, Understanding: Correlating semantic models with conceptual representation in the brain
Luana Bulat, Stephen Clark, Ekaterina Shutova
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Semantic Composition via Probabilistic Model Theory
Guy Emerson, Ann Copestake
Proceedings of the 12th International Conference on Computational Semantics (IWCS). 2017
Variational Inference for Logical Inference
Guy Emerson, Ann Copestake
Proceedings of the 2017 Conference on Logic and Machine Learning in Natural Language (LaML)
Grammatical error correction in non-native English
Zheng Yuan
PhD thesis, University of Cambridge. 2017
Initializing neural networks for hierarchical multi-label text classification
Simon Baker, Anna Korhonen
BioNLP 2017
Text mining for improved exposure assessment
Kristin Larsson , Simon Baker, Ilona Silins, Yufan Guo, Ulla Stenius, Anna Korhonen, Marika Berglund
PLOS ONE. 2017
Cancer Hallmarks Analytics Tool (CHAT): A text mining approach to organise and evaluate scientific literature on cancer
Simon Baker, Imran Ali, Ilona Silins, Sampo Pyysalo, Yufan Guo, Johan Högberg, Ulla Stenius, Anna Korhonen
Bioinformatics. 2017
Detecting Off-topic Responses to Visual Prompts
Marek Rei
The 12th Workshop on Innovative Use of NLP for Building Educational Applications (BEA). 2017
An Error-Oriented Approach to Word Embedding Pre-Training
Youmna Farag, Marek Rei, Ted Briscoe
The 12th Workshop on Innovative Use of NLP for Building Educational Applications (BEA). 2017
Auxiliary Objectives for Neural Error Detection Models
Marek Rei, Helen Yannakoudakis
The 12th Workshop on Innovative Use of NLP for Building Educational Applications (BEA). 2017
Artificial Error Generation with Machine Translation and Syntactic Patterns
Marek Rei, Mariano Felice, Zheng Yuan, Ted Briscoe
The 12th Workshop on Innovative Use of NLP for Building Educational Applications (BEA). 2017
Neural Sequence-Labelling Models for Grammatical Error Correction
Helen Yannakoudakis, Marek Rei, Øistein E. Andersen, Zheng Yuan
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Grasping the finer point: A Supervised Similarity Network for Metaphor Detection
Marek Rei, Luana Bulat, Douwe Kiela, Ekaterina Shutova
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Semi-supervised Multitask Learning for Sequence Labeling
Marek Rei
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL). 2017
Automatic Annotation and Evaluation of Error Types for Grammatical Error Correction
Christopher Bryant, Mariano Felice, Ted Briscoe
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL). 2017
The Representational Geometry of Word Embeddings Learned by Neural MT Systems
Felix Hill, Kyunghyun Cho, Yoshua Bengio
MT. 2017
Latent Variable Dialogue Models and their Diversity
Kris Cao, Stephen Clark
Proceedings of the short papers of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2017)
Learning to Negate Adjectives with Bilinear Models
Laura Rimell, Amandla Mabona, Luana Bulat and Douwe Kiela
Proceedings of the short papers of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2017)
Modelling metaphor with attribute-based semantics
Luana Bulat, Stephen Clark, Ekaterina Shutova
Proceedings of the short papers of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2017)
Modelling semantic acquisition in second language learning
Ekaterina Kochmar and Ekaterina Shutova
BEA 2017
Multilingual Metaphor Processing: Experiments with Semi-supervised and Unsupervised Learning
Ekaterina Shutova, Lin Sun, Dario Gutierrez, Patricia Lichtenstein, Srini Narayanan
Computational Linguistics. 2017

2016

The Goldilocks Principle: Reading Children's Books with Explicity Memory Representations
Felix Hill, Antoine Bordes, Sumit Chopra, Jason Weston
Proceedings of the International Conference on Learning Representations (ICLR). 2016
Automatic Extraction of Learner Errors in ESL Sentences Using Linguistically Enhanced Alignments
Mariano Felice, Christopher Bryant, Ted Briscoe
The 26th International Conference on Computational Linguistics (COLING-2016)
Robust Text Classification for Sparsely Labelled Data Using Multi-level Embeddings
Simon Baker, Douwe Kiela, Anna Korhonen
The 26th International Conference on Computational Linguistics (COLING-2016)
A Proposition-Based Abstractive Summariser
Yimai Fang, Haoyue Zhu, Ewa Muszyńska, Alexander Kuhnle, Simone Teufel
The 26th International Conference on Computational Linguistics (COLING-2016)
Attending to characters in neural sequence labeling models
Marek Rei, Sampo Pyysalo, Gamal K.O. Crichton
The 26th International Conference on Computational Linguistics (COLING-2016)
Recognising enthymemes in real-world texts: a feasibility study
Olesya Razuvayevskaya, Simone Teufel
Proceedings of the 6th COMMA Workshop on the Foundations of the Language of Argumentation. 2016
RELPRON: A Relative Clause Composition Data Set for Compositional Distributional Semantics
Laura Rimell, Jean Maillard, Tamara Polajnar, Stephen Clark
Computational Linguistics. 2016
Predicting the Direction of Derivation in English Conversion
Max Kisselew, Laura Rimell, Alexis Palmer, Sebastian Padó
Proceedings of the 14th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology. 2016
Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vector Differences for Lexical Relation Learning
Ekaterina Vylomova, Laura Rimell, Trevor Cohn, Timothy Baldwin
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL-16). 2016
SLEDDED: A Proposed Dataset of Event Descriptions for Evaluating Phrase Representations
Laura Rimell, Eva Maria Vecchi
Proceedings of The First Workshop on Evaluating Vector Space Representations for NLP (RepEval). 2016
Meta4meaning
Ping Xiao, Khalid Alnajjar, Mark Granroth-Wilding, Kat Agres, Hannu Toivonen
Proceedings of The Seventh International Conference on Computational Creativity. 2016
The Categorial Framework for Compositional Distributional Semantics
Stephen Clark, Laura Rimell, Tamara Polajnar, Jean Maillard
Technical Report, University of Cambridge Computer Laboratory. 2016
Comparing Data Sources and Architectures for Deep Visual Representation Learning in Semantics
Douwe Kiela, Anita Vero, Stephen Clark
Empirical Methods in Natural Language Processing Conference. 2016
Citation Block Determination using Textual Coherence
Dain Kaplan, Takenobu Tokunaga, Simone Teufel
Journal of Information Processing. 2016
Solving the AL Chicken-and-Egg Corpus and Model Problem: Model-free Active Learning for Phenomena-driven Corpus Construction
Dain Kaplan, Neil Rubens, Simone Teufel, Takenobu Tokunaga
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)
Issues in preprocessing current datasets for grammatical error correction
Christopher Bryant and Mariano Felice
Technical report UCAM-CL-TR-895, Computer Laboratory, University of Cambridge. 2016
Artificial error generation for translation-based grammatical error correction
Mariano Felice
Ph.D. thesis, University of Cambridge. 2016
Predicting the impact of scientific concepts using full‐text features
Kathy McKeown, Hal Daume, Snigdha Chaturvedi, John Paparrizos, Kapil Thadani, Pablo Barrio, Or Biran, Suvarna Bothe, Michael Collins, Kenneth R Fleischmann, Luis Gravano, Rahul Jha, Ben King, Kevin McInerney, Taesun Moon, Arvind Neelakantan, Diarmuid O'Seaghdha, Dragomir Radev, Clay Templeton, Simone Teufel
Journal of the Association for Information Science and Technology. 2016
Meta4meaning: Automatic Metaphor Interpretation Using Corpus-Derived Word Associations
Ping Xiao, Khalid Alnajjar, Mark Granroth-Wilding, Kathleen Agres and Hannu Toivonen
Proceedings 7th International Conference on Computational Creativity (ICCC 2016)
What Happens Next? Event Prediction Using a Compositional Neural Network Model
Mark Granroth-Wilding and Stephen Clark
Proceedings 13th AAAI Conference on Artificial Intelligence (AAAI 2016)
Extracting Structured Scholarly Information from the Machine Translation Literature
Eunsol Choi, Matic Horvat, Jonathan May, Kevin Knight, Daniel Marcu
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)
Candidate re-ranking for SMT-based grammatical error correction
Zheng Yuan, Ted Briscoe, Mariano Felice
Proceedings of the 11th Workshop on Innovative Use of NLP for Building Educational Applications. 2016
Grammatical error correction using neural machine translation
Zheng Yuan, Ted Briscoe
Proceedings of NAACL-HLT. 2016
Constrained Multi-Task Learning for Automated Essay Scoring
Ronan Cummins, Meng Zhang, Ted Briscoe
Association for Computational Linguistics (ACL). 2016
Don’t Interrupt Me While I Type: Inferring Text Entered Through Gesture Typing on Android Keyboards
Laurent Simon, Wenduan Xu, Ross Anderson
Proceedings of 16th Privacy Enhancing Technologies Symposium. 2016
Expected F-measure Training for Shift-Reduce Parsing with Recurrent Neural Networks
Wenduan Xu, Michael Auli, Stephen Clark
Proceedings of NAACL. 2016
LSTM Shift-Reduce CCG Parsing
Wenduan Xu
Proceedings of EMNLP. 2016
Cancer Hallmark Text Classification Using Convolutional Neural Networks
Simon Baker, Anna Korhonen, Sampo Pyysalo
BioTxtM 2016
Automatic semantic classification of scientific literature according to the hallmarks of cancer
Simon Baker, Ilona Silins, Yufan Guo, Imran Ali, Johan Hogberg,Ulla Stenius, Anna Korhonen
Bioinformatics. 2016 Feb 1;32(3):432-40
Improving Argument Overlap for Proposition-Based Summarisation
Yimai Fang, Simone Teufel
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 479-485. 2016
Error Detection in Content Word Combinations.
Ekaterina Kochmar
Technical report, Computer Laboratory. 2016
Text Readability Assessment for Second Language Learners
Menglin Xia, Ekaterina Kochmar, Ted Briscoe
BEA 2016
Calling on the classical phone': a distributional model of adjective-noun errors in learners' English
Aurélie Herbelot and Ekaterina Kochmar
COLING 2016
Graph- and surface-level sentence chunking
Ewa Muszyńska
ACL SRW, 2016
Vision and Feature Norms: Improving automatic feature norm learning through cross-modal maps
Luana Bulat, Douwe Kiela, Stephen Clark
Proceedings of NAACL-HLT, 579-588, 2016
HyperLex: A Large-Scale Evaluation of Graded Lexical Entailment
Ivan Vulić, Daniela Gerz, Douwe Kiela, Felix Hill, Anna Korhonen
arXiv preprint arXiv:1608.02117, 2016
MMFEAT: A Toolkit for Extracting Multi-Modal Features
Douwe Kiela
ACL 2016, 55, 2016
Multi-modal representations for improved bilingual lexicon learning
Ivan Vulic, Douwe Kiela, Stephen Clark, Marie-Francine Moens
The 54th Annual Meeting of the Association for Computational Linguistics, 188, 2016
Virtual Embodiment: A Scalable Long-Term Strategy for Artificial Intelligence Research
Douwe Kiela, Luana Bulat, Anita L. Verő, Stephen Clark
arXiv preprint arXiv:1610.07432, 2016
Metaphor as a Medium for Emotion: An Empirical Study
Saif Mohammad, Ekaterina Shutova, Peter Turney
*SEM. 2016
Semantic classifications for detection of verb metaphors
Beata Beigman Klebanov, Chee Wee Leong, Dario Gutierrez, Ekaterina Shutova, Michael Flor
ACL. 2016
Literal and Metaphorical Senses in Compositional Distributional Semantic Models
Dario Gutierrez, Ekaterina Shutova, Tyler Marghetis, Benjamin Bergen
ACL. 2016
Cross-Lingual Lexico-Semantic Transfer in Language Learning
Ekaterina Kochmar, Ekaterina Shutova
ACL. 2016
Detecting Cross-cultural Differences Using a Multilingual Topic Model
Dario Gutierrez, Ekaterina Shutova, Patricia Lichtenstein, Gerard de Melo, Luca Gilardi
TACL. 2016
Black Holes and White Rabbits: Metaphor Identification with Visual Features
Ekaterina Shutova, Douwe Kiela, Jean Maillard
NAACL. 2016
Metaphor: A Computational Perspective
Tony Veale, Ekaterina Shutova, Beata Beigman Klebanov
Synthesis Lectures on Human Language Technologies. Edited by Graeme Hirst. Morgan & Claypool, USA. 2016
Functional Distributional Semantics
Guy Emerson, Ann Copestake
The 1st Workshop on Representation Learning for NLP (RepL4NLP-2016)
Resources for Building Applications with Dependency Minimal Recursion Semantics
Ann Copestake, Guy Emerson, Michael Wayne Goodman, Matic Horvat, Alexander Kuhnle, Ewa Muszyńska
The 10th International Conference on Language Resources and Evaluation. 2016
Unsupervised Modeling of Topical Relevance in L2 Learner Text
Ronan Cummins, Helen Yannakoudakis, Ted Briscoe
The 11th Workshop on Innovative Use of NLP for Building Educational Applications (BEA). 2016
A Joint Model for Word Embedding and Word Morphology
Kris Cao, Marek Rei
The 1st Workshop on Representation Learning for NLP (RepL4NLP-2016)
Compositional Sequence Labeling Models for Error Detection in Learner Writing
Marek Rei, Helen Yannakoudakis
The 54th Annual Meeting of the Association for Computational Linguistics (ACL-2016)
Automatic Text Scoring Using Neural Networks
Dimitrios Alikaniotis, Helen Yannakoudakis, Marek Rei
The 54th Annual Meeting of the Association for Computational Linguistics (ACL-2016)
Sentence Similarity Measures for Fine-Grained Estimation of Topical Relevance in Learner Essays
Marek Rei, Ronan Cummins
The 11th Workshop on Innovative Use of NLP for Building Educational Applications (BEA). 2016

2015

Learning Distributed Representations of Sentences from Unlabelled Data
Felix Hill, Kyunghyun Cho, Anna Korhonen
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Learning to Understand Phrases by Embedding the Dictionary
Felix Hill, Kyunghyun Cho, Anna Korhonen, Yoshua Bengio
Transactions of the Association for Computational Linguistics. 2015
Vector Space Models of Lexical Meaning
Stephen Clark
Handbook of Contemporary Semantic Theory — second edition, edited by Shalom Lappin and Chris Fox. 2015
The Frobenius anatomy of word meanings II: possessive relative pronouns
Merhnoosh Sadrzadeh, Stephen Clark, Bob Coecke
Journal of Logic and Computation. 2015
Computational Syntax
Emily M. Bender, Stephen Clark, Tracy Holloway King
Syntax – Theory and Analysis. An International Handbook. Handbooks of Linguistics and Communication Science. 2015
CCG Supertagging with a Recurrent Neural Network
Wenduan Xu, Michael Auli, Stephen Clark
Proceedings of the Short Papers of the 53rd Annual Meeting of the Association for Computational Linguistics (ACL 2015)
Learning Adjective Meanings with a Tensor-Based Skip-Gram Model
Jean Maillard, Stephen Clark
SIGNLL Conference on Computational Natural Language Learning (CoNLL 2015)
Discriminative Syntax-Based Word Ordering for Text Generation
Yue Zhang, Stephen Clark
Computational Linguistics. 2015
Learning low-rank tensors for transitive verbs
Daniel Fried, Tamara Polajnar, Stephen Clark
Advances in Distributional Semantics Workshop. 2015
The Java Version of the C&C Parser: Version 0.95
Stephen Clark, Darren Foong, Luana Bulat, Wenduan Xu
Technical report, University of Cambridge Computer Laboratory, August. 2015
Low-Rank Tensors for Verbs in Compositional Distributional Semantics
Daniel Fried, Tamara Polajnar, Stephen Clark
Proceedings of the Short Papers of the 53rd Annual Meeting of the Association for Computational Linguistics (ACL 2015)
An Exploration of Discourse-Based Sentence Spaces for Compositional Distributional Semantics
Tamara Polajnar, Laura Rimell, Stephen Clark
Proceedings of the Workshop on Linking Models of Lexical, Sentential and Discourse-level Semantics (LSDSem). 2015
Hierarchical Statistical Semantic Realization for Minimal Recursion Semantics
Matic Horvat, Ann Copestake, William Byrne
IWCS 2015
Layers of interpretation: On grammar and compositionality
Emily M. Bender, Dan Flickinger, Stephan Oepen, Woodley Packard, Ann Copestake
Proceedings of the 11th International Conference on Computational Semantics. 2015
Towards a standard evaluation method for grammatical error detection and correction
Mariano Felice, Ted Briscoe
Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics, Denver, CO, June. Association for Computational Linguistics
CCG Supertagging with Recurrent Neural Networks
Wenduan Xu, Michael Auli, Stephen Clark
Proceedings of ACL. 2015
Improving Literature-Based Discovery with Advanced Text Mining
Anna Korhonen, Yufan Guo, Simon Baker, Meliha Yetisgen-Yildiz, Ulla Stenius, Masashi Narita, Pietro Liò
CIBB 2015
From distributional semantics to feature norms: grounding semantic models in human perceptual data
Luana Fagarasan, Eva Maria Vecchi, Stephen Clark
Proceedings of the Short Papers of the 11th International Conference on Computational Semantics (IWCS 2015), London, UK
Using Learner Data to Improve Error Correction in Adjective–Noun Combinations.
Ekaterina Kochmar, Ted Briscoe
BEA 2015
Grounding semantics in olfactory perception
Douwe Kiela, Luana Bulat, Stephen Clark
Proceedings of ACL 2, 231-6, 2015
Exploiting image generality for lexical entailment detection
Douwe Kiela, Laura Rimell, Ivan Vulic, Stephen Clark
Proceedings of the 53rd Annual Meeting of the Association for Computational ..., 2015
Multi-and cross-modal semantics beyond vision: Grounding in auditory perception
Douwe Kiela, Stephen Clark
Proceedings of EMNLP, 2015
Specializing word embeddings for similarity or relatedness
Douwe Kiela, Felix Hill, Stephen Clark
Proceedings of EMNLP, 2015
Unsupervised discovery of information structure in biomedical documents
Douwe Kiela, Yufan Guo, Ulla Stenius, Anna Korhonen
Bioinformatics 31 (7), 1084-1092, 2015
Adaptive communication: Languages with more non-native speakers tend to have fewer word forms
Christian Bentz, Annemarie Verkerk, Douwe Kiela, Felix Hill, Paula Buttery
PloS one 10 (6), e0128254, 2015
Visual bilingual lexicon induction with transferred convnet features
Douwe Kiela, Ivan Vulic, Stephen Clark
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing
Perceptually grounded selectional preferences
Ekaterina Shutova, Niket Tandon, Gerard de Melo
ACL. 2015
Lacking Integrity: HPSG as a Morphosyntactic Theory
Guy Emerson, Ann Copestake
The 22nd International Conference on Head-Driven Phrase Structure Grammar (HPSG). 2015
Leveraging a Semantically Annotated Corpus to Disambiguate Prepositional Phrase Attachment
Guy Emerson, Ann Copestake
The 11th International Conference on Computational Semantics. 2015
Evaluating the performance of Automated Text Scoring systems
Helen Yannakoudakis, Ronan Cummins
The 10th Workshop on Innovative Use of NLP for Building Educational Applications (BEA). 2015
Online Representation Learning in Recurrent Neural Language Models
Marek Rei
The 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP)

2014

Simlex 999: Evaluating Semantic Models with Genuine Similarity Estimation
Felix Hill, Roi Reichart, Anna Korhonen
Computational Linguistics. 2014
A quantitative empirical analysis of the abstract/concrete distinction
Felix Hill, Anna Korhonen, Christian Bentz
Cognitive science. 2014
Native Language Identification Using Large, Longitudinal Data.
Xiao Jiang, Yufan Guo, Jeroen Geertzen, Dora Alexopoulou, Lin Sun, Anna Korhonen
LREC. 2014
Technologies and Tools for Lexical Acquisition
Laura Rimell, Anna Korhonen, Valeria Quochi, Núria Bel Rafecas, Tommaso Caselli, Prokopis Prokopidis, Maria Gavrilidou, Thierry Poibeau, Muntsa Padró, Eva Revilla, Monica Monachini, Maurizio Tesconi, Matteo Abrate, Clara Bacciu
. 2014
Evaluation of carcinogenic modes of action for pesticides in fruit on the Swedish market using a text-mining tool
Ilona Silins, Anna Korhonen, Ulla Stenius
Frontiers in pharmacology. 2014
Enter search terms
Jeroen Geertzen, Theodora Alexopoulou, Anna Korhonen
. 2014
Concreteness and Subjectivity as Dimensions of Lexical Meaning.
Felix Hill, Anna Korhonen
ACL (2). 2014
Learning Abstract Concept Embeddings from Multi-Modal Data: Since You Probably Can't See What I Mean.
Felix Hill, Anna Korhonen
EMNLP. 2014
Text mining for improved human exposure assessment
Kristin Larsson, Ilona Silins, Yufan Guo, Anna Korhonen, Ulla Stenius, Marika Berglund
Toxicology Letters. 2014
Verb clustering for brazilian portuguese
Carolina Scarton, Lin Sun, Karin Kipper-Schuler, Magali Sanches Duran, Martha Palmer, Anna Korhonen
International Conference on Intelligent Text Processing and Computational Linguistics. 2014
Automatic Extraction of Property Norm‐Like Data From Large Text Corpora
Colin Kelly, Barry Devereux, Anna Korhonen
Cognitive Science. 2014
A text-mining approach for chemical risk assessment and cancer research
Ilona Silins, Anna Korhonen, Yufan Guo, Ulla Stenius
Toxicology Letters. 2014
Multi-modal models for concrete and abstract concept meaning
Felix Hill, Roi Reichart, Anna Korhonen
Transactions of the Association for Computational Linguistics. 2014
Distributional Lexical Entailment by Topic Coherence
Laura Rimell
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL-14). 2014
A robust parser-interpreter for jazz chord sequences
Mark Granroth-Wilding, Mark Steedman
Journal of New Music Research. 2014
Sentiment analysis of scientific citations
Awais Athar
Technical Report, University of Cambridge, Computer Laboratory. 2014
Learning a Theory of Marriage (and other relations) from a Web Corpus
Sandro Bauer, Stephen Clark, Laura Rimell, Thore Graepel
Proceedings of the Short Papers of the European Conference on Information Retrieval (ECIR 2014)
A Type-Driven Tensor-Based Semantics for CCG
Jean Maillard, Stephen Clark, Edward Grefenstette
EACL 2014 Type Theory and Natural Language Semantics Workshop (TTNLS)
Practical Linguistic Steganography using Contextual Synonym Substitution and a Novel Vertex Coding Method
Ching-Yun Chang, Stephen Clark
Computational Linguistics. 2014
Application-Driven Relation Extraction with Limited Distant Supervision
Andreas Vlachos, Stephen Clark
COLING-14 Aha!-Workshop on Information Discovery in Text. 2014
Learning to Identify Historical Figures for Timeline Creation from Wikipedia Articles
Sandro Bauer, Stephen Clark, Thore Graepel
HistoInformatics 2014 - the 2nd International Workshop on Computational History
A New Corpus and Imitation Learning Framework for Context-Dependent Semantic Parsing
Andreas Vlachos, Stephen Clark
Transactions of the Association for Computational Linguistics (TACL). 2014
Scientific Argumentation Detection as Limited-domain Intention Recognition.
Simone Teufel
ArgNLP. 2014
Resolving Coreferent and Associative Noun Phrases in Scientific Text.
Ina Rösiger, Simone Teufel
EACL. 2014
Topical PageRank: A Model of Scientific Expertise for Bibliographic Search.
James Jardine, Simone Teufel
EACL. 2014
Generating artificial errors for grammatical error correction.
Mariano Felice and Zheng Yuan
Proceedings of the Student Research Workshop at the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2014)
To err is human, to correct is divine
Mariano Felice and Zheng Yuan
XRDS: Crossroads, The ACM Magazine for Students, vol. 21 num. 1. 2014
Unsupervised learning of rhetorical structure with un-topic models.
Diarmuid O Séaghdha, Simone Teufel
COLING. 2014
Semantic Relations between Nominals
Vivi Nastase, Preslav Nakov, Diarmuid O Séaghdha, Stan Szpakowicz
Morgan and Claypool. 2014
Emoticons and Phrases: Status Symbols in Social Media.
Simo Editha Tchokni, Diarmuid O Séaghdha, Daniele Quercia
ICWSM. 2014
CRAB 2.0: A text mining tool for supporting literature review in chemical cancer risk assessment.
Yufan Guo, Diarmuid O Séaghdha, Ilona Silins, Lin Sun, Johan Högberg, Ulla Stenius, Anna Korhonen
COLING (Demos). 2014
Probabilistic distributional semantics with latent variable models
Diarmuid O Séaghdha, Anna Korhonen
Computational Linguistics. 2014
Improving Distributional Semantic Vectors through Context Selection and Normalisation
Tamara Polajnar, Stephen Clark
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL). 2014
Evaluation of Simple Distributional Compositional Operations on Longer Texts
Tamara Polajnar, Laura Rimell, Stephen Clark
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14). 2014
Using Sentence Plausibility to Learn the Semantics of Transitive Verbs
Tamara Polajnar, Laura Rimell, Stephen Clark
NIPS workshop on Learning Semantics, Montreal, Canada. 2014
Baseline Methods for Automated Fictional Ideation
Maria Teresa Llano, Rose Hepworth, Simon Colton, Jeremy Gow, John Charnley, Nada Lavrač, Martin Žnidaršič, Matic Perovšek, Mark Granroth-Wilding and Stephen Clark
Proceedings 5th International Conference on Computational Creativity (ICCC 2014)
A Graph-Based Approach to String Regeneration
Matic Horvat, William Byrne
Proceedings of the Student Research Workshop at the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2014)
TagNText: A parallel corpus for the induction of resource-specific non-taxonomical relations from tagged images.
Theodosia Togia, Ann A Copestake
LREC. 2014
The CoNLL-2014 Shared Task on Grammatical Error Correction
Hwee Tou Ng, Siew Mei Wu, Ted Briscoe, Christian Hadiwinoto, Raymond Hendy Susanto, Christopher Bryant
CoNLL Shared Task. 2014
Shift-Reduce CCG Parsing with a Dependency Model
Wenduan Xu, Stephen Clark, Yue Zhang
Proceedings of ACL. 2014
An Unsupervised Model for Instance Level Subcategorization Acquisition
Simon Baker, Roi Reichart, Anna Korhonen
EMNLP 2014
A Summariser based on Human Memory Limitations and Lexical Competition
Yimai Fang, Simone Teufel
Proceedings of EACL, 732-741. 2014
Reducing Dimensions of Tensors in Type-Driven Distributional Semantics
Tamara Polajnar, Luana Făgărășan, Stephen Clark
Proceedings of the Empirical Methods in Natural Language Processing Conference (EMNLP 2014), Doha, Qatar
Detecting Learner Errors in the Choice of Content Words Using Compositional Distributional Semantics.
Ekaterina Kochmar, Ted Briscoe
COLING 2014
Improving Multi-Modal Representations Using Image Dispersion: Why Less is Sometimes More.
Douwe Kiela, Felix Hill, Anna Korhonen, Stephen Clark
ACL (2), 835-841, 2014
Learning Image Embeddings using Convolutional Neural Networks for Improved Multi-Modal Semantics.
Douwe Kiela, Léon Bottou
EMNLP, 36-45, 2014
ZIPF’S LAW ACROSS LANGUAGES OF THE WORLD: TOWARDS A QUANTITATIVE MEASURE OF LEXICAL DIVERSITY
Christian Bentz, Douwe Kiela
Evolution of Language: Proceedings of the 10th International Conference ..., 2014
A systematic study of semantic vector space model parameters
Douwe Kiela, Stephen Clark
Proceedings of the 2nd Workshop on Continuous Vector Space Models and their ..., 2014
Zipf's law and the grammar of languages: A quantitative study of Old and Modern English parallel texts
Christian Bentz, Douwe Kiela, Felix Hill, Paula Buttery
Corpus Linguistics and Linguistic Theory 10 (2), 175-211, 2014
Grammatical error correction using hybrid systems and type filtering
Mariano Felice, Zheng Yuan, Øistein E. Andersen, Helen Yannakoudakis, Ekaterina Kochmar
The Seventeenth Conference on Computational Natural Language Learning (CoNLL 2014): Shared Task
Using an Expert System to Automatically Map the Learning Profile of Individuals.
John Yannakoudakis, Irene Yannakoudakis, Helen Yannakoudakis, George Papadourakis
The Sixth International Conference on Mobile, Hybrid, and On-line Learning, eLmL. 2014
Looking for hyponyms in vector space
Marek Rei, Ted Briscoe
The Eighteenth Conference on Computational Natural Language Learning (CoNLL-14). 2014

2013

Computational modeling as a methodology for studying human language learning
Thierry Poibeau, Aline Villavicencio, Anna Korhonen, Afra Alishahi
. 2013
Improved Lexical Acquisition through DPP-based Verb Clustering.
Roi Reichart, Anna Korhonen
ACL. 2013
Automatic linguistic annotation of large scale L2 databases: The EF-Cambridge Open Language Database (EFCAMDAT)
Jeroen Geertzen, Theodora Alexopoulou, Anna Korhonen
Proceedings of the 31st Second Language Research Forum. Somerville, MA: Cascadilla Proceedings Project. 2013
The ef cambridge open language database (efcamdat) user manual part i: written production
Jeroen Geertzen, Theodora Alexopoulou, Rachel Baker, Henriëtte Hendriks, Sichu Jiang, Anna Korhonen, EE First
. 2013
Minimally supervised learning for unconstrained conceptual property extraction
Colin Kelly, Anna Korhonen, Barry Devereux
Proceedings of the 35th Annual Conference of the Cognitive Science Society. 2013
Theory and Applications of Natural Language Processing
Graeme Hirst, Eduard Hovy, Mark Johnson
. 2013
Active learning-based information structure analysis of full scientific articles and two applications for biomedical literature review
Yufan Guo, Ilona Silins, Ulla Stenius, Anna Korhonen
Bioinformatics. 2013
A tensor-based factorization model of semantic compositionality
Tim Van de Cruys, Thierry Poibeau, Anna Korhonen
Conference of the North American Chapter of the Association of Computational Linguistics (HTL-NAACL). 2013
Improved Information Structure Analysis of Scientific Documents Through Discourse and Lexical Constraints.
Yufan Guo, Roi Reichart, Anna Korhonen
HLT-NAACL. 2013
Diathesis alternation approximation for verb clustering.
Lin Sun, Diana McCarthy, Anna Korhonen
ACL. 2013
Conceptual metaphor theory meets the data: a corpus-based human annotation study
Ekaterina Shutova, Barry J Devereux, Anna Korhonen
Language resources and evaluation. 2013
UCAM-CORE: Incorporating structured distributional similarity into STS
Tamara Polajnar, Laura Rimell, Douwe Kiela
Proceedings of *SEM 2013 Shared Task
Parser Evaluation Using Textual Entailments
Deniz Yuret, Laura Rimell, Aydin Han
Language Resources and Evaluation. 2013
Acquisition and Evaluation of Verb Subcategorization Resources for Biomedicine
Laura Rimell, Thomas Lippincott, Karin Verspoor, Helen L. Johnson, Anna Korhonen
Journal of Biomedical Informatics. 2013
Approaches to Verb Subcategorization for Biomedicine
Thomas Lippincott, Laura Rimell, Karin Verspoor, Anna Korhonen
Journal of Biomedical Informatics. 2013
Type-Driven Syntax and Semantics for Composing Meaning Vectors
Stephen Clark
Quantum Physics and Linguistics: A Compositional, Diagrammatic Discourse. 2013
Getting Creative with Semantic Similarity
Ching-Yun Chang, Stephen Clark, Brian Harrington
Short Papers of the Seventh IEEE International Conference on Semantic Computing (ICSC-13). 2013
The Frobenius Anatomy of Relative Pronouns
Stephen Clark, Bob Coecke, Mehrnoosh Sadrzadeh
13th Meeting on the Mathematics of Language (MoL 13). 2013
A quantum teleportation inspired algorithm produces sentence meaning from word meaning and grammatical structure
Stephen Clark, Bob Coecke, Edward Grefenstette, Stephen Pulman, Mehrnoosh Sadrzadeh
arXiv preprint arXiv:1305.0556. 2013
The Frobenius anatomy of word meanings I: subject and object relative pronouns
Mehrnoosh Sadrzadeh, Stephen Clark, Bob Coecke
Journal of Logic and Computation. 2013
Statistical metaphor processing
Ekaterina Shutova, Simone Teufel, Anna Korhonen
Computational Linguistics. 2013
A computational model of logical metonymy
Ekaterina Shutova, Jakub Kaplan, Simone Teufel, Anna Korhonen
ACM Transactions on Speech and Language Processing (TSLP). 2013
Constrained grammatical error correction using Statistical Machine Translation
Zheng Yuan and Mariano Felice
Proceedings of the Seventeenth Conference on Computational Natural Language Learning (CoNLL 2013): Shared Task
Reading tweeting minds: real-time analysis of short text for computational social science
Zhe Wang, Daniele Quercia, Diarmuid Ó Séaghdha
Proceedings of the 24th ACM Conference on Hypertext and Social Media. 2013
SemEval-2013 task 4: Free paraphrases of noun compounds
Iris Hendrickx, Zornitsa Kozareva, Preslav Nakov, Diarmuid O Séaghdha, Stan Szpakowicz, Tony Veale
Proceedings of the Seventh International Workshop on Semantic Evaluation (SemEval 2013)
Dependency language models for sentence completion
Joseph Gubbins, Andreas Vlachos
Proceedings of 2013 Conference on Empirical Methods in Natural Language Processing
Semantic Parsing as Machine Translation
Jacob Andreas, Andreas Vlachos, Stephen Clark
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics. 2013
The Semi-generative Lexicon: Limits on Productivity
Ann Copestake
Advances in Generative Lexicon Theory, 455-474. 2013
Can distributional approaches improve on good old-fashioned lexical semantics?
Ann Copestake
IWCS Workshop Towards a Formal Distributional Semantics. 2013
Interpreting compound nouns with kernel methods
Diarmuid O Séaghdha, Ann Copestake
Natural Language Engineering. 2013
Learning to Prune: Context-Sensitive Pruning for Syntactic MT
Wenduan Xu, Yue Zhang, Philip Williams, Philipp Koehn
Proceedings of ACL. 2013
Capturing Anomalies in the Choice of Content Words in Compositional Distributional Semantic Space.
Ekaterina Kochmar, Ted Briscoe
RANLP 2013
Detecting Compositionality of Multi-Word Expressions using Nearest Neighbours in Vector Space Models.
Douwe Kiela, Stephen Clark
EMNLP, 1427-1432, 2013
Concreteness and corpora: A theoretical and practical analysis
Felix Hill, Douwe Kiela, Anna Korhonen
Proceedings of the Workshop on Cognitive Modeling and Computational ..., 2013
Classifying Intermediate Learner English: A Data-driven Approach to Learner Corpora
Theodora Alexopoulou, Helen Yannakoudakis, Angeliki Salamoura
In S. Granger, G. Gilquin & F. Meunier (eds) Twenty Years of Learner Corpus Research: Looking back, Moving ahead. Corpora and Language in Use – Proceedings 1, Louvain-la-Neuve: Presses universitaires de Louvain. 2013
Developing and Testing a Self-Assessment and Tutoring System
Øistein E. Andersen, Helen Yannakoudakis, Fiona Barker, Tim Parish
The 9th Workshop on Innovative Use of NLP for Building Educational Applications (BEA). 2013
Parser lexicalisation through self-learning
Marek Rei, Ted Briscoe
The 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2013)
Minimally supervised dependency-based methods for natural language processing
Marek Rei
PhD thesis, University of Cambridge. 2013

2012

Using Argumentative Zones for Extractive Summarization of Scientific Articles.
Danish Contractor, Yufan Guo, Anna Korhonen
COLING. 2012
PANACEA (Platform for Automatic, Normalised Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies)
Núria Bel Rafecas, Marc Poch, Antonio Toral
Proceedings of the 16th Annual Conference of the European Association for Machine Translation: EAMT 2012; 2012 May 28-30; Trento, Italy. Trento: Fondazione Bruno Kessler; 2012. p. 90
Semi-supervised learning for automatic conceptual property extraction
Colin Kelly, Barry Devereux, Anna Korhonen
Proceedings of the 3rd Workshop on Cognitive Modeling and Computational Linguistics. 2012
Exocrine pancreatic tumorigenesis and autotaxin expression
Sandeep Kadekar, Ilona Silins, Anna Korhonen, Kristian Dreij, Lauy Al-Anati, Johan Högberg, Ulla Stenius
Toxicology Letters. 2012
A text mining approach for chemical cancer research and risk assessment
Ilona Silins, Anna Korhonen, Lin Sun, Johan Högberg, Ulla Stenius
Toxicology Letters. 2012
Data and literature gathering in chemical cancer risk assessment
Ilona Silins, Anna Korhonen, Johan Högberg, Ulla Stenius
Integrated environmental assessment and management. 2012
Exocrine pancreatic carcinogenesis and autotaxin expression
Sandeep Kadekar, Ilona Silins, Anna Korhonen, Kristian Dreij, Lauy Al-Anati, Johan Högberg, Ulla Stenius
PloS one. 2012
Unsupervised Metaphor Paraphrasing using a Vector Space Model.
Ekaterina Shutova, Tim Van de Cruys, Anna Korhonen
COLING (Posters). 2012
CRAB Reader: A Tool for Analysis and Visualization of Argumentative Zones in Scientific Literature.
Yufan Guo, Ilona Silins, Roi Reichart, Anna Korhonen
COLING (Demos). 2012
Multi-way Tensor Factorization for Unsupervised Lexical Acquisition
Tim Van de Cruys, Laura Rimell, Thierry Poibeau, Anna Korhonen
Proceedings of the 24th International Conference on Computational Linguistics (COLING-12). 2012
Merging Lexicons for Higher Precision Subcategorization Frame Acquisition
Laura Rimell, Thierry Poibeau, Anna Korhonen
Proceedings of the LREC 2012 Workshop on Language Resource Merging
Adjective Deletion for Linguistic Steganography and Secret Sharing
Ching-Yun Chang, Stephen Clark
COLING. 2012
The Secret's in the Word Order: Text-to-Text Generation for Linguistic Steganography
Ching-Yun Chang, Stephen Clark
COLING. 2012
Syntax-Based Word Ordering Incorporating a Large-Scale Language Model
Yue Zhang, Graeme Blackwood, Stephen Clark
EACL 2012
Context-enhanced citation sentiment detection
Awais Athar, Simone Teufel
Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Detection of implicit citations for sentiment detection
Awais Athar, Simone Teufel
Proceedings of the Workshop on Detecting Structure in Scholarly Discourse. 2012
Auralist: introducing serendipity into music recommendation
Yuan Cao Zhang, Diarmuid Ó Séaghdha, Daniele Quercia, Tamas Jambor
Proceedings of the fifth ACM international conference on Web search and data mining. 2012
Text mining for literature review and knowledge discovery in cancer risk assessment and research
Anna Korhonen, Diarmuid O Séaghdha, Ilona Silins, Lin Sun, Johan Högberg, Ulla Stenius
PloS one. 2012
Talk of the city: Our tweets, our community happiness
Daniele Quercia, Diarmuid Ò Séaghdha, Jon Crowcroft
Proceedings of the 6th International AAAI Conference on Weblogs and Social Media (ICWSM 2012)
Modelling selectional preferences in a lexical hierarchy
Diarmuid O Séaghdha, Anna Korhonen
Proceedings of the First Joint Conference on Lexical and Computational Semantics-Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation. 2012
Learning syntactic verb frames using graphical models
Thomas Lippincott, Diarmuid O Séaghdha, Anna Korhonen
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers-Volume 1. 2012
Talking places: Modelling and analysing linguistic content in foursquare
Sandro Bauer, Anastasios Noulas, Diarmuid O Séaghdha, Stephen Clark, Cecilia Mascolo
Privacy, Security, Risk and Trust (PASSAT), 2012 International Conference on and 2012 International Confernece on Social Computing (SocialCom)
Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora.
Carmen Dayrell, Arnaldo Candido Jr, Gabriel Lima, Danilo Machado Jr, Ann A Copestake, Valéria Delisandra Feltrim, Stella EO Tagnin, Sandra M Aluísio
LREC. 2012
Lexicalised compositionality
Ann Copestake, Aurelie Herbelot
Unpublished draft. 2012
HOO 2012 Error Recognition and Correction Shared Task: Cambridge University Submission Report
Ekaterina Kochmar, Øistein Andersen, Ted Briscoe
BEA 2012
Automating Second Language Acquisition Research: Integrating Information Visualisation and Machine Learning
Helen Yannakoudakis, Ted Briscoe, Theodora Alexopoulou
The EACL 2012 joint workshop of LINGVIS & UNCLH
Modeling Coherence in ESOL Learner Texts
Helen Yannakoudakis, Ted Briscoe
The NAACL 2012 workshop on Innovative Use of Natural Language Processing for Building Educational Applications

2011

A comparison and user-based evaluation of models of textual information structure in the context of cancer risk assessment
Yufan Guo, Anna Korhonen, Maria Liakata, Ilona Silins, Johan Hogberg, Ulla Stenius
BMC bioinformatics. 2011
A weakly-supervised approach to argumentative zoning of scientific documents
Yufan Guo, Anna Korhonen, Thierry Poibeau
Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2011
Latent vector weighting for word meaning in context
Tim Van de Cruys, Thierry Poibeau, Anna Korhonen
Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2011
Hierarchical verb clustering using graph factorization
Lin Sun, Anna Korhonen
Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2011
Weakly supervised learning of information structure of scientific abstracts—is it accurate enough to benefit real-world tasks in biomedicine?
Yufan Guo, Anna Korhonen, Ilona Silins, Ulla Stenius
Bioinformatics. 2011
Sentiment analysis of citations using sentence structure-based features
Awais Athar
ACL HLT 2011
Concrete sentence spaces for compositional distributional models of meaning
Edward Grefenstette, Mehrnoosh Sadrzadeh, Stephen Clark, Bob Coecke, Stephen Pulman
International Conference on Computational Semantics. 2011
Syntax-Based Grammaticality Improvement using CCG and Guided Search
Yue Zhang, Stephen Clark
Conference on Empirical Methods in Natural Language Processing. 2011
Syntactic processing using the generalized perceptron and beam search
Yue Zhang, Stephen Clark
Computational linguistics. 2011
Shift-reduce CCG parsing
Yue Zhang, Stephen Clark
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1. 2011
Robust argumentative zoning for sensemaking in scholarly documents
Simone Teufel, Min-Yen Kan
Advanced Language Technologies for Digital Libraries (ALTDL), Lecture Notes in Computer Science. 2011
Probabilistic models of similarity in syntactic context
Diarmuid O Séaghdha, Anna Korhonen
Proc. of EMNLP. 2011
Exploring subdomain variation in biomedical language
Thomas Lippincott, Diarmuid O Seaghdha, Anna Korhonen
BMC bioinformatics. 2011
Formalising and specifying underquantification
Aurelie Herbelot, Ann Copestake
Proceedings of the Ninth International Conference on Computational Semantics. 2011
Towards an on-demand simple portuguese wikipedia
Arnaldo Candido Junior, Ann Copestake, Lucia Specia, Sandra Maria Aluísio
Proceedings of the Second Workshop on Speech and Language Processing for Assistive Technologies. 2011
Exciting and interesting: issues in the generation of binomials
Ann Copestake, Aurélie Herbelot
Proceedings of the UCNLG+ Eval: Language Generation and Evaluation Workshop. 2011
Introduction to Linguistics for Natural Language Processing
Ted Briscoe
Computer Laboratory, University of Cambridge. 2011
Identification of a Writer’s Native Language by Error Analysis.
Ekaterina Kochmar
MPhil dissertation, Computer Laboratory. 2011
A New Dataset and Method for Automatically Grading ESOL Texts
Helen Yannakoudakis, Ted Briscoe, Ben Medlock
The 49th Annual Meeting of the Association for Computational Linguistics. 2011
Unsupervised Entailment Detection between Dependency Graph Fragments
Marek Rei, Ted Briscoe
The 2011 Workshop on Biomedical Natural Language Processing (BioNLP-11)
Intelligent Information Access from Scientific Papers
Ted Briscoe, Karl Harrison, Andrew Naish-Guzman, Andy Parker, Marek Rei, Advaith Siddharthan, David Sinclair, Mark Slater, Rebecca Watson
Current Challenges in Patent Information Retrieval, edited by Mihai Lupu, Katja Mayer, John Tait and Anthony J. Trippe. 2011

2010

Large-scale acquisition of feature-based conceptual representations from textual corpora
Barry Devereux, Nicholas Pilkington, Thierry Poibeau, Anna Korhonen
The Annual Meeting of the Cognitive Science Society. 2010
Annotating the Enron Email Corpus with Number Senses.
Stuart Moore, Sabine Buchholz, Anna Korhonen
LREC. 2010
Methods for the automatic acquisition of Language Resources and their evaluation methods
Núria Bel, Béatrice Daille, Andrejs Vasiljevs
. 2010
Using fMRI activation to conceptual stimuli to evaluate methods for extracting conceptual representations from corpora
Barry Devereux, Colin Kelly, Anna Korhonen
Proceedings of the NAACL HLT 2010 First Workshop on Computational Neurolinguistics
Acquiring human-like feature-based conceptual representations from corpora
Colin Kelly, Barry Devereux, Anna Korhonen
Proceedings of the NAACL HLT 2010 First Workshop on Computational Neurolinguistics
Identifying the information structure of scientific abstracts: an investigation of three different schemes
Yufan Guo, Anna Korhonen, Maria Liakata, Ilona Silins Karolinska, Lin Sun, Ulla Stenius
Proceedings of the 2010 Workshop on Biomedical Natural Language Processing
The acquisition of unrestricted feature-based conceptual representations from corpora
Barry Devereux, Colin Kelly, Thierry Poibeau, Nicholas Pilkington, Anna Korhonen
. 2010
Automatic lexical classification: bridging research and practice
Anna Korhonen
Philosophical Transactions of the Royal Society of London A: Mathematical, Physical and Engineering Sciences. 2010
Investigating the cross-linguistic potential of verbnet: style classification
Lin Sun, Anna Korhonen, Thierry Poibeau, Cédric Messiant
Proceedings of the 23rd International Conference on Computational Linguistics. 2010
Metaphor identification using verb and noun clustering
Ekaterina Shutova, Lin Sun, Anna Korhonen
Proceedings of the 23rd International Conference on Computational Linguistics. 2010
Proceedings of Verb 2010
Pier Marco Bertinetto, Anna Korhonen, Alessandro Lenci, Alissa Melinger, Sabine Schulte im Walde, Aline Villavicencio
. 2010
Evaluation of Dependency Parsers on Unbounded Dependencies
Joakim Nivre, Laura Rimell, Ryan McDonald, Carlos Gómez-Rodríguez
Proceedings of the 23rd International Conference on Computational Linguistics (COLING-10). 2010
Mathematical foundations for a compositional distributional model of meaning
Bob Coecke, Mehrnoosh Sadrzadeh, Stephen Clark
Linguistic Analysis. 2010
Supertagging for Efficient Wide-Coverage CCG Parsing
Stephen Clark, James R Curran
Supertagging: Using Complex Lexical Descriptions in Natural Language Processing. 2010
Statistical parsing
Stephen Clark
Handbook of Computational Linguistics and Natural Language Processing. 2010
Automated collage generation-with intent
Anna Krzeczkowska, Jad El-Hage, Simon Colton, Stephen Clark
Proceedings of the 1st International Conference on Computational Creativity. 2010
Linguistic steganography using automatically generated paraphrases
Ching-Yun Chang, Stephen Clark
Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Faster parsing by supertagger adaptation
Jonathan K Kummerfeld, Jessika Roesner, Tim Dawborn, James Haggerty, James R Curran, Stephen Clark
Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. 2010
Cambridge: Parser evaluation using textual entailment by grammatical relation comparison
Laura Rimell, Stephen Clark
Proceedings of the 5th International Workshop on Semantic Evaluation. 2010
Chart pruning for fast lexicalised-grammar parsing
Yue Zhang, Byung-Gyu Ahn, Stephen Clark, Curt Van Wyk, James R Curran, Laura Rimell
Proceedings of the 23rd International Conference on Computational Linguistics: Posters. 2010
A fast decoder for joint word segmentation and POS-tagging using a single discriminative model
Yue Zhang, Stephen Clark
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Practical linguistic steganography using contextual synonym substitution and vertex colour coding
Ching-Yun Chang, Stephen Clark
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Metaphor Corpus Annotated for Source-Target Domain Mappings.
Ekaterina Shutova, Simone Teufel
LREC. 2010
Corpora for the Conceptualisation and Zoning of Scientific Papers.
Maria Liakata, Simone Teufel, Advaith Siddharthan, Colin R Batchelor
LREC. 2010
Exploring variation across biomedical subdomains
Diarmuid Ó Séaghdha, Lin Sun, Anna Korhonen
In Proceedings of the 23rd International Conference on Computational Linguistics. 2010
Latent variable models of selectional preference
Diarmuid O Séaghdha
Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. 2010
Semeval-2010 task 8: Multi-way classification of semantic relations between pairs of nominals
Iris Hendrickx, Su Nam Kim, Zornitsa Kozareva, Preslav Nakov, Diarmuid O Séaghdha, Sebastian Padó, Marco Pennacchiotti, Lorenza Romano, Stan Szpakowicz
Proceedings of the 5th International Workshop on Semantic Evaluation. 2010
SemEval-2010 task 9: The interpretation of noun compounds using paraphrasing verbs and prepositions
Cristina Butnariu, Su Nam Kim, Preslav Nakov, Diarmuid O Séaghdha, Stan Szpakowicz, Tony Veale
Proceedings of the 5th International Workshop on Semantic Evaluation. 2010
Exploring variations across biomedical subdomains
Tom Lippincott, Diarmuid O Séaghdha, Lin Sun, Anna Korhonen
Proceedings of the 23rd International Conference on Computational Linguistics. 2010
Scaling the iHMM: Parallelization versus Hadoop
Sébastien Bratières, Jurgen van Gael, Andreas Vlachos, Zoubin Ghahramani
Proceedings of the Workshop on Scalable Machine Learning and Applications, IEEE International Conference on Computing and Information Technology. 2010
Two strong baselines for the BioNLP 2009 event extraction task
Andreas Vlachos
Proceedings of the 2010 Workshop on Biomedical Natural Language Processing
Semi-supervised learning for biomedical information extraction
Andreas Vlachos
PhD thesis, University of Cambridge. 2010
Underquantification: an application to mass terms
Aurelie Herbelot, Ann Copestake
Proceedings of Empirical, Theoretical and Computational Approaches to Countability in Natural Language, Bochum, Germany. 2010
Annotating underquantification
Aurelie Herbelot, Ann Copestake
Proceedings of the Fourth Linguistic Annotation Workshop. 2010
Camtology: intelligent information access for science
Ted Briscoe, Karl Harrison, Andrew Naish-Guzman, Andy Parker, Advaith Siddharthan, David Sinclair, Mark Slater, Rebecca Watson
Proceedings of the NAACL HLT 2010 Demonstration Session
Active learning for constrained Dirichlet process mixture models
Andreas Vlachos, Zoubin Ghahramani, Ted Briscoe
Proceedings of the 2010 workshop on geometrical models of natural language semantics
Automated assessment of ESOL free text examinations
Ted Briscoe, Ben Medlock, Øistein Andersen
Computer Laboratory, University of Cambridge. 2010
Combining Manual Rules and Supervised Learning for Hedge Cue and Scope Detection
Marek Rei, Ted Briscoe
The 14th Conference on Natural Language Learning (CoNLL-10). 2010

2009

Automatic Lexical Classification--Balancing between Machine Learning and Linguistics.
Anna Korhonen
PACLIC. 2009
Number sense disambiguation
Stuart Moore, Anna Korhonen, Sabine Buchholz
Proceedings of the Conference of Pacific Association for Computational Linguistics (PACLING’09). 2009
GEMS: GEometrical Models of Natural Language Semantics
Roberto Basili, Marco Pennacchiotti
. 2009
Improved cancer risk assessment using text mining
Ilona Silins, Anna Korhonen, Johan Högberg, Lin Sun, Ulla Stenius
Cancer Research. 2009
VerbNet overview, extensions, mappings and applications
Karin Kipper Schuler, Anna Korhonen, Susan Brown
Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Tutorial Abstracts
User-driven development of text mining resources for cancer risk assessment
Lin Sun, Anna Korhonen, Ilona Silins, Ulla Stenius
Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing. 2009
Improving verb clustering with automatically acquired selectional preferences
Lin Sun, Anna Korhonen
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2-Volume 2
The first step in the development of text mining technology for cancer risk assessment: Identifying and organizing scientific evidence in risk assessment literature
Anna Korhonen, Ilona Silins, Lin Sun, Ulla Stenius
BMC bioinformatics. 2009
Towards unrestricted, large-scale acquisition of feature-based conceptual representations from corpus data
Barry Devereux, Nicholas Pilkington, Thierry Poibeau, Anna Korhonen
Research on Language and Computation. 2009
EBMT for SMT: a new EBMT-SMT hybrid
James Smith, Stephen Clark
Proceedings of the 3rd International Workshop on Example-Based Machine Translation. 2009
Comparing the accuracy of CCG and Penn Treebank parsers
Stephen Clark, James R Curran
Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
Unbounded dependency recovery for parser evaluation
Laura Rimell, Stephen Clark, Mark Steedman
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2-Volume 2
Transition-based parsing of the Chinese treebank using a global discriminative model
Yue Zhang, Stephen Clark
Proceedings of the 11th International Conference on Parsing Technologies. 2009
Porting a lexicalized-grammar parser to the biomedical domain
Laura Rimell, Stephen Clark
Journal of biomedical informatics. 2009
An annotation scheme for citation function
Simone Teufel, Advaith Siddharthan, Dan Tidhar
Proceedings of the 7th SIGdial Workshop on Discourse and Dialogue. 2009
Logical metonymy: Discovering classes of meanings
Ekaterina Shutova, Simone Teufel
DiSCo 2009
Towards discipline-independent argumentative zoning: evidence from chemistry and computational linguistics
Simone Teufel, Advaith Siddharthan, Colin Batchelor
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3-Volume 3
Semantic classification with WordNet kernels
Diarmuid Ó Séaghdha
Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
Unsupervised and Constrained Dirichlet Process Mixture Models for Verb Clustering
Andreas Vlachos, Anna Korhonen, Zoubin Ghahramani
Proceedings of the ACL Workshop on Geometrical Models of Natural Language Semantics. 2009
The infinite HMM for unsupervised PoS tagging
Jurgen Van Gael, Andreas Vlachos, Zoubin Ghahramani
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing
Slacker semantics: why superficiality, dependency and avoidance of commitment can be the right way to go
Ann Copestake
Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics. 2009
Using lexical and relational similarity to classify semantic relations
Diarmuid O Séaghdha, Ann Copestake
Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics. 2009
Investigating content selection for language generation using machine learning
Colin Kelly, Ann Copestake, Nikiforos Karamanis
Proceedings of the 12th European Workshop on Natural Language Generation. 2009
Annotating genericity: How do humans decide? (A case study in ontology extraction)
Aurelie Herbelot, Ann Copestake
Studies in Generative Grammar. 2009
Large-scale syntactic processing: Parsing the web
Stephen Clark, Ann Copestake, James R Curran, Yue Zhang, Aurelie Herbelot, James Haggerty, Byung-Gyu Ahn, Curt Van Wyk, Jessika Roesner, Jonathan Kummerfeld, Tim Dawborn
Final report of the 2009 JHU CLSP workshop
What can formal or computational models tell us about how (much) language shaped the brain
Ted Briscoe
Biological Foundations and the Origin of Syntax. 2009
Biomedical event extraction without training data
Andreas Vlachos, Paula Buttery, Diarmuid O Séaghdha, Ted Briscoe
Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task. 2009
Adaptive Interactive Information Extraction
Marek Rei
MPhil Thesis, University of Cambridge. 2009

2008

Automatic classification of English verbs using rich syntactic features
Lin Sun, Anna Korhonen, Yuval Krymolowski
. 2008
A new challenge for text mining: Cancer risk assessment
Ian Lewin, Ilona Silins, Anna Korhonen, Johan Hogberg, Ulla Stenius
Proceedings of the ISMB BioLINK Special Interest Group on Text Data Mining. 2008
Verb class discovery from rich syntactic data
Lin Sun, Anna Korhonen, Yuval Krymolowski
International Conference on Intelligent Text Processing and Computational Linguistics. 2008
A large-scale classification of English verbs
Karin Kipper, Anna Korhonen, Neville Ryant, Martha Palmer
Language Resources and Evaluation. 2008
Lexschem: A large subcategorization lexicon for french verbs
Cédric Messiant, Thierry Poibeau
Language Resource and Evaluation conference. 2008
The choice of features for classification of verbs in biomedical texts
Anna Korhonen, Yuval Krymolowski, Nigel Collier
Proceedings of the 22nd International Conference on Computational Linguistics-Volume 1. 2008
Towards a Compositional Distributional Model of Meaning
Stephen Clark, Bob Coecke, Mehrnoosh Sadrzadeh
. 2008
Joint Word Segmentation and POS Tagging Using a Single Perceptron.
Yue Zhang, Stephen Clark
ACL. 2008
Constructing a parser evaluation scheme
Laura Rimell, Stephen Clark
Coling 2008: Proceedings of the workshop on Cross-Framework and Cross-Domain Parser Evaluation
Asknet: Creating and evaluating large scale integrated semantic networks
Brian Harrington, Stephen Clark
International Journal of Semantic Computing. 2008
A tale of two parsers: investigating and combining graph-based and transition-based dependency parsing using beam-search
Yue Zhang, Stephen Clark
Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2008
Adapting a lexicalized-grammar parser to contrasting domains
Laura Rimell, Stephen Clark
Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2008
Using terms from citations for IR: some first results
Anna Ritchie, Simone Teufel, Stephen Robertson
European Conference on Information Retrieval. 2008
Sentence-based emotion classification for text-to-speech
Eirini Spyropoulou, Sabine Buchholz, Simone Teufel
International Workshop on Computational Aspects of Affectual and Emotional Interaction. 2008
Comparing citation contexts for information retrieval
Anna Ritchie, Stephen Robertson, Simone Teufel
Proceedings of the 17th ACM conference on Information and knowledge management. 2008
Learning compound noun semantics
Diarmuid O Séaghdha
University of Cambridge, Cambridge, UK. 2008
A stopping criterion for active learning
Andreas Vlachos
Computer, Speech and Language, Volume 22, Issue 3, July 2008
Dirichlet Process Mixture Models for Verb Clustering
Andreas Vlachos, Zoubin Ghahramani, Anna Korhonen
Proceedings of the ICML workshop on Prior Knowledge for Text and Language. 2008
Pyridines, pyridine and pyridine rings: disambiguating chemical name entities
Peter Corbett, Colin Batchelor, Ann Copestake
in Proceedings of BERBMTM-08 at LREC-2008
Language Resources and Chemical Informatics.
CJ Rupp, Ann A Copestake, Peter T Corbett, Peter Murray-Rust, Advaith Siddharthan, Simone Teufel, Benjamin Waldron
LREC. 2008
Generating research websites using summarisation techniques
Advaith Siddharthan, Ann Copestake
Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Demo Session. 2008
Semantic classification with distributional kernels
Diarmuid Ó Séaghdha, Ann Copestake
Proceedings of the 22nd International Conference on Computational Linguistics-Volume 1. 2008
Cascaded classifiers for confidence-based chemical named entity recognition
Peter Corbett, Ann Copestake
BMC bioinformatics. 2008
Linguistic Adaptations for Resolving Ambiguity
Ted Briscoe, Paula Buttery
The Evolution of Language: Proceedings of the 7th International Conference (EVOLANG7), Barcelona, Spain, 12-15 March 2008
Bootstrapping an interactive information extraction system for FlyBase curation.
Ted Briscoe, Caroline Gasperin, Ian Lewin, Andreas Vlachos
Ontologies and Text Mining for Life Sciences. 2008
Natural Language Processing in aid of FlyBase curators
Nikiforos Karamanis, Ruth Seal, Ian Lewin, Peter McQuilton, Andreas Vlachos, Caroline Gasperin, Rachel Drysdale, Ted Briscoe
BMC bioinformatics. 2008
The BNC Parsed with RASP4UIMA.
Øistein E Andersen, Julien Nioche, Ted Briscoe, John A Carroll
LREC. 2008
Language learning, power laws, and sexual selection
Ted Briscoe
Mind & Society. 2008
Statistical anaphora resolution in biomedical texts
Caroline Gasperin, Ted Briscoe
Proceedings of the 22nd International Conference on Computational Linguistics-Volume 1. 2008

2007

I will shoot your shopping down and you can shoot all my tins: automatic lexical acquisition from the CHILDES database
Paula Buttery, Anna Korhonen
Proceedings of the Workshop on Cognitive Aspects of Computational Language Acquisition. 2007
Combining Symbolic and Distributional Models of Meaning
Stephen Clark, Stephen Pulman
AAAI Spring Symposium: Quantum Interaction. 2007
Chinese segmentation with a word-based perceptron algorithm
Yue Zhang, Stephen Clark
Annual Meeting-Association for Computational Linguistics. 2007
Formalism-independent parser evaluation with CCG and DepBank
Stephen Clark, James R Curran
Annual Meeting-Association for Computational Linguistics. 2007
Improving the efficiency of a wide-coverage CCG parser
Bojan Djordjevic, James R Curran, Stephen Clark
Proceedings of the 10th International Conference on Parsing Technologies. 2007
Linguistically motivated large-scale NLP with C&C and Boxer
James R Curran, Stephen Clark, Johan Bos
Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions. 2007
Perceptron training for a wide-coverage lexicalized-grammar parser
Stephen Clark, James R Curran
Proceedings of the Workshop on Deep Linguistic Processing. 2007
Asknet: Automated semantic knowledge network
Brian Harrington, Stephen Clark
PROCEEDINGS OF THE NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE. 2007
Wide-coverage efficient statistical parsing with CCG and log-linear models
Stephen Clark, James R Curran
Computational Linguistics. 2007
Whose Idea Was This, and Why Does it Matter? Attributing Scientific Work to Citations.
Advaith Siddharthan, Simone Teufel
HLT-NAACL. 2007
An overview of evaluation methods in TREC ad hoc information retrieval and TREC question answering
Simone Teufel
In Evaluation of Text and Speech Systems. L. Dybkjaer, H. Hemsen, W. Minker (Eds.) Springer, Dordrecht (The Netherlands). 2007
Annotation of chemical named entities
Peter Corbett, Colin Batchelor, Simone Teufel
Proceedings of the Workshop on BioNLP 2007: Biological, Translational, and Clinical Language Processing
Designing and evaluating a semantic annotation scheme for compound nouns
Diarmuid O Séaghdha
Proc. Corpus Linguistics. 2007
Annotating and learning compound noun semantics
Diarmuid Ó Séaghdha
Proceedings of the 45th Annual Meeting of the ACL: Student Research Workshop. 2007
Evaluating and combining biomedical named entity recognition systems
Andreas Vlachos
Proceedings of the 2007 Workshop on Biological, translational, and clinical language processing
Tackling the BioCreative2 Gene Mention task with Conditional Random Fields and syntactic parsing
Andreas Vlachos
Proceedings of the Second BioCreative Challenge Evaluation Workshop. 2007
From gene names to actual genes
Ioannis Korkontzelos, Andreas Vlachos, Ian Lewin
Proceedings of BioLINK SIG: Linking Literature, Information and Knowledge for Biology . 2007
Evaluating an Open Domain GRE algorithm on closed domains System IDs: CAM-B, CAM-T, CAM-BU and CAM-TU
Advaith Siddharthan, Ann Copestake
Proceedings of the Workshop on Using Corpora for NLG: Language Generation and Machine Translation (UCNLG+ MT). 2007
Semantic composition with (robust) minimal recursion semantics
Ann Copestake
Proceedings of the Workshop on Deep Linguistic Processing. 2007
Co-occurrence contexts for noun compound interpretation
Diarmuid Ó Séaghdha, Ann Copestake
proceedings of the Workshop on A Broader Perspective on Multiword Expressions. 2007
Applying robust semantics
Ann Copestake
Proceedings of the 10th Conference of the Pacific Assocation for Computational Linguistics (PACLING). 2007
Integrating general-purpose and domain-specific components in the analysis of scientific text
CJ Rupp, Ann Copestake, Peter Corbett, Ben Waldron
Proc. of the UK e-Science Programme All Hands Meeting. 2007
Adapting the RASP system for the CoNLL07 domain-adaptation task
Rebecca Watson, Ted Briscoe
Proceedings of the CoNLL Shared Task Session of EMNLP-CoNLL. 2007
Weakly supervised learning for hedge classification in scientific literature
Ben Medlock, Ted Briscoe
Annual Meeting of the Association for Computational Linguistics. 2007
A system for large-scale acquisition of verbal, nominal and adjectival subcategorization frames from corpora
Judita Preiss, Ted Briscoe, Anna Korhonen
Annual Meeting of the Association for Computational Linguistics. 2007
Semi-supervised training of a statistical parser from unlabeled partially-bracketed data
Rebecca Watson, Ted Briscoe, John Carroll
Proceedings of the 10th International Conference on Parsing Technologies. 2007

2006

A large-scale extension of VerbNet with novel verb classes
Karin Kipper, Anna Korhonen, Neville Ryant, Martha Palmer
Atti del XII Congresso Internazionale di Lessicografia: Torino, 6-9 settembre 2006
Extending VerbNet with novel verb classes
Karin Kipper, Anna Korhonen, Neville Ryant, Martha Palmer
Proceedings of LREC. 2006
Zone analysis in biology articles as a basis for information extraction
Yoko Mizuta, Anna Korhonen, Tony Mullen, Nigel Collier
International journal of medical informatics. 2006
Automatic classification of verbs in biomedical texts
Anna Korhonen, Yuval Krymolowski, Nigel Collier
Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics. 2006
Extensive classifications of english verbs
Karin Kipper, Anna Korhonen, Neville Ryant, Martha Palmer
Proceedings of the 12th EURALEX International Congress. 2006
Partial training for a lexicalized-grammar parser
Stephen Clark, James R Curran
Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics. 2006
Multi-tagging for lexicalized-grammar parsing
James R Curran, Stephen Clark, David Vadas
Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics. 2006
Argumentative zoning applied to critiquing novices’ scientific abstracts
Valéria D Feltrim, Simone Teufel, Maria Graças V das Nunes, Sandra M Aluísio
In ``Computing Attitude and Affect in Text: Theory and Applications'' James G. Shanahan, Yan Qu, Janyce Wiebe (Eds.) Springer, Dordrecht, The Netherlands, 2005. 2006
Argumentative zoning for improved citation indexing
Simone Teufel
In ``Computing Attitude and Affect in Text: Theory and Applications'' James G. Shanahan, Yan Qu, Janyce Wiebe (Eds.) Springer, Dordrecht, The Netherlands, 2005.. 2006
Creating a test collection for citation-based IR experiments
Anna Ritchie, Simone Teufel, Stephen Robertson
Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics. 2006
A bootstrapping approach to unsupervised detection of cue phrase variants
Rashid M Abdalla, Simone Teufel
Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics. 2006
Automatic classification of citation function
Simone Teufel, Advaith Siddharthan, Dan Tidhar
Proceedings of the 2006 conference on empirical methods in natural language processing
How to find better index terms through citations
Anna Ritchie, Simone Teufel, Stephen Robertson
Proceedings of the Workshop on How Can Computational Linguistics Improve Information Retrieval?. 2006
Bootstrapping and Evaluating Named Entity Recognition in the Biomedical Domain
Andreas Vlachos, Caroline Gaseprin
Proceedings of the HLT-NAACL BioNLP Workshop on Linking Natural Language and Biology. 2006
Active Annotation
Andreas Vlachos
Proceedings of the Workshop on Adaptive Text Extraction and Mining. 2006
An architecture for language processing for scientific texts
Ann Copestake, Peter Corbett, Peter Murray-Rust, CJ Rupp, Advaith Siddharthan, Simone Teufel, Ben Waldron
Proceedings of the UK e-Science All Hands Meeting 2006
Robust minimal recursion semantics
Ann Copestake
unpublished draft. 2006
Flexible interfaces in the application of language technology to an escience corpus
CJ Rupp, Ann Copestake, Simone Teufel, Ben Waldron
Proceedings of the UK e-Science programme all hands meeting. 2006
Preprocessing and tokenisation standards in DELPH-IN tools
Benjamin Waldron, Ann Copestake, Ulrich Schäfer, Bernd Kiefer
Proceedings of the 5th International Conference on Language Resources and Evaluation. 2006
A standoff annotation interface between DELPH-IN components
Benjamin Waldron, Ann Copestake
Proceedings of the 5th Workshop on NLP and XML: Multi-Dimensional Markup in Natural Language Processing. 2006
Conventional speech act formulae: from corpus findings to formalization
Ann Copestake, Marina Terkourafi
Proceedings of Constraints in Discourse, NUI Maynooth, Ireland. 2006
Conventional speech act formulae in HPSG
Ann Copestake, Marina Terkourafi
poster). 13th International Conference on Head-Driven Phrase Structure Grammar, Varna. 2006
Acquiring ontological relationships from wikipedia using rmrs
Aurelie Herbelot, Ann Copestake
Proceedings of Workshop on Web content Mining with Human Language Technologies, ISWC06. 2006
Bootstrapping the recognition and anaphoric linking of named entities in drosophila articles
Andreas Vlachos, Caroline Gasperin, Ian Lewin, Ted Briscoe
Proc. of the Pacific Symposium on Biocomputing. 2006
An introduction to tag sequence grammars and the RASP system parser
Ted Briscoe
Computer Laboratory Technical Report. 2006
Annotation guidelines for Named Entity Recognition in the FlySLIP project
Andreas Vlachos, Nikiforos Karamanis, Ruth Seal, Ian Lewin, Chihiro Yamada, Caroline Gasperin, Ted Briscoe
University of Cambridge, CRL, Cambridge. 2006
A large subcategorization lexicon for natural language processing applications
Anna Korhonen, Yuval Krymolowski, Ted Briscoe
Proc. of the 5th LREC. 2006
The second release of the RASP system
Ted Briscoe, John Carroll, Rebecca Watson
Proceedings of the COLING/ACL on Interactive presentation sessions. 2006
Evaluating the accuracy of an unlexicalized statistical parser on the PARC DepBank
Ted Briscoe, John Carroll
Proceedings of the COLING/ACL on Main conference poster sessions. 2006