Cloze Test Generation

Proposer: Ekaterina Kochmar
Supervisors: Ekaterina Kochmar
Special Resources: Access to the Cambridge Learner Corpus (CLC) + Access to an NLIP machine/server
Applications: automated test generation, reading comprehension assessment

Description

Cloze tests, or fill-in-the-blank multiple-choice exercises, are tests that use sentences either directly extracted from the text or constructed based on the text content, where the tested words are replaced with a gap. The reader is typically presented with a list of alternatives, with one correct answer and a number of distractors, to choose from. For example, a cloze test assessing the use and command of prepositions may include the following question (from Lee & Seneff, 2007):

If you don't have anything planned for this evening, let's go __ a movie.

with the following options to choose from:

(a) to
(b) of
(c) on
(d) null
Cloze tests have been showed to be effective means of assessing one's reading comprehension. In the past, such tests have been actively used to assess the level of language proficiency of non-native readers, but nowadays they can also be used to assess machine comprehension (Hermann et al., 2015).

In the example above, all options have been generated automatically using different approaches to the distractor selection. Option (a) is the correct preposition in this case; option (b) is based on the fact that the two prepositions have comparable frequency and therefore might be confused by the non-native readers; distractor (c) is generated using collocation similarity between the two prepositions; finally, generation of distractor (d) relies on confusion probability in non-native data. This project will look into automated cloze test generation for a wide variety of lexical items, and in particular for open class words (verbs, nouns, adjectives and adverbs). As a starting point, the following methods may be applied to this task:
1. Approach 1: The baseline method that uses words of the same part of speech with comparable frequency in native English as possible distractors;
2. Approach 2: A more advanced approach that uses lexical resources such as WordNet and learner dictionaries to extract synonymous and related words; as an extension, native languages of the speakers can be taken into account and native language-related confusion patterns can be used to generate more challenging distractors in a more personalised way;
3. Approach 3: An approach that uses distributionally similar words as distractors, relying on distributional semantic space; the word vectors should be build from large native English corpora using state-of-the-art approaches and tools (e.g., word2vec);
4. Approach 4: A language modelling-based approach that generates the alternatives in a way similar to the one used for sentence completion task generation in Zweig & Burges (2011).
A successful cloze test generation system should be able to identify the fragments of text to be tested, and generate the distractors plausible enough to be picked by the readers, while producing incorrect or ungrammatical sentences if actually picked. The project will address these challenges.
Additional resources:

Dataset of cloze tests used in language exams will be provided.

Background reading:

Hermann et al. (2015), Teaching Machines to Read and Comprehend
Lee & Seneff (2007). Automatic Generation of Cloze Items for Prepositions
Zweig & Burges (2011). The Microsoft Research Sentence Completion Challenge
A comprehensive reading list summarising the past research on cloze test and distractor generation can be found here.

Cloze Test Generation

Description

Additional resources:

Background reading: