Readability Data

Cambridge English Exams dataset

Please cite the following paper if you use the dataset and please do not reproduce more than one of the reading texts in any publication/presentation arising from further analysis of the dataset.

Menglin Xia, Ekaterina Kochmar, Ted Briscoe. 2016. "Text Readability Assessment for Second Language Learners". Proceedings of the 11th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2016). San Diego, California, USA. [PDF]

Please drop me an email (menglin.xia at if you need to download the data.