Computer Laboratory

The Cambridge Dictionary of Mathematical Types (version 1.0)

Yiannos A. Stathopoulos and Simone Teufel, with contributions from Simon Baker and Marek Rei

Thank you for your interest in downloading the Cambridge Dictionary of Mathematical Types (CDMT) and for using it in your research.
This release is a bundle of two type (technical terminology phrases that represent mathematical concepts and objects) dictionaries and a human-annotated gold-standard data set for type detection (determining whether a technical term phrase is a type or not).
The seed dictionary contains 10601 types, while the extended type dictionary contains 1.23 million automatically detected type phrases. The relevant papers for each data set are listed below.

The data set is distributed under the Open Database License (license details here)

Please provide your details in the form below. While the Comments field is optional, you are very welcome to enter any comments, thoughts or questions you may have; your feedback is very much appreciated.

Please cite the following paper if you are using seed dictionary and gold standard annotation in your research:

Yiannos A. Stathopoulos and Simone Teufel:
Mathematical Information Retrieval Based on Type Embeddings and Query Expansion
In Proceedings of the 26th International Conference on Computational Linguistics (Coling 2016). Osaka, Japan, 2016.

@inproceedings{DBLP:conf/coling/StathopoulosT16, author = {Yiannos Stathopoulos and Simone Teufel}, title = {Mathematical Information Retrieval based on Type Embeddings and Query Expansion}, booktitle = {{COLING} 2016, 26th International Conference on Computational Linguistics, Proceedings of the Conference: Technical Papers, December 11-16, 2016, Osaka, Japan}, pages = {2344--2355}, year = {2016}, crossref = {DBLP:conf/coling/2016}, url = {http://aclweb.org/anthology/C/C16/C16-1221.pdf}, timestamp = {Tue, 16 Jan 2018 17:41:35 +0100}, biburl = {https://dblp.org/rec/bib/conf/coling/StathopoulosT16}, bibsource = {dblp computer science bibliography, https://dblp.org} }

Please cite the following paper if you are using the extended type dictionary in your research:

Yiannos A. Stathopoulos, Simon Baker, Marek Rei and Simone Teufel:
Variable Typing: Assigning Meaning to Variables in Mathematical Text
In Proceedings of the 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2018) New Orleans, United States, 2018

@InProceedings{N18-1028, author = "Stathopoulos, Yiannos and Baker, Simon and Rei, Marek and Teufel, Simone", title = "Variable Typing: Assigning Meaning to Variables in Mathematical Text", booktitle = "Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)", year = "2018", publisher = "Association for Computational Linguistics", pages = "303--312", location = "New Orleans, Louisiana", url = "http://aclweb.org/anthology/N18-1028" }

Name:
Institution:
Email address:
Comments:
Fields marked with bold font are mandatory.