Computer Laboratory

Douwe Kiela

I'm a PhD student working on the A Unified Model of Compositional and Distributional Semantics: Theory and Applications project under Steve Clark. I did an undergraduate degree at Utrecht University with a double major in Cognitive Artificial Intelligence and Philosophy. After that I went on to get a master's degree in Logic from the University of Amsterdam's Institute for Logic, Language & Computation, with brief stints as a visiting researcher at Stanford's CSLI and NYU's Philosophy and Linguistics departments. Before I started the PhD, I completed the MPhil in Advanced Computer Science here in Cambridge. See my LinkedIn profile for industry-related experience.

How do I pronounce your name?

In Dutch, "ou" is pronounced like in loud and "we" like in were, but you can simply call me Dow if that's easier.



  • Multi- and Cross-Modal Semantics Beyond Vision: Grounding in Auditory Perception
    D. Kiela and S. Clark
    Proceedings of EMNLP 2015, Lisbon, Portugal.
    [pdf] [bib] [data]
  • Visual Bilingual Lexicon Induction with Transferred ConvNet Features
    D. Kiela, I. Vulić and S. Clark
    Proceedings of EMNLP 2015, Lisbon, Portugal.
    [pdf] [bib] [data]
  • Specializing Word Embeddings for Similarity or Relatedness
    D. Kiela, F. Hill and S. Clark
    Proceedings of EMNLP 2015, Lisbon, Portugal.
    [pdf] [bib]
  • Grounding Semantics in Olfactory Perception
    D. Kiela, L. Bulat and S. Clark
    Proceedings of ACL 2015, Beijing, China.
    [pdf] [bib] [data]
  • Exploiting Image Generality for Lexical Entailment Detection
    D. Kiela, L. Rimell, I. Vulić and S. Clark
    Proceedings of ACL 2015, Beijing, China.
    [pdf] [bib] [data]
  • Adaptive communication: Languages with more non-native speakers tend to have fewer word forms.
    C. Bentz, A. Verkerk, D. Kiela, F. Hill and P. Buttery
    PLoS One.


  • Unsupervised Discovery of Information Structure in Biomedical Documents
    D. Kiela, Y. Guo, U. Stenius and A. Korhonen
    Bioinformatics, Volume 31, Number 7
    [paper] [bib] [data]
  • Learning Image Embeddings using Convolutional Neural Networks for Improved Multi-Modal Semantics
    D. Kiela and L. Bottou
    Proceedings of EMNLP 2014, Doha, Qatar.
    [pdf] [bib] [data]
  • Improving Multi-Modal Representations Using Image Dispersion: Why Less is Sometimes More
    D. Kiela*, F. Hill*, A. Korhonen and S. Clark
    Proceedings of ACL 2014, Baltimore, MA.
    [pdf] [bib] [data]
  • A Systematic Study of Semantic Vector Space Model Parameters
    D. Kiela and S. Clark
    Proceedings of EACL 2014, Second Workshop on Continuous Vector Space Models and their Compositionality (CVSC), Gothenburg, Sweden.
    [pdf] [bib]
  • Zipf's law across languages of the world: Towards a quantitative measure of lexical diversity
    C. Bentz and D. Kiela
    Proceedings of the 10th International Conference on the Evolution of Language (EvoLang-10), Vienna, Austria.
  • Zipf's law and the grammar of languages: A quantitative study of Old and Modern English parallel texts
    C. Bentz, D. Kiela, F. Hill and P. Buttery
    Corpus Linguistics and Linguistic Theory, Volume 10, Issue 1. Berlin: de Gruyter
    [pdf] [bib]


  • Detecting Compositionality of Multi-Word Expressions using Nearest Neighbours in Vector Space Models
    D. Kiela and S. Clark
    Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP-13), Seattle, WA
    [pdf] [bib]
  • Concreteness and corpora: A theoretical and practical analysis.
    F. Hill, D. Kiela and A. Korhonen
    Proceedings of ACL 2013, Workshop on Cognitive Modelling and Computational Linguistics, Sofia, Bulgaria (Best student paper award)
    [pdf] [bib]
  • UCAM-CORE: Incorporating structured distributional similarity into STS.
    T. Polajnar, L. Rimell and D. Kiela
    Proceedings of *SEM 2013 (shared task), Atlanta, Georgia.
  • [pdf] [bib]

Full list of publications



University of Cambridge Computer Laboratory
William Gates Building, 15 JJ Thomson Avenue
Cambridge CB3 0FD, UK
+44 (0)1223 763612