Department of Computer Science and Technology

Technical reports

Colour videos with depth: acquisition, processing and evaluation

Christian Richardt

March 2012, 132 pages

This technical report is based on a dissertation submitted November 2011 by the author for the degree of Doctor of Philosophy to the University of Cambridge, Gonville & Caius College.

Some figures in this document are best viewed in colour. If you received a black-and-white copy, please consult the online version if necessary.

DOI: 10.48456/tr-815


The human visual system lets us perceive the world around us in three dimensions by integrating evidence from depth cues into a coherent visual model of the world. The equivalent in computer vision and computer graphics are geometric models, which provide a wealth of information about represented objects, such as depth and surface normals. Videos do not contain this information, but only provide per-pixel colour information. In this dissertation, I hence investigate a combination of videos and geometric models: videos with per-pixel depth (also known as RGBZ videos). I consider the full life cycle of these videos: from their acquisition, via filtering and processing, to stereoscopic display.

