There is provided a method of processing image data, comprising the steps of: (a) providing a plurality of images of a scene; (b) generating a disparity map for each of at least two pairs of images from the plurality of images; (c) transforming each of the disparity maps into a common coordinate system; and (d) merging the transformed disparity maps to provide a single representation of the depth information of the scene.