An image capturing device having a user interface in a form of a screen; wherein the user interface allows a user to provide instructions regarding positioning of the image capturing device to include at least one predetermined view point. The instructions are in a form of at least one of a voice, a text, and an image. The image capturing device can also have a storage device configured to (i) record the instructions; and (ii) store a captured image and a processor programmed to retrieve the instructions from the storage device when requested by a user via the user interface and to create boundaries around key objects that are being built.