A vehicular vision system includes a plurality of cameras disposed at a vehicle and having respective exterior fields of view, and a display screen for displaying images derived from captured image data in a surround view format where captured image data is merged to provide a single composite display image from a virtual viewing position. A control includes a processor that processes image data captured by the cameras to detect an object present in the field of view of at least one of the cameras. During a driving maneuver of the vehicle, the display screen displays surround view video images and responsive to detection of the object, the display screen displays an enlarged view of the detected object.