A vehicle entertainment system is controlled responsive to gestures that are formed by a passenger of the vehicle. The vehicle entertainment system includes a display device, at least one gesture control camera, and a processor. The gesture control camera generates a camera signal responsive to light reflected from at least one object within a field of view of the at least one gesture control camera. The processor analyzes the camera signal to identify a gesture made by a passenger moving the at least one object, and controls at least one operation of the vehicle entertainment system responsive to the identified gesture.