A pair of video frames may be used for simultaneous localization and mapping (SLAM) initialization. The pair of frames may be determined according to a translation threshold. Whether the translation threshold is met may be determined by obtaining the pair of video frames and estimating the translation between the frames.