A method of controlling an intelligent security device can include capturing a video; collecting voice information included in the video; in response to determining that the voice information includes a wake-up word corresponding to a predetermined basic wake-up word for the intelligent security device, transmitting a spoken utterance included in the voice information to a smart device; receiving a command from the smart device, the command being generated based on information related to the spoken utterance; and executing an operation of the intelligent security device based on the command.