An apparatus for contextual execution comprises a processor, and a memory containing instructions, which when executed by the processor, cause the apparatus to receive, from a user terminal, a control input associated with an intent, obtain location data associated with a location of the user terminal, and determine a scored set of execution options associated with the control input. Further, the instructions, when executed by the processor cause the apparatus to obtain a contextual label associated with the location data, the label determined based on the application of one or more adapted pretrained deep learning models to the location data.