Patent 11687720 was granted and assigned to Snap Inc. on June, 2023 by the United States Patent and Trademark Office.
A caption of a multimodal message (e.g., social media post) can be identified as a named entity using an entity recognition system. The entity recognition system can use a visual attention based mechanism to generate a visual context representation from an image and caption. The system can use the visual context representation to identify one or more terms of the caption as a named entity.