Patent attributes
The disclosure relates to AI-based machine-learning and natural language modeling to identify semantic similarities between sets of content having natural language text. For example, a system may generate a relevance classification that indicates whether content such as articles are non-specifically relevant to charities without identifying a particular charity. If the content is non-specifically relevant to charities, the system may apply a natural language model to generate sentence embeddings based on the content and determine a level similarity between the sentence embeddings and a query embedding generated from a charity query. The charity query may itself be generated from a full description of the charity through an encoder-decoder architecture with reinforcement learning.