Patent attributes
Embodiments of the invention include a system for automated persona feature selection. Soft clusters of entities are received, each entity having a history of features. Each feature has a general prevalence coefficient representing prevalence of entities having the respective feature in their history. A feature list is generated for each cluster, each feature having an in-cluster coefficient representing prevalence of entities in the cluster having the feature in their history. Features having an in-cluster coefficient that is different from that feature's general prevalence coefficient are selected. A variance across the clusters is determined for each selected feature. A discriminating feature list having high variance features is generated for each cluster. Clusters are selected for an entity by comparing the features of the entity's history to features of the discriminating feature lists of the clusters. Content is customized according to the chosen clusters and sent to the entity.