Patent attributes
A system for data processing using machine learning processing and distributed architecture is described. Specifically, proprietary data transformation rules to be applied for the data processing may be stored at edge computing devices, while the bulk of data processing may be performed at a central computing node that houses the databases. A subset of a data set, in the database, may be sent from the central computing node to the edge computing node. The edge computing node may generate a second data set based on applying data transformation rules to the subset of the data set. The central computing node may determine, using a machine learning (ML) algorithm and based on the subset of the data set and the second data set, the data transformation rules, which may then be applied to the rest of the data set.