Patent 10355946 was granted and assigned to Hortonworks on July, 2019 by the United States Patent and Trademark Office.
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying a plurality of software components deployed on a cluster of computing nodes, each component operating on one or more respective nodes within the cluster to process a workload, each component utilizing at least the following resources of the one or more respective nodes: storage, processing power, memory, and disk and network input and output operations; determining, for each identified component, current and historical usages of the resources of the one or more respective nodes by the component when processing the workload; receiving data specifying a future workload; and calculating, from the current and historical usages of each component, an estimated capacity required for the cluster to process the future workload, where the estimated capacity specifies, at least, how many nodes should be in the cluster to process the future workload.