AutoScale Your Clusters
Scaling is the ability to easily increase or decrease a resource to accommodate heavier or lighter loads. In Azure CycleCloud, jobs can be easily scaled up when the load increases, or scaled down to conserve cost. This can be done automatically or manually.
When creating a new cluster via the GUI, the Compute Backend tab allows you to choose to auto-scale your cluster and add execute hosts as required. Check the box to allow CycleCloud to start and stop execute nodes as required, and set the number of initial and maximum cores allowed.
Setting a Max Cores number will limit the number of nodes started to ensure your workload does not run unfetted. You can also set up an usage alert when submitting your job to ensure a budget is not exceeded.
Auto-Scaling in Cluster Template
By default, auto-scaling is disabled for all new clusters. To enable it, add the following to your cluster template:
Autoscale = true ... MaxCoreCount = xx