Configure Horizontal Pod AutoScaler (HPA)

Deploy the Metrics Server

Metrics Server is a cluster-wide aggregator of resource usage data. These metrics will drive the scaling behavior of the deployments. We will deploy the metrics server using Helm configured in a previous module

# create the metrics-service namespace first
kubectl create namespace metrics

helm install metrics-server \
    stable/metrics-server \
    --version 2.9.0 \
    --namespace metrics

Edit 2020-04-22: some versions of EKS may get the following error once you create the HPA in the next step

kubectl describe hpa
... failed to get cpu utilization: unable to get metrics for resource cpu

To fix, set the following values for the metric-server helm release (as described here):

  - --kubelet-preferred-address-types=InternalIP

Confirm the Metrics API is available.

Return to the terminal in the Cloud9 Environment

kubectl get apiservice -o yaml

If all is well, you should see a status message similar to the one below in the response

  - lastTransitionTime: "2020-02-18T21:33:26Z"
    message: all checks passed
    reason: Passed
    status: "True"
    type: Available

We are now ready to scale a deployed application