Autoscale your infrastructure with Kubernetes

Preparing for a peak demand event requires time: you have to establish your baselines, assess your service level agreements, and align all your teams around the same data. Scaling your infrastructure has similar considerations, but also involves projecting system needs against cost. How do you balance the two without sacrificing performance? Where is the threshold for good enough?

One solution is horizontal pod autoscaling (HPA), which is built with our Kubernetes integration with Pixie. Once you've set up HPA, your Kubernetes cluster automatically allocates more pods when demand peaks, then deallocates when demand falls. This allows you to sidestep cost concerns when anticipating demand.

Diagram demonstrating how HPA interacts with New Relic at a high level.

The tutorial assumes that you're using Kubernetes clusters. To set up HPA, you'll need:

A Kubernetes cluster running a supported version.
Your New Relic user
No other external metrics adapter installed in the cluster

Objectives

This tutorial walks you through setting up a demo environment for HPA. You will:

Install the New Relic Kubernetes integration
Set up an example environment to test autoscaling

Forward metrics to Kubernetes

Clone our New Relic Pixie lab repo

Clone the following repo from Github:

bash

$git clone https://github.com/newrelic-experimental/pixie-lab-materials
$cd pixie-lab-materials/main
$./setup.sh

The setup.sh script spins up a new minikube cluster using the Pixie-supported hyperkit driver. It then configures your network memory and CPU for optimal performance with Pixie and creates all the pods and services that make up the demo application.

In a new terminal window, open a minikube tunnel:

bash

$minikube tunnel -p minikube-pixie-lab

You should have two terminals:

Your tunnel, which remains open to access your demo application.
A place to run commands for the tutorial.

Install the Kubernetes integration with Pixie

Follow our guided install to install the New Relic Kubernetes integration. This connects New Relic to your Kubernetes cluster. Make sure to:

Check Instant service-level insights, Full-body requests, and Application profiles through Pixie to enable Pixie.
Keep all other default checked items
Once you click continue, copy and paste that command into your dev environment.

Install New Relic Metrics Adapter

To install the New Relic Metrics Adapter, use the newrelic-k8s-metrics-adapter Helm chart. If you've used the nri-bundle-chart to deploy any New Relic Kubernetes components, then you have access to this Helm chart.

bash

$helm upgrade --install newrelic newrelic/nri-bundle \
>  --namespace newrelic --create-namespace --reuse-values \
>  --set metrics-adapter.enabled=true \
>  --set newrelic-k8s-metrics-adapter.personalAPIKey=YOUR_NEW_RELIC_PERSONAL_API_KEY \
>  --set newrelic-k8s-metrics-adapter.config.accountID=YOUR_NEW_RELIC_ACCOUNT_ID \
>  --set newrelic-k8s-metrics-adapter.config.externalMetrics.manipulate_average_requests.query="FROM Metric SELECT average(http.server.duration) WHERE instrumentation.provider='pixie'"

Here's what these flags do:

metrics-adapter.enabled: Sets to true to install the metrics adapter chart
newrelic-k8s-metrics-adapter.personalAPIKey: Sets your New Relic API key.
newrelic-k8s-metrics-adapter.accountID: The ID of the account that forwards metrics.
newrelic-k8s-metrics-adapter.config.externalMetrics.external_metric_name.query: Adds a new external metric with the following information:
- external_metric_name: The metric name.
- query: The base NRQL query for the metric.

Confirm your NRQL query is accurate

You should test your query before sending metrics to the autoscaler from New Relic. Go to one.newrelic.com > Query your Data, then copy and paste the following NRQL query:

FROM Metric SELECT average(http.server.duration) WHERE instrumentation.provider='pixie'

Configure your autoscaler

From the pixie-lab-materials/main/kube directory, create a new file called hpa.yml. The New Relic metrics adapter sends data to the controller manager, which is defined by the HPA definition in this YAML file.

kind: HorizontalPodAutoscaler
apiVersion: autoscaling/v2beta2
metadata:
    name: manipulate-scaler
spec:
    scaleTargetRef:
        apiVersion: apps/v1
        kind: Deployment
        name: manipulation-service
    minReplicas: 1
    maxReplicas: 10
    metrics:
        - type: External
            external:
                metric:
                    name: manipulate_average_requests
                target:
                    type: Value
                    value: 100

Apply the new YAML file by running:

bash

$cd pixie-lab-materials/main/kube
$kubectl apply -f hpa.yaml

Check your work: Add load to trigger autoscaling

Navigate to your site deployment with kubectl get services.
Open the EXTERNAL-IP from your frontend-service in your browser.
Install hey and Go v1.17 with brew install hey.
Send GET requests to the EXTERNAL-IP with hey -n 10 -c 2 -m GET http://<EXTERNAL-IP>.
Watch your HPA autoscalling with watch kubnectl get hpa.

You've successfully set up HPA if the pod autoscales the number of replicas as the average HTTP request time increases. You can adjust the configuration for your own services so that New Relic and HPA automatically autoscale as needed.

1Get started

Get data about your architecture with APM and infrastructure agents

2Create service levels for gameday

Create service levels informed by your baseline

3Reduce noise with quality alerts

Evaluate your alerts with alert quality management

4Align your teams with workloads

Align your teams around the same data

5Autoscale your infrastructure with Kubernetes

Scale your resources as demand peaks

You are here

Autoscale your infrastructure with Kubernetes

Objectives .css-21sua1{background:none;border:none;width:0;padding:0;}

Forward metrics to Kubernetes

Clone our New Relic Pixie lab repo

Install the Kubernetes integration with Pixie

Install New Relic Metrics Adapter

Confirm your NRQL query is accurate

Configure your autoscaler

Check your work: Add load to trigger autoscaling

2Create service levels for gameday

3Reduce noise with quality alerts

4Align your teams with workloads

5Autoscale your infrastructure with Kubernetes

Objectives