Use Dynamic Application Sizing

18min
|
Enterprise
Nomad
Vagrant
Video

Enterprise Only

The functionality described here is available only in Nomad Enterprise with the Multi-Cluster & Efficiency module. To explore Nomad Enterprise features, you can sign up for a free 30-day trial from here.

Prometheus Required

Currently, Prometheus is the only APM supported for Dynamic Application Sizing

Using a Vagrant virtual machine, you will deploy a simple environment containing:

An APM, specifically Prometheus, to collect metric data.
Nomad Autoscaler Enterprise.
A sample job, which will be configured to enable DAS recommendations with:
- one NGINX instance used as a TCP load balancer.
- three Redis instances to service requests.
A sample dispatch job to create load on the Redis nodes.

Prerequisites

Familiarity with the Dynamic application scaling concepts tutorial.

This Vagrantfile to create a suitable environment to run the demonstration.

This Vagrantfile provisions:

one Ubuntu 20.04 VM preinstalled with:
- Nomad Enterprise v1.0.0 beta 2
- The current version of Consul installable via package
- The current version of Docker installable via package

Start and connect to the Vagrant environment

Download the Vagrantfile. Start the test-drive environment by running vagrant up.

$ vagrant up

Once the environment is provisioned and you are returned to your command prompt, connect to the Vagrant instance.

$ vagrant ssh

Once you are at the vagrant@ubuntu-focal:~$ prompt, you are ready to continue.

Verify Nomad telemetry configuration

Nomad needs to be configured to enable telemetry publishing. You need to enable allocation and node metrics. Since this tutorial also uses Prometheus as its APM, you need to set prometheus_metrics to true.

The configuration for the Nomad inside the test-drive already has the appropriate telemetry configuration. View the configuration using cat /etc/nomad.d/nomad.hcl file and note the following stanza is included.

telemetry {  publish_allocation_metrics = true  publish_node_metrics       = true  prometheus_metrics         = true}

Given this configuration, Nomad generates node and allocation metrics and make them available in a format that Prometheus can consume. If you are using this test-drive with your own Nomad cluster, add this telemetry block to the configuration for every Nomad node in your cluster and restart them to load the new configuration.

Return to the vagrant user's home directory if you changed away from it.

$ cd /home/vagrant

Start Prometheus

The autoscaler configuration in this test-drive uses Prometheus to retrieve historical metrics when starting to track a new target. In this beta, Prometheus is also used for ongoing monitoring metrics, but this is currently being shifted to using Nomad's metrics API. The first step is to run an instance of Prometheus for the Nomad Autoscaler to use. The simplest way to do this is to run Prometheus as a Nomad job. The environment contains a complete Prometheus job file to get started with.

You can create a file called prometheus.nomad with the following content, or you can copy prometheus.nomad from the ~/nomad-autoscaler/jobs folder when logged into a vagrant user's shell inside the VM.

job "prometheus" {  datacenters = ["dc1"]  group "prometheus" {    count = 1    network {      port "prometheus_ui" {        static = 9090      }    }    task "prometheus" {      driver = "docker"      config {        image = "prom/prometheus:v2.18.1"        args = [          "--config.file=/etc/prometheus/config/prometheus.yml",          "--storage.tsdb.path=/prometheus",          "--web.console.libraries=/usr/share/prometheus/console_libraries",          "--web.console.templates=/usr/share/prometheus/consoles",        ]        volumes = [          "local/config:/etc/prometheus/config",        ]        ports = ["prometheus_ui"]      }      template {        data = <<EOH---global:  scrape_interval:     1s  evaluation_interval: 1sscrape_configs:  - job_name: nomad    metrics_path: /v1/metrics    params:      format: ['prometheus']    static_configs:    - targets: ['{{ env "attr.unique.network.ip-address" }}:4646']  - job_name: consul    metrics_path: /v1/agent/metrics    params:      format: ['prometheus']    static_configs:    - targets: ['{{ env "attr.unique.network.ip-address" }}:8500']EOH        change_mode   = "signal"        change_signal = "SIGHUP"        destination   = "local/config/prometheus.yml"      }      resources {        cpu    = 100        memory = 256      }      service {        name = "prometheus"        port = "prometheus_ui"        check {          type     = "http"          path     = "/-/healthy"          interval = "10s"          timeout  = "2s"        }      }    }  }}

Run the job in Nomad.

$ nomad job run prometheus.nomad

Start the autoscaler

The next step is to run the Nomad Autoscaler. For the beta, an enterprise version of the Nomad Autoscaler is provided that includes the DAS plugins. The simplest approach is to run the autoscaler as a Nomad job; however, you can download the Nomad Autoscaler and run it as a standalone process.

This test-drive Vagrant environment comes with Consul. The supplied Nomad job specifications uses this Consul to discover the Nomad and Prometheus URLs. Should you want to use this specification in a cluster without Consul, You can supply the URLs yourself and remove the checks.

You can create a file called das-autoscaler.nomad with the following content, or you can copy das-autoscaler.nomad from the ~/nomad-autoscaler/jobs folder when logged into a vagrant user's shell inside the VM.

job "das-autoscaler" {  datacenters = ["dc1"]  group "autoscaler" {    count = 1    task "autoscaler" {      driver = "docker"      config {        image   = "hashicorp/nomad-autoscaler-enterprise:0.2.0-beta2"        command = "bin/nomad-autoscaler"        args = [          "agent",          "-config",          "${NOMAD_TASK_DIR}/autoscaler.hcl",          "-http-bind-address",          "0.0.0.0",        ]        ports = ["http"]      }      template {        destination = "${NOMAD_TASK_DIR}/autoscaler.hcl"        data = <<EOH// Set the log level so we can see some more interesting output at the expense// of chattiness.log_level = "debug"// Set the address of the Nomad agent. This can be omitted and in this example// is set to the default for clarity.nomad {  // Use Consul service discovery for the Nomad client IP and Port.  address = "{{ with service "nomad-client" }}{{ with index . 0 }}http://{{.Address}}:{{.Port}}{{ end }}{{ end }}"  // Use the splat operator so the autoscaler monitors scaling policies from  // all Nomad namespaces. If you wish to have it only monitor a single  // namespace, update this param to match the desired name.  namespace = "*"  // If Nomad ACLs are in use, the following line should be uncommented and  // updated to include an ACL token.  // token = ""}// Setup the Prometheus APM so that the autoscaler can pull historical and// point-in-time metrics regarding task resource usage.apm "prometheus" {  driver = "prometheus"  config = {    // Use Consul service discovery for the Prometheus IP and Port.    address = "{{ with service "prometheus" }}{{ with index . 0 }}http://{{.Address}}:{{.Port}}{{ end }}{{ end }}"  }}policy_eval {  // Lower the evaluate interval so we can reproduce recommendations after only  // 5 minutes, rather than having to wait for 24hrs as is the default.  evaluate_after = "5m"  // Disable the horizontal application and horizontal cluster workers. This  // helps reduce log noise during the demo.  workers = {    cluster    = 0    horizontal = 0  }}EOH      }      resources {        cpu    = 1024        memory = 512      }    }    network {      port "http" {        to = 8080      }    }    service {      name = "nomad-autoscaler"      port = "http"      check {        type     = "http"        path     = "/v1/health"        interval = "5s"        timeout  = "2s"      }    }  }}

Run the job in Nomad.

$ nomad job run das-autoscaler.nomad

Upon starting, the autoscaler loads the DAS-specific plugin and launches workers to evaluate vertical policies. You can see the logs using the Nomad UI or nomad alloc logs ... command:

[INFO] agent.plugin_manager: successfully launched and dispensed plugin: plugin_name=app-sizing-percentile[INFO] agent.plugin_manager: successfully launched and dispensed plugin: plugin_name=nomad-target[INFO] agent.plugin_manager: successfully launched and dispensed plugin: plugin_name=app-sizing-nomad[INFO] agent.plugin_manager: successfully launched and dispensed plugin: plugin_name=prometheus[INFO] agent.plugin_manager: successfully launched and dispensed plugin: plugin_name=app-sizing-avg[INFO] agent.plugin_manager: successfully launched and dispensed plugin: plugin_name=app-sizing-max[INFO] policy_eval.worker: starting worker: id=f6d205b3-9e48-ba9d-a230-9d3e8f2bdf81 queue=vertical_cpu[INFO] policy_eval.worker: starting worker: id=750bcea7-47af-94b3-820c-1770c757ed07 queue=vertical_mem

If there are already jobs configured with vertical policies, the autoscaler begins dispatching policy evaluations from the broker to the workers; otherwise, this occurs when vertical policies are added to a job specification:

[DEBUG] policy_eval.broker: dequeue eval: queue=vertical_mem

Note

The autoscaler does not immediately register recommendations.

The evaluate_after field in the autoscaler configuration indicates the amount of historical metrics that must be available before a recommendation is made for a task. The purpose is to prevent recommendations with insufficient historical information; without representative data, appropriate recommendations cannot be made, which could result in under-provisioning a task. For the purpose of evaluating the feature, this can be reduced. For more production-like environments, this interval should be long enough to capture a representative sample of metrics. The default interval is 24 hours.

Deploy the sample job

Create a job named example.nomad.hcl with the following content.

job "example" {  datacenters = ["dc1"]  group "cache-lb" {    count = 1    network {      port "lb" {        to = 6379      }    }    service {      name         = "redis-lb"      port         = "lb"      address_mode = "host"      check {        type     = "tcp"        port     = "lb"        interval = "10s"        timeout  = "2s"      }    }    task "nginx" {      driver = "docker"      config {        image = "nginx"        ports = ["lb"]        volumes = [          # It's safe to mount this path as a file because it won't re-render.          "local/nginx.conf:/etc/nginx/nginx.conf",          # This path hosts files that will re-render with Consul Template.          "local/nginx:/etc/nginx/conf.d"        ]      }      # This template overwrites the embedded nginx.conf file so it loads      # conf.d/*.conf files outside of the `http` block.      template {        data        = <<EOFuser  nginx;worker_processes  1;error_log  /var/log/nginx/error.log warn;pid        /var/run/nginx.pid;events {    worker_connections  1024;}include /etc/nginx/conf.d/*.conf;EOF        destination = "local/nginx.conf"      }      # This template creates a TCP proxy to Redis.      template {        data          = <<EOFstream {  server {    listen 6379;    proxy_pass backend;  }  upstream backend {  {{ range service "redis" }}    server {{ .Address }}:{{ .Port }};  {{ else }}server 127.0.0.1:65535; # force a 502  {{ end }}  }}EOF        destination   = "local/nginx/nginx.conf"        change_mode   = "signal"        change_signal = "SIGHUP"      }      resources {        cpu    = 50        memory = 10      }    }  }  group "cache" {    count = 3    network {      port "db" {        to = 6379      }    }    service {      name         = "redis"      port         = "db"      address_mode = "host"      check {        type     = "tcp"        port     = "db"        interval = "10s"        timeout  = "2s"      }    }    task "redis" {      driver = "docker"      config {        image = "redis:3.2"        ports = ["db"]      }      resources {        cpu    = 500        memory = 256      }    }  }}

Add DAS to the sample job

In order to enable a Nomad job task for sizing recommendations, the following job specification contains a task scaling stanza for CPU and one for memory. These stanzas, when placed within a job specification's task stanza, configure the task for both CPU and memory recommendations.

To enable application-sizing for multiple tasks with DAS, you need to add this scaling block to every new or additional task in the job spec. Inside both the cache-lb and the cache tasks, add the following scaling policies. You can verify your changes against the completed example.nomad.hcl file in the ~/nomad-autoscaler/jobs directory.

      scaling "cpu" {        policy {          cooldown            = "1m"          evaluation_interval = "1m"          check "95pct" {            strategy "app-sizing-percentile" {              percentile = "95"            }          }        }      }      scaling "mem" {        policy {          cooldown            = "1m"          evaluation_interval = "1m"          check "max" {            strategy "app-sizing-max" {}          }        }      }

Note

These scaling policies are extremely aggressive and provide "flappy" recommendations, making them unsuitable for production. They are set with low cooldown and evaluation_interval values in order to quickly generate recommendations for this test drive. Consult the Dynamic Application Sizing Concepts tutorial for how to determine suggested production values.

Reregister the example.nomad.hcl file by running the nomad job run example.nomad.hcl command.

$ nomad job run example.nomad.hcl

Once the job has been registered with its updated specification, the Nomad autoscaler automatically detects the new scaling policies and start the required internal processes.

Further details on the individual parameters and available strategies can be found in the Nomad documentation, including information on how you can further customize the application-sizing block to your needs (percentile, cooldown periods, sizing strategies).

Review DAS recommendations

Once the autoscaler has generated recommendations, you can review them in the Nomad UI or using the Nomad API and accept or dismiss the recommendations.

Accept suggestions with the Nomad UI

Select the Optimize option in the Workload section of the sidebar. When there are DAS recommendations they appear here.

cache-recommendation

Clicking Accept applies the recommendation, updating the job with resized tasks. Dismissing the recommendation causes it to disappear. However, the autoscaler continues to monitor and eventually makes additional recommendations for the job until the vertical scaling policy is removed from the job specification.

Click the Accept button to accept the suggestion.

You also receive a suggestion for the cache-lb task.

cache-lb-recommendation

Click the Accept button to accept the suggestion.

Accept suggestions with the Nomad API

Use curl to access the List Recommendations API.

$ curl 'http://127.0.0.1:4646/v1/recommendations?pretty'

You should receive two recommendations: one for the cache task and one for the cache-lb task.

[  {    "ID": "1308e937-63b1-fa43-67e9-3187c954e417",    "Region": "global",    "Namespace": "default",    "JobID": "example",    "JobVersion": 0,    "Group": "cache-lb",    "Task": "nginx",    "Resource": "CPU",    "Value": 57,    "Current": 50,    "Meta": {      "window_size": 300000000000.0,      "nomad_policy_id": "dd393d4b-99d7-7b72-132c-7e70f1b6b2dc",      "num_evaluated_windows": 11.0    },    "Stats": {      "max": 20.258468627929688,      "mean": 0.21294420193006963,      "min": 0.0,      "p99": 20.258468627929688    },    "EnforceVersion": false,    "SubmitTime": 1604353860521108002,    "CreateIndex": 350,    "ModifyIndex": 350  },  {    "ID": "b9331de3-299f-cd74-bf6d-77aa36a3e147",    "Region": "global",    "Namespace": "default",    "JobID": "example",    "JobVersion": 0,    "Group": "cache",    "Task": "redis",    "Resource": "CPU",    "Value": 57,    "Current": 500,    "Meta": {      "window_size": 300000000000.0,      "nomad_policy_id": "1b63f7bd-c995-d61e-cf4f-b49a8d777b65",      "num_evaluated_windows": 12.0    },    "Stats": {      "p99": 32.138671875,      "max": 32.138671875,      "mean": 2.5897381649120943,      "min": 0.06250959634780884    },    "EnforceVersion": false,    "SubmitTime": 1604353860521659719,    "CreateIndex": 352,    "ModifyIndex": 352  },  {    "ID": "f91454d6-8df8-ce64-696b-b21c758cfb3b",    "Region": "global",    "Namespace": "default",    "JobID": "example",    "JobVersion": 0,    "Group": "cache",    "Task": "redis",    "Resource": "MemoryMB",    "Value": 10,    "Current": 256,    "Meta": {      "nomad_policy_id": "9153e45b-618c-a7e4-6aa3-c720fd20184f",      "num_evaluated_windows": 12.0,      "window_size": 300000000000.0,      "nomad_autoscaler.count.capped": true,      "nomad_autoscaler.count.original": 2.0,      "nomad_autoscaler.reason_history": []    },    "Stats": {      "max": 2.01171875,      "mean": 1.9451913759689923,      "min": 1.9375,      "p99": 1.984375    },    "EnforceVersion": false,    "SubmitTime": 1604353860521511567,    "CreateIndex": 351,    "ModifyIndex": 351  }]

You can accept them by using the Apply and Dismiss Recommendations API endpoint. Replace the Recommendation IDs in the command with the recommendation IDs received when you queried the List Recommendations API.

$ curl 'http://127.0.0.1:4646/v1/recommendations/apply?pretty' \  --request POST \  --data '{"Apply":["1308e937-63b1-fa43-67e9-3187c954e417",                    "b9331de3-299f-cd74-bf6d-77aa36a3e147"]}'  "Errors": [],  "LastIndex": 0,  "RequestTime": 0,  "UpdatedJobs": [    {      "EvalCreateIndex": 403,      "EvalID": "5a1c5f5e-6a82-17a9-ca2d-b053e4f418f2",      "JobID": "example",      "JobModifyIndex": 403,      "Namespace": "default",      "Recommendations": [        "1308e937-63b1-fa43-67e9-3187c954e417",        "b9331de3-299f-cd74-bf6d-77aa36a3e147"      ],      "Warnings": ""    }  ]}

Verify recommendation is applied

Watch for the deployment to complete and then verify that the job is now using the recommended values instead of the ones initially supplied. You can do this with in the Nomad UI or using the nomad alloc status command for a cache and a cache-lb allocation listed from the nomad job status example command.

Verify resources with the Nomad UI

Navigate to the example job's detail screen in the Nomad UI

updated-job

Note that the Task Groups section shows the updated values for Reserved CPU and Reserved Memory given by the autoscaler.

Verify resources with the Nomad CLI

List out the allocations for the example job by running nomad job status example.

$ nomad job status exampleID            = exampleName          = exampleSubmit Date   = 2020-11-02T16:28:52ZType          = servicePriority      = 50Datacenters   = dc1Namespace     = defaultStatus        = runningPeriodic      = falseParameterized = falseSummaryTask Group  Queued  Starting  Running  Failed  Complete  Lostcache       0       0         3        0       3         0cache-lb    0       0         1        0       1         0Latest DeploymentID          = c3ee5e5dStatus      = successfulDescription = Deployment completed successfullyDeployedTask Group  Desired  Placed  Healthy  Unhealthy  Progress Deadlinecache       3        3       3        0          2020-11-02T16:39:30Zcache-lb    1        1       1        0          2020-11-02T16:39:06ZAllocationsID        Node ID   Task Group  Version  Desired  Status    Created    Modified5a35ffec  c442fcaa  cache       2        run      running   4m49s ago  4m30s ago8ceec492  c442fcaa  cache-lb    2        run      running   5m7s ago   4m35s agoceb84c32  c442fcaa  cache       2        run      running   5m7s ago   4m50s ago2de4ff81  c442fcaa  cache       2        run      running   5m16s ago  4m57s ago2ffa9be6  c442fcaa  cache-lb    1        stop     complete  37m9s ago  5m7s ago528156b1  c442fcaa  cache       0        stop     complete  37m9s ago  5m15s ago04645e48  c442fcaa  cache       0        stop     complete  37m9s ago  5m7s ago2d9fc1f2  c442fcaa  cache       0        stop     complete  37m9s ago  4m48s ago

From the job status output, a cache allocation has allocation ID 5a35ffec. Run the nomad alloc status 5a35ffec command to get the Task Resources information about this allocation.

$ nomad alloc status 5a35ffecID                  = 5a35ffec-2af3-d36f-6dd8-d8453407d6a5Eval ID             = 564f00dfName                = example.cache[2]Node ID             = c442fcaaNode Name           = ubuntu-focalJob ID              = exampleJob Version         = 2Client Status       = runningClient Description  = Tasks are runningDesired Status      = runDesired Description = <none>Created             = 6m55s agoModified            = 6m36s agoDeployment ID       = c3ee5e5dDeployment Health   = healthyAllocation AddressesLabel  Dynamic  Address*db    yes      10.0.2.15:25465 -> 6379Task "redis" is "running"Task ResourcesCPU       Memory          Disk     Addresses3/57 MHz  992 KiB/10 MiB  300 MiBTask Events:Started At     = 2020-11-02T16:29:11ZFinished At    = N/ATotal Restarts = 0Last Restart   = N/ARecent Events:Time                  Type        Description2020-11-02T16:29:11Z  Started     Task started by client2020-11-02T16:29:11Z  Task Setup  Building Task Directory2020-11-02T16:29:11Z  Received    Task received by client

Note that the Task Resources section shows the updated values for memory and CPU given by the autoscaler.

From the earlier job status output, a cache-lb allocation has allocation ID 8ceec492. Run the nomad alloc status 8ceec492 command to get the Task Resources information about this allocation.

$ nomad alloc status 8ceec492ID                  = 8ceec492-9549-e563-40d9-bf76a47940f2Eval ID             = f0c24365Name                = example.cache-lb[0]Node ID             = c442fcaaNode Name           = ubuntu-focalJob ID              = exampleJob Version         = 2Client Status       = runningClient Description  = Tasks are runningDesired Status      = runDesired Description = <none>Created             = 7m44s agoModified            = 7m12s agoDeployment ID       = c3ee5e5dDeployment Health   = healthyAllocation AddressesLabel  Dynamic  Address*lb    yes      10.0.2.15:29363 -> 6379Task "nginx" is "running"Task ResourcesCPU       Memory          Disk     Addresses0/57 MHz  1.5 MiB/10 MiB  300 MiBTask Events:Started At     = 2020-11-02T16:28:54ZFinished At    = N/ATotal Restarts = 0Last Restart   = N/ARecent Events:Time                  Type        Description2020-11-02T16:29:24Z  Signaling   Template re-rendered2020-11-02T16:29:16Z  Signaling   Template re-rendered2020-11-02T16:29:13Z  Signaling   Template re-rendered2020-11-02T16:29:04Z  Signaling   Template re-rendered2020-11-02T16:28:57Z  Signaling   Template re-rendered2020-11-02T16:28:55Z  Signaling   Template re-rendered2020-11-02T16:28:54Z  Started     Task started by client2020-11-02T16:28:53Z  Driver      Downloading image2020-11-02T16:28:53Z  Task Setup  Building Task Directory2020-11-02T16:28:52Z  Received    Task received by client

Here, also, the Task Resources section shows the updated values for memory and CPU given by the autoscaler.

Generate load to create new recommendations

Create a parameterized dispatch job to generate load in your cluster. Create a file named das-load-test.nomad with the following content. You can also copy this file from the ~/nomad-autoscaler/jobs folder in the Vagrant instance.

job "das-load-test" {  datacenters = ["dc1"]  type        = "batch"  parameterized {    payload       = "optional"    meta_optional = ["requests", "clients"]  }  group "redis-benchmark" {    task "redis-benchmark" {      driver = "docker"      config {        image   = "redis:3.2"        command = "redis-benchmark"        args = [          "-h","${HOST}",          "-p","${PORT}",          "-n","${REQUESTS}",          "-c","${CLIENTS}",        ]      }      template {        destination = "secrets/env.txt"        env         = true        data = <<EOF{{ with service "redis-lb" }}{{ with index . 0 -}}HOST={{.Address}}PORT={{.Port}}{{- end }}{{ end }}REQUESTS={{ or (env "NOMAD_META_requests") "100000" }}CLIENTS={{  or (env "NOMAD_META_clients") "50" }}EOF      }      resources {        cpu    = 100        memory = 128      }    }  }}

$ nomad job run das-load-test.nomadJob registration successful

Now, dispatch instances of the load-generation task by running the following:

$ nomad job dispatch das-load-testDispatched Job ID = das-load-test/dispatch-1604336299-70a3923eEvaluation ID     = 1793fe23==> Monitoring evaluation "1793fe23"    Evaluation triggered by job "das-load-test/dispatch-1604336299-70a3923e"    Allocation "589a1825" created: node "c442fcaa", group "redis-benchmark"    Evaluation status changed: "pending" -> "complete"==> Evaluation "1793fe23" finished with status "complete

Each run of this job creates 100,000 requests against your Redis cluster using 50 Redis clients.

Once you have run the job, watch the Optimize view for new suggestions based on the latest activity.

Exit and clean up

Exit the shell session on the Vagrant VM by typing exit. Run the vagrant destroy command to stop and remove the virtual box instance. Delete the Vagrantfile once you no longer want to use the test-drive environment.

Learn more

If you have not already, review the Dynamic Application Sizing Concepts tutorial for more information about the individual parameters and available strategies.

You can also find more information in the Nomad Autoscaler Scaling Policies documentation, including how you can further customize the application-sizing block to your needs (percentile, cooldown periods, and sizing strategies).

Dynamic app sizing

Next Collection

Migrate a monolith

This tutorial also appears in:

5 tutorials

Nomad 1.0
Explore Nomad v1.0 features like Nomad Autoscaler and the event-stream.
- Nomad
9 tutorials

Nomad Enterprise
Learn about Nomad Enterprise features and how to use them at scale.
- Nomad
6 tutorials

Expand Nomad with Ecosystem Add-ins
Explore applications that enhance how you use your Nomad cluster through their use of the Nomad HTTP API or plug-in interface.
- Nomad