

VM Anti-Affinity

For certain deployment jobs, you might want to distribute the instances across multiple physical resources of the IaaS. Even though an IaaS abstracts away the underlying hardware resources, most have specific APIs to configure VM affinity and anti-affinity rules.

One popular example of a deployment job that needs this type of configuration is Hadoop Datanode. If multiple Datanode instances are placed on the same physical machine, replicated data becomes unavailable if that machine fails. To make replication useful in this scenario, BOSH allows you to configure the resource pool for a deployment job. You configure VM anti-affinity rules for an IaaS using the cloud_properties sub-block of the resource_pools block in your deployment manifest.

Currently only vSphere and OpenStack CPIs provide a way to do so.

vSphere Configuration¶

The vSphere VM-VM Affinity Rules feature allows you to specify whether VMs should run on the same host or be kept on separate hosts. As of BOSH version 101 (stemcell 2693), you can configure the vSphere CPI to include all VMs of a specified BOSH resource pool within a single DRS rule and separate the VMs among multiple hosts.

The following resource pool and instance group configuration manifest example instructs BOSH to:

Create two MySQL VMs in the cl vSphere cluster.
Create a keep-mysql-on-different-hosts DRS rule in the cl vSphere cluster.
Configure the DRS rule with a type that separates the associated VMs onto different hosts (separate_vms).
Associate the VMs with the DRS rule.

# Assuming that a MySQL release is used...

vm_extensions:
- name: anti-affinity-mysql
  cloud_properties:
    datacenters:
    - name: dc
      clusters:
      - cl:
          drs_rules:
          - name: keep-mysql-on-different-hosts
            type: separate_vms

instance_groups:
- name: mysql
  instances: 2
  vm_extensions:
  - anti-affinity-mysql

If the vSphere CPI does not place the VMs on different hosts, check that you have done the following:

Associated multiple healthy hosts to the vSphere cluster.
Enabled DRS for the vSphere cluster. You can modify the DRS automation level in the cluster settings.
Enabled a DRS rule and associated it with the appropriate VMs.
Given the DRS enough time to move the VMs to different hosts.

Notes:

The vSphere CPI currently only supports one DRS rule per BOSH VM Extension.
If a BOSH VM Extension is applied to only one VM, the vSphere CPI does not create a DRS rule. After BOSH adds a second VM, the vSphere CPI will create and apply a DRS rule to all VMs in the BOSH VM Extension.
You can also use YAML Anchors in the config. e.g.:

# Assuming there are 2 clusters which need same DRS rule...

vm_extensions:
- name: hadoop-datanodes
  cloud_properties:
    datacenters:
    - name: my-dc
      clusters:
      - my-vsphere-cluster1:
          drs_rules: &default_drs_rule
          - name: separate-hadoop-datanodes-rule
            type: separate_vms
      - my-vsphere-cluster2:
          drs_rules: *default_drs_rules

OpenStack Configuration¶

OpenStack's Filter scheduler allows to customize compute node selection algorithm which determines placement of new VMs. To enforce anti-affinity among VMs, ServerGroupAntiAffinityFilter is available:

ServerGroupAntiAffinityFilter - This filter implements anti-affinity for a server group. First you must create a server group with a policy of 'anti-affinity' via the server groups API. Then, when you boot a new server, provide a scheduler hint of 'group=' where is the UUID of the server group you created. This will result in the server getting added to the group. When the server gets scheduled, anti-affinity will be enforced among all servers in that group.

The following resource pool and job configuration manifest example instructs BOSH to:

Assume that the server group was created and its UUID is af09abf2-2283-47d6-f2bd-2932a9ae949c
Assume that the server group specifies 'anti-affinity' policy
Create seven hadoop-datanode VMs and add them to the server group af09abf2-2283-47d6-f2bd-2932a9ae949c

# Assuming that a Hadoop release is used...

resource_pools:
- name: hadoop-datanodes
  cloud_properties:
    instance_type: m3.xlarge
    scheduler_hints:
      group: af09abf2-2283-47d6-f2bd-2932a9ae949c

jobs:
- name: hadoop-datanode
  templates:
  - {name: hadoop-datanode, release: hadoop}
  instances: 7
  resource_pool: hadoop-datanodes
  persistent_disk: 10_240
  networks:
  - name: default