Skip to main content

CF app network loss

Last updated on

CF app network loss is a Cloud Foundry chaos fault that drops packetLossPercentage percent of egress packets from one or more app instances of app in organization/space. You can restrict the loss to specific destination hosts, IP ranges, source ports, or destination ports. The fault lasts for duration seconds.

Use this fault to test how the app and its callers handle a flaky or congested network: TCP retransmissions, application-level retries, timeouts, circuit-breaker decisions, and downstream error propagation.

Run your first experiment

If you have not configured the chaos infrastructure yet, go to Quickstart to install the Linux chaos infrastructure and run an experiment end to end.


Use cases

  • Retry resilience: Confirm the app retries lost requests within budget instead of giving up immediately or piling on.
  • Timeout tuning: Validate timeouts are set high enough to absorb the added retransmissions.
  • Circuit-breaker thresholds: Confirm the breaker opens at the configured loss/error rate.
  • Targeted partition: Drop traffic only to a specific dependency to test single-dependency failure modes.

Before you begin

  • Chaos infrastructure: A Linux chaos infrastructure (LCI) installed in one of the supported deployment models.
  • CF and BOSH credentials: The LCI host has CF_*, UAA_SERVER_ENDPOINT, and BOSH_* credentials configured.
  • Target identifiers: You know the organization, space, app, and the boshDeployment.
  • Network interface: You know the interface name inside the container (typically eth0).
  • Destinations: You know the host(s), IP range(s), or port(s) you want to disrupt. Leave blank to affect all egress.

Supported environments

PlatformSupport status
Cloud Foundry (TAS, PCF, open-source) running on BOSH-managed Diego cellsSupported

Permissions required

ActionRequirement
List apps the CF user can accessSpaceDeveloper, SpaceAuditor, OrgManager, or OrgAuditor; scopes cloud_controller.read or cloud_controller.admin
List BOSH deploymentsBOSH user with bosh.read scope
SSH to a Diego cell via BOSHBOSH UAA token with bosh.ssh or bosh.admin scope
Apply traffic-shaping rules inside the target container's namespaceOperator with sudo or root on the cell host

Authentication

LayerWhere to provideTunables
Cloud Foundry API + BOSH director/etc/linux-chaos-infrastructure/cf.env on the LCI hostCF_API_ENDPOINT, CF_USERNAME, CF_PASSWORD, UAA_SERVER_ENDPOINT, BOSH_CLIENT, BOSH_CLIENT_SECRET, BOSH_CA_CERT, BOSH_ENVIRONMENT
vSphere (only when faultInjectorLocation: vSphere)/etc/linux-chaos-infrastructure/vsphere.envGOVC_URL, GOVC_USERNAME, GOVC_PASSWORD, GOVC_INSECURE, VM_NAME, VM_USERNAME, VM_PASSWORD

Fault tunables

Required parameters

TunableDescriptionDefault
deploymentModelLCI placement model. One of model-1 or model-2.(required)
organizationCF organization that owns the app.(required)
spaceCF space within the organization.(required)
appApp whose egress traffic loses packets.(required)

Chaos parameters

TunableDescriptionDefault
packetLossPercentagePercentage of egress packets to drop (0-100).100
networkInterfaceNetwork interface inside the container.eth0
destinationHostsComma-separated list of hostnames to affect. Leave empty to affect all egress.""
destinationIPsComma-separated list of destination IPs or CIDRs to affect.""
sourcePortsComma-separated list of source ports to affect.""
destinationPortsComma-separated list of destination ports to affect.""
instanceAffectedPercentagePercentage of instances to target. 0 targets exactly one.0
boshDeploymentBOSH deployment name. Required for deploymentModel: model-2.""
faultInjectorLocationlocal or vSphere. Required for deploymentModel: model-2.local
faultInjectorPortLocal port used by the fault-injector.50320
durationTotal chaos duration.30s
skipSSLValidationSkip SSL validation when calling CF APIs.false
rampTimeWait period in seconds before and after the fault.0

Tunables that apply to every fault are documented in common tunables for all faults.


Fault execution in brief

Authenticates to Cloud Foundry and BOSH, locates the target app instance(s), and installs a traffic-shaping rule on networkInterface inside the container's network namespace that drops packetLossPercentage% of packets matching the configured destinations/ports. The rule is removed when duration elapses.


Expected behavior during fault execution

  • TCP connections to affected destinations experience retransmissions; round-trip time may rise.
  • Application-level retries fire if configured; otherwise requests fail with timeout or connection-reset errors.
  • After the fault ends, packet loss returns to baseline.

Signals to watch

  • Caller error rate: Use an HTTP probe and assert error rate stays within SLO.
  • TCP retransmissions: Inspect host-level network metrics on the cell hosting the affected instance.

Recovery and cleanup

  • The traffic-shaping rule is removed at the end of duration.
  • If the experiment is aborted, the fault still attempts to remove the rule before exiting.

Limitations

  • Operates on the egress side only. Ingress packet loss is not simulated.
  • Without a destination filter, ALL egress traffic loses packets, which can break control-plane communication.
  • At packetLossPercentage: 100, the destination is effectively unreachable; use CF app stop or CF app route unmap for cleaner outage simulation.

Troubleshooting

CF app network loss: no observable failures in Harness Chaos Engineering

Confirm packetLossPercentage is set (the default of 100 drops all matching traffic). Confirm networkInterface matches the interface inside the container. Confirm destination filters match actual traffic.

App becomes completely unreachable during the experiment

With no destination filters and packetLossPercentage of 100, all egress (including health checks and platform calls) is dropped. Add destinationHosts or destinationIPs to scope the loss to your dependency.


Common configurations

Drop 50% of traffic to one downstream

apiVersion: litmuchaos.io/v1alpha1
kind: LinuxFault
metadata:
name: cf-app-network-loss
labels:
name: app-network-loss
spec:
cfAppNetworkChaos/inputs:
duration: 60s
deploymentModel: model-2
faultInjectorLocation: vSphere
app: cf-app
organization: dev-org
space: dev-space
boshDeployment: cf
networkInterface: eth0
packetLossPercentage: 50
destinationHosts: payments-api.internal

CF secrets

The following Cloud Foundry secrets reside on the same machine where the chaos infrastructure is executed. These secrets are provided in the /etc/linux-chaos-infrastructure/cf.env file in the following format:

CF_API_ENDPOINT=XXXXXXXXXXXXXXXXXXX
CF_USERNAME=XXXXXXXXXXXXXXXXXXXXXXX
CF_PASSWORD=XXXXXXXXXXXXXXXXXXXXXXX
UAA_SERVER_ENDPOINT=XXXXXXXXXXXXXXX
BOSH_CLIENT=XXXXXXXXXXXXXXXXXXXXXXX
BOSH_CLIENT_SECRET=XXXXXXXXXXXXXXXX
BOSH_CA_CERT=XXXXXXXXXXXXXXXXXXXXXX
BOSH_ENVIRONMENT=XXXXXXXXXXXXXXXXXX
info

If the secrets file is not provided, the secrets are attempted to be derived from environment variables and the config file by the fault-injector.

ENV nameDescriptionExample
CF_API_ENDPOINTAPI endpoint for the CF setuphttps://api.system.cf-setup.com
CF_USERNAMEUsername for the CF userusername
CF_PASSWORDPassword for the CF userpassword
UAA_SERVER_ENDPOINTAPI endpoint for the UAA server for the CF setuphttps://uaa.system.cf-setup.com
BOSH_CLIENTUsed by the bosh CLI, the BOSH clientadmin
BOSH_CLIENT_SECRETUsed by the bosh CLI, the BOSH client secretUBu9Fu3oW35sO6fw12auPH76gsRTy7
BOSH_CA_CERTUsed by the bosh CLI, the file path for BOSH CA certificate/root/root_ca_certificate
BOSH_ENVIRONMENTUsed by the bosh CLI, the BOSH environmentbosh.corp.local

Fault injector ENVs and config file

If /etc/linux-chaos-infrastructure/cf.env file is not provided, fault-injector attempts to derive the secrets from environment variables or a configuration file. Any secret that is re-declared will be overridden in the following order of decreasing precedence:

  1. /etc/linux-chaos-infrastructure/cf.env file
  2. Environment variables
  3. Configuration file

The configuration file should be provided at /etc/linux-chaos-infrastructure/cf-fault-injector.yaml:

cf-api-endpoint: XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
username: XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
password: XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
uaa-server-endpoint: XXXXXXXXXXXXXXXXXXXXXXXXXX
bosh-client: XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
bosh-client-secret: XXXXXXXXXXXXXXXXXXXXXXXXXXX
bosh-ca-cert: XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
bosh-environment: XXXXXXXXXXXXXXXXXXXXXXXXXXXXX

A mapping between all the three formats for providing the secrets is as follows:

cf.envENVcf-fault-injector.yaml
CF_API_ENDPOINTCF_API_ENDPOINTcf-api-endpoint
CF_USERNAMEUSERNAMEusername
CF_PASSWORDPASSWORDpassword
UAA_SERVER_ENDPOINTUAA_SERVER_ENDPOINTuaa-server-endpoint
BOSH_CLIENTBOSH_CLIENTbosh-client
BOSH_CLIENT_SECRETBOSH_CLIENT_SECRETbosh-client-secret
BOSH_CA_CERTBOSH_CA_CERTbosh-ca-cert
BOSH_ENVIRONMENTBOSH_ENVIRONMENTbosh-environment

vSphere secrets

These secrets are provided only if vSphere is used as the deployment platform for CF.

The following vSphere secrets reside on the same machine where the chaos infrastructure is executed. These secrets are provided in the /etc/linux-chaos-infrastructure/vsphere.env file in the following format:

GOVC_URL=XXXXXXXXXXXXXXXXXXXXXX
GOVC_USERNAME=XXXXXXXXXXXXXXXXX
GOVC_PASSWORD=XXXXXXXXXXXXXXXXX
GOVC_INSECURE=XXXXXXXXXXXXXXXXX
VM_NAME=XXXXXXXXXXXXXXXXXXXXXXX
VM_USERNAME=XXXXXXXXXXXXXXXXXXX
VM_PASSWORD=XXXXXXXXXXXXXXXXXXX
ENV Name Description Notes
GOVC_URL Endpoint for vSphere For example, 192.168.214.244
GOVC_USERNAME Username for the vSphere user For example, username
GOVC_PASSWORD Password for the vSphere user For example, password
GOVC_INSECURE Skip SSL validation for govc commands For example, true
VM_NAME Name of the vSphere VM where the fault-injector utility is installed For example, cf-vm
VM_USERNAME Username for the VM guest user For example, root
VM_PASSWORD Password for the VM guest user For example, password