Skip to content

Latest commit

 

History

History
67 lines (46 loc) · 2.43 KB

README.adoc

File metadata and controls

67 lines (46 loc) · 2.43 KB

OCP production best practices

1. Cluster configuration

  • ❏ Deploy logging component

  • ❏ Creating infrastructure machine sets

  • ❏ Creating infrastructure Nodes

  • ❏ Disable workload from running on infra nodes by applying taints on infra nodes.

  • ❏ Move Infra components to Infra nodes

    • ❏ The default router

    • ❏ The container image registry

    • ❏ The cluster metrics collection (elasticsearch, kibana, curator, fluentd)

    • ❏ Cluster aggregated logging

  • ❏ Deploy Cluster Autoscaler

  • ❏ Deploy Machine Autoscaler

Resource requests and overcommitment https://docs.openshift.com/container-platform/4.6/post_installation_configuration/node-tasks.html#nodes-cluster-resource-override_post-install-node-tasks * [ ] Enable network policies to restrict communication between namespaces

2. Etcd configuration

  • ❏ Enabling etcd encryption

  • ❏ Etcd disk performance (etcd_disk_wal_fsync_duration_seconds_bucket metric, 99th percentile should complete in under 10ms for storage to be considered fast enough) TOBE ADDED to Detail: histogram_quantile(0.99, irate(etcd_disk_wal_fsync_duration_seconds_bucket[5m]))

  • ❏ Defragmenting etcd every month

3. Node configuration

  • ❏ Deploy machine health checks

  • ❏ Creating a KubeletConfig CRD to configure the Pods per Node (podsPerCore and maxPods)

tip: when workers during initial install not activated, check the 'oc get csr' output on Pending.