-
Define a standard checklist to be followed for Production ready OCP 4
-
K8s
-
OpenShift
-
Etcd
This post is an effort to create a list of opinionated checklist for a production ready OCP 4 cluster.
-
❏ Deploy logging component
-
❏ Creating infrastructure machine sets
-
❏ Creating infrastructure Nodes
-
❏ Disable workload from running on infra nodes by applying taints on infra nodes.
-
❏ Move Infra components to Infra nodes
-
❏ The default router
-
❏ The container image registry
-
❏ The cluster metrics collection (elasticsearch, kibana, curator, fluentd)
-
❏ Cluster aggregated logging
-
-
❏ Deploy Cluster Autoscaler
-
❏ Deploy Machine Autoscaler
Resource requests and overcommitment https://docs.openshift.com/container-platform/4.6/post_installation_configuration/node-tasks.html#nodes-cluster-resource-override_post-install-node-tasks * [ ] Enable network policies to restrict communication between namespaces
-
❏ Enabling etcd encryption
-
❏ Etcd disk performance (etcd_disk_wal_fsync_duration_seconds_bucket metric, 99th percentile should complete in under 10ms for storage to be considered fast enough) TOBE ADDED to Detail: histogram_quantile(0.99, irate(etcd_disk_wal_fsync_duration_seconds_bucket[5m]))
-
❏ Defragmenting etcd every month