To launch a cluster with the the monitoring platform installed, set up a GCE cluster using its instructions, and enable the monitoring platform before bringing up your cluster by adding gcp-dev/myvars.yaml
:
openshift_monitoring_deploy: true
# openshift_cluster_monitoring_operator_image: quay.io/dmace/cluster-monitoring-operator:7c7e214
# openshift_cluster_monitoring_operator_alertmanager_config: |+
# global:
# resolve_timeout: 5m
# route:
# group_wait: 30s
# group_interval: 5m
# repeat_interval: 12h
# receiver: default
# routes:
# - match:
# alertname: DeadMansSwitch
# repeat_interval: 5m
# receiver: deadmansswitch
# receivers:
# - name: default
# - name: deadmansswitch
# openshift_monitor_availability_install: true
# openshift_monitor_app_create_image: quay.io/dmace/openshift-monitor-project-lifecycle:0d619bf
# openshift_monitor_app_create_run_interval: 30s
# openshift_monitor_app_create_log_level: 2
To log into the Prometheus or Alertmanager consoles, you need a user with some broad permissions:
oc adm policy add-cluster-role-to-user cluster-reader dmace
The consoles are here (WHAT
is the prefix you used in the GCP make up
command):
https://prometheus-k8s-openshift-monitoring.apps.$WHAT.origin-gce.dev.openshift.com
https://alertmanager-main-openshift-monitoring.apps.$WHAT.origin-gce.dev.openshift.com
Make changes in openshift-ansible, and create a new image:
docker build -f images/installer/Dockerfile -t ironcladlou/openshift-ansible:latest .
Update any variables you want in gcp-dev/vars.yaml
, then run the ad-hoc playbooks:
$ make WHAT=dmace OPENSHIFT_ANSIBLE_IMAGE=ironcladlou/openshift-ansible:latest sh
# Now, inside the container...
$ ansible-playbook playbooks/openshift-monitoring/install-gcp.yml
It's a Kube deployment:
oc get -n openshift-monitoring deployments/cluster-monitoring-operator
To deploy new images quickly, just patch the deployment.
If it's new and not included in the image, just use:
oc apply -n openshift-monitoring -f /some/servicemonitor.yaml
If modifying one included in the image, direct edits may be overwritten by the Cluster Monitoring Operator, so deploy a new image to make changes persistent.
Deploy a new Cluster Monitoring Operator image.
Do I get it right that I need GCE account? I do not think I have any. Could I use AWS instead?