I needed a simple check that would tell me if a drive suddenly failed/disappeared from the server.
This is accomplished by counting the number of mounted partitions. You can tehn setup a monitor trigger if the count decreases.
- put
disk_partition.pyin your datadog'schecks.ddirectory - put
disk_partition.yamlin your datadog'sconf.ddirectory
Restart the datadog agent.
https://stackoverflow.com/a/25777174/2785592 https://docs.datadoghq.com/guides/agent_checks/