Hardware-observer is a subordinate machine charm that provides monitoring and alerting of hardware resources on bare-metal infrastructure. This charm leverages the following exporters to provide detailed metrics:
- Prometheus Hardware Exporter: For collecting metrics from BMCs and RAID controllers.
- Smartctl Exporter: For collecting SMART metrics from storage devices.
Hardware-observer collects and exports Prometheus metrics from BMCs (using the IPMI and newer Redfish protocols) and various SAS and RAID controllers through the use of the prometheus-hardware-exporter project. It additionally configures Prometheus alert rules that are fired when the status of any metric is suboptimal.
Appropriate collectors and alert rules are installed based on the availability of one or more of the RAID/SAS controllers mentioned below:
- Broadcom MegaRAID controller
- Dell PowerEdge RAID Controller
- LSI SAS-2 controller
- LSI SAS-3 controller
- HPE Smart Array controller
The Smartctl Exporter integrates with the Hardware-observer to provide monitoring of storage device health via SMART data. Metrics are collected and exported to Prometheus using the smartctl-exporter.
This charm is ideal for monitoring hardware resources when used in conjunction with the Canonical Observability Stack.
In order to manage third-party hardware resources, vendor-specific CLI tools need to be uploaded via juju attach-resource
.
In the Resources page on Charmhub, the name of the resource along with the download URL can be found.
-
See the Juju SDK documentation for more information about developing and improving charms.