Integration of IFL based metrics to the prometheus operator of openshift-monitoring stack. #2432
Labels
lifecycle/stale
Denotes an issue or PR has remained open with no activity and has become stale.
Proposal
We have a Z-specific operator that can collect the ifl-related metrics using the hyptop on the hypervisor (z/vm or KVM). It helps facilitate the system's cost management.
For KVM:
We have a service running on the hypervisor, with proper network configurations connecting to the Openshift cluster running on top, exposing the hyptop data. Next, the hyptop data is monitored by the default Prometheus.
For ZVM:
The service monitor runs on every node of the cluster and exposes the hyptop data to Prometheus.
Example Metric and screen shot:
ifl_usage:
metric values:
Our thought is whether these can be integrated with the existing prometheus-based cluster monitoring stack that is deployed on OpenShift. We would like to know the inputs from the folks!
The text was updated successfully, but these errors were encountered: