Integration with BMC Helix Operations Management

Hardware Sentry easily integrates with BMC Helix Operations Management to expose hardware health, performance metrics, and sustainability indicators into Helix Dashboards. The integration is achieved by using the standard Prometheus Remote Write protocol, which can be ingested by the BMC Helix platform.

Hardware Sentry integration with BMC Helix

Prerequisites

  1. Download and install Hardware Sentry
  2. Configure the monitoring of the host(s)
  3. Verify that the metrics are stored in the Prometheus Server
  4. Download the latest version of the Hardware Sentry Dashboards for BMC Helix from the Sentry Software's Web site
  5. Import Hardware Sentry dashboards into BMC Helix.

Configuration

First connect to BMC Helix Operations Management, go to the Administration > Repository page, and click on the Copy API Key button to get your API Key.

Copy API Key

Then, declare the exporter in the pipeline section of otel/otel-config.yaml:

service:
  extensions: [health_check]
  pipelines:
    metrics:
      receivers: [prometheus_exec/hws-exporter,prometheus/internal]
      processors: [memory_limiter,batch,metricstransform]
      exporters: [prometheusremotewrite/helix] # Your helix config must be listed here

Edit the exporters section of the otel/otel-config.yaml configuration file as in the below example:

  prometheusremotewrite/helix:
    endpoint: https://<your-helix-env>.onbmc.com/metrics-gateway-service/api/v1.0/prometheus
    headers:
      Authorization: Bearer <apiToken>
    resource_to_telemetry_conversion:
      enabled: true

where:

  • <your-helix-env> is the host name of your BMC Helix environment, at onbmc.com
  • <apiToken> is the API Key of your BMC Helix environment
  • resource_to_telemetry_conversion converts all the resource attributes to metric labels when enabled

Using Hardware Sentry dashboards in BMC Helix

Hardware Sentry comes with a set of 3 dashboards that expose health, performance and sustainability metrics and indicators:

  • Main: Global hardware and sustainability information for your entire monitored infrastructure
  • Site: Deep insight into a specific site (server room)
  • Host: Detailed metrics for a host and its internal components

Dashboards are organized into panels exposing systems' health and performance metrics, real-time and projected data on energy consumption and costs, as well as carbon emissions for your monitored infrastructure.

From the BMC Helix Dashboards Home Page, browse the Dashboards menu and select the Hardware Sentry folder to access the Hardware Sentry dashboards.

Helix Dashboards - Accessing Hardware Sentry dashboards

Understanding the monitoring coverage of your IT infrastructure

The ability to monitor your entire IT infrastructure is essential if you want to prevent or fix performance issues before they affect your customers and business. Reaching 100% coverage is a legitimate goal. The Coverage panels of the Hardware Sentry - Main and Hardware Sentry - Site dashboards expose the percentage of hosts that are actually monitored.

A coverage percentage below 100% indicates that some hosts are not monitored adequately. In such a case, you can:

  • Check the Hardware Sentry Agents Status panel at the bottom of the Main dashboard to make sure that the Hardware Sentry Agents responsible for collecting data are fully operational. The panel displays the agents' hostnames and indicates if the agent and connector versions are up-to-date. The Last Seen column indicates the last time an agent was seen over the past 6 hours.

    An agent going undetected for more than 2 minutes may indicate a potential problem with the host, the connection, or the agent configuration.

  • Open each Site dashboard to spot the hosts with no collected data. Then, from the host page, verify the configured Protocol(s) status and matching Connector(s). If their status is Down or Failed, open the config/hws-config.yaml file and verify the host configuration. If you have manually defined the connectors, check the selectedConnectors and excludedConnectors parameters (See Configuring the Hardware Sentry Agent).

Note: A low coverage value will automatically increase the Margin of Error, minimizing the accuracy of the estimated trends reported in the Power, Costs, and CO₂ Emissions panel.

Monitoring energy usage and carbon emissions

In addition to detailed operational data, Hardware Sentry dashboards expose unified, accurate, and continuous views of power consumption, energy costs, and carbon emissions. These indicators allow IT administrators to assess the current situation, identify excessive energy usage or other issues, and take action to gain in efficiency.

The Power, Cost, and CO₂ emissions section of the Hardware Sentry - Main and Hardware Sentry - Site dashboards display live, and historical metrics about the amount of kWh your infrastructure consumes daily, monthly, and yearly as well as the associated costs and carbon emission levels.

Helix Dashboards - Monitoring energy usage and carbon emissions

The Margin of Error panel indicates the level of confidence in the estimated values. The lower the Margin of Error, the more accurate the estimate.

Hardware Sentry also reports the power consumption, energy costs, and the CO₂ emissions of each monitored host in the corresponding Hardware Sentry - Host dashboard.

Helix Dashboards - Monitoring energy usage and carbon emissions

The Power per Device Type panel shows the distribution of the energy consumed by the internal components, by type, for the monitored host.

Identifying the top consumer sites

Operating servers that no longer meet high efficiency standards can result in high electricity consumption, costs, and carbon emission levels. Hardware Sentry can help you identify the most energy-intensive sites and hosts to reduce your electricity bills and carbon emissions.

The Total Power column of the Sites section in the Hardware Sentry - Main dashboard displays the total power consumption of the monitored hosts by site. The sites with the higher Total Power are the most energy-consuming.

Helix Dashboards - Identifying the top consumer sites

Scroll down to the Top Consumers panel available in the Power and Host Information section. The Top Consumers panel lists the top ten most energy-intensive hosts in your entire infrastructure.

Helix Dashboards - Identifying the top consumer sites

Note: Metrics are automatically updated according to the collection interval (By default: 2m).

Replacing older equipment with more efficient ones or moving servers to a site with a lower electricity cost can help you improve your sustainability goals and reduce your energy costs.

Click the histogram bar of a host to view detailed information about a specific system.

Optimizing a site's temperature

Data centers are energy-intensive facilities. This energy is converted into heat that must be dissipated away from the equipment racks to maintain an optimal room temperature. Therefore, the hardware devices' temperature is a critical parameter that must be closely monitored.

The Heating Margin panel exposes the number of degrees Celsius (°C) remaining before the temperature reaches the closest warning temperature threshold of a monitored host for each site.

Note: Some hardware devices do not expose their overall temperature and are, therefore, not included in the heating margin computation. Hardware Sentry provides the percentage of the Monitoring Confidence per site in the Site dashboard.

Helix Dashboards - Viewing potential savings

Combining the temperature indicators collected from each monitored server and component, Hardware Sentry calculates the Recommended Site Temperature that determines the temperature at which the site can safely operate. Increasing a server room temperature can generate substantial savings on your electricity bill (Potential Yearly Savings) and significantly reduce your facility's carbon emissions ( Potential Yearly CO₂ Reduction).

No results.