Back to the list of classes

Hardware GPU

PATROL Class: MS_HW_GPU - Monitor Type: MONITOR - Monitor Category: System
No CDM Class defined.

Monitors the Graphics Processing Unit (GPU).

Attributes (Parameters)

Name PATROL Name Units Default Thresholds Description
Corrected Error Count CorrectedErrorCount errors None

Number of detected and corrected errors. This information can be very useful to predict a failure in the near future.

Value set by gpuColl.

Type: Statistics

Decoder Used Time Percent DecoderUsedTimePercent % None

Percentage of time spent by the GPU decoding videos.

Value set by gpuColl.;

Type: Statistics - Spikes expected

Encoder Used Time Percent EncoderUsedTimePercent % None

Percentage of time spent by the GPU encoding videos.

Value set by gpuColl.

Type: Statistics - Spikes expected

Error Count ErrorCount errors

 ALARM
if ≥ 1 errors

Number of errors encountered by the GPU since the last reinitialization or since the last manual reset. This parameter is incremented each time an error occurs. An alert is raised automatically from the first detected error. The operator can reset the counter and clear the alert through the menu Acknowledge Error Count Alerts and Reset. Resetting the ErrorCount parameter can also be configured to be performed automatically.

Value set by gpuColl.

Type: Statistics

Memory Utilization MemoryUtilization %

 WARN
if ∈ [90 %, 100 %]

N/A

Type: Statistics

Power Consumption PowerConsumption Watts None

Power consumption of the GPU.

Value set by gpuColl.

Type: Statistics

Predicted Failure PredictedFailure 0 = OK
1 = Failure Predicted

 WARN
if = 1 (Failure Predicted)

Reports the predicted failure analysis performed by the GPU itself.

Value set by gpuColl.

Type: Availability

Present Present 0 = Missing
1 = Present

 ALARM
if = 0 (Missing)

Availability of the GPU.

Value set by gpuColl.

Type: Availability

Received Bytes Rate ReceivedBytesRate MB/s None

GPU incoming traffic.

Value set by gpuColl.

Type: Statistics - Spikes expected

Status Status 0 = OK
1 = Degraded
2 = Failed

 WARN
if = 1 (Degraded)

 ALARM
if = 2 (Failed)

GPU status.

Value set by gpuColl.

Type: Availability - Default graph

Status Information StatusInformation None None

Additional (textual) information about the current status of the GPU.

Value set by gpuColl.

Type: Text

Transmitted Bytes Rate TransmittedBytesRate MB/s None

GPU outgoing traffic.

Value set by gpuColl.

Type: Statistics - Spikes expected

Used Time Percent UsedTimePercent %

 WARN
if ∈ [80 %, 90 %]

 ALARM
if ∈ [90 %, 100 %]

Percentage of time spent by the GPU doing any work.

Value set by gpuColl.

Type: Statistics

No results.