Fault Detection and Diagnostics Overview
4.
To display HWdiag help, type:
■
hwdiag to display a list of options and main commands.
■
hwdiag -h to display a list of main commands and their subcommands.
■
hwdiag -h main command to display help specific to main command.
Fault Detection and Diagnostics Overview
The server supports multiple fault detection and diagnostics tools. Fault detection tools, such
as the Oracle ILOM Fault Manager, automatically poll the system to detect hardware faults and
adverse environmental conditions. Diagnostics tools, such as Oracle VTS must be run manually
and can assist you in troubleshooting server issues. The following table provides an overview of
the fault detection and diagnostics tools supported by the server.
Tool Description Documentation
Oracle ILOM Fault
Manager
The fault manager automatically detects system hardware faults and
environmental conditions on the server. If a problem occurs on the
server, Oracle ILOM identifies the problem in the Open Problems
table and logs information about the fault in the Event log.
The Oracle ILOM Fault Manager is part of the Oracle ILOM
firmware embedded on the server service processor (SP).
Refer to Protecting Against Hardware
Faults: Oracle ILOM Fault Manager,
Oracle ILOM User's Guide for System
Monitoring and Diagnostics, Firmware
Release 4.0.x at:
http://www.oracle.com/goto/ilom/
docs
Oracle Linux
Fault Management
Architecture (FMA)
Oracle Linux FMA can be used to manage faults detected at the
operating system (OS) level in much the same way that you manage
faults in Oracle ILOM. Fault diagnosis messages from Linux FMA
are maintained on a fault management database, which is shared
with Oracle ILOM.
Oracle Linux FMA software can be optionally installed on the
server through Oracle Hardware Management Pack.
Refer to the Oracle Hardware
Management Pack Linux Fault
Management Architecture User's Guide
at:
http://www.oracle.com/goto/ohmp/
docs
Oracle Solaris
Fault Management
Architecture (FMA)
The fault manager receives data related to hardware and software
errors, automatically diagnoses the underlying problem, and
responds by trying to take faulty components offline.
Oracle Solaris FMA is included with the Oracle Solaris operating
system (OS).
Refer to Oracle Solaris Administration:
Common Tasks at:
http://docs.oracle.com/cd/
E23824_01/index.html
Auto Service Request
(ASR)
ASR is an optional support service for Oracle hardware. ASR
collects hardware telemetry data from telemetry sources (such as
Oracle ILOM) on ASR-enabled systems in your data center. ASR
filters this telemetry data and forwards what it determines to be
potential faults directly to Oracle, and then automatically initiates a
service request. You can configure features of the ASR service from
Oracle ILOM.
Go to:
http://www.oracle.com/us/support/
auto-service-request/index.html
76 Oracle X7 Series Servers Administration Guide • September 2017