|
CPU errors exceeded acceptable levels
Type
- Fault
Severity
- Major
Description
- The number of errors associated with this CPU has exceeded acceptable levels.
Automated Response
- The fault manager will attempt to remove the affected CPU from service.
Impact
- System performance may be affected.
Suggested Action for System Administrator
- Schedule a repair procedure to replace the affected CPU, the identity of which can be determined using fmdump -v -u <EVENT-ID>
to view the results of diagnosis and the specific Field Replaceable
Unit (FRU) identified for repair.
The event-id can be found in the EVENT-ID field of the message.
For example:
EVENT-ID: eb847cd6-168b-cc6b-97b7-87bcba2ee179
Details
- The Message ID:
SUN4U-8000-AC
indicates that the Solaris[TM] Fault Manager has received reports from
a CPU that one or more errors associated with the Level 2 cache (L2SRAM)
have been detected. The Diagnostic Engine (DE) has triggered an
automatic response to disable and isolate this CPU from the
configuration in order to prevent repeat errors and increase the
system's total Availability.
Arrangements should be made to replace the Field Replaceable Unit (FRU)
on which the suspect CPU is located. Refer to the information displayed
in the FRU: field of the fmdump output. For example:
%fmdump -v -u eb847cd6-168b-cc6b-97b7-87bcba2ee179
TIME UUID SUNW-MSG-ID
Sep 06 21:03:05.4129 eb847cd6-168b-cc6b-97b7-87bcba2ee179 SUN4U-8000-AC
100% fault.cpu.ultraSPARC-IIIi.l2cachedata
FRU: hc:///component=Slot A
rsrc: cpu:///cpuid=0/serial=D27081B5443
|