The Message ID:
SUN4V-8001-MR
indicates a problem
in the interconnect between the UltraSPARC T2 Plus processors.
No data has been lost. However, a data lane has been taken
out of service which will reduce the system's ability to correct
transmission errors and may impact system performance. The system
is at increased risk of incurring an uncorrectable error, which
will cause a service interruption, until the problem is resolved.
Use the command fmdump -v -u <EVENT_ID>
with the <EVENT_ID> from the PSH console message
to locate the suspected faulty components. For this fault, the fault manager identifies each thread in the
processor as faulty as shown in the example below.
# fmdump -v -u 4a6ee8b0-129a-eaae-847d-c6c62860a110
TIME UUID SUNW-MSG-ID
Jan 08 11:21:17.7669 4a6ee8b0-129a-eaae-847d-c6c62860a110 SUN4V-8001-MR
100% fault.cpu.ultraSPARC-T2plus.lfu-f
Problem in: cpu:///cpuid=63/serial=FAD806CD143516C
Affects: cpu:///cpuid=63/serial=FAD806CD143516C
FRU: hc://:serial=100048:part=501784702/motherboard=0
Location: MB
100% fault.cpu.ultraSPARC-T2plus.lfu-f
Problem in: cpu:///cpuid=62/serial=FAD806CD143516C
Affects: cpu:///cpuid=62/serial=FAD806CD143516C
FRU: hc://:serial=100048:part=501784702/motherboard=0
Location: MB
.
.
.
100% fault.cpu.ultraSPARC-T2plus.lfu-f
Problem in: cpu:///cpuid=0/serial=FAD806CD143516C
Affects: cpu:///cpuid=0/serial=FAD806CD143516C
FRU: hc://:serial=100048:part=501784702/motherboard=0
Location: MB
Note the FRU called out for each faulty thread is the same. In the
example above, the FRU is the motherboard. Also note the FRU part
number and location is included:
part=501784702
Location: MB