LINUX.ORG.RU

История изменений

Исправление sl-project, (текущая версия) :

Cтолкнулся с ворохом каких то ошибок по модулям IO. Попробовал искать что это может быть, но неудачно.

кто то может сталкивался с этим и знает в чем причина? напряжения и температуры в норме. Сначала запустил сервер, старт прошел успешно, а так как автозагрузка отключена, дал команду на загрузку, Но она не прошла, рестартовал сервер, были кучи ошибок по модулям IO, Это второй блок ошибок после холодной перезагрузки.

Nov 03 06:38:40 sf4800 Platform.SC: Data Parity error polling failed. Board will no longer be polled: JtagController.tapIssueCmd:  ConsoleBus ERROR:  errorCode=00008100 (CM_PRER) ack=00
	
	I/O request: RP2.sdc.b0 (138000b0) offset=038000b0 window=83000004 P=0 DD=3 space=4
	Error address: SSC0.cbh.330 (13e00330)
Hardware error occurred during Interconnect testing: sun.serengeti.HpuFailedException: /N0/IB6: L1ICT.pass1CheckInterConnectTest: sun.serengeti.FailedHwException: DoubleErrorHandler.checkForErrors: : PCI I/O Board at /N0/IB6
Nov 03 06:38:40 sf4800 Platform.SC: Data Parity error polling failed. Board will no longer be polled: JtagController.tapIssueCmd:  ConsoleBus ERROR:  errorCode=00008800 (CM_EACK) ack=ee
	
	I/O request: RP0.sdc.b0 (134000b0) offset=034000b0 window=83000004 P=0 DD=3 space=4
	Error address: IB6.ar.50 (12c80050)
Hardware error occurred during Interconnect testing: sun.serengeti.HpuFailedException:  setSlaveSync: DoubleErrorHandler.checkForErrors: : PCI I/O Board at /N0/IB6
Nov 03 06:38:40 sf4800 Domain-A.SC: Excluded unusable, unlicensed, failed or disabled board: /N0/IB6
Testing IO Boards ...
Nov 03 06:38:57 sf4800 Platform.SC: Device voltage problem: /N0/IB6 abnormal state for device: Board 0 1.5 VDC 0 Value: 0.03 Volts DC
Nov 03 06:38:57 sf4800 Platform.SC: Device voltage problem: /N0/IB6 abnormal state for device: Board 0 3.3 VDC 0 Value: 0.49 Volts DC
Nov 03 06:38:57 sf4800 Platform.SC: Device voltage problem: /N0/IB6 abnormal state for device: Board 0 5 VDC 0 Value: 0.29 Volts DC
Nov 03 06:38:57 sf4800 Platform.SC: Device voltage problem: /N0/IB6 abnormal state for device: Board 0 12 VDC 0 Value: 0.08 Volts DC
Nov 03 06:38:57 sf4800 Platform.SC: PCI I/O Board at /N0/IB6 Device poll caused: sun.serengeti.FailedHwException: (SdcAsic)Asic.getTemp: Path broken between CBH and SDC: IB6.sdc.10 (12c00010)
Nov 03 06:38:57 sf4800 Platform.SC: Device will not be polled
Nov 03 06:38:58 sf4800 Platform.SC: PCI I/O Board at /N0/IB6 Device poll caused: sun.serengeti.FailedHwException: (ArAsic)Asic.getTemp: Path broken between CBH and SDC: IB6.ar.10 (12c80010)
Nov 03 06:38:58 sf4800 Platform.SC: Device will not be polled
Nov 03 06:38:58 sf4800 Platform.SC: PCI I/O Board at /N0/IB6 Device poll caused: sun.serengeti.FailedHwException: /partition0/domain0/IB6/dx0: DxAsic.getTemp: sun.serengeti.jtag.JtagException: JtagController.tapWait:  Path broken between CBH and SDC: IB6.sdc.b0 (12c000b0)
Nov 03 06:38:58 sf4800 Platform.SC: Device will not be polled
Nov 03 06:38:58 sf4800 Platform.SC: PCI I/O Board at /N0/IB6 Device poll caused: sun.serengeti.FailedHwException: /partition0/domain0/IB6/dx1: DxAsic.getTemp: sun.serengeti.jtag.JtagException: JtagController.tapWait:  Path broken between CBH and SDC: IB6.sdc.b0 (12c000b0)
Nov 03 06:38:58 sf4800 Platform.SC: Device will not be polled
Nov 03 06:38:58 sf4800 Platform.SC: PCI I/O Board at /N0/IB6 Device poll caused: sun.serengeti.FailedHwException: (RepeaterSbbcAsic)Asic.getTemp: Path broken between CBH and SDC: IB6.sbbc0.regs.10 (11800010)
Nov 03 06:38:58 sf4800 Platform.SC: Device will not be polled
Nov 03 06:38:58 sf4800 Platform.SC: PCI I/O Board at /N0/IB6 Device poll caused: sun.serengeti.FailedHwException: I2cComm.readCmd:  Path broken between CBH and SDC: IB6.sbbc0.regs.c0 (118000c0)
Nov 03 06:38:58 sf4800 Platform.SC: Device will not be polled
Nov 03 06:38:58 sf4800 Platform.SC: PCI I/O Board at /N0/IB6 Device poll caused: sun.serengeti.FailedHwException: I2cComm.readCmd:  Path broken between CBH and SDC: IB6.sbbc0.regs.c0 (118000c0)
Nov 03 06:38:58 sf4800 Platform.SC: Device will not be polled
Nov 03 06:38:58 sf4800 Platform.SC: /N0/IB6, sensor status, outside acceptable limits (7,1,0x501060d00050000)
Nov 03 06:38:58 sf4800 Platform.SC: /N0/IB6, sensor status, outside acceptable limits (7,1,0x501060d00070000)
Nov 03 06:38:58 sf4800 Platform.SC: /N0/IB6, sensor status, outside acceptable limits (7,1,0x501060d00080000)
Nov 03 06:38:58 sf4800 Platform.SC: /N0/IB6, sensor status, outside acceptable limits (7,1,0x501060d00090000)
Loading the test table from board IB8 PROM 0 ...
Copying IO PROM to CPU DRAM
.Nov 03 06:39:05 sf4800 Platform.SC: ErrorMonitor: Domain A has a SYSTEM ERROR
Nov 03 06:39:05 sf4800 Domain-A.SC: ErrorMonitor: Domain A has a SYSTEM ERROR
Nov 03 06:39:05 sf4800 Domain-A.SC: /N0/IB8 encountered the first error
Nov 03 06:39:05 sf4800 Domain-A.SC: ArAsic reported first error on /N0/IB8
Nov 03 06:39:05 sf4800 Domain-A.SC: 
/partition0/domain0/IB8/ar0: 
>>> L2CheckError[0x6150] : 0x00009e1e
	     CMDVSyncErr [12:09] : 0xf Ports [9:6] command valid mismatched against internal expected command valid
	     PreqSyncErr [04:01] : 0xf Ports [9:6] prereq mismatched against internal expected prereq
	              FE [15:15] : 0x1 

Nov 03 06:39:05 sf4800 Platform.SC: [AD] Event: SF4800
     CSN: 203M20F6 DomainID: A ADInfo: 1.SCAPP.20.9
     Time: Sun Nov 03 06:39:05 PST 2019
     FRU-List-Count: 0; FRU-PN:  ; FRU-SN:  ; FRU-LOC: UNRESOLVED
     Recommended-ActioNov 03 06:39:05 sf4800 Domain-A.SC: [AD] Event: SF4n: Service action re800
     CSN: 203M2quired
0F6 DomainID
: A ADInfo: 1.SCAPP.20.9
     Time: Sun Nov 03 06:39:05 PST 2019
     FRU-List-Count: 0; FRU-PN:  ; FRU-SN:  ; FRU-LOC: UNRESOLVNov 03 06:39:05 sf4800 Platform.SC: A fatal condition is detected on Domain A.ED
     Recommended-Action: Service action required

Исходная версия sl-project, :

ошибки модулей IO

Cтолкнулся с ворохом каких то ошибок по модулям IO. Попробовал искать что это может быть, но неудачно.

кто то может сталкивался с этим и знает в чем причина? напряжения и температуры в норме.

Nov 03 06:38:40 sf4800 Platform.SC: Data Parity error polling failed. Board will no longer be polled: JtagController.tapIssueCmd:  ConsoleBus ERROR:  errorCode=00008100 (CM_PRER) ack=00
	
	I/O request: RP2.sdc.b0 (138000b0) offset=038000b0 window=83000004 P=0 DD=3 space=4
	Error address: SSC0.cbh.330 (13e00330)
Hardware error occurred during Interconnect testing: sun.serengeti.HpuFailedException: /N0/IB6: L1ICT.pass1CheckInterConnectTest: sun.serengeti.FailedHwException: DoubleErrorHandler.checkForErrors: : PCI I/O Board at /N0/IB6
Nov 03 06:38:40 sf4800 Platform.SC: Data Parity error polling failed. Board will no longer be polled: JtagController.tapIssueCmd:  ConsoleBus ERROR:  errorCode=00008800 (CM_EACK) ack=ee
	
	I/O request: RP0.sdc.b0 (134000b0) offset=034000b0 window=83000004 P=0 DD=3 space=4
	Error address: IB6.ar.50 (12c80050)
Hardware error occurred during Interconnect testing: sun.serengeti.HpuFailedException:  setSlaveSync: DoubleErrorHandler.checkForErrors: : PCI I/O Board at /N0/IB6
Nov 03 06:38:40 sf4800 Domain-A.SC: Excluded unusable, unlicensed, failed or disabled board: /N0/IB6
Testing IO Boards ...
Nov 03 06:38:57 sf4800 Platform.SC: Device voltage problem: /N0/IB6 abnormal state for device: Board 0 1.5 VDC 0 Value: 0.03 Volts DC
Nov 03 06:38:57 sf4800 Platform.SC: Device voltage problem: /N0/IB6 abnormal state for device: Board 0 3.3 VDC 0 Value: 0.49 Volts DC
Nov 03 06:38:57 sf4800 Platform.SC: Device voltage problem: /N0/IB6 abnormal state for device: Board 0 5 VDC 0 Value: 0.29 Volts DC
Nov 03 06:38:57 sf4800 Platform.SC: Device voltage problem: /N0/IB6 abnormal state for device: Board 0 12 VDC 0 Value: 0.08 Volts DC
Nov 03 06:38:57 sf4800 Platform.SC: PCI I/O Board at /N0/IB6 Device poll caused: sun.serengeti.FailedHwException: (SdcAsic)Asic.getTemp: Path broken between CBH and SDC: IB6.sdc.10 (12c00010)
Nov 03 06:38:57 sf4800 Platform.SC: Device will not be polled
Nov 03 06:38:58 sf4800 Platform.SC: PCI I/O Board at /N0/IB6 Device poll caused: sun.serengeti.FailedHwException: (ArAsic)Asic.getTemp: Path broken between CBH and SDC: IB6.ar.10 (12c80010)
Nov 03 06:38:58 sf4800 Platform.SC: Device will not be polled
Nov 03 06:38:58 sf4800 Platform.SC: PCI I/O Board at /N0/IB6 Device poll caused: sun.serengeti.FailedHwException: /partition0/domain0/IB6/dx0: DxAsic.getTemp: sun.serengeti.jtag.JtagException: JtagController.tapWait:  Path broken between CBH and SDC: IB6.sdc.b0 (12c000b0)
Nov 03 06:38:58 sf4800 Platform.SC: Device will not be polled
Nov 03 06:38:58 sf4800 Platform.SC: PCI I/O Board at /N0/IB6 Device poll caused: sun.serengeti.FailedHwException: /partition0/domain0/IB6/dx1: DxAsic.getTemp: sun.serengeti.jtag.JtagException: JtagController.tapWait:  Path broken between CBH and SDC: IB6.sdc.b0 (12c000b0)
Nov 03 06:38:58 sf4800 Platform.SC: Device will not be polled
Nov 03 06:38:58 sf4800 Platform.SC: PCI I/O Board at /N0/IB6 Device poll caused: sun.serengeti.FailedHwException: (RepeaterSbbcAsic)Asic.getTemp: Path broken between CBH and SDC: IB6.sbbc0.regs.10 (11800010)
Nov 03 06:38:58 sf4800 Platform.SC: Device will not be polled
Nov 03 06:38:58 sf4800 Platform.SC: PCI I/O Board at /N0/IB6 Device poll caused: sun.serengeti.FailedHwException: I2cComm.readCmd:  Path broken between CBH and SDC: IB6.sbbc0.regs.c0 (118000c0)
Nov 03 06:38:58 sf4800 Platform.SC: Device will not be polled
Nov 03 06:38:58 sf4800 Platform.SC: PCI I/O Board at /N0/IB6 Device poll caused: sun.serengeti.FailedHwException: I2cComm.readCmd:  Path broken between CBH and SDC: IB6.sbbc0.regs.c0 (118000c0)
Nov 03 06:38:58 sf4800 Platform.SC: Device will not be polled
Nov 03 06:38:58 sf4800 Platform.SC: /N0/IB6, sensor status, outside acceptable limits (7,1,0x501060d00050000)
Nov 03 06:38:58 sf4800 Platform.SC: /N0/IB6, sensor status, outside acceptable limits (7,1,0x501060d00070000)
Nov 03 06:38:58 sf4800 Platform.SC: /N0/IB6, sensor status, outside acceptable limits (7,1,0x501060d00080000)
Nov 03 06:38:58 sf4800 Platform.SC: /N0/IB6, sensor status, outside acceptable limits (7,1,0x501060d00090000)
Loading the test table from board IB8 PROM 0 ...
Copying IO PROM to CPU DRAM
.Nov 03 06:39:05 sf4800 Platform.SC: ErrorMonitor: Domain A has a SYSTEM ERROR
Nov 03 06:39:05 sf4800 Domain-A.SC: ErrorMonitor: Domain A has a SYSTEM ERROR
Nov 03 06:39:05 sf4800 Domain-A.SC: /N0/IB8 encountered the first error
Nov 03 06:39:05 sf4800 Domain-A.SC: ArAsic reported first error on /N0/IB8
Nov 03 06:39:05 sf4800 Domain-A.SC: 
/partition0/domain0/IB8/ar0: 
>>> L2CheckError[0x6150] : 0x00009e1e
	     CMDVSyncErr [12:09] : 0xf Ports [9:6] command valid mismatched against internal expected command valid
	     PreqSyncErr [04:01] : 0xf Ports [9:6] prereq mismatched against internal expected prereq
	              FE [15:15] : 0x1 

Nov 03 06:39:05 sf4800 Platform.SC: [AD] Event: SF4800
     CSN: 203M20F6 DomainID: A ADInfo: 1.SCAPP.20.9
     Time: Sun Nov 03 06:39:05 PST 2019
     FRU-List-Count: 0; FRU-PN:  ; FRU-SN:  ; FRU-LOC: UNRESOLVED
     Recommended-ActioNov 03 06:39:05 sf4800 Domain-A.SC: [AD] Event: SF4n: Service action re800
     CSN: 203M2quired
0F6 DomainID
: A ADInfo: 1.SCAPP.20.9
     Time: Sun Nov 03 06:39:05 PST 2019
     FRU-List-Count: 0; FRU-PN:  ; FRU-SN:  ; FRU-LOC: UNRESOLVNov 03 06:39:05 sf4800 Platform.SC: A fatal condition is detected on Domain A.ED
     Recommended-Action: Service action required