Hardware Failure with new kernel [message #39907] |
Sun, 20 June 2010 02:05 |
edeis
Messages: 3 Registered: June 2010
|
Junior Member |
|
|
I recently upgrade to the latest OpenVZ kernel for Centos and am now experiencing hardware errors and a crash. If I run the previous kernel, I don't get the errors.
I amm have run memtest86+ to check if it is bad RAM. But it passes no problem.
kernel: 2.6.18-194.3.1.el5.028stab069.6
Error generated (is detected every hour while cronjob runs)
# mcelog
MCE 0
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
CPU 2 BANK 8 MISC 41
MCG status:
MCi status:
Error overflow
Uncorrected error
MCi_MISC register valid
Processor context corrupt
MCA: MEMORY CONTROLLER AC_CHANNEL0_ERR
Transaction: Address/Command error
Memory address parity error
Memory corrected error count (CORE_ERR_CNT): 911
Memory transaction Tracker ID (RTId): 41
Memory DIMM ID of error: 0
Memory channel ID of error: 0
Memory ECC syndrome: 0
STATUS ea10e3c0008000b0 MCGSTATUS 0
MCE 0
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
CPU 2 BANK 8 MISC 41
MCG status:
MCi status:
Error overflow
Uncorrected error
MCi_MISC register valid
Processor context corrupt
MCA: MEMORY CONTROLLER AC_CHANNEL0_ERR
Transaction: Address/Command error
Memory address parity error
Memory corrected error count (CORE_ERR_CNT): 7970
Memory transaction Tracker ID (RTId): 41
Memory DIMM ID of error: 0
Memory channel ID of error: 0
Memory ECC syndrome: 0
STATUS ea17c880008000b0 MCGSTATUS 0
Using previous Kernel: 2.6.18-164.15.1.el5.028stab068.9
seems to be okay.
help? is it bad hardware or something wrong with the kernel?
[Updated on: Sun, 20 June 2010 02:08] Report message to a moderator
|
|
|
|