*SOLVED* 028test10.1 same sda error on 3 of 5 machines [message #9618] |
Sat, 13 January 2007 11:39 |
|
Hello!
We have a strange problem with some of our servers.
Before we installed OpenVZ (using CentOS 4.4 as basis) on our machines every server worked without any problem. Now - after the installation of OpenVZ 028test10.1 we got strange hardware errors. The error is the same on ALL machines (including sector numbers):
end_request: I/O error, dev sda, sector 5100005.
The strange on that fact is that every machine has different hard drive vendors. So it seems to be impossible that every machine has the same error. But all (5 of 5) machines still have the same main board. I think it is not a hardware bug but a kernel problem because all machines worked well with ubuntu 6.06 and its official ubuntu server kernel.
After the error occurs the machine is only reachable by ping but not via ssh. Console access is not possible too.
btw. We need the 2.6.18 kernel series because we need nfs support in the VEs.
I really hope someone can give us a hint on how to solve that problem.
Thank you for reading
Bernhard
[Updated on: Fri, 04 May 2007 07:24] by Moderator Report message to a moderator
|
|
|
Re: 028test10.1 same sda error on 3 of 5 machines [message #9620 is a reply to message #9618] |
Sat, 13 January 2007 12:30 |
|
Notice:
In one of the servers we changed the hard disk already -> still the same error.
The systems are using a VIA VT6420 SATA Controller. And I already found some links to other people that have similar problems.
For example this one:
http://lkml.org/lkml/2006/12/30/79
They concludde that this is a device error but I still think this is wrong because we have exactly the same error on many machines.
Maybe the driver for VT6420 has an error. I'll try looking deeper into it but I'm not a kernel hacker so my knowledge of the kernel is limited. Hopefully someone can help out.
[Updated on: Sat, 13 January 2007 12:49] Report message to a moderator
|
|
|
|
|
|
Re: 028test10.1 same sda error on 3 of 5 machines [message #12444 is a reply to message #11391] |
Sat, 28 April 2007 10:36 |
|
Hi!
I'll try the new stable RHEL5 kernel on RHEL4 (CentOS) and will report if the problem is still there.
At least the normal 2.6.18-ovz releases still have the same problem. Also kernel 2.6.20 does not work on that machines btw. I'll post more about that in another thread.
Bernhard
|
|
|
|
|
|