OpenVZ Forum


Home » General » Support » Weird IOwait raising using 028stab085.2 x86_64 and i686 kernel (Can't use new released kernel on heavy loaded HNs.)
icon5.gif  Weird IOwait raising using 028stab085.2 x86_64 and i686 kernel [message #42220] Sun, 20 March 2011 19:17 Go to next message
AnVir is currently offline  AnVir
Messages: 20
Registered: October 2009
Location: Russia
Junior Member

I did kernel upgrade on two production HN. Each is 8 cores intel 2.5 GHz, one has 16 GB ram with x86_64 CentOS 5.5, another has 8 GB ram with i686 CentOS 5.5 (enterprise kernel).
On the first, 64 bit HN started about 80 VEs, each vith 1-2 java process, total ~15000 threads on HN.
On the second, 32 bit HN started about 40 VEs, each vith 1 proccess of HLDS or SRCDS game server, total ~2500 threads.
Each HN has 2x500 GB SATA HDD without RAID of any kind.
Memory load is about 50%, so IOwait not raises more than 15% even in primetime.
Kernel was upgraded from 028stab081.1 to 028stab085.2.
About 60% of VEs started as always (with high IOwait, but still good disk IO responce time - it is usual during HN boot, while disk cache is filling up), then two of eight cores on each HN show me 100% IOwait state, and other six - 0% load at all. At the same time LA start to raise from usual 50-200 value to ~4000 (maybe, higher, but i reboot HN at this point, ~30 vinutes of uptime).
I tryed to perform some actions, for example, to find processes in "D..." state trough "ps axuww | grep ' D'". I got some free breath when I killed all syslogd and crond processes, but not for long, after 3-5 minutes system hang again with the same symtomths.
So, I was forced to revert kernels to 028stab081.1, did reboot - and both nodes started as usual, with low IOwait when disk cache was filled.

I have some specific settings on file systems, in sysctl (/proc) and disk scheduler, which I got by tuning up them for about a year on 12 hardware nodes with compareble load. I tryed to change them to much lower/higher values, hope to obtain the reason of this activity, and got no effect. Some of this settings you can see below:

In /etc/fstab:
/dev/sda3 /vz/private ext3 defaults,noatime,nodiratime,commit=29,data=journal 1 1
Commit interval on each FS is different, about 30 seconds, and it is simple number (divides only by 1 and by self: 23, 29, 31 etc).

In /etc/sysctl.conf:
kernel.vcpu_sched_timeslice = 5
kernel.vcpu_hot_timeslice = 4
kernel.fairsched-max-latency = 20
kernel.sched_interactive = 0
kernel.hung_task_timeout_secs = 60
kernel.max_lock_depth = 1024
kernel.sysrq = 0
kernel.exec-shield = 0
kernel.randomize_va_space = 0
kernel.pid_max = 65534
vm.swappiness = 1
vm.pagecache = 90
vm.vfs_cache_pressure = 1000
vm.flush_mmap_pages = 0
vm.dirty_background_ratio = 10
vm.dirty_writeback_centisecs = 3013
vm.dirty_expire_centisecs = 30031
vm.dirty_ratio = 30
vm.max_writeback_pages = 65536
I repeat: all of them i tried to change while solving the problem. It always worked fine, and more - it gains performance with my tasks.

In /etc/rc.local:
blockdev --setra 2048 /dev/sda
blockdev --setra 2048 /dev/sdb
echo 4096 > /sys/block/sda/queue/nr_requests
echo 4096 > /sys/block/sdb/queue/nr_requests
echo 'cfq' > /sys/block/sda/queue/scheduler
echo 'cfq' > /sys/block/sdb/queue/scheduler
Changing length of queue, RA or cfq scheduler to deadline or noop has no effect.

I'll post any additional information if you require any to help me. Now, I installed 028stab087.1 testing kernel on 64-bit node that I described above, and plan to boot it in next 2-3 hours. I see a lot of fixes in changelog - maybe, someone fixed my issue already... Anyway, thank you for any reply Smile

...and sorry for my ugly english spelling.


Welcome to xfes.ru OpenVZ repository mirror
Re: Weird IOwait raising using 028stab085.2 x86_64 and i686 kernel [message #42223 is a reply to message #42220] Sun, 20 March 2011 21:05 Go to previous messageGo to next message
AnVir is currently offline  AnVir
Messages: 20
Registered: October 2009
Location: Russia
Junior Member

AnVir wrote on Sun, 20 March 2011 15:17
Now, I installed 028stab087.1 testing kernel on 64-bit node that I described above, and plan to boot it in next 2-3 hours. I see a lot of fixes in changelog - maybe, someone fixed my issue already...


So... It seems to be all right now.

top - 00:04:47 up 46 min, 2 users, load average: 260.67, 281.87, 204.49
Tasks: 1454 total, 1 running, 1416 sleeping, 0 stopped, 37 zombie
Cpu(s): 5.5%us, 3.3%sy, 0.0%ni, 50.0%id, 40.7%wa, 0.0%hi, 0.5%si, 0.0%st
Mem: 16352112k total, 10590872k used, 5761240k free, 609116k buffers
Swap: 8385888k total, 0k used, 8385888k free, 5706920k cached

$ uname -a
Linux xxxxxxx.ru 2.6.18-238.5.1.el5.028stab087.1 #1 SMP Wed Mar 16 23:55:12 MSK 2011 x86_64 x86_64 x86_64 GNU/Linux

I'll perform some tests and then write, how it works now.


Welcome to xfes.ru OpenVZ repository mirror
Re: Weird IOwait raising using 028stab085.2 x86_64 and i686 kernel [message #42311 is a reply to message #42220] Wed, 30 March 2011 20:43 Go to previous messageGo to next message
AnVir is currently offline  AnVir
Messages: 20
Registered: October 2009
Location: Russia
Junior Member

Crying or Very Sad UP: problem is still actual.
Node is full operational on 2.6.18-238.5.1.el5.028stab087.1, but the same bug is on 2.6.18-238.5.1.el5.028stab085.3. Guys, it's really early to move it to stable, I checked it on very popular server platforms - Intel SR1630 and Intel SR1530, with different CPUs, 8-16 GB RAM and 32/64 bit archetivture. It always does weird IO load when ~60th VE with 1-2 java process started.
Please, do something... I'am relying on you Smile


Welcome to xfes.ru OpenVZ repository mirror
Re: Weird IOwait raising using 028stab085.2 x86_64 and i686 kernel [message #42328 is a reply to message #42311] Fri, 01 April 2011 09:42 Go to previous messageGo to next message
meto is currently offline  meto
Messages: 1
Registered: April 2011
Location: Polska
Junior Member
I also encountered IO problems and almost started replacing drives. Now I'm running 2.6.18-238.5.1.el5.028stab088.1 and it seams fine now. I think, that one of newer kernel should be pushed to stable to fix that issue.

Here you have munin graph from container:
index.php?t=getfile&id=905&private=0
  • Attachment: cpu-week.png
    (Size: 41.86KB, Downloaded 1003 times)
Re: Weird IOwait raising using 028stab085.2 x86_64 and i686 kernel [message #42330 is a reply to message #42220] Fri, 01 April 2011 10:18 Go to previous messageGo to next message
AnVir is currently offline  AnVir
Messages: 20
Registered: October 2009
Location: Russia
Junior Member

Ok, i'll try it too.
Also, sometimes LA raises to values below and even above total number of running processes. Now i am fixing that changing CPU scheduler interactivity 0 -> 2 and back 2 -> 0 after 15 minutes. I am not sure that this is the same bug... But if anyone sees such peaks - report, please.


Welcome to xfes.ru OpenVZ repository mirror
Re: Weird IOwait raising using 028stab085.2 x86_64 and i686 kernel [message #42418 is a reply to message #42220] Thu, 14 April 2011 15:39 Go to previous messageGo to next message
seanfulton is currently offline  seanfulton
Messages: 105
Registered: May 2007
Senior Member
Anyone using 89.1 on this issue? Success?
sean
Re: Weird IOwait raising using 028stab085.2 x86_64 and i686 kernel [message #42423 is a reply to message #42418] Fri, 15 April 2011 08:58 Go to previous messageGo to next message
AnVir is currently offline  AnVir
Messages: 20
Registered: October 2009
Location: Russia
Junior Member

028stab088.1 works fine, but it is testing kernel.
I'll check it only with new stable kernel, I don't want reboot production server too often.


Welcome to xfes.ru OpenVZ repository mirror
Re: Weird IOwait raising using 028stab085.2 x86_64 and i686 kernel [message #42429 is a reply to message #42220] Sat, 16 April 2011 10:39 Go to previous messageGo to next message
koct9i is currently offline  koct9i
Messages: 51
Registered: February 2008
Member
Did you use cpu-limits? Then it is the bug which was fixed in 028stab087.1
Re: Weird IOwait raising using 028stab085.2 x86_64 and i686 kernel [message #42966 is a reply to message #42220] Wed, 22 June 2011 21:48 Go to previous messageGo to next message
seanfulton is currently offline  seanfulton
Messages: 105
Registered: May 2007
Senior Member
Just a follow-up--are you guys running 91.1 or any of the newer stable kernels?

sean
Re: Weird IOwait raising using 028stab085.2 x86_64 and i686 kernel [message #42967 is a reply to message #42429] Wed, 22 June 2011 22:49 Go to previous messageGo to next message
AnVir is currently offline  AnVir
Messages: 20
Registered: October 2009
Location: Russia
Junior Member

koct9i wrote on Sat, 16 April 2011 06:39
Did you use cpu-limits? Then it is the bug which was fixed in 028stab087.1

Sure we did. So, I think, it was scheduler-related issue.


Welcome to xfes.ru OpenVZ repository mirror
Re: Weird IOwait raising using 028stab085.2 x86_64 and i686 kernel [message #42968 is a reply to message #42966] Wed, 22 June 2011 22:56 Go to previous message
AnVir is currently offline  AnVir
Messages: 20
Registered: October 2009
Location: Russia
Junior Member

seanfulton, now I use current stable kernel on x86_64 with 11 HNs and it's all ok. There is at least two or three last stable kernels with this bug fixed.
2 openvz team: thank you, guys, you are the best Very Happy


Welcome to xfes.ru OpenVZ repository mirror
Previous Topic: Mount --bind CIFS OpenVZ - bad performance
Next Topic: Container networking breaks after a while
Goto Forum:
  


Current Time: Sat Jul 05 19:04:28 GMT 2025

Total time taken to generate the page: 0.02030 seconds