OpenVZ Forum


Home » General » Support » load spikes to 150-290 on HN
load spikes to 150-290 on HN [message #28060] Fri, 07 March 2008 11:52 Go to next message
sara3 is currently offline  sara3
Messages: 38
Registered: February 2008
Member
Hello
i have sudden unexplained load spikes on the hardware node
here is my bean counters

cat /proc/user_beancounters
Version: 2.5
       uid  resource           held    maxheld    barrier      limit    failcnt
      136:  kmemsize        5838686   10451272 2147483647 2147483647          0
            lockedpages           0          0 2147483647 2147483647          0
            privvmpages       94335     120403 2147483647 2147483647          0
            shmpages            219        731 2147483647 2147483647          0
            dummy                 0          0          0          0          0
            numproc              62        134        267        268          0
            physpages         29767      48439          0 2147483647          0
            vmguarpages           0          0       6144 2147483647          0
            oomguarpages      43634      53789       6144 2147483647          0
            numtcpsock           28         66 2147483647 2147483647          0
            numflock              6         13 2147483647 2147483647          0
            numpty                0          0        255        255          0
            numsiginfo            5         22       1024       1024          0
            tcpsndbuf        274256     926936 2147483647 2147483647          0
            tcprcvbuf        458752     696492 2147483647 2147483647          0
            othersockbuf      17760     605356 2147483647 2147483647          0
            dgramrcvbuf           0      11960   47483647  147483647          0
            numothersock         18         49 2147483647 2147483647          0
            dcachesize       852338     900178 2147483647 2147483647          0
            numfile            2982       4502 2147483647 2147483647          0
            dummy                 0          0          0          0          0
            dummy                 0          0          0          0          0
            dummy                 0          0          0          0          0
            numiptent           398        400        400        400        245
      138:  kmemsize        6259105   21805746 2147483647 2147483647          0
            lockedpages           0          0 2147483647 2147483647          0
            privvmpages       62541     240161 2147483647 2147483647          0
            shmpages             14         14 2147483647 2147483647          0
            dummy                 0          0          0          0          0
            numproc              74        328      32567      32567          0
            physpages         14785     121721          0 2147483647          0
            vmguarpages           0          0       6144 2147483647          0
            oomguarpages      22566     122026       6144 2147483647          0
            numtcpsock           38        228 2147483647 2147483647          0
            numflock             10         95 2147483647 2147483647          0
            numpty                0          1        255        255          0
            numsiginfo            0        150       1024       1024          0
            tcpsndbuf        459540    2982872 2147483647 2147483647          0
            tcprcvbuf        622592    3777732 2147483647 2147483647          0
            othersockbuf      13320    1177960 2147483647 2147483647          0
            dgramrcvbuf           0      82080   47483647  147483647          0
            numothersock         16        281 2147483647 2147483647          0
            dcachesize       375978     592910 2147483647 2147483647          0
            numfile            2995       7685 2147483647 2147483647          0
            dummy                 0          0          0          0          0
            dummy                 0          0          0          0          0
            dummy                 0          0          0          0          0
            numiptent            33         33        400        400          0
      144:  kmemsize        7330763   39453623  182802841  201083125          0
            lockedpages           0          0       8925       8925          0
            privvmpages       40093     654747     305911     336502      33610
            shmpages            149        805      30591      30591          0
            dummy                 0          0          0          0          0
            numproc              82        485       8000       8000          0
            physpages         19434     228719          0 2147483647          0
            vmguarpages           0          0     305911 2147483647          0
            oomguarpages      19509     232475     305911 2147483647          0
            numtcpsock          103        298       8000       8000          0
            numflock             56        377       1000       1100          0
            numpty                0          1        512        512          0
            numsiginfo           11        387       1024       1024          0
            tcpsndbuf        959040    3285600   28166280   60934280          0
            tcprcvbuf       1303748    3583756   28166280   60934280          0
            othersockbuf       2220    2862648   14083140   46851140          0
            dgramrcvbuf           0      10416   14083140   14083140          0
            numothersock          2        342       8000       8000          0
            dcachesize       279314    1100958   39924069   41121792          0
            numfile            2461      14010      71392      71392          0
            dummy                 0          0          0          0          0
            dummy                 0          0          0          0          0
            dummy                 0          0          0          0          0
            numiptent            33         48        200        200          0
      137:  kmemsize        2233967    6569195 2147483647 2147483647          0
            lockedpages           0          0 2147483647 2147483647          0
            privvmpages       24608      41516 2147483647 2147483647          0
            shmpages              0          0 2147483647 2147483647          0
            dummy                 0          0          0          0          0
            numproc              22         76      32567      32567          0
            physpages          4661      21212          0 2147483647          0
            vmguarpages           0          0       6144 2147483647          0
            oomguarpages      11166      23002       6144 2147483647          0
            numtcpsock            9         34 2147483647 2147483647          0
            numflock              6         63 2147483647 2147483647          0
            numpty                0          1        255        255          0
            numsiginfo            0          3       1024       1024          0
            tcpsndbuf         91020     435120 2147483647 2147483647          0
            tcprcvbuf        147456          0 2147483647 2147483647          0
            othersockbuf      17760      73944 2147483647 2147483647          0
            dgramrcvbuf           0      26640   47483647  147483647          0
            numothersock         12         38 2147483647 2147483647          0
            dcachesize       110152     196700 2147483647 2147483647          0
            numfile            1372       2894 2147483647 2147483647          0
            dummy                 0          0          0          0          0
            dummy                 0          0          0          0          0
            dummy                 0          0          0          0          0
            numiptent            14         14        400        400          0
      139:  kmemsize        4940371   13141172 2147483647 2147483647          0
            lockedpages           0          0 2147483647 2147483647          0
            privvmpages       54594     138526 2147483647 2147483647          0
            shmpages            223       2159 2147483647 2147483647          0
            dummy                 0          0          0          0          0
            numproc              65        177      32567      32567          0
            physpages          9785      75007          0 2147483647          0
            vmguarpages           0          0       6144 2147483647          0
            oomguarpages      19144      75041       6144 2147483647          0
            numtcpsock           29        116 2147483647 2147483647          0
            numflock              8         12 2147483647 2147483647          0
            numpty                0          5        255        255          0
            numsiginfo            0         94       1024       1024          0
            tcpsndbuf        404040    2069040 2147483647 2147483647          0
            tcprcvbuf        475136    2457580 2147483647 2147483647          0
            othersockbuf      39532     223184 2147483647 2147483647          0
            dgramrcvbuf           0      26640   47483647  147483647          0
            numothersock         30         77 2147483647 2147483647          0
            dcachesize       381036     486200 2147483647 2147483647          0
            numfile            2436       3530 2147483647 2147483647          0
            dummy                 0          0          0          0          0
            dummy                 0          0          0          0          0
            dummy                 0          0          0          0          0
            numiptent            33         33        400        400          0
      149:  kmemsize        3370556    3984508   30138368   3
...

Re: load spikes to 150-290 on HN [message #28061 is a reply to message #28060] Fri, 07 March 2008 12:15 Go to previous messageGo to next message
ugob is currently offline  ugob
Messages: 271
Registered: March 2007
Senior Member
What is top showing when that happens?

Are you doing a suspend or backup at this time? Is it always around the same time?

Is the server slowed down a lot during this period?


Please read the manual before asking questions:
http://download.openvz.org/doc/OpenVZ-Users-Guide.pdf

Please have a look at the wiki before asking questions:
http://wiki.openvz.org/Main_Page
Re: load spikes to 150-290 on HN [message #28063 is a reply to message #28061] Fri, 07 March 2008 12:21 Go to previous messageGo to next message
sara3 is currently offline  sara3
Messages: 38
Registered: February 2008
Member
Hello
thanks alot for your fast reply
1- top showed no abnormal processes , but tasks count was above 1300 the normal was used to be around 700-800
2- no backup , cp , tar , gzip or any process that can consume alot of resources but itself
3- yes the hardware node and all vpses slow down
Re: load spikes to 150-290 on HN [message #28064 is a reply to message #28063] Fri, 07 March 2008 12:25 Go to previous messageGo to next message
ugob is currently offline  ugob
Messages: 271
Registered: March 2007
Senior Member
What is the processes that use most CPU? Around what time?

What 'vmstat 5 5' show? Anything in the 'swap' colums?


Please read the manual before asking questions:
http://download.openvz.org/doc/OpenVZ-Users-Guide.pdf

Please have a look at the wiki before asking questions:
http://wiki.openvz.org/Main_Page
Re: load spikes to 150-290 on HN [message #28067 is a reply to message #28064] Fri, 07 March 2008 12:29 Go to previous messageGo to next message
sara3 is currently offline  sara3
Messages: 38
Registered: February 2008
Member
hi again
thanks for your help
1- i cannot see one process that consumes all cpu but it seems that suddenly one or two VE's begin to have 300 processes running at once
2- free ram at that time was 250 Mb and swap usage was about 500 Mb
3- was not running vmstat 5 5 at that time
waiting for your follow up Smile
Re: load spikes to 150-290 on HN [message #28068 is a reply to message #28067] Fri, 07 March 2008 12:34 Go to previous messageGo to next message
ugob is currently offline  ugob
Messages: 271
Registered: March 2007
Senior Member
Ok, and what kind of processes are suddently spwaning those VEs? You could do a vzctl exec or use vzps (available from download.openvz.org).

You could use a script like this:

============
for i in `vzlist -o veid -H`
do
echo "VPS $i"
vzctl exec $i ps -aux
done
============

Also, you should use sar and read sar reports. What distro is the HN?


Please read the manual before asking questions:
http://download.openvz.org/doc/OpenVZ-Users-Guide.pdf

Please have a look at the wiki before asking questions:
http://wiki.openvz.org/Main_Page
Re: load spikes to 150-290 on HN [message #28069 is a reply to message #28068] Fri, 07 March 2008 13:15 Go to previous messageGo to next message
sara3 is currently offline  sara3
Messages: 38
Registered: February 2008
Member
hello
i have centos 4.4 on HW
i also can assure no abnormal processes in VE's
when i find high cpu load average i go to the VE that causes the issue and find nothing strange

please look at my beancounters does it say anything or any advices to tweak this ?
i had a similar problem and was fixed when i set cpulimit to zero but now i have it zero on all VEs and it doesn't help still
Re: load spikes to 150-290 on HN [message #28072 is a reply to message #28069] Fri, 07 March 2008 13:26 Go to previous messageGo to next message
ugob is currently offline  ugob
Messages: 271
Registered: March 2007
Senior Member
Well you should:

- Raise numiptent on 136
- Raise privvmpages on 144 (do you know why it is allocating that much memory?)

You should run 'vzmemcheck -vA' on the HN when high load happens. You'll see how much memory is used by each VE.

How much ram do you have?

If not installed, install sysstat and wait one day, then go into /var/log/sa/ to see the performance reports per date (sarXX).

You're saying that some VE have 300+ more process when it happens. It is important to see what kind of process it is. httpd? sendmail? java?


Please read the manual before asking questions:
http://download.openvz.org/doc/OpenVZ-Users-Guide.pdf

Please have a look at the wiki before asking questions:
http://wiki.openvz.org/Main_Page
Re: load spikes to 150-290 on HN [message #28082 is a reply to message #28072] Fri, 07 March 2008 17:16 Go to previous messageGo to next message
sara3 is currently offline  sara3
Messages: 38
Registered: February 2008
Member
hello
i have been hit again by the high load that reached 200 and couldn't do anything in the VE because of the "-bash: fork: Cannot allocate memory" error
attached file contains the output of top , ps auxf , vzmemcheck -vA , vmstat 5 5

problem solved when i restarted the vps but this condition of high load strikes happends too many times per day please advise

  • Attachment: log.txt
    (Size: 161.85KB, Downloaded 560 times)
Re: load spikes to 150-290 on HN [message #28090 is a reply to message #28082] Sat, 08 March 2008 03:21 Go to previous messageGo to next message
ugob is currently offline  ugob
Messages: 271
Registered: March 2007
Senior Member
In which VE? 144? It looks like it is spawning many processes, and they are using up all the memory available for the VE.

Please read the manual before asking questions:
http://download.openvz.org/doc/OpenVZ-Users-Guide.pdf

Please have a look at the wiki before asking questions:
http://wiki.openvz.org/Main_Page
Re: load spikes to 150-290 on HN [message #28094 is a reply to message #28090] Sat, 08 March 2008 10:06 Go to previous messageGo to next message
sara3 is currently offline  sara3
Messages: 38
Registered: February 2008
Member
yes
the ve 144 is causing the problem
suddenly i find its NPROC from vzlist is over 300 and i cannot enter it due to the "fork: Cannot allocate memory" error" error
at that time the load begins to be over 150-300 load average and top on HW shows alot of apache processes and total tasks number > 1300 however the normall all day long is from 700-800 only and i also see that its using swap .. the load returns to normal only when i restart the ve and it doesn't even accept to be restarted from first time because it at first gives a time out stopping the ve but responds to stop command on second time

plz help as this issue is requiring me to stay 24/7 in front of pc and it even happens 1-2 time per 3hours
Re: load spikes to 150-290 on HN [message #28100 is a reply to message #28094] Sat, 08 March 2008 15:08 Go to previous messageGo to next message
ugob is currently offline  ugob
Messages: 271
Registered: March 2007
Senior Member
First step is to find if you can increase the privvmpages for this VM. Try vzsplit -n1 | grep PRIVVM and let us know what you get.

If you can't, consider increasing the physical RAM in the server.

At the same time, you should try to find what is causing this. You could start by using tail -f /var/log/httpd/access_log to see what is hitting your server.

Finally, you could try to tweak apache so that you have a max number of apache processes, or use threads instead of forks (doesn't work with php, though, I think), or reduce the memory footprint of the individual httpd processes.

What is 144 running exactly?


Please read the manual before asking questions:
http://download.openvz.org/doc/OpenVZ-Users-Guide.pdf

Please have a look at the wiki before asking questions:
http://wiki.openvz.org/Main_Page
Re: load spikes to 150-290 on HN [message #28101 is a reply to message #28100] Sat, 08 March 2008 15:26 Go to previous messageGo to next message
sara3 is currently offline  sara3
Messages: 38
Registered: February 2008
Member
Hello
thanks for your follow up

# vzsplit -n1 | grep PRIVVM
PRIVVMPAGES="466631:513294"


the ve is running once vb forum that has 150-250 or less users online


apache access log has this
127.0.0.1 - - [08/Mar/2008:17:23:56 +0200] "OPTIONS * HTTP/1.0" 200 -
127.0.0.1 - - [08/Mar/2008:17:24:17 +0200] "OPTIONS * HTTP/1.0" 200 -
127.0.0.1 - - [08/Mar/2008:17:24:18 +0200] "OPTIONS * HTTP/1.0" 200 -
127.0.0.1 - - [08/Mar/2008:17:24:19 +0200] "OPTIONS * HTTP/1.0" 200 -
127.0.0.1 - - [08/Mar/2008:17:24:20 +0200] "OPTIONS * HTTP/1.0" 200 -
127.0.0.1 - - [08/Mar/2008:17:24:21 +0200] "OPTIONS * HTTP/1.0" 200 -
127.0.0.1 - - [08/Mar/2008:17:24:22 +0200] "OPTIONS * HTTP/1.0" 200 -
127.0.0.1 - - [08/Mar/2008:17:24:23 +0200] "OPTIONS * HTTP/1.0" 200 -
127.0.0.1 - - [08/Mar/2008:17:25:20 +0200] "OPTIONS * HTTP/1.0" 200 -
127.0.0.1 - - [08/Mar/2008:17:25:34 +0200] "OPTIONS * HTTP/1.0" 200 -

Re: load spikes to 150-290 on HN [message #28102 is a reply to message #28101] Sat, 08 March 2008 15:54 Go to previous messageGo to next message
ugob is currently offline  ugob
Messages: 271
Registered: March 2007
Senior Member
sara3 wrote on Sat, 08 March 2008 10:26

Hello
thanks for your follow up

# vzsplit -n1 | grep PRIVVM
PRIVVMPAGES="466631:513294"


the ve is running once vb forum that has 150-250 or less users online



Ok, what is your current value for PRIVVMPAGES for 144? How much RAM do you have on the HN?

What package is running the forum? phpbb?


Please read the manual before asking questions:
http://download.openvz.org/doc/OpenVZ-Users-Guide.pdf

Please have a look at the wiki before asking questions:
http://wiki.openvz.org/Main_Page
Re: load spikes to 150-290 on HN [message #28103 is a reply to message #28102] Sat, 08 March 2008 16:23 Go to previous messageGo to next message
sara3 is currently offline  sara3
Messages: 38
Registered: February 2008
Member
thanks again for ur reply i do appreciate ur help

1- forum software is vbulletin
2- total HN node is 4 GB of memory
3- ve 144 has PRIVVMPAGES="305911:336502"
Re: load spikes to 150-290 on HN [message #28236 is a reply to message #28103] Wed, 12 March 2008 11:43 Go to previous messageGo to next message
sara3 is currently offline  sara3
Messages: 38
Registered: February 2008
Member
hello
anybody please would kindly help ?
Re: load spikes to 150-290 on HN [message #28238 is a reply to message #28236] Wed, 12 March 2008 12:17 Go to previous messageGo to next message
ugob is currently offline  ugob
Messages: 271
Registered: March 2007
Senior Member
Have you taken a look at:

http://forum.openvz.org/index.php?t=rview&goto=28236#msg _28100


Please read the manual before asking questions:
http://download.openvz.org/doc/OpenVZ-Users-Guide.pdf

Please have a look at the wiki before asking questions:
http://wiki.openvz.org/Main_Page
Re: load spikes to 150-290 on HN [message #28336 is a reply to message #28238] Fri, 14 March 2008 10:13 Go to previous messageGo to next message
sara3 is currently offline  sara3
Messages: 38
Registered: February 2008
Member
yes
and i replied in http://forum.openvz.org/index.php?t=msg&&th=5620& ;goto=28100#msg_28100
Re: load spikes to 150-290 on HN [message #28339 is a reply to message #28060] Fri, 14 March 2008 10:32 Go to previous message
ugob is currently offline  ugob
Messages: 271
Registered: March 2007
Senior Member
I made 3 suggestions in this post, and you only replied to questions.

You can increase the ram.

You can increase the PRIVVMPAGES setting for 144.

You can check in the apache logs of 144 to see if something is hammering apache indeed. A web log analyser like awstats or webalizer could help.


Please read the manual before asking questions:
http://download.openvz.org/doc/OpenVZ-Users-Guide.pdf

Please have a look at the wiki before asking questions:
http://wiki.openvz.org/Main_Page
Previous Topic: OpenVZ on Amazon EC2
Next Topic: how to block an ip from all VE's
Goto Forum:
  


Current Time: Sat May 11 06:49:56 GMT 2024

Total time taken to generate the page: 0.01553 seconds