|
Re: high load spikes average of 80 [message #7220 is a reply to message #7219] |
Sat, 07 October 2006 14:46 |
HaroldB
Messages: 61 Registered: June 2006
|
Member |
|
|
This isn't a vz specific question, realistically you should read about how to monitor system resources and rates in the Linux OS. Obviously, some metric is exhausted. cpu power, memory, disk io, etc.
send us the output of:
vmstat 5 (let it run for a minute)
iostat -k 5 (run for a minute)
free -m
I can probably point you in the right direction.
If you are asking why one vps could be overrunning the entire node, send us the output of:
vzmemcheck -vA
vzcpucheck -v
[Updated on: Sat, 07 October 2006 14:50] Report message to a moderator
|
|
|
|
|
|
|
|
Re: high load spikes average of 80 [message #7227 is a reply to message #7226] |
Sat, 07 October 2006 17:54 |
mora
Messages: 37 Registered: October 2006
|
Member |
|
|
here is the readings when the load is high
vzmemcheck -vA
Output values in Mbytes
veid LowMem LowMem RAM MemSwap MemSwap Alloc Alloc Alloc
util commit util util commit util commit limit
108 4.54 8332.65 118.61 190.95 8356.65 688.22 8356.65 8396940.65
109 5.74 8237.28 254.52 318.75 8261.28 507.80 8261.28 8396845.28
------------------------------------------------------------ -------------
Summary: 10.28 16569.94 373.13 509.70 16617.94 1196.02 16617.94 16793785.93
326.00 326.00 976.00 4977.00 4977.00 4977.00 4977.00 4977.00
vzcpucheck -v
vpsid units
-----------------------
0 1000
109 100000
108 100000
Current CPU utilization: 201000
Power of the node: 319144
the top of thje hardware node says
- 13:56:34 up 23:48, 3 users, load average: 94.03, 93.03, 87.88
Tasks: 398 total, 63 running, 317 sleeping, 0 stopped, 18 zombie
Cpu(s): 41.3% us, 58.0% sy, 0.0% ni, 0.0% id, 0.0% wa, 0.3% hi, 0.3% si
Mem: 1000068k total, 942376k used, 57692k free, 20612k buffers
Swap: 4096532k total, 254020k used, 3842512k free, 357000k cached
the uptime for the second vps says
19:58:56 up 23:11, 0 users, load average: 32.91, 32.94, 30.75
the ps auxf for it says
9 ps auxf
USER PID %CPU %MEM VSZ RSS TTY STAT START TIME COMMAND
root 1 0.0 0.0 1624 512 ? Ss Oct06 0:00 init [3]
root 12057 0.0 0.0 1528 572 ? Ss Oct06 0:00 syslogd -m 0
root 12133 0.0 0.0 1456 404 ? S Oct06 0:00 /usr/sbin/courierlogger -pid=/var/spool/authdaemon/pid -facility=mail -start /usr/libexec/courier-authlib/authdaemond
root 12134 0.0 0.0 1800 480 ? S Oct06 0:00 \_ /usr/libexec/courier-authlib/authdaemond
root 12149 0.0 0.0 1800 580 ? S Oct06 0:00 \_ /usr/libexec/courier-authlib/authdaemond
root 5222 0.0 0.3 9828 3556 ? S 09:26 0:00 | \_ /etc/authlib/authProg
root 12150 0.0 0.0 1800 588 ? S Oct06 0:00 \_ /usr/libexec/courier-authlib/authdaemond
root 1951 0.0 0.3 9828 3352 ? S 13:23 0:00 | \_ /etc/authlib/authProg
root 12151 0.0 0.0 1800 552 ? S Oct06 0:00 \_ /usr/libexec/courier-authlib/authdaemond
root 1832 0.0 0.2 9828 2964 ? S 13:22 0:00 | \_ /etc/authlib/authProg
root 12153 0.0 0.0 1800 584 ? S Oct06 0:00 \_ /usr/libexec/courier-authlib/authdaemond
root 9266 0.0 0.6 9832 6932 ? S 18:41 0:00 | \_ /etc/authlib/authProg
root 12154 0.0 0.0 1800 532 ? S Oct06 0:00 \_ /usr/libexec/courier-authlib/authdaemond
nobody 12251 0.0 0.1 4356 1164 ? SLs Oct06 0:00 proftpd: (accepting connections)
root 17843 0.0 0.1 4016 1224 ? Ss Oct06 0:00 /usr/sbin/sshd
root 17905 0.0 0.2 9756 2536 ? S Oct06 0:01 chkservd
root 17916 0.0 0.0 1460 468 ? S Oct06 0:00 /usr/sbin/courierlogger -pid=/var/run/imapd.pid -start -name=imapd /usr/lib/courier-imap/libexec/couriertcpd -address=0 -maxprocs=40 -maxperip=30 -nodnslookup -noidentlookup 143 /usr/lib/courier-imap/sbin/imaplogin /usr/lib/courier-imap/bin/imapd Maildir
root 17917 0.0 0.0 1568 484 ? S Oct06 0:00 \_ /usr/lib/courier-imap/libexec/couriertcpd -address=0 -maxprocs=40 -maxperip=30 -nodnslookup -noidentlookup 143 /usr/lib/courier-imap/sbin/imaplogin /usr/lib/courier-imap/bin/imapd Maildir
root 17923 0.0 0.0 1452 340 ? S Oct06 0:00 /usr/sbin/courierlogger -pid=/var/run/imapd-ssl.pid -start -name=imapd-ssl /usr/lib/courier-imap/libexec/couriertcpd -address=0 -maxprocs=40 -maxperip=30 -nodnslookup -noidentlookup 993 /usr/lib/courier-imap/bin/couriertls -server -tcpd /usr/lib/courier-imap/sbin/imaplogin /usr/lib/courier-imap/bin/imapd Maildir
root 17924 0.0 0.0 1560 404 ? S Oct06 0:00 \_ /usr/lib/courier-imap/libexec/couriertcpd -address=0 -maxprocs=40 -maxperip=30 -nodnslookup -noidentlookup 993 /usr/lib/courier-imap/bin/couriertls -server -tcpd /usr/lib/courier-imap/sbin/imaplogin /usr/lib/courier-imap/bin/imapd Maildir
root 17929 0.0 0.0 1452 464 ? S Oct06 0:00 /usr/sbin/courierlogger -pid=/var/run/pop3d.pid -start -name=pop3d /usr/lib/courier-imap/libexec/couriertcpd -address=0 -maxprocs=40 -maxperip=30 -nodnslookup -noidentlookup 110 /usr/lib/courier-imap/sbin/pop3login /usr/lib/courier-imap/bin/pop3d Maildir
root 17930 0.0 0.0 1560 480 ? S Oct06 0:00 \_ /usr/lib/courier-imap/libexec/couriertcpd -address=0 -maxprocs=40 -maxperip=30 -nodnslookup -noidentlookup 110 /usr/lib/courier-imap/sbin/pop3login /usr/lib/courier-imap/bin/pop3d Maildir
root 17936 0.0 0.0 1452 340 ? S Oct06 0:00 /usr/sbin/courierlogger -pid=/var/run/pop3d-ssl.pid -start -name=pop3d-ssl /usr/lib/courier-imap/libexec/couriertcpd -address=0 -maxprocs=40 -maxperip=30 -nodnslookup -noidentlookup 995 /usr/lib/courier-imap/bin/couriertls -server -tcpd /usr/lib/courier-imap/sbin/pop3login /usr/lib/courier-imap/bin/pop3d Maildir
root 17937 0.0 0.0 1560 404 ? S Oct06 0:00 \_ /usr/lib/courier-imap/libexec/couriertcpd -address=0 -maxprocs=40 -maxperip=30 -nodnslookup -noidentlookup 995 /usr/lib/courier-imap/bin/couriertls -server -tcpd /usr/lib/courier-imap/sbin/pop3login /usr/lib/courier-imap/bin/pop3d Maildir
root 18105 0.0 0.3 9604 3520 ? Ss Oct06 0:10 /usr/local/apache/bin/httpd -DSSL
nobody 28001 0.1 1.1 17148 11988 ? S 13:50 0:40 \_ /usr/local/apache/bin/httpd -DSSL
nobody 28002 0.2 1.2 17388 12116 ? S 13:50 0:45 \_ /usr/local/apache/bin/httpd -DSSL
nobody 28003 0.1 1.2 17584 12456 ? S 13:50 0:40 \_ /usr/local/apache/bin/httpd -DSSL
nobody 28004 0.1 1.2 17440 12276 ? S 13:50 0:37 \_ /usr/local/apache/bin/httpd -DSSL
nobody 28005 0.1 1.2 16976 12024 ? S 13:50 0:36 \_ /usr/local/apache/bin/httpd -DSSL
nobody 28007 0.1 1.1 17008 11844 ? S 13:50 0:37 \_ /usr/local/apache/bin/httpd -DSSL
nobody 28008 0.1 1.2 17084 12152 ? S 13:50 0:28 \_ /usr/local/apache/bin/httpd -DSSL
nobody 28009 0.2 1.2 17576 12580 ? S 13:50 0:52 \_ /usr/local/apache/bin/httpd -DSSL
nobody 28015 0.1 1.2 17512 12440 ? S 13:50 0:41 \_ /usr/local/apache/bin/httpd -DSSL
nobody 28016 0.1 1.2 17348 12056 ? S 13:50 0:41 \_ /usr/local/apache/bin/httpd -DSSL
nobody 19766 0.2 1.2 17252 12440 ? S 17:58 0:16 \_ /usr/local/apache/bin/httpd -DSSL
nobody 29910 0.0 0.8 13944 8792 ? S 19:09 0:01 \_ /usr/local/apache/bin/httpd -DSSL
nobody 17544 0.0 0.8 13088 8004 ? S 19:42 0:00 \_ /usr/local/apache/bin/httpd -DSSL
nobody 24016 0.0 0.5 10944 5800 ? S 19:52 0:00 \_ /usr/local/apache/bin/httpd -DSSL
nobody 25795 0.1 0.8 13736 8528 ? S 19:54 0:00 \_ /usr/local/apache/bin/httpd -DSSL
nobody 26095 0.6 0.8 13924 8732 ? S 19:54 0:01 \_ /usr/local/apache/bin/httpd -DSSL
nobody 27671 0.2 0.6 11644 6624 ? S 19:56 0:00 \_ /usr/local/apache/bin/httpd -DSSL
nobody 27673 0.6 0.8 13808 8620 ? S 19:56 0:01 \_ /usr/local/apache/bin/httpd -DSSL
nobody 27674 0.0 0.7 12336 7188 ? S 19:56 0:00 \_ /usr/local/apache/bin/httpd -DSSL
nobody 27800 0.5 0.8 13928 8716 ? S 19:56 0:00 \_ /usr/local/apache/bin/httpd -DSSL
nobody 27809 0.0 0.3 9604 3704 ? S 19:56 0:00 \_ /usr/local/apache/bin/httpd -DSSL
nobody 27814 0.1 0.8 13604 8276 ? S 19:56 0:00 \_ /usr/local/apache/bin/httpd -DSSL
nobody 27969 0.4 0.8 13804 8552 ? S 19:57 0:00 \_ /usr/local/apache/bin/httpd -DSSL
nobody 27976 0.2 0.8 13356 8100 ? S 19:57 0:00 \_ /usr/local/apache/bin/httpd -DSSL
nobody 27977 0.0 0.6 12120 6792 ? S 19:57 0:00 \_ /usr/local/apache/bin/httpd -DSSL
nobody 27979 0.4 0.8 13888 8688 ? S 19:57 0:00 \_ /usr/local/apache/bin/httpd -DSSL
nobody 27980 0.0 0.3 9604 3704 ? S 19:57 0:00 \_ /usr/local/apache/bin/httpd -DSSL
nobody 27981 0.8 0.8 13740 8540 ? S 19:57 0:01 \_ /usr/local/apache/bin/httpd -DSSL
nobody 27982 0.5 0.7 13096 7892 ? S 19:57 0:00 \_ /usr/local/apache/bin/httpd -DSSL
nobody 27983 0.7 1.1 16880 11676 ? S 19:57 0:00 \_ /usr/local/apache/bin/httpd -DSSL
nobody 27984 0.4 0.7 12564 7312 ? S 19:57 0:00 \_ /usr/local/apache/bin/httpd -DSSL
nobody 27985 0.0 0.3 9612 3752 ? S 19:57 0:00 \_ /usr/local/apache/bin/httpd -DSSL
nobody 27986 0.4 0.9 14332 9012 ? S 19:57 0:00 \_ /usr/local/apache/bin/httpd -DSSL
nobody 27988 0.2 0.6 11304 6216 ? S 19:57 0:00 \_ /usr/local/apache/bin/httpd -DSSL
nobody 28020 0.1 0.8 13436 8108 ? S 19:57 0:00 \_ /usr/local/apache/bin/httpd -DSSL
nobody 28021 0.0 0.3 9604 3712 ? S 19:57 0:00 \_ /usr/local/apache/bin/httpd -DSSL
nobody 28023 1.2 0.8 13968 8832 ? S 19:57 0:01 \_ /usr/local/apache/bin/httpd -DSSL
nobody 28026 0.0 0.3 9604 3704 ? S 19:57 0:00 \_ /usr/local/apache/bin/httpd -DSSL
nobody 28078 0.3 0.0 0 0 ? Z 19:57 0:00 \_ [httpd] <defunct>
nobody 28344 0.0 0.3 9604 3704 ? S 19:58 0:00 \_ /usr/local/apache/bin/httpd -DSSL
nobody 28350 0.3 0.8 13488 8208 ? S 19:58 0:00 \_ /usr/local/apache/bin/httpd -DSSL
nobody 28351 0.8 0.8 14156 8904 ? S 19:58 0:00 \_ /usr/local/apache/bin/ht
...
[Updated on: Sat, 07 October 2006 18:03] Report message to a moderator
|
|
|
|
Re: high load spikes average of 80 [message #7230 is a reply to message #7228] |
Sat, 07 October 2006 19:54 |
mora
Messages: 37 Registered: October 2006
|
Member |
|
|
hello
after applying this and restarting vz and both vps i got this only after few seconds
Hardware Node
16:01:20 up 1:43, 4 users, load average: 22.78, 11.85, 6.71
VE 101
22:01:21 up 11 min, 0 users, load average: 1.13, 0.56, 0.34
VE 102
22:01:28 up 11 min, 0 users, load average: 20.34, 10.38, 4.86
also how can i gurantee 128 MB ram with maximum of 250 MB ram for each vps and how can i calculate how much Mb is their current usage ? i know i ask too many questions but many many thanks for your help
[Updated on: Sat, 07 October 2006 20:39] Report message to a moderator
|
|
|
|
|
|
Re: high load spikes average of 80 [message #7243 is a reply to message #7219] |
Mon, 09 October 2006 03:07 |
HaroldB
Messages: 61 Registered: June 2006
|
Member |
|
|
As I said before, and as Dev verified, the load value of the HN is not a primary indicator of the server being *slow*, overrun, overcommited, or generally a problem situation. Sinces your VEs have a low cpu limit, their running procs get queued. Since the load is just the number of processes that are "trying to run at the same time", this should seem normal. Think if one VE had a cpulimit of 2 (very very low) and launched 50 concurrent running processes. This would absolutely add 50 to your HN load while using very little cpu resources. In this case, despite your HN having a load of 50, your server is extremely responsive, the other VEs are uneffected, and overall there is no general problem.
You put the cfq scheduler definition in the grub.conf as a boot argument to your kernel.
|
|
|
|
|
|
|
|
|