Free tool for displaying useful uninteruptible process information [message #2848] |
Sun, 23 April 2006 17:40 data:image/s3,"s3://crabby-images/02509/0250970494a0ed54d28de4a7129473ea6b156d86" alt="Go to next message Go to next message" |
RapidVPS
Messages: 32 Registered: January 2006
|
Member |
|
|
Source code: https://www.rapidvps.com/devel/unint_scan-1.5.tar.gz
Uninteruptible process scanner/display
Introduction
------------
This is version 1.5 of unint_scan, a console application which monitors uninteruptible processes and displays the responsible VEID on a Virtuozzo server in real time. From my experience, when the number of unint processes grows beyond ~30 and stay unint for longer than 3-10 seconds, the system performance will be hindered as the system load increases in proportion to the number of unint processes. 100 unint processes older than 3-10 seconds will result in the system load being 100 + avg, from my experience, causing the system performace to crawl.
unint_scan is a tool which locates uninteruptible process, and displays the VEID and process name responsible. What you do to fix the problem of unints (stop the veid, disable the program) is your choice. Often some applications will cause heavy disk I/O. This will not solve that problem but at least may display the responsible VEID, pid, and process name which will allow you to take reactive measures against the problem.
Requirements:
------------
-server running VZ/OpenVZ Kernel tested on 2.4.20-021stab028.17.777-enterprise and 2.6.8-022stab070.1-enterprise (should work on any vz kernel)
-c compiler (gcc)
-ncurses library and header files (ncurses,ncurses-devel packages)
#ie Redhat/Fedora/CentOS:
#yum install ncurses ncurses-devel
Example output:
------------
........................................
. Unint Processes: 3 .
. OpenVZ Detected .
. Load: 3.47 3.61 3.60 .
. Time: 1:52:29 .
. PRESS q TO END .
. .
. PID VEID PROGRAM .
. 2736 1024 mysqld .
. 283 1 qmail-send .
. 29362 0 syslogd .
. .
. .
. .
........................................
In this case, pids 2736, 283, and 29362 are in the state of unint. VEID of 0 is the host system. Although this does not tell you what the overall problem is (if one exists), you can spot problem VEs quite quickly. If the load is higher than average and one VEID has 5+ unint processes, this VE may have a software problem. Also, the PROGRAM column is very important, as it will tell you what program is caught in the unint state.
-Rick
[Updated on: Sun, 23 April 2006 20:55] by Moderator Report message to a moderator
|
|
|
|
|
|