So i have a host node running 20 VEs.
From time to time HN just hangs, no ping, no ssh response. i have to reboot it by powercycling to bring it back to work.
Its 2.6.18-128.2.1.el5.028stab064.4PAE running centos 5.2
and as you can see from /var/log/messages from 10.49 to 11.25 (reboot time) server doesnt respond.
Where do i look? I have same thing happening from time to time with 2-3 of my servers.
Sep 4 10:31:05 perun kernel: Fatal resource shortage: privvmpages, UB 6640.
Sep 4 10:31:05 perun kernel: Fatal resource shortage: privvmpages, UB 6640.
Sep 4 10:32:59 perun kernel: CT: 6830: stopped
Sep 4 10:33:23 perun kernel: CT: 6830: started
Sep 4 10:49:03 perun kernel: CT: 6830: stopped
Sep 4 10:49:12 perun kernel: CT: 6830: started
Sep 4 11:02:52 perun kernel: CT: 6830: stopped
Sep 4 11:03:23 perun kernel: CT: 6830: started
Sep 4 11:25:05 perun syslogd 1.4.1: restart.
Sep 4 11:25:05 perun kernel: klogd 1.4.1, log source = /proc/kmsg started.
Sep 4 11:25:05 perun kernel: Linux version 2.6.18-128.2.1.el5.028stab064.4PAE (root@rh5-build-x64) (gcc version 4.1.2 20071124 (Red Hat 4.1.2-42)) #1 SMP Wed Jul 22 00:38:32 MSD 2009
Sep 4 11:25:05 perun kernel: BIOS-provided physical RAM map:
Sep 4 11:25:05 perun kernel: **********************************************************