Checkpoint bug [message #8818] |
Thu, 07 December 2006 10:17 |
dagr
Messages: 83 Registered: February 2006
|
Member |
|
|
Again i get strange results with checkpointing
2.6.9-023stab032 SMP - RHEL4 x86 - vzctl version 3.0.13
//////////////////////////////////////////////////
[dagr@ws-ca dagr]$ sudo vzctl start 555
Starting VE ...
VE is mounted
Adding IP address(es): 10.0.0.55
Setting CPU units: 1000
Set hostname: 10.0.0.55
VE start in progress...
[dagr@ws-ca dagr]$ sudo vzctl chkpnt 555 --suspend
Setting up checkpoint...
suspend...
get context...
Checkpointing completed succesfully
[dagr@ws-ca dagr]$ sudo vzctl chkpnt 555 --dump --dumpfile ./img
Setting up checkpoint...
join context..
dump...
Checkpointing completed succesfully
[dagr@ws-ca dagr]$ sudo vzctl status 555
VEID 555 exist mounted running
[dagr@ws-ca dagr]$ sudo vzctl chkpnt 555 --kill
Killing...
[dagr@ws-ca dagr]$ sudo vzctl status 555
VEID 555 exist mounted down
[dagr@ws-ca dagr]$ sudo vzctl restore 555 --undump --dumpfile ./img
Restoring VE ...
Starting VE ...
VE is mounted
undump...
Adding IP address(es): 10.0.0.55
Setting CPU units: 1000
get context...
VE start in progress...
Restoring completed succesfully
[dagr@ws-ca dagr]$ sudo vzctl enter 555
enter into VE 555 failed
[dagr@ws-ca dagr]$ sudo vzctl stop 555
Stopping VE ...
Unable to stop VE: operation timed out
//////////////////////////////////////////////////////////// //
After this there is no way to stop VPS without rebooting HN. Also /etc/init.d/vz stop is not able to stop it and just continuosly tries , hanging the whole server, so only hard reset helps !!
After reboot vps is ok to run, start,stop,enter , until next try to dump and restore - i checked , its "restore" section which make it hang.
//////////////////////////////////////////////////////////// //
Before that i tried to restore dump on another hn - an got this
Adding IP address(es): 10.0.0.55
Setting CPU units: 1000
Error: undump failed: No such file or directory
Restoring failed:
rst_file: -2 20496
rst_files: -2
make_baby: -2
rst_clone_children
VE start failed
Stopping VE ...
VE was stopped
VE is unmounted
///////////////////////////////////////////
Same error if i do all on same hn , but dont kill after making dump , but resume and then stop . Also i noticed that after killing - it doesnt umounts VPS , tried also umount after killing - didnt help .
///////////////////////////////////////////////////////
At least tell me - if the any way top stop it without restarting the HN ?
|
|
|
|
|
|
|
|
|
|
|
|