OpenVZ Forum


Home » General » Support » checkpoint-restore fails on undump
checkpoint-restore fails on undump [message #42440] Mon, 18 April 2011 14:45 Go to next message
astnwt is currently offline  astnwt
Messages: 5
Registered: April 2011
Junior Member
Hey there Smile

We are using openVZ on different HNs which all mount /var/lib/vz from
a shared storage to avoid long migration times and allow us to do fast
HN-to-HN transitions. All CTs have veth devices that are bridged to VLANs on the HN.

However, we are not able to resume a machine that's been suspended
and dumped, since the undump command will not succeed.

Kernel: 2.6.32-5-openvz-amd64 #1 SMP Mon Mar 7 22:25:57 UTC 2011 x86_64 GNU/Linux
HN: Debian 6.0.1 (latest patches)
CT: Debian 6.0.1 (latest patches).


What we actually do:
Example: Moving a CT from HN-A to HN-B

HN-A# vzctl chkpnt 2000 --suspend
Setting up checkpoint...
	suspend...
	get context...
Checkpointing completed succesfully

HN-A# vzctl chkpnt 2000 --dump --dumpfile /vz/dump/2000.dump
Setting up checkpoint...
	join context..
	dump...
Checkpointing completed succesfully

HN-A# vzctl chkpnt 2000 --kill
Killing...

HN-B# vzctl restore 2000 --undump --dumpfile /vz/dump/2000.dump
Restoring container ...
Starting container ...
VPS_MOUNT_SCRIPT: will bridge veth2000.0 to br100 as soon as CT is in running state..
Container is mounted
        undump...
Setting CPU units: 1000
Configure veth devices: veth2000.0 
Error: undump failed: No such file or directory
Restoring failed:
Error: can't open file /var/run/apache2/.nfs000000000054309600000037
Error: rst_file: -2 58400
Error: rst_files: -2
Error: make_baby: -2
Error: rst_clone_children
Container start failed
Stopping container ...
Container was stopped
Container is unmounted



/usr/lib/vz (aka. /vz) is mounted from the same storage share, so the dump
file written to /vz/dump is available on all three HNs after performing the dump command.

Could the veth device be the problem as it will be created at CT startup?
We would really appreciate any help on this.

All the best,

creo
Re: checkpoint-restore fails on undump [message #42490 is a reply to message #42440] Thu, 21 April 2011 20:23 Go to previous message
astnwt is currently offline  astnwt
Messages: 5
Registered: April 2011
Junior Member
Sorry for the double-posting guys,
the posting that describes the problem best with our latest information is here:

"undump fails because of .nfs file in Container" (Topic ID 42451)

Maybe a mod wants to delete this one Smile
I'll update the other posting with what we found out up 'til now.
Previous Topic: openvz cpanel high usage
Next Topic: undump fails because of .nfs file in Container
Goto Forum:
  


Current Time: Sun Jul 14 19:29:56 GMT 2024

Total time taken to generate the page: 0.03014 seconds