| Checkpoint bug [message #8818] | 
			Thu, 07 December 2006 10:17   | 
		 
		
			
				
				
				
					
						  
						dagr
						 Messages: 83 Registered: February 2006 
						
					 | 
					Member  | 
					 | 
		 
		 
	 | 
 
	
		Again i get strange results with checkpointing 
 
2.6.9-023stab032 SMP - RHEL4 x86 - vzctl version 3.0.13   
 
////////////////////////////////////////////////// 
[dagr@ws-ca dagr]$  sudo vzctl start 555  
Starting VE ...                           
VE is mounted                                                     
Adding IP address(es): 10.0.0.55                                  
Setting CPU units: 1000                                           
Set hostname: 10.0.0.55                                           
VE start in progress...                                           
[dagr@ws-ca dagr]$  sudo vzctl chkpnt 555 --suspend               
Setting up checkpoint...                                          
        suspend...                                                
        get context...                                            
Checkpointing completed succesfully                               
[dagr@ws-ca dagr]$  sudo vzctl chkpnt 555 --dump --dumpfile ./img 
Setting up checkpoint...                                          
        join context..                                            
        dump...                                                   
Checkpointing completed succesfully                               
[dagr@ws-ca dagr]$  sudo vzctl status 555        
VEID 555 exist mounted running                   
[dagr@ws-ca dagr]$  sudo vzctl chkpnt 555 --kill 
Killing...                                       
[dagr@ws-ca dagr]$  sudo vzctl status 555        
VEID 555 exist mounted down          
[dagr@ws-ca dagr]$ sudo vzctl restore 555   --undump --dumpfile ./img 
Restoring VE ...                                                      
Starting VE ...                                                       
VE is mounted                                                         
        undump...                                                     
Adding IP address(es): 10.0.0.55                                      
Setting CPU units: 1000                                               
        get context...                                                
VE start in progress...                                               
Restoring completed succesfully                                       
[dagr@ws-ca dagr]$  sudo vzctl enter 555                              
enter into VE 555 failed                                              
[dagr@ws-ca dagr]$  sudo vzctl stop 555                               
Stopping VE ...                                                       
Unable to stop VE: operation timed out                                            
 //////////////////////////////////////////////////////////// // 
After this there is no way to stop VPS without rebooting HN. Also  /etc/init.d/vz stop is not able to stop it and just continuosly tries , hanging the whole server, so  only hard reset helps !! 
After reboot vps is ok to run, start,stop,enter , until next try to dump and restore - i checked , its "restore" section which make it hang. 
 //////////////////////////////////////////////////////////// // 
Before that i tried to restore dump on another hn - an got this  
 
Adding IP address(es): 10.0.0.55                  
Setting CPU units: 1000                           
Error: undump failed: No such file or directory   
Restoring failed:                                 
rst_file: -2 20496                                
rst_files: -2                                     
make_baby: -2                                     
rst_clone_children                                
VE start failed                                   
Stopping VE ...                                   
VE was stopped                                    
VE is unmounted    
 
/////////////////////////////////////////// 
Same error if i do all on same hn , but dont kill after making dump , but resume and then stop . Also i noticed that after killing - it doesnt umounts VPS , tried also umount after killing  - didnt help .  
 
/////////////////////////////////////////////////////// 
 
At least tell me - if the any way top stop it without restarting the HN ? 
  
 
 
 
		
		
		
 |  
	| 
		
	 | 
 
 
 | 
	| 
		
 | 
	| 
		
 | 
	| 
		
 | 
	| 
		
 | 
	| 
		
 | 
	| 
		
 | 
	| 
		
 | 
	| 
		
 | 
	| 
		
 |