| Home » Mailing lists » Devel » Container Test Campaign Goto Forum:
	| 
		
			| Container Test Campaign [message #3633] | Wed, 07 June 2006 14:20  |  
			| 
				
				
					|  Clement Calmels Messages: 11
 Registered: June 2006
 | Junior Member |  |  |  
	| Hello ! 
 I'm part of a team of IBMers working on lightweight containers and we
 are going to start a new test campaign. Candidates are vserver,
 vserver context, namespaces (being pushed upstream), openvz, mcr (our
 simple container dedicated to migration) and eventually xen.
 
 We will focus on the performance overhead but we are also interested in
 checkpoint/restart and live migration. A last topic would be how well
 the
 resource managment criteria are met, but that's extra for the moment.
 
 We plan on measuring performance overhead by comparing the results on
 a vanilla kernel with a partial and with a complete virtual
 environment. By partial, we mean the patched kernel and a 'namespace'
 virtualisation.
 
 Test tools
 ----------
 o For network performance :
 
 * netpipe (http://www.scl.ameslab.gov/netpipe/)
 * netperf (http://www.netperf.org/netperf/NetperfPage.html)
 * tbench (http://samba.org/ftp/tridge/dbench/README)
 
 o Filesystem :
 
 * dbench (http://samba.org/ftp/tridge/dbench/README)
 * iozone (http://www.iozone.org/)
 
 o General
 
 * kernbench (http://ck.kolivas.org/kernbench/) stress cpu and
 filesystem through kernel compilation
 * More 'real world' application could be used, feel free to submit
 candidates...
 
 We have experience on C/R and migration so we'll start with our own
 scenario, migrating oracle under load. The load is generated by DOTS
 (http://ltp.sourceforge.net/dotshowto.php).
 
 If you could provided us some material on what has already been done :
 URL, bench tools, scenarios. We'll try to compile them in. configuration
 hints and tuning are most welcome if they are reasonable.
 
 Results, tools, scenarios will be published on lxc.sf.net . We will
 set up the testing environment so as to be able to accept new
 versions, patches, test tools and rerun the all on demand. Results,
 tools, scenarios will be published on lxc.sf.net.
 
 thanks !
 
 Clement,
 |  
	|  |  |  
	| 
		
			| RE: Container Test Campaign [message #3871 is a reply to message #3633] | Wed, 21 June 2006 19:25   |  
			| 
				
				
					|  mef Messages: 6
 Registered: June 2006
 | Junior Member |  |  |  
	| Hi Clement, 
 You mention that testing isolation properties is more of an extra than an
 immediate criteria.  Based on our experience, this actually is a fairly
 important criteria.  Without decent isolation (both from a namespace and
 resource perspective) it is rather difficult to support lots of concurrent
 users.  As our paper states, we run anywhere from 30-90 vservers per machine
 (each machine usually with a 2GHz processor and 1GB of RAM).
 
 We are interested in checkpoint/restart too, but have nothing to test /
 contribute.  I've forwarded your message to Jason Nieh @ Columbia.  He has a
 relatively long history of working in that area.  I saw a demo of their
 checkpoint/restart/migration support last December (live video migrated
 between servers within a single IBM blade system). Their latest paper
 published at USENIX LISA also states that they can migrate from one linux
 kernel version to another.  This enables "live" system upgrade, which IMHO
 is just as important as load balancing.
 
 Another area we are quite interested in is "network virtualization" (private
 route tables, ip tables, etc).  We are aware that other container based
 systems (e.g., openvz) have support for this, but we (i.e., PlanetLab) are
 pretty much a vserver shop at the moment.  We added our own support to
 safely share a single, public IPv4 address between multiple containers,
 while simultaneously support raw sockets etc.  This is an absolute
 requirement for PlanetLab, and I'd argue (but not here) that it also is
 important for desktop usage scenarios that involve containers and want to
 avoid the use of NAT.
 
 Best regards,
 Marc
 |  
	|  |  |  
	| 
		
			| RE: Container Test Campaign [message #3872 is a reply to message #3633] | Wed, 21 June 2006 19:25   |  
			| 
				
				
					|  mef Messages: 6
 Registered: June 2006
 | Junior Member |  |  |  
	| Hello Clement, 
 Sorry for the late response, as I have been on vacation.
 
 We are interested in this test campaign.  Our work so far has focused on
 performance, scalability, and isolation properties of vserver compared with
 xen.  My guess is that you cc'd me due to the posting of our paper comparing
 vserver with xen (attached for those of you who have not seen it yet).  In
 what way can be participate/contribute (i.e., where do we start)?  We could
 share our test setup (except SpecWeb 99) that we used for our paper with
 everyone. Also, we'd appreciate if the folks participating in this test
 campaign could skim our paper and give us some feedback wrt the evaluation
 section and the appendix where we describe in reasonable the kernel vars,
 lvm partition setup, etc., we've used to eliminate differences between
 systems.
 
 Best regards,
 Marc
 
 
 > -----Original Message-----
 > From: Clement Calmels [mailto:clement.calmels@fr.ibm.com]
 > Sent: Wednesday, June 07, 2006 10:20 AM
 > To: devel@openvz.org; vserver@list.linux-vserver.org
 > Cc: kir@openvz.org; dev@openvz.org; sam.vilain@catalyst.net.nz;
 > mef@CS.Princeton.EDU; clg@fr.ibm.com; serue@us.ibm.com;
 > haveblue@us.ibm.com; dlezcano@fr.ibm.com
 > Subject: Container Test Campaign
 >
 >
 > Hello !
 >
 > I'm part of a team of IBMers working on lightweight containers and we
 > are going to start a new test campaign. Candidates are vserver,
 > vserver context, namespaces (being pushed upstream), openvz, mcr (our
 > simple container dedicated to migration) and eventually xen.
 >
 > We will focus on the performance overhead but we are also interested in
 > checkpoint/restart and live migration. A last topic would be how well
 > the
 > resource managment criteria are met, but that's extra for the moment.
 >
 > We plan on measuring performance overhead by comparing the results on
 > a vanilla kernel with a partial and with a complete virtual
 > environment. By partial, we mean the patched kernel and a 'namespace'
 > virtualisation.
 >
 > Test tools
 > ----------
 > o For network performance :
 >
 >  * netpipe (http://www.scl.ameslab.gov/netpipe/)
 >  * netperf (http://www.netperf.org/netperf/NetperfPage.html)
 >  * tbench (http://samba.org/ftp/tridge/dbench/README)
 >
 > o Filesystem :
 >
 >   * dbench (http://samba.org/ftp/tridge/dbench/README)
 >   * iozone (http://www.iozone.org/)
 >
 > o General
 >
 >   * kernbench (http://ck.kolivas.org/kernbench/) stress cpu and
 >     filesystem through kernel compilation
 >   * More 'real world' application could be used, feel free to submit
 >     candidates...
 >
 > We have experience on C/R and migration so we'll start with our own
 > scenario, migrating oracle under load. The load is generated by DOTS
 > (http://ltp.sourceforge.net/dotshowto.php).
 >
 > If you could provided us some material on what has already been done :
 > URL, bench tools, scenarios. We'll try to compile them in. configuration
 > hints and tuning are most welcome if they are reasonable.
 >
 > Results, tools, scenarios will be published on lxc.sf.net . We will
 > set up the testing environment so as to be able to accept new
 > versions, patches, test tools and rerun the all on demand. Results,
 > tools, scenarios will be published on lxc.sf.net.
 >
 > thanks !
 >
 > Clement,
 
	
	 Attachment: paper.pdf (Size: 296.38KB, Downloaded 810 times)
 |  
	|  |  |  
	| 
		
			| Re: Container Test Campaign [message #3877 is a reply to message #3633] | Thu, 22 June 2006 11:33   |  
			| 
				
				
					|  serue Messages: 750
 Registered: February 2006
 | Senior Member |  |  |  
	| Quoting Serge E. Hallyn (serue@us.ibm.com): > OTOH, perhaps we can collaborate on a test wrapper.  This would require
 > details to be filled in by each implementation's owner, but would save
 > us from each having to come up with the boundary conditions.  For
 > instance, my testcase for the utsname patches which are in -mm is
 > attached.  While the testcase is specific to that implementation, by
 > abstracting the "start a new container" command into a variable which
 > can be filled in for vserver, openvz, etc, we might be able to come up
 > with a generic utsname resource testing shell which can be easily filled
 > in to work for each implementation.
 >
 > Just a thought.
 
 Oh yeah, and I meant to point out that once we have isolation testcases
 for each implementation, then I think Clement would be able to easily
 run these testcases along with the performance tests.  (Clement, correct
 me if I'm wrong :)
 
 -serge
 |  
	|  |  |  
	| 
		
			| Re: Container Test Campaign [message #3878 is a reply to message #3871] | Thu, 22 June 2006 11:31   |  
			| 
				
				
					|  serue Messages: 750
 Registered: February 2006
 | Senior Member |  |  |  
	| Quoting Marc E. Fiuczynski (mef@CS.Princeton.EDU): > Hi Clement,
 >
 > You mention that testing isolation properties is more of an extra than an
 > immediate criteria.  Based on our experience, this actually is a fairly
 > important criteria.  Without decent isolation (both from a namespace and
 
 As we develop our own patches for upstream inclusion, we will also be
 writing testcases to verify isolation, but obviously sitting down and
 writing such testcases for every c/r implementation is not something we
 can commit to  :)
 
 OTOH, perhaps we can collaborate on a test wrapper.  This would require
 details to be filled in by each implementation's owner, but would save
 us from each having to come up with the boundary conditions.  For
 instance, my testcase for the utsname patches which are in -mm is
 attached.  While the testcase is specific to that implementation, by
 abstracting the "start a new container" command into a variable which
 can be filled in for vserver, openvz, etc, we might be able to come up
 with a generic utsname resource testing shell which can be easily filled
 in to work for each implementation.
 
 Just a thought.
 
 -serge
 
	
	 Attachment: utstest.c (Size: 6.48KB, Downloaded 294 times)
 |  
	|  |  |  
	| 
		
			| RE: Container Test Campaign [message #3887 is a reply to message #3872] | Thu, 22 June 2006 16:33   |  
			| 
				
				
					|  Clement Calmels Messages: 11
 Registered: June 2006
 | Junior Member |  |  |  
	| Hi, 
 We have currently set up tests. Our goal is to provide a huge variety of
 test cases and measurements (to reduce imprecise plot). We started with
 same kind of microbenchmark: ltp, dbench, tbench and more complex
 benchmarks: kernel compilation. Tests are launched outside and inside a
 container. Besides we want to make tests with different number of
 container within a real node to get some clues on the different
 solutions's scalabilities.
 After taking a glance at your paper, it seems we got same kind of
 results.
 It would be a good idea to share our test setup. It seems there are easy
 ways to disadvantage one container solution against another one (Xen
 using loop device instead of a dedicated LVM partition for example). Or
 for example, the way we "share" a node between different containers
 could have some consequences on test results. I would prefer "fair"
 benchmarks.
 In my opinion, a "fight" between the different container solutions would
 be as useless as a "Google Fight". But finding real world cases where a
 solution seems better than the others may result in more accurate
 conclusions.
 
 Concerning the checkpoint/restart/migration topic, we (IBM) owned a
 solution called Metacluster. The main goal of Metacluster was the
 migration issue... but as a result it brought isolation in some areas
 (pid for example). We will use this solution and make some performance
 measures during the migration of well known application (Oracle under
 different workloads...). Openvz and Xen should be included in such
 benchs.
 
 Best regards,
 Clement.
 
 Le mercredi 21 juin 2006 à 15:25 -0400, Marc E. Fiuczynski a écrit :
 > Hello Clement,
 >
 > Sorry for the late response, as I have been on vacation.
 >
 > We are interested in this test campaign.  Our work so far has focused on
 > performance, scalability, and isolation properties of vserver compared with
 > xen.  My guess is that you cc'd me due to the posting of our paper comparing
 > vserver with xen (attached for those of you who have not seen it yet).  In
 > what way can be participate/contribute (i.e., where do we start)?  We could
 > share our test setup (except SpecWeb 99) that we used for our paper with
 > everyone. Also, we'd appreciate if the folks participating in this test
 > campaign could skim our paper and give us some feedback wrt the evaluation
 > section and the appendix where we describe in reasonable the kernel vars,
 > lvm partition setup, etc., we've used to eliminate differences between
 > systems.
 >
 > Best regards,
 > Marc
 >
 >
 > > -----Original Message-----
 > > From: Clement Calmels [mailto:clement.calmels@fr.ibm.com]
 > > Sent: Wednesday, June 07, 2006 10:20 AM
 > > To: devel@openvz.org; vserver@list.linux-vserver.org
 > > Cc: kir@openvz.org; dev@openvz.org; sam.vilain@catalyst.net.nz;
 > > mef@CS.Princeton.EDU; clg@fr.ibm.com; serue@us.ibm.com;
 > > haveblue@us.ibm.com; dlezcano@fr.ibm.com
 > > Subject: Container Test Campaign
 > >
 > >
 > > Hello !
 > >
 > > I'm part of a team of IBMers working on lightweight containers and we
 > > are going to start a new test campaign. Candidates are vserver,
 > > vserver context, namespaces (being pushed upstream), openvz, mcr (our
 > > simple container dedicated to migration) and eventually xen.
 > >
 > > We will focus on the performance overhead but we are also interested in
 > > checkpoint/restart and live migration. A last topic would be how well
 > > the
 > > resource managment criteria are met, but that's extra for the moment.
 > >
 > > We plan on measuring performance overhead by comparing the results on
 > > a vanilla kernel with a partial and with a complete virtual
 > > environment. By partial, we mean the patched kernel and a 'namespace'
 > > virtualisation.
 > >
 > > Test tools
 > > ----------
 > > o For network performance :
 > >
 > >  * netpipe (http://www.scl.ameslab.gov/netpipe/)
 > >  * netperf (http://www.netperf.org/netperf/NetperfPage.html)
 > >  * tbench (http://samba.org/ftp/tridge/dbench/README)
 > >
 > > o Filesystem :
 > >
 > >   * dbench (http://samba.org/ftp/tridge/dbench/README)
 > >   * iozone (http://www.iozone.org/)
 > >
 > > o General
 > >
 > >   * kernbench (http://ck.kolivas.org/kernbench/) stress cpu and
 > >     filesystem through kernel compilation
 > >   * More 'real world' application could be used, feel free to submit
 > >     candidates...
 > >
 > > We have experience on C/R and migration so we'll start with our own
 > > scenario, migrating oracle under load. The load is generated by DOTS
 > > (http://ltp.sourceforge.net/dotshowto.php).
 > >
 > > If you could provided us some material on what has already been done :
 > > URL, bench tools, scenarios. We'll try to compile them in. configuration
 > > hints and tuning are most welcome if they are reasonable.
 > >
 > > Results, tools, scenarios will be published on lxc.sf.net . We will
 > > set up the testing environment so as to be able to accept new
 > > versions, patches, test tools and rerun the all on demand. Results,
 > > tools, scenarios will be published on lxc.sf.net.
 > >
 > > thanks !
 > >
 > > Clement,
 |  
	|  |  |  
	| 
		
			| Re: Container Test Campaign [message #3888 is a reply to message #3871] | Thu, 22 June 2006 21:51   |  
			| 
				
				
					|  Cedric Le Goater Messages: 443
 Registered: February 2006
 | Senior Member |  |  |  
	| Hi marc ! 
 Marc E. Fiuczynski wrote:
 
 > You mention that testing isolation properties is more of an extra than an
 > immediate criteria.  Based on our experience, this actually is a fairly
 > important criteria.  Without decent isolation (both from a namespace and
 > resource perspective) it is rather difficult to support lots of concurrent
 > users.  As our paper states, we run anywhere from 30-90 vservers per machine
 > (each machine usually with a 2GHz processor and 1GB of RAM).
 
 is that a common setup for planet lab or a maximum ? how many vservers/
 vcontext do you think we should try to reach ?
 
 > We are interested in checkpoint/restart too, but have nothing to test /
 > contribute.  I've forwarded your message to Jason Nieh @ Columbia.  He has a
 > relatively long history of working in that area.  I saw a demo of their
 > checkpoint/restart/migration support last December (live video migrated
 > between servers within a single IBM blade system).
 
 we've worked a few years with a zap guy. I only wished they were bit more
 open (source) about what they've been doing since crak.
 
 > Their latest paper
 > published at USENIX LISA also states that they can migrate from one linux
 > kernel version to another.  This enables "live" system upgrade, which IMHO
 > is just as important as load balancing.
 
 this feature is one the *major* features of mobile containers but it will
 require specific kernel APIs to make it maintainable on the long term.
 
 > Another area we are quite interested in is "network virtualization" (private
 > route tables, ip tables, etc).  We are aware that other container based
 > systems (e.g., openvz) have support for this, but we (i.e., PlanetLab) are
 > pretty much a vserver shop at the moment.  We added our own support to
 > safely share a single, public IPv4 address between multiple containers,
 > while simultaneously support raw sockets etc.  This is an absolute
 > requirement for PlanetLab, and I'd argue (but not here) that it also is
 > important for desktop usage scenarios that involve containers and want to
 > avoid the use of NAT.
 
 Did you contribute that feature to vserver ?
 
 So you have different containers exposing the same IP address ? How do you
 assign incoming packets to a container ?
 
 thanks,
 
 C.
 |  
	|  |  |  
	| 
		
			| Re: Container Test Campaign [message #3890 is a reply to message #3878] | Thu, 22 June 2006 23:39   |  
			| 
				
				
					|  Sam Vilain Messages: 73
 Registered: February 2006
 | Member |  |  |  
	| Serge E. Hallyn wrote: > Quoting Marc E. Fiuczynski (mef@CS.Princeton.EDU):
 >> Hi Clement,
 >>
 >> You mention that testing isolation properties is more of an extra than an
 >> immediate criteria.  Based on our experience, this actually is a fairly
 >> important criteria.  Without decent isolation (both from a namespace and
 >
 > As we develop our own patches for upstream inclusion, we will also be
 > writing testcases to verify isolation, but obviously sitting down and
 > writing such testcases for every c/r implementation is not something we
 > can commit to  :)
 >
 > OTOH, perhaps we can collaborate on a test wrapper.  This would require
 > details to be filled in by each implementation's owner, but would save
 > us from each having to come up with the boundary conditions.  For
 > instance, my testcase for the utsname patches which are in -mm is
 > attached.  While the testcase is specific to that implementation, by
 > abstracting the "start a new container" command into a variable which
 > can be filled in for vserver, openvz, etc, we might be able to come up
 > with a generic utsname resource testing shell which can be easily filled
 > in to work for each implementation.
 >
 > Just a thought.
 >
 > -serge
 
 You might like to consider making the output of these tests use the
 Perl Test::TAP output; eg:
 
 1..5 # declare how many tests you expect to run - 0..0 means unknown
 ok 1 # pass test one
 not ok 2 # this one failed - perhaps say why in comments
 ok 3 # SKIP somereason - this test was skipped
 ok 4
 ok 5
 
 This makes it easier to run the tests inside Harnesses.
 
 Sam.
 
 >
 >  ------------------------------------------------------------ ----------
 >
 > /*
 >  * Copyright (C) 2005 IBM
 >  * Author: Serge Hallyn <serue@us.ibm.com>
 >  * Compile using "gcc -o utstest utstest.c"
 >  * Run using "for i in `seq 1 5`; do  ./utstest $i; done"
 >  *
 >  * test1:
 >     P1: A=gethostname
 >     P2: B=gethostname
 >     Ensure(A==B)
 >
 >  * test2:
 >     P1: sethostname(newname); A=gethostname
 >     P2: (wait); B=gethostname
 >     Ensure (A==B)
 >
 >  * test3:
 >     P1: A=gethostname; unshare(utsname); sethostname(newname);
 C=gethostname
 >     P2: B=gethostname; (wait); (wait); D=gethostname
 >     Ensure (A==B && A==D && C!=D)
 >
 >  * test4:
 >     P1: A=gethostname; unshare(utsname); (wait); C=gethostname
 >     P2: B=gethostname; (wait); sethostname(newname); D=gethostname
 >     Ensure (A==B && A==C && C!=D)
 >
 >  * test5:
 >     P1: A=gethostname; unshare(utsname) without suff. perms; (wait);
 C=gethostname
 >     P2: B=gethostname; (wait); sethostname(newname); D=gethostname
 >     Ensure (A==B==C==D) and state is ok.
 >  *
 >  */
 >
 > #include <sys/wait.h>
 > #include <assert.h>
 > #include <stdio.h>
 > #include <stdlib.h>
 > #include <unistd.h>
 > #include <string.h>
 > #include <errno.h>
 >
 > int drop_root()
 > {
 >     int ret;
 >     ret = setresuid(1000, 1000, 1000);
 >     if (ret) {
 >         perror("setresuid");
 >         exit(4);
 >     }
 >     return 1;
 > }
 >
 > #include <linux/unistd.h>
 >
 > static inline _syscall1 (int,  unshare, int, flags)
 >
 > #ifndef CLONE_NEWUTS
 > #define CLONE_NEWUTS            0x04000000      /* New utsname group? */
 > #endif
 >
 > int p1fd[2], p2fd[2];
 > pid_t cpid;
 > int testnum;
 >
 > #define HLEN 100
 > #define NAME1 "serge1"
 > #define NAME2 "serge2"
 >
 > void picknewhostname(char *orig, char *new)
 > {
 >     memset(new, 0, HLEN);
 >     if (strcmp(orig, NAME1) == 0)
 >         strcpy(new, NAME2);
 >     else
 >         strcpy(new, NAME1);
 > }
 >
 > void P1(void)
 > {
 >     char hostname[HLEN], newhostname[HLEN], rhostname[HLEN];
 >     int err;
 >     int len;
 >
 >     close(p1fd[1]);
 >     close(p2fd[0]);
 >
 >     switch(testnum) {
 >     case 1:
 >         gethostname(hostname, HLEN);
 >         len = read(p1fd[0], rhostname, HLEN);
 >         if (len == strlen(hostname) &&
 >             strncmp(hostname, rhostname, len) == 0) {
 >             printf("test 1: success\n");
 >             exit(0);
 >         }
 >         printf("test 1: fail\n");
 >         printf("Proc 1: hostname %s\n", hostname);
 >         printf("test 2: hostname %s\n", rhostname);
 >         exit(1);
 >     case 2:
 >         gethostname(hostname, HLEN);
 >         picknewhostname(hostname, newhostname);
 >         err = sethostname(newhostname, strlen(newhostname));
 >         write(p2fd[1], "1", 1);
 >         if (err == -1) { perror("sethostname"); exit(1); }
 >         len = read(p1fd[0], rhostname, HLEN);
 >         if (len == strlen(newhostname) &&
 >                 strncmp(newhostname, rhostname, len) == 0) {
 >             printf("test 2: success\n");
 >             exit(0);
 >         }
 >         printf("test 2: fail\n");
 >         printf("Proc 1: hostname %s\n", newhostname);
 >         printf("test 2: hostname %s\n", rhostname);
 >         exit(1);
 >     case 3:
 >         gethostname(hostname, HLEN);
 >         picknewhostname(hostname, newhostname);
 >         err = unshare(CLONE_NEWUTS);
 >         printf("unshare returned %d (should be 0)\n", err);
 >         err = sethostname(newhostname, strlen(newhostname));
 >         write(p2fd[1], "1", 1);
 >         if (err == -1) { perror("sethostname"); exit(1); }
 >
 >         len = read(p1fd[0], rhostname, HLEN);
 >         if (len == strlen(newhostname) &&
 >                 strncmp(newhostname, rhostname, len) == 0) {
 >             printf("test 3: fail\n");
 >             printf("Proc 1: hostname %s\n", newhostname);
 >             printf("test 2: hostname %s\n", rhostname);
 >             printf("These should have been different\n");
 >             exit(1);
 >         }
 >         if (len == strlen(hostname) &&
 >                 strncmp(hostname, rhostname, len) == 0) {
 >             printf("test 3: success\n");
 >             exit(0);
 >         }
 >         printf("test 3: fail\n");
 >         printf("Proc 1: original hostname %s\n", hostname);
 >         printf("Proc 2: hostname %s\n", rhostname);
 >         printf("These should have been the same\n");
 >         exit(1);
 >
 >     case 4:
 >         gethostname(hostname, HLEN);
 >         err = unshare(CLONE_NEWUTS);
 >         printf("unshare returned %d (should be 0)\n", err);
 >         write(p2fd[1], "1", 1); /* tell p2 to go ahead and sethostname */
 >         len = read(p1fd[0], rhostname, HLEN);
 >         gethostname(newhostname, HLEN);
 >         if (strcmp(hostname, newhostname) != 0) {
 >             printf("test 4: fail\n");
 >             printf("Proc 1: hostname %s\n", hostname);
 >             printf("Proc 1: new hostname %s\n", newhostname);
 >             printf("These should have been the same\n");
 >             exit(1);
 >         }
 >         if (strncmp(hostname, rhostname, len)==0) {
 >             printf("test 4: fail\n");
 >             printf("Proc 1: hostname %s\n", hostname);
 >             printf("Proc 2: new hostname %s\n", rhostname);
 >             printf("These should have been different\n");
 >             exit(1);
 >         }
 >         printf("test 4: success\n");
 >         exit(0);
 >     case 5:
 >         /* drop CAP_SYS_ADMIN, then do same as case 4 but check
 >          * that hostname != newhostname && rhostname == newhostname */
 >         if (!drop_root()) {
 >             printf("failed to drop root.\n");
 >             exit(3);
 >         }
 >         gethostname(hostname, HLEN);
 >         err = unshare(CLONE_NEWUTS);
 >         printf("unshare returned %d (should be -1)\n", err);
 >         write(p2fd[1], "1", 1); /* tell p2 to go ahead and sethostname */
 >         len = read(p1fd[0], rhostname, HLEN);
 >         gethostname(newhostname, HLEN);
 >         if (strncmp(newhostname, rhostname, len)!=0) {
 >             printf("test 5: fail\n");
 >             printf("Proc 1: newhostname %s\n", newhostname);
 >             printf("Proc 2: new hostname %s\n", rhostname);
 >             printf("These should have been the same\n");
 >             exit(1);
 >         }
 >         if (strcmp(hostname, newhostname) == 0) {
 >             printf("test 5: fail\n");
 >             printf("Proc 1: hostname %s\n", hostname);
 >             printf("Proc 1: new hostname %s\n", newhostname);
 >             printf("These should have been different\n");
 >             exit(1);
 >         }
 >         printf("test 5: success\n");
 >         exit(0);
 >     default:
 >         break;
 >     }
 >     return;
 > }
 >
 > void P2(void)
 > {
 >     char hostname[HLEN], newhostname[HLEN];
 >     int len;
 >     int err;
 >
 >     close(p1fd[0]);
 >     close(p2fd[1]);
 >
 >     switch(testnum) {
 >     case 1:
 >         gethostname(hostname, HLEN);
 >         write(p1fd[1], hostname, strlen(hostname));
 >         break;
 >     case 2:
 >     case 3:
 >         len = 0;
 >         while (!len) {
 >             len = read(p2fd[0], hostname, 1);
 >         }
 >         gethostname(hostname, HLEN);
 >         write(p1fd[1], hostname, strlen(hostname));
 >         break;
 >     case 4:
 >     case 5:
 >         len = 0;
 >         while (!len) {
 >             len = read(p2fd[0], hostname, 1);
 >         }
 >         gethostname(hostname, HLEN);
 >         picknewhostname(hostname, newhostname);
 >         sethostname(newhostname, strlen(newhostname));
 >         write(p1fd[1], newhostname, strlen(newhostname));
 >         break;
 >     default:
 >         printf("undefined test: %d\n", testnum);
 >         break;
 >     }
 >     return;
 > }
 >
 > int main(int argc, char *argv[])
 > {
 >     if (argc != 2) {
 >         printf("Usage: %s <testnum>\n", argv[0]);
 >         printf(" where testnum is between 1 and 5 inclusive\n");
 >         exit(2);
 >     }
 >     if (pipe(p1fd) == -1) { perror("pipe"); exit(EXIT_FAILURE); }
 >     if (pipe(p2fd) == -1) { perror("pipe"); exit(EXIT_FAILURE); }
 >
 >     testnum = atoi(argv[1]);
 >     if (testnum < 1 || testnum > 5) {
 >         printf("testnum should be between 1 and 5 inclusive.\n");
 >         exit(2);
 >     }
 >
 >     cpid = fork();
 >
 >     if (cpid == -1) { perror("fork"); exit(EXIT_FAILURE); }
 >
 >     if (cpid == 0)
 >         P1();
 >     else
 >         P2();
 >
 >     return 0;
 > }
 
 
 --
 Sam Vilain, Catalyst IT (NZ) Ltd.
 phone: +64 4 499 2267        cell:  +64 21 55 40 50
 DDI:   +64 4 803 2342        PGP ID: 0x66B25843
 |  
	|  |  |  
	| 
		
			| Re: Container Test Campaign [message #3891 is a reply to message #3872] | Thu, 22 June 2006 21:39   |  
			| 
				
				
					|  Cedric Le Goater Messages: 443
 Registered: February 2006
 | Senior Member |  |  |  
	| Marc E. Fiuczynski wrote: 
 > Sorry for the late response, as I have been on vacation.
 
 a good thing indeed :)
 
 > We are interested in this test campaign.  Our work so far has focused on
 > performance, scalability, and isolation properties of vserver compared with
 > xen.  My guess is that you cc'd me due to the posting of our paper comparing
 > vserver with xen (attached for those of you who have not seen it yet).
 
 yes.
 
 > In what way can be participate/contribute (i.e., where do we start)?  We could
 > share our test setup (except SpecWeb 99) that we used for our paper with
 > everyone. Also, we'd appreciate if the folks participating in this test
 > campaign could skim our paper and give us some feedback wrt the evaluation
 > section and the appendix where we describe in reasonable the kernel vars,
 > lvm partition setup, etc., we've used to eliminate differences between
 > systems.
 
 OK, first half is great. Not finished yet but some of your results confirm
 ours : vserver is great, openvz will be great, xen overhead is important
 and does not scale as well. What we expect from you is also feedback on the
 test and their environment to make sure they are relevant.
 
 Xen is not our major focus but if we have time we'll torture it a bit.
 
 thanks,
 
 C.
 |  
	|  |  |  
	| 
		
			| Re: Container Test Campaign [message #3893 is a reply to message #3872] | Fri, 23 June 2006 03:40   |  
			| 
				
				
					|  Sam Vilain Messages: 73
 Registered: February 2006
 | Member |  |  |  
	| Marc E. Fiuczynski wrote: > Hello Clement,
 >
 > Sorry for the late response, as I have been on vacation.
 >
 > We are interested in this test campaign.  Our work so far has
 > focused on performance, scalability, and isolation properties of
 > vserver compared with xen.  My guess is that you cc'd me due to the
 > posting of our paper comparing vserver with xen (attached for those
 > of you who have not seen it yet).  In what way can be
 > participate/contribute (i.e., where do we start)?  We could share
 > our test setup (except SpecWeb 99) that we used for our paper with
 > everyone. Also, we'd appreciate if the folks participating in this
 > test campaign could skim our paper and give us some feedback wrt
 > the evaluation section and the appendix where we describe in
 > reasonable the kernel vars, lvm partition setup, etc., we've used
 > to eliminate differences between systems.
 
 One area it would be interesting to see benchmarks for is the
 performance impact of filesystem unification and a lot of vservers -
 for instance, a system with 10 vservers, each running apache and
 actively serving pages, I'd expect to see more cache hits at the L2
 and/or L3 CPU cache layers on account of the fact that, eg, C
 libraries are not being paged out to load in other (identical) C
 libraries.
 
 My guess is that you just can't leverage that kind of benefit from a
 hypervisor approach, but I don't really know enough about how they
 work under the hood to be able to say.
 
 Sam.
 
 >> -----Original Message----- From: Clement Calmels
 >> [mailto:clement.calmels@fr.ibm.com] Sent: Wednesday, June 07,
 >> 2006 10:20 AM To: devel@openvz.org;
 >> vserver@list.linux-vserver.org Cc: kir@openvz.org;
 >> dev@openvz.org; sam.vilain@catalyst.net.nz; mef@CS.Princeton.EDU;
 >> clg@fr.ibm.com; serue@us.ibm.com; haveblue@us.ibm.com;
 >> dlezcano@fr.ibm.com Subject: Container Test Campaign
 >>
 >>
 >> Hello !
 >>
 >> I'm part of a team of IBMers working on lightweight containers
 >> and we are going to start a new test campaign. Candidates are
 >> vserver, vserver context, namespaces (being pushed upstream),
 >> openvz, mcr (our simple container dedicated to migration) and
 >> eventually xen.
 >>
 >> We will focus on the performance overhead but we are also
 >> interested in checkpoint/restart and live migration. A last topic
 >> would be how well the resource managment criteria are met, but
 >> that's extra for the moment.
 >>
 >> We plan on measuring performance overhead by comparing the
 >> results on a vanilla kernel with a partial and with a complete
 >> virtual environment. By partial, we mean the patched kernel and a
 >> 'namespace' virtualisation.
 >>
 >> Test tools ---------- o For network performance :
 >>
 >> * netpipe (http://www.scl.ameslab.gov/netpipe/) * netperf
 >> (http://www.netperf.org/netperf/NetperfPage.html) * tbench
 >> (http://samba.org/ftp/tridge/dbench/README)
 >>
 >> o Filesystem :
 >>
 >> * dbench (http://samba.org/ftp/tridge/dbench/README) * iozone
 >> (http://www.iozone.org/)
 >>
 >> o General
 >>
 >> * kernbench (http://ck.kolivas.org/kernbench/) stress cpu and
 >> filesystem through kernel compilation * More 'real world'
 >> application could be used, feel free to submit candidates...
 >>
 >> We have experience on C/R and migration so we'll start with our
 >> own scenario, migrating oracle under load. The load is generated
 >> by DOTS (http://ltp.sourceforge.net/dotshowto.php).
 >>
 >> If you could provided us some material on what has already been
 >> done : URL, bench tools, scenarios. We'll try to compile them in.
 >> configuration hints and tuning are most welcome if they are
 >> reasonable.
 >>
 >> Results, tools, scenarios will be published on lxc.sf.net . We
 >> will set up the testing environment so as to be able to accept
 >> new versions, patches, test tools and rerun the all on demand.
 >> Results, tools, scenarios will be published on lxc.sf.net.
 >>
 >> thanks !
 >>
 >> Clement,
 |  
	|  |  |  
	|  |  
	|  |  
	| 
		
			| Re: Container Test Campaign [message #3936 is a reply to message #3888] | Fri, 23 June 2006 17:31   |  
			| 
				
				
					|  Mark Huang Messages: 1
 Registered: June 2006
 | Junior Member |  |  |  
	| Cedric Le Goater wrote: > Did you contribute that feature to vserver ?
 
 The feature is fairly specific to our needs and would not be very useful to the
 most common vserver use case (shared hosting).
 
 > So you have different containers exposing the same IP address ? How do you
 > assign incoming packets to a container ?
 
 We wrote a kernel module that leverages netfilter hooks and ip_conntrack. You're
 only allowed to send IP, but you can send IP packets through any type of socket
 (TCP, UDP, raw IP, or even raw packet). As Marc mentioned, this flexibility was
 an absolute requirement.
 
 The kernel module sits in the input and output path of the stack, and associates
 every incoming and outgoing packet with an ip_conntrack struct (to which we
 added container IDs). Once a container sends out an outgoing packet, it is
 entitled to receive incoming packets associated with that connection. A
 container may also receive incoming packets associated with ports that it has
 reserved by calling bind() (the kernel module keeps track of bind() calls). You
 can think of the kernel module as a local stateful firewall for sockets.
 
 To users, it looks like they can run pretty much anything root would be able to,
 including programs that use raw IP sockets (ping and traceroute), programs that
 use raw packet sockets (tcpdump), and regular server apps (Apache, MySQL, etc.).
 When they run tcpdump, they of course only see packets that they "own" (i.e.,
 packets that are associated with their active connections).
 
 There's technical documentation for the kernel module on our website:
 
 http://www.planet-lab.org/doc/vnet.php
 
 The kernel module does a lot more than this as well, which is another reason
 that it hasn't been merged into mainline vserver. Recent features include
 virtualized TUN/TAP and IP aliasing support.
 
 Lastly, you're of course free to browse the code:
 
 http://cvs.planet-lab.org/cvs/vnet/
 |  
	|  |  |  
	|  |  
	|  |  
	| 
		
			| Re:  Container Test Campaign [message #4183 is a reply to message #3633] | Fri, 30 June 2006 17:28   |  
			| 
				
				
					|  Clement Calmels Messages: 11
 Registered: June 2006
 | Junior Member |  |  |  
	| Hi, 
 A first round about virtualisation benchmarks can be found here:
 http://lxc.sourceforge.net/bench/
 These benchmarks run with vanilla kernels and the patched versions of
 well know virtualisation solutions: VServer and OpenVZ. Some benchs also
 run inside the virtual 'guest' but we ran into trouble trying to run
 some of them... probably virtual 'guest' configuration issues... we will
 trying to fix them...
 The metacluster migration solution (formely a Meiosys company produt)
 was added as it seems that the checkpoint/restart topic is close to the
 virtualisation's one (OpenVZ now provides a checkpoint/restart
 capability).
 For the moment, benchmarks only ran on xeon platform but we expect more
 architecture soon. Besides the 'classic' benchs used, more network
 oriented benchs will be added. Netpipe between two virtual 'guests' for
 example. We hope we will be able to provide results concerning the
 virtual 'guest' scalability, running several 'guest' at the same time.
 
 Best regards,
 
 
 Le mercredi 07 juin 2006 à 16:20 +0200, Clement Calmels a écrit :
 > Hello !
 >
 > I'm part of a team of IBMers working on lightweight containers and we
 > are going to start a new test campaign. Candidates are vserver,
 > vserver context, namespaces (being pushed upstream), openvz, mcr (our
 > simple container dedicated to migration) and eventually xen.
 >
 > We will focus on the performance overhead but we are also interested in
 > checkpoint/restart and live migration. A last topic would be how well
 > the
 > resource managment criteria are met, but that's extra for the moment.
 >
 > We plan on measuring performance overhead by comparing the results on
 > a vanilla kernel with a partial and with a complete virtual
 > environment. By partial, we mean the patched kernel and a 'namespace'
 > virtualisation.
 >
 > Test tools
 > ----------
 > o For network performance :
 >
 >  * netpipe (http://www.scl.ameslab.gov/netpipe/)
 >  * netperf (http://www.netperf.org/netperf/NetperfPage.html)
 >  * tbench (http://samba.org/ftp/tridge/dbench/README)
 >
 > o Filesystem :
 >
 >   * dbench (http://samba.org/ftp/tridge/dbench/README)
 >   * iozone (http://www.iozone.org/)
 >
 > o General
 >
 >   * kernbench (http://ck.kolivas.org/kernbench/) stress cpu and
 >     filesystem through kernel compilation
 >   * More 'real world' application could be used, feel free to submit
 >     candidates...
 >
 > We have experience on C/R and migration so we'll start with our own
 > scenario, migrating oracle under load. The load is generated by DOTS
 > (http://ltp.sourceforge.net/dotshowWe ran into trouble trying to run sto.php).
 >
 > If you could provided us some material on what has already been done :
 > URL, bench tools, scenarios. We'll try to compile them in. configuration
 > hints and tuning are most welcome if they are reasonable.
 >
 > Results, tools, scenarios will be published on lxc.sf.net . We will
 > set up the testing environment so as to be able to accept new
 > versions, patches, test tools and rerun the all on demand. Results,
 > tools, scenarios will be published on lxc.sf.net.
 >
 > thanks !
 >
 > Clement,
 >
 --
 Clément Calmels <clement.calmels@fr.ibm.com>
 |  
	|  |  |  
	| 
		
			| Re: [Vserver] Re:  Container Test Campaign [message #4202 is a reply to message #4183] | Sun, 02 July 2006 15:32   |  
			| 
				
				
					|  Herbert Poetzl Messages: 239
 Registered: February 2006
 | Senior Member |  |  |  
	| On Fri, Jun 30, 2006 at 07:28:06PM +0200, Clément Calmels wrote: > Hi,
 >
 > A first round about virtualisation benchmarks can be found here:
 > http://lxc.sourceforge.net/bench/
 
 very interesting results, tx ...
 
 > These benchmarks run with vanilla kernels and the patched versions of
 > well know virtualisation solutions: VServer and OpenVZ. Some benchs
 > also run inside the virtual 'guest' but we ran into trouble trying to
 > run some of them... probably virtual 'guest' configuration issues...
 > we will trying to fix them... The metacluster migration solution
 > (formely a Meiosys company produt) was added as it seems that the
 > checkpoint/restart topic is close to the virtualisation's one (OpenVZ
 > now provides a checkpoint/restart capability).
 
 from the tests:
 "For benchs inside real 'guest' nodes (OpenVZ/VServer) you should
 take into account that the FS tested is not the 'host' node one's."
 
 at least for Linux-VServer it should not be hard to avoid the
 chroot/filesystem namespace part and have it run on the host fs.
 a bind mount into the guest might do the trick too, if you need
 help to accomplish that, just let me know ...
 
 > For the moment, benchmarks only ran on xeon platform but we expect
 > more architecture soon. Besides the 'classic' benchs used, more
 > network oriented benchs will be added. Netpipe between two virtual
 > 'guests' for example. We hope we will be able to provide results
 > concerning the virtual 'guest' scalability, running several 'guest'
 > at the same time.
 
 best,
 Herbert
 
 > Best regards,
 >
 > Le mercredi 07 juin 2006 à 16:20 +0200, Clement Calmels a écrit :
 > > Hello !
 > >
 > > I'm part of a team of IBMers working on lightweight containers and we
 > > are going to start a new test campaign. Candidates are vserver,
 > > vserver context, namespaces (being pushed upstream), openvz, mcr (our
 > > simple container dedicated to migration) and eventually xen.
 > >
 > > We will focus on the performance overhead but we are also interested in
 > > checkpoint/restart and live migration. A last topic would be how well
 > > the
 > > resource managment criteria are met, but that's extra for the moment.
 > >
 > > We plan on measuring performance overhead by comparing the results on
 > > a vanilla kernel with a partial and with a complete virtual
 > > environment. By partial, we mean the patched kernel and a 'namespace'
 > > virtualisation.
 > >
 > > Test tools
 > > ----------
 > > o For network performance :
 > >
 > >  * netpipe (http://www.scl.ameslab.gov/netpipe/)
 > >  * netperf (http://www.netperf.org/netperf/NetperfPage.html)
 > >  * tbench (http://samba.org/ftp/tridge/dbench/README)
 > >
 > > o Filesystem :
 > >
 > >   * dbench (http://samba.org/ftp/tridge/dbench/README)
 > >   * iozone (http://www.iozone.org/)
 > >
 > > o General
 > >
 > >   * kernbench (http://ck.kolivas.org/kernbench/) stress cpu and
 > >     filesystem through kernel compilation
 > >   * More 'real world' application could be used, feel free to submit
 > >     candidates...
 > >
 > > We have experience on C/R and migration so we'll start with our own
 > > scenario, migrating oracle under load. The load is generated by DOTS
 > > (http://ltp.sourceforge.net/dotshowWe ran into trouble trying to run sto.php).
 > >
 > > If you could provided us some material on what has already been done :
 > > URL, bench tools, scenarios. We'll try to compile them in. configuration
 > > hints and tuning are most welcome if they are reasonable.
 > >
 > > Results, tools, scenarios will be published on lxc.sf.net . We will
 > > set up the testing environment so as to be able to accept new
 > > versions, patches, test tools and rerun the all on demand. Results,
 > > tools, scenarios will be published on lxc.sf.net.
 > >
 > > thanks !
 > >
 > > Clement,
 > >
 > --
 > Clément Calmels <clement.calmels@fr.ibm.com>
 >
 > _______________________________________________
 > Vserver mailing list
 > Vserver@list.linux-vserver.org
 > http://list.linux-vserver.org/mailman/listinfo/vserver
 |  
	|  |  |  
	| 
		
			| Re:  Container Test Campaign [message #4218 is a reply to message #4183] | Mon, 03 July 2006 07:49   |  
			| 
				
				
					|  Kirill Korotaev Messages: 137
 Registered: January 2006
 | Senior Member |  |  |  
	| >From what I see just just after 1 minute check of your results: 
 DBench:
 - different disk I/O schedulers are compared. This makes comparison useless
 (not virtualization technologies are compared itself).
 - the fact that there is too much difference between measurements
 (e.g. vserver makes linux faster :lol:) makes me believe that you use large disk partiion,
 where data blocks allocation on the disk influence your results.
 To make these measurements correct the same partition with the size closest to the required
 max disk space should be used in all DBench tests.
 
 TBench:
 - when running inside VE, please, make sure, that /proc/user_beancounters doesn't show you
 resource allocation fails (failcnt column).
 Resource limits set by default can be just too small to finish your test case.
 And this doesn't mean your conclusion 'Concerning the results, obviously more isolation brings more overhead.'.
 I'm really suprised to see such statements.
 
 I also noticed that you do the measurements with different HZ settings.
 This influences the results as well...
 
 BTW, do you plan to do functional testing in addition to performance?
 
 Thanks,
 Kirill
 
 
 > Hi,
 >
 > A first round about virtualisation benchmarks can be found here:
 > http://lxc.sourceforge.net/bench/
 > These benchmarks run with vanilla kernels and the patched versions of
 > well know virtualisation solutions: VServer and OpenVZ. Some benchs also
 > run inside the virtual 'guest' but we ran into trouble trying to run
 > some of them... probably virtual 'guest' configuration issues... we will
 > trying to fix them...
 > The metacluster migration solution (formely a Meiosys company produt)
 > was added as it seems that the checkpoint/restart topic is close to the
 > virtualisation's one (OpenVZ now provides a checkpoint/restart
 > capability).
 > For the moment, benchmarks only ran on xeon platform but we expect more
 > architecture soon. Besides the 'classic' benchs used, more network
 > oriented benchs will be added. Netpipe between two virtual 'guests' for
 > example. We hope we will be able to provide results concerning the
 > virtual 'guest' scalability, running several 'guest' at the same time.
 >
 > Best regards,
 >
 >
 > Le mercredi 07 juin 2006 à 16:20 +0200, Clement Calmels a écrit :
 >
 >>Hello !
 >>
 >>I'm part of a team of IBMers working on lightweight containers and we
 >>are going to start a new test campaign. Candidates are vserver,
 >>vserver context, namespaces (being pushed upstream), openvz, mcr (our
 >>simple container dedicated to migration) and eventually xen.
 >>
 >>We will focus on the performance overhead but we are also interested in
 >>checkpoint/restart and live migration. A last topic would be how well
 >>the
 >>resource managment criteria are met, but that's extra for the moment.
 >>
 >>We plan on measuring performance overhead by comparing the results on
 >>a vanilla kernel with a partial and with a complete virtual
 >>environment. By partial, we mean the patched kernel and a 'namespace'
 >>virtualisation.
 >>
 >>Test tools
 >>----------
 >>o For network performance :
 >>
 >> * netpipe (http://www.scl.ameslab.gov/netpipe/)
 >> * netperf (http://www.netperf.org/netperf/NetperfPage.html)
 >> * tbench (http://samba.org/ftp/tridge/dbench/README)
 >>
 >>o Filesystem :
 >>
 >>  * dbench (http://samba.org/ftp/tridge/dbench/README)
 >>  * iozone (http://www.iozone.org/)
 >>
 >>o General
 >>
 >>  * kernbench (http://ck.kolivas.org/kernbench/) stress cpu and
 >>    filesystem through kernel compilation
 >>  * More 'real world' application could be used, feel free to submit
 >>    candidates...
 >>
 >>We have experience on C/R and migration so we'll start with our own
 >>scenario, migrating oracle under load. The load is generated by DOTS
 >>(http://ltp.sourceforge.net/dotshowWe ran into trouble trying to run sto.php).
 >>
 >>If you could provided us some material on what has already been done :
 >>URL, bench tools, scenarios. We'll try to compile them in. configuration
 >>hints and tuning are most welcome if they are reasonable.
 >>
 >>Results, tools, scenarios will be published on lxc.sf.net . We will
 >>set up the testing environment so as to be able to accept new
 >>versions, patches, test tools and rerun the all on demand. Results,
 >>tools, scenarios will be published on lxc.sf.net.
 >>
 >>thanks !
 >>
 >>Clement,
 >>
 |  
	|  |  |  
	| 
		
			| Re:  Container Test Campaign [message #4219 is a reply to message #4183] | Mon, 03 July 2006 08:14   |  
			|  |  
	| Clement, 
 Thanks for sharing the results! A few comments...
 
 (1) General
 
 1.1 It would be nice to run vmstat (say, vmstat 10) for the duration of
 the tests, and put the vmstat output logs to the site.
 
 1.2 Can you tell how you run the tests. I am particularly interested in
 - how many iterations do you do?
 - what result do you choose from those iterations?
 - how reproducible are the results?
 - are you rebooting the box between the iterations?
 - are you reformatting the partition used for filesystem testing?
 - what settings are you using (such as kernel vm params)?
 - did you stop cron daemons before running the test?
 - are you using the same test binaries across all the participants?
 - etc. etc...
 
 Basically, the detailed description of a process would be nice to have,
 in order to catch possible problems. There are a lot of tiny things
 which are influencing the results. For example, in linux kernels 2.4
 binding the NIC IRQ to a single CPU on an SMP system boosts network
 performance by about 15%! Sure this is not relevant here, it's just an
 example.
 
 1.3 Would be nice to have diffs between different kernel configs.
 
 (2) OpenVZ specifics
 
 2.1 Concerning the tests running inside an OpenVZ VE, the problem is
 there is a (default) set of resource limits applied to each VE.
 Basically one should tailor those limits to suit the applications
 running, OR, for the purpose of testing, just set those limits to some
 very high values so they will never be reached.
 
 For example, the tbench test is probably failed to finish because it
 hits the limits for privvmpages, tcpsndbuf and tcprcvbuf. I have
 increased the limits for those parameters and the test was finished
 successfully. Also, dbench test could hit the disk quota limit for a VE.
 
 Some more info is available at http://wiki.openvz.org/Resource_management
 
 2.2 For OpenVZ specifically, it would be nice to collect
 /proc/user_beancounters output before and after the test.
 
 Clément Calmels wrote:
 > Hi,
 >
 > A first round about virtualisation benchmarks can be found here:
 > http://lxc.sourceforge.net/bench/
 > These benchmarks run with vanilla kernels and the patched versions of
 > well know virtualisation solutions: VServer and OpenVZ. Some benchs also
 > run inside the virtual 'guest' but we ran into trouble trying to run
 > some of them... probably virtual 'guest' configuration issues... we will
 > trying to fix them...
 > The metacluster migration solution (formely a Meiosys company produt)
 > was added as it seems that the checkpoint/restart topic is close to the
 > virtualisation's one (OpenVZ now provides a checkpoint/restart
 > capability).
 > For the moment, benchmarks only ran on xeon platform but we expect more
 > architecture soon. Besides the 'classic' benchs used, more network
 > oriented benchs will be added. Netpipe between two virtual 'guests' for
 > example. We hope we will be able to provide results concerning the
 > virtual 'guest' scalability, running several 'guest' at the same time.
 >
 > Best regards,
 >
 >
 > Le mercredi 07 juin 2006 à 16:20 +0200, Clement Calmels a écrit :
 >
 >> Hello !
 >>
 >> I'm part of a team of IBMers working on lightweight containers and we
 >> are going to start a new test campaign. Candidates are vserver,
 >> vserver context, namespaces (being pushed upstream), openvz, mcr (our
 >> simple container dedicated to migration) and eventually xen.
 >>
 >> We will focus on the performance overhead but we are also interested in
 >> checkpoint/restart and live migration. A last topic would be how well
 >> the
 >> resource managment criteria are met, but that's extra for the moment.
 >>
 >> We plan on measuring performance overhead by comparing the results on
 >> a vanilla kernel with a partial and with a complete virtual
 >> environment. By partial, we mean the patched kernel and a 'namespace'
 >> virtualisation.
 >>
 >> Test tools
 >> ----------
 >> o For network performance :
 >>
 >>  * netpipe (http://www.scl.ameslab.gov/netpipe/)
 >>  * netperf (http://www.netperf.org/netperf/NetperfPage.html)
 >>  * tbench (http://samba.org/ftp/tridge/dbench/README)
 >>
 >> o Filesystem :
 >>
 >>   * dbench (http://samba.org/ftp/tridge/dbench/README)
 >>   * iozone (http://www.iozone.org/)
 >>
 >> o General
 >>
 >>   * kernbench (http://ck.kolivas.org/kernbench/) stress cpu and
 >>     filesystem through kernel compilation
 >>   * More 'real world' application could be used, feel free to submit
 >>     candidates...
 >>
 >> We have experience on C/R and migration so we'll start with our own
 >> scenario, migrating oracle under load. The load is generated by DOTS
 >> (http://ltp.sourceforge.net/dotshowWe ran into trouble trying to run sto.php).
 >>
 >> If you could provided us some material on what has already been done :
 >> URL, bench tools, scenarios. We'll try to compile them in. configuration
 >> hints and tuning are most welcome if they are reasonable.
 >>
 >> Results, tools, scenarios will be published on lxc.sf.net . We will
 >> set up the testing environment so as to be able to accept new
 >> versions, patches, test tools and rerun the all on demand. Results,
 >> tools, scenarios will be published on lxc.sf.net.
 >>
 >> thanks !
 >>
 >> Clement,
 >>
 |  
	|  |  |  
	| 
		
			| RE:  Container Test Campaign [message #4243 is a reply to message #4218] | Mon, 03 July 2006 18:23   |  
			| 
				
				
					|  mef Messages: 6
 Registered: June 2006
 | Junior Member |  |  |  
	| Hi Kirill, 
 Thanks for the feedback.  Not sure whether you are referring to Clement's work or our paper.  I'll assume you are referring to our paper.
 
 > >From what I see just just after 1 minute check of your results:
 >
 > DBench:
 > - different disk I/O schedulers are compared. This makes
 >   comparison useless (not virtualization technologies are
 >   compared itself).
 
 Correct.  As you can image, it is not easy to pick benchmarks.  The intention was simply to repeat measurements presented by the Xen paper published in 2003 Symposium of Operating Sytems Principles, and show how Vserver compares on those.  The point of the paper is to show how "container based virtualization" compares to hypervisor based systems.  We do not have the time nor expertise to show this with OpenVZ.  It would be great if someon from the openvz community could step in and show how OpenVZ compares to Xen.
 
 > - the fact that there is too much difference between measurements
 >   (e.g. vserver makes linux faster :lol:)
 
 This is an interesting, odd, and likely laughable result.  We just reported what we observed.  It is quite possible that we made a mistake somewhere.  However, I believe that the problem lies more with the dbench benchmark than with our setup.  We did try to eliminate as many variables as possible.  Please take a peek on the last page of the paper to see our discussion wrt normalizing the configurations.  We are open to further suggestions to eliminate further variables.
 
 > I also noticed that you do the measurements with different HZ settings.
 > This influences the results as well...
 
 Of course.  My assumption is that it would negatively affect Vserver.  Are you suggesting that it can positively affect the benchmark results to run at 1000HZ vs. 100HZ, as the Xen Domains are configured to do?
 
 > BTW, do you plan to do functional testing in addition to performance?
 
 Please clarify what you mean here?  From what I gather, the main thing that Vserver lacks is the degree of network virtualization that OpenVZ supports.  Is there anything else?  From my perspective, the comparison will have to be with Xen/UML rather than, say, a contest between container based systems.  I say this because it appears that the majority of the LKML community is believes that container-based systems don't add much above and beyond what Xen/UML/QEMU/VMware already offer today.
 
 Best regards,
 Marc
 
 > -----Original Message-----
 > From: Kirill Korotaev [mailto:dev@openvz.org]
 > Sent: Monday, July 03, 2006 3:50 AM
 > To: ? Calmels
 > Cc: devel@openvz.org; vserver@list.linux-vserver.org;
 > sam.vilain@catalyst.net.nz; serue@us.ibm.com; DLEZCANO@fr.ibm.com;
 > mef@CS.Princeton.EDU
 > Subject: Re: [Devel] Container Test Campaign
 >
 >
 > >From what I see just just after 1 minute check of your results:
 >
 > DBench:
 > - different disk I/O schedulers are compared. This makes
 > comparison useless
 >   (not virtualization technologies are compared itself).
 > - the fact that there is too much difference between measurements
 >   (e.g. vserver makes linux faster :lol:) makes me believe that
 > you use large disk partiion,
 >   where data blocks allocation on the disk influence your results.
 >   To make these measurements correct the same partition with the
 > size closest to the required
 >   max disk space should be used in all DBench tests.
 >
 > TBench:
 > - when running inside VE, please, make sure, that
 > /proc/user_beancounters doesn't show you
 >   resource allocation fails (failcnt column).
 >   Resource limits set by default can be just too small to finish
 > your test case.
 >   And this doesn't mean your conclusion 'Concerning the results,
 > obviously more isolation brings more overhead.'.
 >   I'm really suprised to see such statements.
 >
 > I also noticed that you do the measurements with different HZ settings.
 > This influences the results as well...
 >
 > BTW, do you plan to do functional testing in addition to performance?
 >
 > Thanks,
 > Kirill
 >
 >
 > > Hi,
 > >
 > > A first round about virtualisation benchmarks can be found here:
 > > http://lxc.sourceforge.net/bench/
 > > These benchmarks run with vanilla kernels and the patched versions of
 > > well know virtualisation solutions: VServer and OpenVZ. Some benchs also
 > > run inside the virtual 'guest' but we ran into trouble trying to run
 > > some of them... probably virtual 'guest' configuration issues... we will
 > > trying to fix them...
 > > The metacluster migration solution (formely a Meiosys company produt)
 > > was added as it seems that the checkpoint/restart topic is close to the
 > > virtualisation's one (OpenVZ now provides a checkpoint/restart
 > > capability).
 > > For the moment, benchmarks only ran on xeon platform but we expect more
 > > architecture soon. Besides the 'classic' benchs used, more network
 > > oriented benchs will be added. Netpipe between two virtual 'guests' for
 > > example. We hope we will be able to provide results concerning the
 > > virtual 'guest' scalability, running several 'guest' at the same time.
 > >
 > > Best regards,
 > >
 > >
 > > Le mercredi 07 juin 2006 à 16:20 +0200, Clement Calmels a écrit :
 > >
 > >>Hello !
 > >>
 > >>I'm part of a team of IBMers working on lightweight containers and we
 > >>are going to start a new test campaign. Candidates are vserver,
 > >>vserver context, namespaces (being pushed upstream), openvz, mcr (our
 > >>simple container dedicated to migration) and eventually xen.
 > >>
 > >>We will focus on the performance overhead but we are also interested in
 > >>checkpoint/restart and live migration. A last topic would be how well
 > >>the
 > >>resource managment criteria are met, but that's extra for the moment.
 > >>
 > >>We plan on measuring performance overhead by comparing the results on
 > >>a vanilla kernel with a partial and with a complete virtual
 > >>environment. By partial, we mean the patched kernel and a 'namespace'
 > >>virtualisation.
 > >>
 > >>Test tools
 > >>----------
 > >>o For network performance :
 > >>
 > >> * netpipe (http://www.scl.ameslab.gov/netpipe/)
 > >> * netperf (http://www.netperf.org/netperf/NetperfPage.html)
 > >> * tbench (http://samba.org/ftp/tridge/dbench/README)
 > >>
 > >>o Filesystem :
 > >>
 > >>  * dbench (http://samba.org/ftp/tridge/dbench/README)
 > >>  * iozone (http://www.iozone.org/)
 > >>
 > >>o General
 > >>
 > >>  * kernbench (http://ck.kolivas.org/kernbench/) stress cpu and
 > >>    filesystem through kernel compilation
 > >>  * More 'real world' application could be used, feel free to submit
 > >>    candidates...
 > >>
 > >>We have experience on C/R and migration so we'll start with our own
 > >>scenario, migrating oracle under load. The load is generated by DOTS
 > >>(http://ltp.sourceforge.net/dotshowWe ran into trouble trying
 > to run sto.php).
 > >>
 > >>If you could provided us some material on what has already been done :
 > >>URL, bench tools, scenarios. We'll try to compile them in. configuration
 > >>hints and tuning are most welcome if they are reasonable.
 > >>
 > >>Results, tools, scenarios will be published on lxc.sf.net . We will
 > >>set up the testing environment so as to be able to accept new
 > >>versions, patches, test tools and rerun the all on demand. Results,
 > >>tools, scenarios will be published on lxc.sf.net.
 > >>
 > >>thanks !
 > >>
 > >>Clement,
 > >>
 |  
	|  |  |  
	| 
		
			| Re:  Container Test Campaign [message #4261 is a reply to message #4219] | Tue, 04 July 2006 09:42   |  
			| 
				
				
					|  Clement Calmels Messages: 11
 Registered: June 2006
 | Junior Member |  |  |  
	| Hi, 
 > 1.1 It would be nice to run vmstat (say, vmstat 10) for the duration of
 > the tests, and put the vmstat output logs to the site.
 
 Our benchmark framework allows us to use oprofile during test...
 couldn't it be better than vmstat?
 
 > Basically, the detailed description of a process would be nice to have,
 > in order to catch possible problems. There are a lot of tiny things
 > which are influencing the results. For example, in linux kernels 2.4
 > binding the NIC IRQ to a single CPU on an SMP system boosts network
 > performance by about 15%! Sure this is not relevant here, it's just an
 > example.
 
 I agree. Actually, I always try to use 'default' configuration or
 installation but I will try to describe the tests in details.
 
 > 1.3 Would be nice to have diffs between different kernel configs.
 
 The different configs used are available in the lxc site. You will
 notice that I used a minimal config file for most of the test, but for
 Openvz I had to use the one I found in the OpenVZ site because I faced
 kernel build error (some CONFIG_NET... issues). I think that the
 differences are more dealing with network stuff.
 
 > For example, the tbench test is probably failed to finish because it
 > hits the limits for privvmpages, tcpsndbuf and tcprcvbuf. I have
 > increased the limits for those parameters and the test was finished
 > successfully. Also, dbench test could hit the disk quota limit for a VE.
 > Some more info is available at http://wiki.openvz.org/Resource_management
 
 I already used this page. I had to increase 'diskinodes' and 'diskspace'
 resources in order to run some test properly (the disk errors were more
 selfexplicit).
 I'm wondering why a default 'guest' creation implies some resources
 restrictions? Couldn't the resources be unlimited? I understand the need
 for resource management, but the default values look a little bit
 tiny...
 
 > 2.2 For OpenVZ specifically, it would be nice to collect
 > /proc/user_beancounters output before and after the test.
 
 For sure... I will take a look at how integrating it in our automatic
 test environment.
 
 Best regards,
 
 --
 Clément Calmels <clement.calmels@fr.ibm.com>
 |  
	|  |  |  
	| 
		
			| Re: [Vserver] Re:  Container Test Campaign [message #4262 is a reply to message #4261] | Tue, 04 July 2006 12:19   |  
			|  |  
	| Clément, 
 Thanks for addressing my concerns! See comments below.
 
 Clément Calmels wrote:
 > Hi,
 >
 >
 >> 1.1 It would be nice to run vmstat (say, vmstat 10) for the duration of
 >> the tests, and put the vmstat output logs to the site.
 >>
 >
 > Our benchmark framework allows us to use oprofile during test...
 > couldn't it be better than vmstat?
 >
 Good idea.
 >> Basically, the detailed description of a process would be nice to have,
 >> in order to catch possible problems. There are a lot of tiny things
 >> which are influencing the results. For example, in linux kernels 2.4
 >> binding the NIC IRQ to a single CPU on an SMP system boosts network
 >> performance by about 15%! Sure this is not relevant here, it's just an
 >> example.
 >>
 >
 > I agree. Actually, I always try to use 'default' configuration or
 > installation but I will try to describe the tests in details.
 >
 >> 1.3 Would be nice to have diffs between different kernel configs.
 >>
 > The different configs used are available in the lxc site. You will
 > notice that I used a minimal config file for most of the test, but for
 > Openvz I had to use the one I found in the OpenVZ site because I faced
 > kernel build error (some CONFIG_NET... issues).
 We are trying to eliminate those, so a bug report would be nice.
 >  I think that the
 > differences are more dealing with network stuff.
 >
 >> For example, the tbench test is probably failed to finish because it
 >> hits the limits for privvmpages, tcpsndbuf and tcprcvbuf. I have
 >> increased the limits for those parameters and the test was finished
 >> successfully. Also, dbench test could hit the disk quota limit for a VE.
 >> Some more info is available at http://wiki.openvz.org/Resource_management
 >>
 >
 > I already used this page. I had to increase 'diskinodes' and 'diskspace'
 > resources in order to run some test properly (the disk errors were more
 > selfexplicit).
 > I'm wondering why a default 'guest' creation implies some resources
 > restrictions? Couldn't the resources be unlimited? I understand the need
 > for resource management, but the default values look a little bit
 > tiny...
 >
 The reason is security. A guest is untrusted by default, though sane
 limits are applied. Same as ulimit which has some sane defaults (check
 output of ulimit -a). Same as those kernel settings from /proc/sys --
 should /proc/sys/fs/file-max be 'unlimited' by default?
 
 In fact, those limits are taken from a sample configuration file during
 "vzctl create" stage. Sample file is specified in global OpenVZ config
 file (/etc/vz/vz.conf, parameter name is CONFIGFILE, default is to take
 configuration from /etc/vz/conf/ve-vps.basic.conf-sample).
 
 There are several ways to change that default configuration:
 
 1. (globally) Put another sample config and specify it in /etc/vz/vz.conf
 2. (globally) Edit the existing sample config
 (/etc/vz/conf/ve-vps.basic.conf-sample)
 3. (per VE) Specify another config during vzctl create stage, like this:
 vzctl create VEID [--config name]
 4. (per VE) Tune the specific parameters using vzctl set [--param value
 ...] --save
 >
 >> 2.2 For OpenVZ specifically, it would be nice to collect
 >> /proc/user_beancounters output before and after the test.
 >>
 >
 > For sure... I will take a look at how integrating it in our automatic
 > test environment.
 >
 > Best regards,
 >
 >
 |  
	|  |  |  
	|  |  
	|  |  
	| 
		
			| Re: [Vserver] Re:  Container Test Campaign [message #4269 is a reply to message #4267] | Tue, 04 July 2006 14:32   |  
			| 
				
				
					|  dev Messages: 1693
 Registered: September 2005
 Location: Moscow
 | Senior Member |  
 |  |  
	| >>from the tests: >> "For benchs inside real 'guest' nodes (OpenVZ/VServer) you should
 >>  take into account that the FS tested is not the 'host' node one's."
 >>
 >>at least for Linux-VServer it should not be hard to avoid the
 >>chroot/filesystem namespace part and have it run on the host fs.
 >>a bind mount into the guest might do the trick too, if you need
 >>help to accomplish that, just let me know ...
 >
 >
 > For the moment I just use the chcontext command to get rid of filesystem
 > part. But even if the tested filesystem is not the host filesystem, I
 > just keep in mind that all applications running inside a 'guest' will
 > use _this_ filesystem and not the host one.
 >>From what I understand about VServer, it looks flexible enougth to let
 > us test different 'virtualisation' parts. A 'guest' looks like a stack
 > of different 'virtualisation' layers (chcontext + ipv4root + chroot).
 > But it's not the case for all solutions.
 
 For OpenVZ it is also possible to test different subsytems separately (virtualization/isolation,
 resource management, disk quota, CPU scheduler).
 I would notice also, that in OpenVZ all these features are ON by default.
 
 So I probably miss something, but why we test other technologies in modes when not
 all the features are ON? in this case we compare not the real overhead,
 but the one minimized for this concrete benchmark. It's just like comparing with
 Xen Domain0 which doesn't have any overhead, but not because it is a good technology, but
 rather because it doesn't do anything valuable.
 
 BTW, comparing with Xen would be interesting as well. Just to show the difference.
 
 Thanks,
 Kirill
 |  
	|  |  |  
	| 
		
			| Re: [Vserver] Re:  Container Test Campaign [message #4270 is a reply to message #4265] | Tue, 04 July 2006 14:52   |  
			|  |  
	| Clément Calmels wrote: > Hi,
 >
 >
 >>> I'm wondering why a default 'guest' creation implies some resources
 >>> restrictions? Couldn't the resources be unlimited? I understand the need
 >>> for resource management, but the default values look a little bit
 >>> tiny...
 >>>
 >>>
 >> The reason is security. A guest is untrusted by default, though sane
 >> limits are applied. Same as ulimit which has some sane defaults (check
 >> output of ulimit -a). Same as those kernel settings from /proc/sys --
 >> should /proc/sys/fs/file-max be 'unlimited' by default?
 >>
 >
 > Ok. So as our benchmarks have no security concern, you will see no
 > objection if I set all the parameters in the 'guest' to their value in
 > the host, won't you?
 Sure.
 
 In case you are testing performance (but not, say, isolation), you can
 definitely set all the UBCs to unlimited values (i.e. both barrier and
 limit for each parameter should be set to MAX_LONG). The only issues is
 with vmguarpages parameter, because this is a guarantee but not limit --
 but unless you are doing something weird it should be OK to set to to
 MAX_LONG as well.
 
 Another approach is to generate sample config (for the given server)
 using vzsplit utility with the number of VEs set to 1, like this:
 # vzsplit -f one-ve -n 1 [-s xxx]
 and use it for new VE creation:
 # vzctl create 123 --config one-ve
 |  
	|  |  |  
	| 
		
			| Re: [Vserver] Re:  Container Test Campaign [message #4273 is a reply to message #4269] | Tue, 04 July 2006 15:28   |  
			| 
				
				
					|  Cedric Le Goater Messages: 443
 Registered: February 2006
 | Senior Member |  |  |  
	| Kirill Korotaev wrote: 
 > For OpenVZ it is also possible to test different subsytems separately
 > (virtualization/isolation, resource management, disk quota, CPU scheduler).
 > I would notice also, that in OpenVZ all these features are ON by default.
 
 hmm, we didn't realize that. Good, it will make the results even more
 relevant. Any pointers on how these light virtualized/isolated environment,
 similar to vserver chontext, can be set up ?
 
 > So I probably miss something, but why we test other technologies in
 > modes when not all the features are ON ? in this case we compare not
 > the real overhead, but the one minimized for this concrete benchmark.
 
 you must know how much time it takes to set up such an environment. it's
 just a beginning. Clement is currently running tests on openvz stable, full
 vserver and adding new tests on dbench to take into account your remarks.
 
 we want to add tests and also namespace patchsets to the matrix. If you
 have some time, please send us some of your material, that'll be nice.
 
 > It's just like comparing with Xen Domain0 which doesn't have any overhead,
 > but not because it is a good technology, but rather because it doesn't do
 > anything valuable.
 
 Xen Domain0 has overhead.
 
 > BTW, comparing with Xen would be interesting as well. Just to show the
 > difference.
 
 We've done that in the OLS paper, soon to come, but we'll add the results
 to lxc.sf.net before ols. You can also check this paper :
 
 http://list.linux-vserver.org/archive/vserver/msg13234.html
 
 thanks,
 
 C.
 |  
	|  |  |  
	|  |  
	|  |  
	|  |  
	| 
		
			| Re: [Vserver] Re:  Container Test Campaign [message #4279 is a reply to message #4276] | Tue, 04 July 2006 16:30   |  
			|  |  
	| See my comments below. 
 In general - please don't get the impression I try to be fastidious. I'm
 just trying to help you create a system in which results can be
 reproducible and trusted. There are a lot of factors that influence the
 performance; some of those are far from being obvious.
 
 Clément Calmels wrote:
 > A basic 'patch' test looks like:
 > o build the appropriate kernel (2.6.16-026test014-x86_64-smp for
 > example)
 > o reboot
 > o run dbench on /tmp with 8 processes
 >
 IMO you should add a reboot here, in between _different_ tests, just
 because previous tests should not influence the following ones.
 Certainly you do not need a reboot before iterations of the same test.
 > o run tbench with 8 processes
 > o run lmbench
 > o run kernbench
 >
 > For test inside a 'guest' I just do something like:
 > o build the appropriate kernel (2.6.16-026test014-x86_64-smp for
 > example)
 > o reboot
 >
 Here you do not have to reboot. OpenVZ tools does not require OpenVZ
 kernel to be built.
 > o build the utilities (vztcl+vzquota for example)
 > o reboot
 > o launch a guest
 >
 Even this part is tricky! You haven't specified whether you create the
 guest before or after the reboot. Let me explain.
 
 If you create a guest before the reboot, the performance (at least at
 the first iteration) could be a bit higher than if you create a guest
 after the reboot. The reason is in the second case the buffer cache will
 be filled with OS template data (which is several hundred megs).
 > o run in the guest dbench ...
 >
 Again, a clean reboot is needed IMO.
 > o run in the guest tbench ...
 > ....
 >
 > -The results are the average value of several iterations of each set of
 > these kind of tests.
 Hope you do not recompile the kernels before the iterations (just to
 speed things up).
 >  I will try to update the site with the numbers of
 > iterations behind each values.
 >
 Would be great to have that data (as well as the results of the
 individual iterations, and probably graphs for the individual iterations
 -- to see the "warming" progress, discrepancy between iterations,
 degradation over iterations (if that takes place) etc).
 
 Based on that data, one can decide to further tailor the testing
 process. For example, if there are visible signs of "warming" for a
 first few iterations (i.e. the performance is worse) it makes sense to
 unconditionally exclude those from the results. If there is a sign of
 degradation, something is wrong. And so on...
 > - For the filesystem testing, the partition is not reformatted. I can
 > change this behaviour...
 >
 Disk layout is influencing the results of the test which do heavy I/O.
 Just a single example: if you try to test the performance of a web
 server, results will decrease over time. The reason of degradation is
 ... web server's access_log file! It grows over time, and write
 operation takes a bit longer (due to several different reasons).
 
 The same will happen with most of the other tests involving I/O. Thus,
 test results will be non-accurate. To achieve more accuracy and exclude
 the impact of the disk and filesystem layout to the results, you should
 reformat the partition you use for testing each time before the test.
 Note that you don't have to reinstall everything from scratch -- just
 use a separate partition (mounted to say /mnt/temptest) and make sure
 most of the I/O during the test happens on that partition.
 > - For the settings of the guest I tried to use the default settings (I
 > had to change some openvz guest settings) just following the HOWTO on
 > vserver or openvz site.
 > For the kernel parameters, did you mean kernel config file tweaking?
 >
 No I mean those params from /proc/sys (== /etc/sysctl.conf). For
 example, if you want networking for OpenVZ guests, you have to turn on
 ip_forwarding. There are some params affecting network performance, such
 as various gc_thresholds. For the big number of guests, you have to tune
 some system-wide parameters as well.
 
 So I am leading to the proposition that all such changes should be
 documented in test results.
 > - Cron are stopped during tests.
 >
 Hope you do that for the guest as well... :)
 > - All binaries are always build in the test node.
 >
 I assuming you are doing your tests on the same system (i.e. same
 compiler/libs/whatever else), and you do not change that system over
 time (i.e. you do not upgrade gcc on it in between the tests).
 > Feel free to provide me different scenario which you think are more
 > relevant.
 |  
	|  |  |  
	| 
		
			| Re: [Vserver] Re:  Container Test Campaign [message #4295 is a reply to message #4279] | Wed, 05 July 2006 07:40   |  
			| 
				
				
					|  Clement Calmels Messages: 11
 Registered: June 2006
 | Junior Member |  |  |  
	| Hi, 
 > In general - please don't get the impression I try to be fastidious. I'm
 > just trying to help you create a system in which results can be
 > reproducible and trusted. There are a lot of factors that influence the
 > performance; some of those are far from being obvious.
 
 Don't get me wrong I'm looking for such remarks :)
 
 > IMO you should add a reboot here, in between _different_ tests, just
 > because previous tests should not influence the following ones.
 > Certainly you do not need a reboot before iterations of the same test.
 
 I don't do this first because I didn't want to get test nodes wasting
 their time rebooting instead of running test. What do you think of
 something like this:
 o reboot
 o run dbench (or wathever) X times
 o reboot
 
 > > For test inside a 'guest' I just do something like:
 > > o build the appropriate kernel (2.6.16-026test014-x86_64-smp for
 > > example)
 > > o reboot
 > >
 > Here you do not have to reboot. OpenVZ tools does not require OpenVZ
 > kernel to be built.
 
 You got me... I was still believing the VZKERNEL_HEADERS variable was
 needed. Things have changed since vzctl 3.0.0-4...
 
 > > o build the utilities (vztcl+vzquota for example)
 > > o reboot
 > > o launch a guest
 > >
 > Even this part is tricky! You haven't specified whether you create the
 > guest before or after the reboot. Let me explain.
 >
 > If you create a guest before the reboot, the performance (at least at
 > the first iteration) could be a bit higher than if you create a guest
 > after the reboot. The reason is in the second case the buffer cache will
 > be filled with OS template data (which is several hundred megs).
 can
 I can split the "launch a guest" part into 2 parts:
 o guest creation
 o reboot
 o guest start-up
 Do you feel comfortable with that?
 
 > > -The results are the average value of several iterations of each set of
 > > these kind of tests.
 > Hope you do not recompile the kernels before the iterations (just to
 > speed things up).
 > >  I will try to update the site with the numbers of
 > > iterations behind each values.
 > >
 > Would be great to have that data (as well as the results of the
 > individual iterations, and probably graphs for the individual iterations
 > -- to see the "warming" progress, discrepancy between iterations,
 > degradation over iterations (if that takes place) etc).
 
 I will try to get/show those datas.
 
 > The same will happen with most of the other tests involving I/O. Thus,
 > test results will be non-accurate. To achieve more accuracy and exclude
 > the impact of the disk and filesystem layout to the results, you should
 > reformat the partition you use for testing each time before the test.
 > Note that you don't have to reinstall everything from scratch -- just
 > use a separate partition (mounted to say /mnt/temptest) and make sure
 > most of the I/O during the test happens on that partition.
 
 It would be possible for 'host' node... inside the 'guest' node, I don't
 know if it makes sense. Just adding an 'external' partition to the
 'guest' for I/O test purpose? For example in an OpenVZ guest, creating a
 new and empty simfs partition in order to run test on it?
 
 > > - For the settings of the guest I tried to use the default settings (I
 > > had to change some openvz guest settings) just following the HOWTO on
 > > vserver or openvz site.
 > > For the kernel parameters, did you mean kernel config file tweaking?
 > >
 > No I mean those params from /proc/sys (== /etc/sysctl.conf). For
 > example, if you want networking for canOpenVZ guests, you have to turn on
 > ip_forwarding. There are some params affecting network performance, such
 > as various gc_thresholds. For the big number of guests, you have to tune
 > some system-wide parameters as well.
 
 For the moment, I just follow the available documentation:
 http://wiki.openvz.org/Quick_installation#Configuring_sysctl _settings
 Do you think these paramenters can hardly affect network performance?
 >From what I understand lot of them are needed.
 
 > > - All binaries are always build in the test node.
 > >
 > I assuming you are doing your tests on the same system (i.e. same
 > compiler/libs/whatever else), and you do not change that system over
 > time (i.e. you do not upgrade gcc on it in between the tests).
 
 I hope! :)
 --
 Clément Calmels <clement.calmels@fr.ibm.com>
 |  
	|  |  |  
	| 
		
			| Re: [Vserver] Re:  Container Test Campaign [message #4296 is a reply to message #4295] | Wed, 05 July 2006 08:34   |  
			|  |  
	| Clément Calmels wrote: > What do you think of
 > something like this:
 > o reboot
 > o run dbench (or wathever) X times
 > o reboot
 >
 Perfectly fine with me.
 >> Here you do not have to reboot. OpenVZ tools does not require OpenVZ
 >> kernel to be built.
 >>
 >
 > You got me... I was still believing the VZKERNEL_HEADERS variable was
 > needed. Things have changed since vzctl 3.0.0-4..
 Yes, we get rid off that dependency, to ease the external packages
 maintenance.
 > can I can split the "launch a guest" part into 2 parts:
 > o guest creation
 > o reboot
 > o guest start-up
 > Do you feel comfortable with that?
 >
 Perfectly fine. Same scenario applies to other cases: the rule of thumb
 is if your test preparation involves a lot of I/O, you'd better reboot
 in between preparation and the actual test.
 >> The same will happen with most of the other tests involving I/O. Thus,
 >> test results will be non-accurate. To achieve more accuracy and exclude
 >> the impact of the disk and filesystem layout to the results, you should
 >> reformat the partition you use for testing each time before the test.
 >> Note that you don't have to reinstall everything from scratch -- just
 >> use a separate partition (mounted to say /mnt/temptest) and make sure
 >> most of the I/O during the test happens on that partition.
 >>
 > It would be possible for 'host' node... inside the 'guest' node, I don't
 > know if it makes sense. Just adding an 'external' partition to the
 > 'guest' for I/O test purpose? For example in an OpenVZ guest, creating a
 > new and empty simfs partition in order to run test on it?
 >
 simfs is not a real filesystem, it is kinda 'pass-though' fake FS which
 works on top of a real FS (like ext2 or ext3).
 So, in order to have a new fresh filesystem for guests, you can create
 some disk partition, mkfs and mount it to /vz. If you want to keep
 templates, just
 change the TEMPLATE variable in /etc/vz/vz.conf from /vz/template to
 something outside of /vz. There are other ways possible, and I think the
 same applies to VServer.
 >>> - For the settings of the guest I tried to use the default settings (I
 >>> had to change some openvz guest settings) just following the HOWTO on
 >>> vserver or openvz site.
 >>> For the kernel parameters, did you mean kernel config file tweaking?
 >>>
 >>>
 >> No I mean those params from /proc/sys (== /etc/sysctl.conf). For
 >> example, if you want networking for canOpenVZ guests, you have to turn on
 >> ip_forwarding. There are some params affecting network performance, such
 >> as various gc_thresholds. For the big number of guests, you have to tune
 >> some system-wide parameters as well.
 >>
 >
 > For the moment, I just follow the available documentation:
 >  http://wiki.openvz.org/Quick_installation#Configuring_sysctl _settings
 > Do you think these paramenters can hardly affect network performance?
 > From what I understand lot of them are needed.
 >
 OK. Still, such stuff should be documented on the test results pages.
 |  
	|  |  |  
	|  |  
	| 
		
			| Re: [Vserver] Re:  Container Test Campaign [message #4314 is a reply to message #4269] | Wed, 05 July 2006 12:13   |  
			| 
				
				
					|  Herbert Poetzl Messages: 239
 Registered: February 2006
 | Senior Member |  |  |  
	| On Tue, Jul 04, 2006 at 06:32:04PM +0400, Kirill Korotaev wrote: >>> from the tests:
 >>> "For benchs inside real 'guest' nodes (OpenVZ/VServer) you should
 >>>  take into account that the FS tested is not the 'host' node one's."
 >>>
 >>> at least for Linux-VServer it should not be hard to avoid the
 >>> chroot/filesystem namespace part and have it run on the host fs.
 >>> a bind mount into the guest might do the trick too, if you need
 >>> help to accomplish that, just let me know ...
 
 >> For the moment I just use the chcontext command to get rid of
 >> filesystem part. But even if the tested filesystem is not the host
 >> filesystem, I just keep in mind that all applications running inside
 >> a 'guest' will use _this_ filesystem and not the host one.
 
 >> From what I understand about VServer, it looks flexible enougth to let
 >> us test different 'virtualisation' parts. A 'guest' looks like a stack
 >> of different 'virtualisation' layers (chcontext + ipv4root + chroot).
 >> But it's not the case for all solutions.
 
 > For OpenVZ it is also possible to test different subsytems separately
 > (virtualization/isolation, resource management, disk quota, CPU
 > scheduler).
 
 > I would notice also, that in OpenVZ all these features are ON by
 > default.
 
 which is the same as for a complete guest setup, what I
 think (and you already mentioned too, IIRC) is that it is
 very important to have identical test setups with and
 without virtualization enabled, which means that the
 following conditions are met:
 
 - filesystem and involved devices are identical
 - used resources and limits are identical/very close
 - number of processes and cache state as close as possible
 - kernel/system state as close as possible
 
 > So I probably miss something, but why we test other technologies in
 > modes when not all the features are ON?
 
 doesn't make much sense, and is not what I actually
 suggested in the first place, it might be interesting
 to narrow down possible issues by putting together
 a complete 'stack' slice by slice if that allows to
 remove a single slice from that equation
 
 > in this case we compare not the real overhead, but the one minimized
 > for this concrete benchmark.
 
 actually all cases are interesting as none of them
 is supposed to add measurable overhead which is also
 true for the whole thing which is at least the sum
 of all of them
 
 HTC,
 Herbert
 
 > It's just like comparing with Xen Domain0 which doesn't have any
 > overhead, but not because it is a good technology, but rather because
 > it doesn't do anything valuable.
 >
 > BTW, comparing with Xen would be interesting as well. Just to show the
 > difference.
 >
 > Thanks,
 > Kirill
 |  
	|  |  |  
	|  |  
	|  |  
	|  |  
	|  |  
	| 
		
			| Re: [Vserver] Re:  Container Test Campaign [message #4342 is a reply to message #4333] | Thu, 06 July 2006 10:44   |  
			| 
				
				
					|  dev Messages: 1693
 Registered: September 2005
 Location: Moscow
 | Senior Member |  
 |  |  
	| Gerrit, 
 >>>>I assuming you are doing your tests on the same system (i.e. same
 >>>>compiler/libs/whatever else), and you do not change that system over
 >>>>time (i.e. you do not upgrade gcc on it in between the tests).
 >>>
 >>>
 >>>I hope! :)
 >>
 >>All binaries should be built statically to work the same way inside host/guest or
 >>you need to make sure that you have exactly the same versions of glibc and other
 >>system libraries. At least glibc can affect perforamnce very much :/
 >
 >
 > Ick - no one builds binaries statically in the real world.  And,
 > when you build binaries statically, you lose all ability to fix
 > security problems in base libraries by doing an update of that library.
 > Instead, all applications need to be rebuilt.
 >
 > Performance tests should reflect real end user usage - not contrived
 > situations that make a particular solution look better or worse.
 > If glibc can affect performance, that should be demonstrated in the
 > real performance results - it is part of the impact of the solution and
 > may need an additional solution or discussion.
 What I tried to say is that performance results done in different
 environments are not comparable so have no much meaning. I don't want us
 to waste our time digging in why one environment is a bif faster or slower than another.
 I hope you don't want too.
 
 Now, to have the same environment there are at least 2 ways:
 - make static binaries (not that good, but easiest way)
 - have exactly the same packages in host/VPS for all test cases.
 
 BTW, I also prefer 2nd way, but it is harder.
 
 Thanks,
 Kirill
 |  
	|  |  | 
 
 
 Current Time: Sat Oct 25 12:35:37 GMT 2025 
 Total time taken to generate the page: 0.12163 seconds |