From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from lists.gentoo.org ([140.105.134.102] helo=robin.gentoo.org) by nuthatch.gentoo.org with esmtp (Exim 4.62) (envelope-from ) id 1GsWrl-0008CB-88 for garchives@archives.gentoo.org; Fri, 08 Dec 2006 03:57:41 +0000 Received: from robin.gentoo.org (localhost [127.0.0.1]) by robin.gentoo.org (8.13.8/8.13.8) with SMTP id kB83unWo031609; Fri, 8 Dec 2006 03:56:49 GMT Received: from ece06.nas.nasa.gov (ece06.nas.nasa.gov [129.99.139.32]) by robin.gentoo.org (8.13.8/8.13.8) with ESMTP id kB83ultc028460 for ; Fri, 8 Dec 2006 03:56:48 GMT Received: from ece06 (localhost [127.0.0.1]) by ece06.nas.nasa.gov (8.13.7/8.13.4) with ESMTP id kB83ukCo022749 for ; Thu, 7 Dec 2006 19:56:46 -0800 Message-Id: <200612080356.kB83ukCo022749@ece06.nas.nasa.gov> To: gentoo-cluster@lists.gentoo.org Subject: Re: [gentoo-cluster] examples of (large) Gentoo clusters In-reply-to: Your message of "Thu, 07 Dec 2006 08:12:57 EST." <17784.4953.845451.460955@gs105.sicortex.com> X-url: http://www.nas.nasa.gov/~bgreen/ Date: Thu, 07 Dec 2006 19:56:46 -0800 From: Bryan Green Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-Id: Gentoo Linux mail X-BeenThere: gentoo-cluster@gentoo.org Reply-to: gentoo-cluster@lists.gentoo.org X-Archives-Salt: c672ea4c-70df-44e0-8c57-f7c8f0cb8c58 X-Archives-Hash: 6ca66e41d6446d4206f7d5d6ef57781c "John R. Dunning" writes: > From: Bryan Green > > It is encouraging to hear that you are willing to base a product on Lustre > 1.6. > > There are problems either way, but based on my experience, I believe 1.6 is a > better choice, at least for the kind of situation I'm expecting to see. > That's based partly on the fact that in my testing I've seen a pretty small > quotient of out-and-out bugs (though there are a couple which are pretty > annoying) and partly on the fact that configuration and management-wise, 1.6 > is way easier to deal with. Part of what I expect will be happening in > deployments is to be building lustrefs's on the fly, under control of some > kind of configurator thingie. For that kind of task, 1.4 would be much more > difficult to deal with. > >>From my limited experience with 1.6, and even more limited experience with 1.4, I wholeheartedly agree with your assessment. Version 1.4 looks like a real headache to configure. By comparison, 'mount -t lustre' pretty much characterizes the simplicity of 1.6. > > Are you by any chance willing to share some of your knowledge about > installing Lustre on Gentoo with others? :) > > Sure. > > Are you worrying about the kernel patching and other software installation > issues, or about how to set up the fs itself once you've got the software > together? Kernel patching. For software installation, the lustre ebuild that was put on this list recently seemed to do the trick for me, and setup was pretty easy. I was able to patch the kernel, but the server was somewhat unstable. Actually, my memory is hazy. I used the 'lustre-sources' ebuild, which effectively packaged up the patches. It was a 2.6.15 kernel. I also tried to make a custom kernel for lustre 1.4, but ultimately hit too many roadblocks. I did learn a bit about how to use 'quilt' though. > > Very briefly, the kernel-patching issue is an ongoing headache. Lustre > patches vfs in non-trivial ways. Unfortunately, everybody else does too. It > becomes a fairly ugly patch-merging problem. If you want, I can detail the > process I've settled on for coming up with a kernel patchset, but you won't > like it. There are similar issues around ldiskfs and other bits, but they're > simpler, at least by comparison. I'd be interested in some of the details - off-list if that is more appropriate, though it might be of interest to others on the list as well. Once you download a 1.6 beta, how do you produce a kernel for Gentoo? Do you patch a gentoo-sources kernel, a vanilla-sources kernel, or something else? The ideal would perhaps be to have a 'lustre-sources' ebuild in the gentoo-science overlay. :) > > Perhaps I could make > self-support an option, if it looked like it would be reliable. > > Well, obviously, you should test the bejeezus out of your configuration before > you declare open season on it. So far I haven't found reason to believe > lustre is substantially worse than any of the other open-source software > packages which are used in production situations. I think that constitutes a > qualified "yes" :-} Are you considering getting support from CFS at some point? Sorry, you don't have to answer if that is a sensitive question. But part of this thread has been the topic of encouraging CFS to support Gentoo. Interestingly, my colleague, who is in charge of installing Lustre (1.4) on our test system, is talking to CFS about supporting a vanilla kernel configuration. The reason? We can't make the system stable with a SLES kernel. It was stable for a long time with Gentoo. Now they seem to have gotten it stable with SLES plus a vanilla 2.6.19 kernel (which of course does not have the Lustre patches). So they want Suse to provide a newer SLES kernel with the Lustre patches, and CFS to support that configuration. -bryan -- gentoo-cluster@gentoo.org mailing list