* [gentoo-dev] Six month major project on Gentoo @ 2011-12-14 11:06 Gaurav Saxena 2011-12-14 18:05 ` Christian Ruppert ` (3 more replies) 0 siblings, 4 replies; 28+ messages in thread From: Gaurav Saxena @ 2011-12-14 11:06 UTC (permalink / raw To: gentoo-dev [-- Attachment #1: Type: text/plain, Size: 472 bytes --] Hello all, I am interested in doing my final year computer scence project on gentoo. I would be having a duration of six months to work on the project. Could you please suggest me some good project ideas that would be helpful to me as well as gentoo. I am interested in parallel computing, data structures , operating system. I am well versed in C/C++. I think there might be projects which need to be done, I would like to work on them. -- Thanks and Regards , Gaurav [-- Attachment #2: Type: text/html, Size: 511 bytes --] ^ permalink raw reply [flat|nested] 28+ messages in thread
* Re: [gentoo-dev] Six month major project on Gentoo 2011-12-14 11:06 [gentoo-dev] Six month major project on Gentoo Gaurav Saxena @ 2011-12-14 18:05 ` Christian Ruppert 2011-12-14 18:14 ` "C. Bergström" 2011-12-15 18:07 ` Gaurav Saxena 2011-12-14 18:29 ` Alec Warner ` (2 subsequent siblings) 3 siblings, 2 replies; 28+ messages in thread From: Christian Ruppert @ 2011-12-14 18:05 UTC (permalink / raw To: gentoo-dev; +Cc: Gaurav Saxena [-- Attachment #1: Type: text/plain, Size: 895 bytes --] On Wednesday 14 December 2011 16:36:42 Gaurav Saxena wrote: > Hello all, > I am interested in doing my final year computer scence project on gentoo. I > would be having a duration of six months to work on the project. Could you > please suggest me some good project ideas that would be helpful to me as > well as gentoo. I am interested in parallel computing, data structures , > operating system. I am well versed in C/C++. I think there might be > projects which need to be done, I would like to work on them. What about OpenRC? :) We could need some help. http://www.gentoo.org/proj/en/base/openrc/ #openrc or #gentoo-base (IRC) via FreeNode Or openrc@gentoo.org. Let me know if you're interested or need more details :) -- Regards, Christian Ruppert Gentoo Linux developer, Bugzilla administrator and Infrastructure member Fingerprint: EEB1 C341 7C84 B274 6C59 F243 5EAB 0C62 B427 ABC8 [-- Attachment #2: This is a digitally signed message part. --] [-- Type: application/pgp-signature, Size: 490 bytes --] ^ permalink raw reply [flat|nested] 28+ messages in thread
* Re: [gentoo-dev] Six month major project on Gentoo 2011-12-14 18:05 ` Christian Ruppert @ 2011-12-14 18:14 ` "C. Bergström" 2011-12-15 18:07 ` Gaurav Saxena 1 sibling, 0 replies; 28+ messages in thread From: "C. Bergström" @ 2011-12-14 18:14 UTC (permalink / raw To: gentoo-dev; +Cc: Christian Ruppert, Gaurav Saxena On 12/15/11 01:05 AM, Christian Ruppert wrote: > On Wednesday 14 December 2011 16:36:42 Gaurav Saxena wrote: >> Hello all, >> I am interested in doing my final year computer scence project on gentoo. I >> would be having a duration of six months to work on the project. Could you >> please suggest me some good project ideas that would be helpful to me as >> well as gentoo. I am interested in parallel computing, data structures , >> operating system. I am well versed in C/C++. I think there might be >> projects which need to be done, I would like to work on them. Not directly gentoo, but certainly would impact all gentoo users - toolchain/compilers/path64 There's a number of ways the compiler could be improved for better SIMD vectorization and parallel computing /* I'm biased and work on this project - ping me on irc if you're interested */ ./C #pathscale ^ permalink raw reply [flat|nested] 28+ messages in thread
* Re: [gentoo-dev] Six month major project on Gentoo 2011-12-14 18:05 ` Christian Ruppert 2011-12-14 18:14 ` "C. Bergström" @ 2011-12-15 18:07 ` Gaurav Saxena 2011-12-19 17:46 ` Christian Ruppert 1 sibling, 1 reply; 28+ messages in thread From: Gaurav Saxena @ 2011-12-15 18:07 UTC (permalink / raw To: Christian Ruppert; +Cc: gentoo-dev Hello all , Thanks a lot for your replies. Christian, I am interested in Open RC, it sounds interesting to me, I would like to know more details regarding what type of projects are there that could be done. On Wed, Dec 14, 2011 at 11:35 PM, Christian Ruppert <idl0r@gentoo.org> wrote: > On Wednesday 14 December 2011 16:36:42 Gaurav Saxena wrote: >> Hello all, >> I am interested in doing my final year computer scence project on gentoo. I >> would be having a duration of six months to work on the project. Could you >> please suggest me some good project ideas that would be helpful to me as >> well as gentoo. I am interested in parallel computing, data structures , >> operating system. I am well versed in C/C++. I think there might be >> projects which need to be done, I would like to work on them. > > What about OpenRC? :) > We could need some help. > http://www.gentoo.org/proj/en/base/openrc/ > #openrc or #gentoo-base (IRC) via FreeNode > Or openrc@gentoo.org. > > Let me know if you're interested or need more details :) > > -- > Regards, > Christian Ruppert > Gentoo Linux developer, Bugzilla administrator and Infrastructure member > Fingerprint: EEB1 C341 7C84 B274 6C59 F243 5EAB 0C62 B427 ABC8 -- Thanks and Regards , Gaurav ^ permalink raw reply [flat|nested] 28+ messages in thread
* Re: [gentoo-dev] Six month major project on Gentoo 2011-12-15 18:07 ` Gaurav Saxena @ 2011-12-19 17:46 ` Christian Ruppert 0 siblings, 0 replies; 28+ messages in thread From: Christian Ruppert @ 2011-12-19 17:46 UTC (permalink / raw To: gentoo-dev; +Cc: Gaurav Saxena [-- Attachment #1: Type: text/plain, Size: 2233 bytes --] On Thursday 15 December 2011 23:37:10 Gaurav Saxena wrote: > Hello all , Thanks a lot for your replies. > > Christian, I am interested in Open RC, it sounds interesting to me, I > would like to know more details regarding what type of projects are > there that could be done. > > On Wed, Dec 14, 2011 at 11:35 PM, Christian Ruppert <idl0r@gentoo.org> wrote: > > On Wednesday 14 December 2011 16:36:42 Gaurav Saxena wrote: > >> Hello all, > >> I am interested in doing my final year computer scence project on > >> gentoo. I would be having a duration of six months to work on the > >> project. Could you please suggest me some good project ideas that > >> would be helpful to me as well as gentoo. I am interested in parallel > >> computing, data structures , operating system. I am well versed in > >> C/C++. I think there might be projects which need to be done, I > >> would like to work on them. > > > > What about OpenRC? :) > > We could need some help. > > http://www.gentoo.org/proj/en/base/openrc/ > > #openrc or #gentoo-base (IRC) via FreeNode > > Or openrc@gentoo.org. > > > > Let me know if you're interested or need more details :) > > > > -- > > Regards, > > Christian Ruppert > > Gentoo Linux developer, Bugzilla administrator and Infrastructure member > > Fingerprint: EEB1 C341 7C84 B274 6C59 F243 5EAB 0C62 B427 ABC8 Hi Gaurav, you may know OpenRC already from your Gentoo machines, it's the service management fronted of "sysvinit" that handles the startup/shutdown of services in various runlevels incl. dependencies. What I can say is: The rc_parallel (so starting services parallel) feature needs some love. There are some issues re service dependencies, locking etc. https://bugs.gentoo.org/391945 https://bugs.gentoo.org/360013 and some more. We also have some issues with links of init scripts: There are a lot of other "major" and minor bugs. See http://preview.tinyurl.com/openrc-bugs You can also ask us via IRC if you want (just stay longer, remind the different timezones so it may take some time till one replies :P) -- Regards, Christian Ruppert Gentoo Linux developer, Bugzilla administrator and Infrastructure member Fingerprint: EEB1 C341 7C84 B274 6C59 F243 5EAB 0C62 B427 ABC8 [-- Attachment #2: This is a digitally signed message part. --] [-- Type: application/pgp-signature, Size: 490 bytes --] ^ permalink raw reply [flat|nested] 28+ messages in thread
* Re: [gentoo-dev] Six month major project on Gentoo 2011-12-14 11:06 [gentoo-dev] Six month major project on Gentoo Gaurav Saxena 2011-12-14 18:05 ` Christian Ruppert @ 2011-12-14 18:29 ` Alec Warner 2011-12-15 18:09 ` Gaurav Saxena 2011-12-22 4:43 ` Donnie Berkholz 2011-12-18 17:02 ` [gentoo-dev] " Petteri Räty 2011-12-19 18:14 ` Sébastien Fabbro 3 siblings, 2 replies; 28+ messages in thread From: Alec Warner @ 2011-12-14 18:29 UTC (permalink / raw To: gentoo-dev On Wed, Dec 14, 2011 at 3:06 AM, Gaurav Saxena <grvsaxena419@gmail.com> wrote: > Hello all, > I am interested in doing my final year computer scence project on gentoo. I > would be having a duration of six months to work on the project. Could you > please suggest me some good project ideas that would be helpful to me as > well as gentoo. I am interested in parallel computing, data structures , > operating system. I am well versed in C/C++. I think there might be > projects which need to be done, I would like to work on them. The only idea I can think of for parallel computing / distributed systems would be at the build level. distcc-ng, a farm of user-controlled machines that compile your code in a p2p fashion. a distributed hash table of input, output tuples (basically .o caching so users can fetch the .o from the DHT) Both of these have *massive* trust issues. When random guys on the internet are compiling your code you have to be very careful about how you verify and execute that code. When you fetch .o files from a DHT you have the same problem. Almost every other problem I can think of at the Gentoo OS level can fit on a meager sized machine (i.e. it is not a distributed systems nor a parallel computing problem.) Many of the annoying parts of Gentoo are merely tools problems; the existing tools are poor / under-maintained or standard tools do not exist (so users / developers roll their own.) You may have success in the tools arena if you talk to mgorny or portage-utils@; I know mgorny has written a few C tools and might have sufficient 'gentoo' C libraries you could utilize; the portage-utils alias holds the portage-utils authors (portage-utils being another set of tools written in C for gentoo.) I actually liked cbergstrom's idea of toolchain-type stuff; but I'm not really sure how easy it is to on-board with those communities (lord knows in my senior year of CS I would have been useless working on a compiler.) > > -- > Thanks and Regards , > Gaurav ^ permalink raw reply [flat|nested] 28+ messages in thread
* Re: [gentoo-dev] Six month major project on Gentoo 2011-12-14 18:29 ` Alec Warner @ 2011-12-15 18:09 ` Gaurav Saxena 2011-12-22 4:43 ` Donnie Berkholz 1 sibling, 0 replies; 28+ messages in thread From: Gaurav Saxena @ 2011-12-15 18:09 UTC (permalink / raw To: gentoo-dev Hello Alec, I am interested in a distributed compiler like this ,this involves quite a good project it seems. I would like to work on it, if possible could you please give me some pointers regarding more details about the project so that I could decide what to work upon. On Wed, Dec 14, 2011 at 11:59 PM, Alec Warner <antarus@gentoo.org> wrote: > On Wed, Dec 14, 2011 at 3:06 AM, Gaurav Saxena <grvsaxena419@gmail.com> wrote: >> Hello all, >> I am interested in doing my final year computer scence project on gentoo. I >> would be having a duration of six months to work on the project. Could you >> please suggest me some good project ideas that would be helpful to me as >> well as gentoo. I am interested in parallel computing, data structures , >> operating system. I am well versed in C/C++. I think there might be >> projects which need to be done, I would like to work on them. > > The only idea I can think of for parallel computing / distributed > systems would be at the build level. > > distcc-ng, a farm of user-controlled machines that compile your code > in a p2p fashion. > a distributed hash table of input, output tuples (basically .o caching > so users can fetch the .o from the DHT) > > Both of these have *massive* trust issues. When random guys on the > internet are compiling your code you have to be very careful about how > you verify and execute that code. When you fetch .o files from a DHT > you have the same problem. > > Almost every other problem I can think of at the Gentoo OS level can > fit on a meager sized machine (i.e. it is not a distributed systems > nor a parallel computing problem.) > > Many of the annoying parts of Gentoo are merely tools problems; the > existing tools are poor / under-maintained or standard tools do not > exist (so users / developers roll their own.) You may have success in > the tools arena if you talk to mgorny or portage-utils@; I know mgorny > has written a few C tools and might have sufficient 'gentoo' C > libraries you could utilize; the portage-utils alias holds the > portage-utils authors (portage-utils being another set of tools > written in C for gentoo.) > > I actually liked cbergstrom's idea of toolchain-type stuff; but I'm > not really sure how easy it is to on-board with those communities > (lord knows in my senior year of CS I would have been useless working > on a compiler.) > >> >> -- >> Thanks and Regards , >> Gaurav > -- Thanks and Regards , Gaurav ^ permalink raw reply [flat|nested] 28+ messages in thread
* Re: [gentoo-dev] Six month major project on Gentoo 2011-12-14 18:29 ` Alec Warner 2011-12-15 18:09 ` Gaurav Saxena @ 2011-12-22 4:43 ` Donnie Berkholz 2011-12-22 10:52 ` Rich Freeman 1 sibling, 1 reply; 28+ messages in thread From: Donnie Berkholz @ 2011-12-22 4:43 UTC (permalink / raw To: gentoo-dev [-- Attachment #1: Type: text/plain, Size: 1234 bytes --] On 10:29 Wed 14 Dec , Alec Warner wrote: > On Wed, Dec 14, 2011 at 3:06 AM, Gaurav Saxena <grvsaxena419@gmail.com> wrote: > > Hello all, > > I am interested in doing my final year computer scence project on gentoo. I > > would be having a duration of six months to work on the project. Could you > > please suggest me some good project ideas that would be helpful to me as > > well as gentoo. I am interested in parallel computing, data structures , > > operating system. I am well versed in C/C++. I think there might be > > projects which need to be done, I would like to work on them. > > The only idea I can think of for parallel computing / distributed > systems would be at the build level. > > distcc-ng, a farm of user-controlled machines that compile your code > in a p2p fashion. I looked into this 6 or 7 years ago. It wasn't feasible unless you were on an extremely high-speed, low-latency network, beyond what was typically accessible at the time outside of universities and LANs. Could be worth exploring again now that 25-100 mbps connections are becoming more common. -- Thanks, Donnie Donnie Berkholz Council Member / Sr. Developer Gentoo Linux Blog: http://dberkholz.com [-- Attachment #2: Type: application/pgp-signature, Size: 198 bytes --] ^ permalink raw reply [flat|nested] 28+ messages in thread
* Re: [gentoo-dev] Six month major project on Gentoo 2011-12-22 4:43 ` Donnie Berkholz @ 2011-12-22 10:52 ` Rich Freeman 2011-12-22 11:11 ` Francesco Riosa 0 siblings, 1 reply; 28+ messages in thread From: Rich Freeman @ 2011-12-22 10:52 UTC (permalink / raw To: gentoo-dev On Wed, Dec 21, 2011 at 11:43 PM, Donnie Berkholz <dberkholz@gentoo.org> wrote: > I looked into this 6 or 7 years ago. It wasn't feasible unless you were > on an extremely high-speed, low-latency network, beyond what was > typically accessible at the time outside of universities and LANs. Could > be worth exploring again now that 25-100 mbps connections are becoming > more common. I tried messing around with this with Amazon EC2. The problem was that due to latency I only really saw the benefit for VERY high levels of parallelization (think -j25+).. However, make isn't actually "distcc-aware" so it just runs 25 jobs of anything in parallel. So, anytime a makefile launched a ton of java or python jobs the host ground to a halt as it wasn't distributed and it was way more than the host could handle (especially java - which swapped like there was no tomorrow). If somebody were to do a distcc-ng for a large cluster one of the problems to solve would be having it not run jobs in parallel if it couldn't actually distribute them. Rich ^ permalink raw reply [flat|nested] 28+ messages in thread
* Re: [gentoo-dev] Six month major project on Gentoo 2011-12-22 10:52 ` Rich Freeman @ 2011-12-22 11:11 ` Francesco Riosa 2011-12-22 15:55 ` Michał Górny 0 siblings, 1 reply; 28+ messages in thread From: Francesco Riosa @ 2011-12-22 11:11 UTC (permalink / raw To: gentoo-dev 2011/12/22 Rich Freeman <rich0@gentoo.org>: > On Wed, Dec 21, 2011 at 11:43 PM, Donnie Berkholz <dberkholz@gentoo.org> wrote: >> I looked into this 6 or 7 years ago. It wasn't feasible unless you were >> on an extremely high-speed, low-latency network, beyond what was >> typically accessible at the time outside of universities and LANs. Could >> be worth exploring again now that 25-100 mbps connections are becoming >> more common. > > I tried messing around with this with Amazon EC2. The problem was > that due to latency I only really saw the benefit for VERY high levels > of parallelization (think -j25+).. However, make isn't actually > "distcc-aware" so it just runs 25 jobs of anything in parallel. So, > anytime a makefile launched a ton of java or python jobs the host > ground to a halt as it wasn't distributed and it was way more than the > host could handle (especially java - which swapped like there was no > tomorrow). > > If somebody were to do a distcc-ng for a large cluster one of the > problems to solve would be having it not run jobs in parallel if it > couldn't actually distribute them. > > Rich > Just wanted to point out that (if there is enough memory) recent kernels manage much better parallelism, even excess of it, once reached the maximum load augmenting threads only bring minimal loss of "real" time. ^ permalink raw reply [flat|nested] 28+ messages in thread
* Re: [gentoo-dev] Six month major project on Gentoo 2011-12-22 11:11 ` Francesco Riosa @ 2011-12-22 15:55 ` Michał Górny 2011-12-22 16:09 ` Rich Freeman 0 siblings, 1 reply; 28+ messages in thread From: Michał Górny @ 2011-12-22 15:55 UTC (permalink / raw To: gentoo-dev; +Cc: vivo75 [-- Attachment #1: Type: text/plain, Size: 954 bytes --] On Thu, 22 Dec 2011 12:11:32 +0100 Francesco Riosa <vivo75@gmail.com> wrote: > > I tried messing around with this with Amazon EC2. The problem was > > that due to latency I only really saw the benefit for VERY high > > levels of parallelization (think -j25+).. However, make isn't > > actually "distcc-aware" so it just runs 25 jobs of anything in > > parallel. So, anytime a makefile launched a ton of java or python > > jobs the host ground to a halt as it wasn't distributed and it was > > way more than the host could handle (especially java - which > > swapped like there was no tomorrow). > > Just wanted to point out that (if there is enough memory) recent > kernels manage much better parallelism, even excess of it, once > reached the maximum load augmenting threads only bring minimal loss of > "real" time. Does that include handling complete lack of memory and heavy swapping? -- Best regards, Michał Górny [-- Attachment #2: signature.asc --] [-- Type: application/pgp-signature, Size: 316 bytes --] ^ permalink raw reply [flat|nested] 28+ messages in thread
* Re: [gentoo-dev] Six month major project on Gentoo 2011-12-22 15:55 ` Michał Górny @ 2011-12-22 16:09 ` Rich Freeman 2011-12-22 17:40 ` [gentoo-dev] " Duncan 0 siblings, 1 reply; 28+ messages in thread From: Rich Freeman @ 2011-12-22 16:09 UTC (permalink / raw To: gentoo-dev On Thu, Dec 22, 2011 at 10:55 AM, Michał Górny <mgorny@gentoo.org> wrote: >> Just wanted to point out that (if there is enough memory) recent >> kernels manage much better parallelism, even excess of it, once >> reached the maximum load augmenting threads only bring minimal loss of >> "real" time. > > Does that include handling complete lack of memory and heavy swapping? > I think the key was the "if there is enough memory" - which I think is a pretty big issue. Running 25 invocations of java each wanting 100-200MB of RAM will cripple a system relatively quickly. And 25 really is just scratching the surface when most people think "cluster." I think the only way to really fix this would be to integrate distributed behavior into make itself. Rich ^ permalink raw reply [flat|nested] 28+ messages in thread
* [gentoo-dev] Re: Six month major project on Gentoo 2011-12-22 16:09 ` Rich Freeman @ 2011-12-22 17:40 ` Duncan 0 siblings, 0 replies; 28+ messages in thread From: Duncan @ 2011-12-22 17:40 UTC (permalink / raw To: gentoo-dev Rich Freeman posted on Thu, 22 Dec 2011 11:09:16 -0500 as excerpted: > On Thu, Dec 22, 2011 at 10:55 AM, Michał Górny <mgorny@gentoo.org> > wrote: >>> Just wanted to point out that (if there is enough memory) recent >>> kernels manage much better parallelism, even excess of it, once >>> reached the maximum load augmenting threads only bring minimal loss of >>> "real" time. >> >> Does that include handling complete lack of memory and heavy swapping? >> >> > I think the key was the "if there is enough memory" - which I think is a > pretty big issue. User experience but it matters too... The above memory limit-facter being the real limiter was true here when I used to run unlimited local jobs, too (I moderated a decent bit since portage itself parallelizes now, the fascination phase is over, and the kernel no longer requires double-load-factor to stay 95-100% CPU utilized). If the make is setup in such a way that it can sufficiently parallelize, even back on dual-core, I could run hundreds of parallel jobs (say, with the kernel build) surprisingly well -- as long as they didn't require much memory. As I've moderated since then, it's mostly to keep memory usage (and I/O, especially when it compounds with swap) sensible, way more than it is any real problem scheduling the actual loads. Plus, the kernel has gotten rather better at fully utilizing CPU resources; it used to be that you had to run about double (sometimes more, but generally slightly less) your cores in load average to avoid idle CPUs. That's no longer the case as a 25% over-scheduling seems to do what it used to require a 100% over- scheduling to achieve in terms of keeping 95-100% CPU usage, so by 50% over you're just wasting cycles and bytes. What I /really/ wish was that there was a make (and portage) switch parallel to -l, that responded based on memory instead of load average! Then those single-job gig-plus links could limit to say 4 jobs on an 8- gig machine, even if it's an 8-way that would otherwise be -j16 -l10 or -l12 (regardless of the balance or lack thereof, of an 8-way with only a gig of RAM per core... 2 gigs/core seems to be about the sweet-spot I've found for a general purpose gentooer build-station/desktop). That'd be an interesting and very useful parallel-domain task, adding a make switch that responded to memory just as -l responds to load. Maybe a make other than gmake already has such an option? -- Duncan - List replies preferred. No HTML msgs. "Every nonfree program has a lord, a master -- and if you use the program, he is your master." Richard Stallman ^ permalink raw reply [flat|nested] 28+ messages in thread
* Re: [gentoo-dev] Six month major project on Gentoo 2011-12-14 11:06 [gentoo-dev] Six month major project on Gentoo Gaurav Saxena 2011-12-14 18:05 ` Christian Ruppert 2011-12-14 18:29 ` Alec Warner @ 2011-12-18 17:02 ` Petteri Räty 2011-12-18 17:13 ` "Paweł Hajdan, Jr." 2011-12-19 18:14 ` Sébastien Fabbro 3 siblings, 1 reply; 28+ messages in thread From: Petteri Räty @ 2011-12-18 17:02 UTC (permalink / raw To: gentoo-dev [-- Attachment #1: Type: text/plain, Size: 865 bytes --] On 14.12.2011 13:06, Gaurav Saxena wrote: > Hello all, > I am interested in doing my final year computer scence project on > gentoo. I would be having a duration of six months to work on the > project. Could you please suggest me some good project ideas that would > be helpful to me as well as gentoo. I am interested in parallel > computing, data structures , operating system. I am well versed in > C/C++. I think there might be projects which need to be done, I would > like to work on them. > There are parallel computing aspects in libbash for metadata generation, data structures in AST building for bash and it's quite low level. Feel free to contact me off list if you are interested. It would be nice to get that project back on track again after the last GSoC. http://www.gentoo.org/proj/en/libbash/index.xml Regards, Petteri [-- Attachment #2: OpenPGP digital signature --] [-- Type: application/pgp-signature, Size: 900 bytes --] ^ permalink raw reply [flat|nested] 28+ messages in thread
* Re: [gentoo-dev] Six month major project on Gentoo 2011-12-18 17:02 ` [gentoo-dev] " Petteri Räty @ 2011-12-18 17:13 ` "Paweł Hajdan, Jr." 2011-12-18 17:17 ` [gentoo-dev] libbash licensing Petteri Räty 2011-12-18 17:45 ` [gentoo-dev] Six month major project on Gentoo Michał Górny 0 siblings, 2 replies; 28+ messages in thread From: "Paweł Hajdan, Jr." @ 2011-12-18 17:13 UTC (permalink / raw To: gentoo-dev [-- Attachment #1: Type: text/plain, Size: 549 bytes --] On 12/18/11 6:02 PM, Petteri Räty wrote: > There are parallel computing aspects in libbash for metadata generation, > data structures in AST building for bash and it's quite low level. By the way, I've always wondered why libbash is separate from the "upstream" bash. Have you considered contributing to the upstream bash to convert the shell itself to a more library-oriented design (somewhat similar to LLVM), so that you have a guarantee that the lib and the shell stay in sync? Feel free to change the subject when responding. [-- Attachment #2: OpenPGP digital signature --] [-- Type: application/pgp-signature, Size: 203 bytes --] ^ permalink raw reply [flat|nested] 28+ messages in thread
* [gentoo-dev] libbash licensing 2011-12-18 17:13 ` "Paweł Hajdan, Jr." @ 2011-12-18 17:17 ` Petteri Räty 2011-12-18 17:45 ` [gentoo-dev] Six month major project on Gentoo Michał Górny 1 sibling, 0 replies; 28+ messages in thread From: Petteri Räty @ 2011-12-18 17:17 UTC (permalink / raw To: gentoo-dev [-- Attachment #1: Type: text/plain, Size: 783 bytes --] On 18.12.2011 19:13, "Paweł Hajdan, Jr." wrote: > On 12/18/11 6:02 PM, Petteri Räty wrote: >> There are parallel computing aspects in libbash for metadata generation, >> data structures in AST building for bash and it's quite low level. > > By the way, I've always wondered why libbash is separate from the > "upstream" bash. > > Have you considered contributing to the upstream bash to convert the > shell itself to a more library-oriented design (somewhat similar to > LLVM), so that you have a guarantee that the lib and the shell stay in sync? > The main reason is that Portage is GPL-2 and bash upstream is GPL-3. Unless someone is willing to do the work to get Portage to GPL-3 or FSF to get bash GPL-2 then those two will not mix. Regards, Petteri [-- Attachment #2: OpenPGP digital signature --] [-- Type: application/pgp-signature, Size: 900 bytes --] ^ permalink raw reply [flat|nested] 28+ messages in thread
* Re: [gentoo-dev] Six month major project on Gentoo 2011-12-18 17:13 ` "Paweł Hajdan, Jr." 2011-12-18 17:17 ` [gentoo-dev] libbash licensing Petteri Räty @ 2011-12-18 17:45 ` Michał Górny 1 sibling, 0 replies; 28+ messages in thread From: Michał Górny @ 2011-12-18 17:45 UTC (permalink / raw To: gentoo-dev; +Cc: phajdan.jr [-- Attachment #1: Type: text/plain, Size: 798 bytes --] On Sun, 18 Dec 2011 18:13:42 +0100 ""Paweł Hajdan, Jr."" <phajdan.jr@gentoo.org> wrote: > On 12/18/11 6:02 PM, Petteri Räty wrote: > > There are parallel computing aspects in libbash for metadata > > generation, data structures in AST building for bash and it's quite > > low level. > > By the way, I've always wondered why libbash is separate from the > "upstream" bash. > > Have you considered contributing to the upstream bash to convert the > shell itself to a more library-oriented design (somewhat similar to > LLVM), so that you have a guarantee that the lib and the shell stay > in sync? I don't think upstream bash would be interested in converting bash into C++. I wouldn't be interested in running such a bash, for instance. -- Best regards, Michał Górny [-- Attachment #2: signature.asc --] [-- Type: application/pgp-signature, Size: 316 bytes --] ^ permalink raw reply [flat|nested] 28+ messages in thread
* Re: [gentoo-dev] Six month major project on Gentoo 2011-12-14 11:06 [gentoo-dev] Six month major project on Gentoo Gaurav Saxena ` (2 preceding siblings ...) 2011-12-18 17:02 ` [gentoo-dev] " Petteri Räty @ 2011-12-19 18:14 ` Sébastien Fabbro 2012-01-02 14:33 ` "Paweł Hajdan, Jr." 3 siblings, 1 reply; 28+ messages in thread From: Sébastien Fabbro @ 2011-12-19 18:14 UTC (permalink / raw To: gentoo-dev Gaurav Saxena <grvsaxena419@gmail.com> wrote: > I am interested in doing my final year computer scence project on gentoo. I > would be having a duration of six months to work on the project. Could you > please suggest me some good project ideas that would be helpful to me as > well as gentoo. I am interested in parallel computing, data structures , > operating system. I am well versed in C/C++. I think there might be > projects which need to be done, I would like to work on them. > One project that could be very useful for Gentoo is an automated stabilization/testing for ebuilds. Obviously it will require some work from the ebuild maintainers, but the ability to distribute the stabilization recipes across a volunteering Gentoo community via something like BOINC could be worth looking at. -- Sébastien ^ permalink raw reply [flat|nested] 28+ messages in thread
* Re: [gentoo-dev] Six month major project on Gentoo 2011-12-19 18:14 ` Sébastien Fabbro @ 2012-01-02 14:33 ` "Paweł Hajdan, Jr." 0 siblings, 0 replies; 28+ messages in thread From: "Paweł Hajdan, Jr." @ 2012-01-02 14:33 UTC (permalink / raw To: gentoo-dev [-- Attachment #1: Type: text/plain, Size: 668 bytes --] On 12/19/11 7:14 PM, Sébastien Fabbro wrote: > One project that could be very useful for Gentoo is an automated > stabilization/testing for ebuilds. Obviously it will require some work > from the ebuild maintainers, but the ability to distribute the > stabilization recipes across a volunteering Gentoo community via > something like BOINC could be worth looking at. I can help with guidance and mentoring here. I'm maintaining a set of scripts <http://git.overlays.gentoo.org/gitweb/?p=proj/arch-tools.git;a=summary> that I use for mass-stabilization and related tasks. If someone wants to extend it so the testing can be distributed, that's great. [-- Attachment #2: OpenPGP digital signature --] [-- Type: application/pgp-signature, Size: 203 bytes --] ^ permalink raw reply [flat|nested] 28+ messages in thread
[parent not found: <i7USJ-40Q-3@gated-at.bofh.it>]
* Re: [gentoo-dev] Six month major project on Gentoo [not found] <i7USJ-40Q-3@gated-at.bofh.it> @ 2011-12-14 21:25 ` Leho Kraav 2011-12-14 23:43 ` Alec Warner 0 siblings, 1 reply; 28+ messages in thread From: Leho Kraav @ 2011-12-14 21:25 UTC (permalink / raw To: linux.gentoo.dev; +Cc: gentoo-dev i'd be really happy if someone took care of https://bugs.gentoo.org/150031 :> "include more info about binpkg in file name" ^ permalink raw reply [flat|nested] 28+ messages in thread
* Re: [gentoo-dev] Six month major project on Gentoo 2011-12-14 21:25 ` Leho Kraav @ 2011-12-14 23:43 ` Alec Warner 2011-12-14 23:58 ` Mike Frysinger 0 siblings, 1 reply; 28+ messages in thread From: Alec Warner @ 2011-12-14 23:43 UTC (permalink / raw To: gentoo-dev; +Cc: linux.gentoo.dev On Wed, Dec 14, 2011 at 1:25 PM, Leho Kraav <leho@kraav.com> wrote: > i'd be really happy if someone took care of https://bugs.gentoo.org/150031 :> > > "include more info about binpkg in file name" > That is great, but its not a 6 month project... -A ^ permalink raw reply [flat|nested] 28+ messages in thread
* Re: [gentoo-dev] Six month major project on Gentoo 2011-12-14 23:43 ` Alec Warner @ 2011-12-14 23:58 ` Mike Frysinger 2011-12-15 5:39 ` Nirbheek Chauhan 0 siblings, 1 reply; 28+ messages in thread From: Mike Frysinger @ 2011-12-14 23:58 UTC (permalink / raw To: gentoo-dev; +Cc: Alec Warner, linux.gentoo.dev [-- Attachment #1: Type: Text/Plain, Size: 452 bytes --] On Wednesday 14 December 2011 18:43:33 Alec Warner wrote: > On Wed, Dec 14, 2011 at 1:25 PM, Leho Kraav <leho@kraav.com> wrote: > > i'd be really happy if someone took care of > > https://bugs.gentoo.org/150031 :> > > > > "include more info about binpkg in file name" > > That is great, but its not a 6 month project... is it though ? i'm inclined to mark INVALID. hijacking filenames for metadata is a tuuuuuuuuuuurrible idea. -mike [-- Attachment #2: This is a digitally signed message part. --] [-- Type: application/pgp-signature, Size: 836 bytes --] ^ permalink raw reply [flat|nested] 28+ messages in thread
* Re: [gentoo-dev] Six month major project on Gentoo 2011-12-14 23:58 ` Mike Frysinger @ 2011-12-15 5:39 ` Nirbheek Chauhan 2011-12-15 12:27 ` Mike Frysinger 2011-12-15 12:43 ` Rich Freeman 0 siblings, 2 replies; 28+ messages in thread From: Nirbheek Chauhan @ 2011-12-15 5:39 UTC (permalink / raw To: gentoo-dev; +Cc: Alec Warner, linux.gentoo.dev On Thu, Dec 15, 2011 at 5:28 AM, Mike Frysinger <vapier@gentoo.org> wrote: > On Wednesday 14 December 2011 18:43:33 Alec Warner wrote: >> On Wed, Dec 14, 2011 at 1:25 PM, Leho Kraav <leho@kraav.com> wrote: >> > i'd be really happy if someone took care of >> > https://bugs.gentoo.org/150031 :> >> > >> > "include more info about binpkg in file name" >> >> That is great, but its not a 6 month project... > > is it though ? i'm inclined to mark INVALID. hijacking filenames for metadata > is a tuuuuuuuuuuurrible idea. I agree. It's along the same lines as only using file extensions for determining the filetype (and we all know how that turned out...). It *does* have the advantage of being really fast, though. Nevertheless, the basic bug is about changing the distfile repository format in such a way that a single repo can contain several distfiles built with differing build conditions. Putting metadata in the filename is only one way of ensuring that. -- ~Nirbheek Chauhan Gentoo GNOME+Mozilla Team ^ permalink raw reply [flat|nested] 28+ messages in thread
* Re: [gentoo-dev] Six month major project on Gentoo 2011-12-15 5:39 ` Nirbheek Chauhan @ 2011-12-15 12:27 ` Mike Frysinger 2011-12-15 12:42 ` Nirbheek Chauhan 2011-12-15 12:43 ` Rich Freeman 1 sibling, 1 reply; 28+ messages in thread From: Mike Frysinger @ 2011-12-15 12:27 UTC (permalink / raw To: gentoo-dev; +Cc: linux.gentoo.dev [-- Attachment #1: Type: Text/Plain, Size: 1538 bytes --] On Thursday 15 December 2011 00:39:44 Nirbheek Chauhan wrote: > On Thu, Dec 15, 2011 at 5:28 AM, Mike Frysinger <vapier@gentoo.org> wrote: > > On Wednesday 14 December 2011 18:43:33 Alec Warner wrote: > >> On Wed, Dec 14, 2011 at 1:25 PM, Leho Kraav <leho@kraav.com> wrote: > >> > i'd be really happy if someone took care of > >> > https://bugs.gentoo.org/150031 :> > >> > > >> > "include more info about binpkg in file name" > >> > >> That is great, but its not a 6 month project... > > > > is it though ? i'm inclined to mark INVALID. hijacking filenames for > > metadata is a tuuuuuuuuuuurrible idea. > > I agree. It's along the same lines as only using file extensions for > determining the filetype (and we all know how that turned out...). It > *does* have the advantage of being really fast, though. it just doesn't scale though (encoding all metadata into the filename quickly hits filesystem limits on name length), and i think the speed increase is only to a limit. once you get into larger repos, using the already existing "Packages" file would be faster. and since that compresses, it should scale a lot nicer. > Nevertheless, the basic bug is about changing the distfile repository > format in such a way that a single repo can contain several distfiles > built with differing build conditions. Putting metadata in the > filename is only one way of ensuring that. sounds like the summary needs updating then by someone who has waded through all the followup comments ;) -mike [-- Attachment #2: This is a digitally signed message part. --] [-- Type: application/pgp-signature, Size: 836 bytes --] ^ permalink raw reply [flat|nested] 28+ messages in thread
* Re: [gentoo-dev] Six month major project on Gentoo 2011-12-15 12:27 ` Mike Frysinger @ 2011-12-15 12:42 ` Nirbheek Chauhan 0 siblings, 0 replies; 28+ messages in thread From: Nirbheek Chauhan @ 2011-12-15 12:42 UTC (permalink / raw To: gentoo-dev On Thu, Dec 15, 2011 at 5:57 PM, Mike Frysinger <vapier@gentoo.org> wrote: > On Thursday 15 December 2011 00:39:44 Nirbheek Chauhan wrote: >> Nevertheless, the basic bug is about changing the distfile repository >> format in such a way that a single repo can contain several distfiles >> built with differing build conditions. Putting metadata in the >> filename is only one way of ensuring that. > > sounds like the summary needs updating then by someone who has waded through > all the followup comments ;) I didn't read every word, but I think I got the gist. I've changed the subject accordingly. :) -- ~Nirbheek Chauhan Gentoo GNOME+Mozilla Team ^ permalink raw reply [flat|nested] 28+ messages in thread
* Re: [gentoo-dev] Six month major project on Gentoo 2011-12-15 5:39 ` Nirbheek Chauhan 2011-12-15 12:27 ` Mike Frysinger @ 2011-12-15 12:43 ` Rich Freeman 2011-12-15 16:30 ` Mike Frysinger 1 sibling, 1 reply; 28+ messages in thread From: Rich Freeman @ 2011-12-15 12:43 UTC (permalink / raw To: gentoo-dev; +Cc: Alec Warner, linux.gentoo.dev On Thu, Dec 15, 2011 at 12:39 AM, Nirbheek Chauhan <nirbheek@gentoo.org> wrote: > Nevertheless, the basic bug is about changing the distfile repository > format in such a way that a single repo can contain several distfiles > built with differing build conditions. Putting metadata in the > filename is only one way of ensuring that. Well, having the filename vary when the metadata changes is the only way of ensuring that. Putting the metadata in the filename is just one of many ways to make the filename vary. Another solution (which I can already sense the objections to), would be to content-hash the files and use that as the filename. Then use indexes to point to the files. You could use symlink indexes to point to the files so that superficially it looks the same as it does now for the last version emerged. Then people looking for a particular set of metadata could use more detailed indexes to find the right file. Portage could look for an exact match when trying to merge a binpkg since searching indexes is a trivial problem. The indexes could be anything from text files to binary files to databases to a couple of directory trees full of symlinks (like /dev/disk/by-*). The symlinks could get tricky with all the metadata - it might make more sense to just keep it simple and use something more like a database for the full details and symlinks for the basics. There are countless variations on this as well - like sticking a copy of the environment for each package in a separate text file with the same base name so that it is easy to grep/search/etc. You can also make it more user-friendly by keeping the PF in the filename followed by the hash - like gvim-1.23-r1-723ba298d92f. In such a case you probably don't even need to index the PFs. Rich ^ permalink raw reply [flat|nested] 28+ messages in thread
* Re: [gentoo-dev] Six month major project on Gentoo 2011-12-15 12:43 ` Rich Freeman @ 2011-12-15 16:30 ` Mike Frysinger 2011-12-15 16:51 ` Mike Frysinger 0 siblings, 1 reply; 28+ messages in thread From: Mike Frysinger @ 2011-12-15 16:30 UTC (permalink / raw To: gentoo-dev; +Cc: linux.gentoo.dev [-- Attachment #1: Type: Text/Plain, Size: 2260 bytes --] On Thursday 15 December 2011 07:43:26 Rich Freeman wrote: > On Thu, Dec 15, 2011 at 12:39 AM, Nirbheek Chauhan wrote: > > Nevertheless, the basic bug is about changing the distfile repository > > format in such a way that a single repo can contain several distfiles > > built with differing build conditions. Putting metadata in the > > filename is only one way of ensuring that. > > Well, having the filename vary when the metadata changes is the only > way of ensuring that. Putting the metadata in the filename is just > one of many ways to make the filename vary. there is more raw metadata available than fits into a filename. so that is already a non-starter. if people want to post multiple binpkgs with different metadata > Another solution (which I can already sense the objections to), would > be to content-hash the files and use that as the filename. Then use > indexes to point to the files. You could use symlink indexes to point > to the files so that superficially it looks the same as it does now > for the last version emerged. Then people looking for a particular > set of metadata could use more detailed indexes to find the right > file. Portage could look for an exact match when trying to merge a > binpkg since searching indexes is a trivial problem. we already hash all the files (i.e. the CONTENTS file), so using the hash of that file alone wouldn't be a bad idea. although i think that file gets generated on the fly when merging the binpkg (seems like a waste to not cache that in the binary package ...). > There are countless variations on this as well - like sticking a copy > of the environment for each package in a separate text file with the > same base name so that it is easy to grep/search/etc. the env is already "trivial" to extract: qtbz2 -x -O bison-2.5.tbz2 | qxpak -O -x - environment.bz2 | bzgrep foo > You can also make it more user-friendly by keeping the PF in the > filename followed by the hash - like gvim-1.23-r1-723ba298d92f. In > such a case you probably don't even need to index the PFs. portage should be using the generated "Packages" index to look up actual tbz2 filenames, so having ${PF}-<hash>.tbz2 shouldn't be too painful. -mike [-- Attachment #2: This is a digitally signed message part. --] [-- Type: application/pgp-signature, Size: 836 bytes --] ^ permalink raw reply [flat|nested] 28+ messages in thread
* Re: [gentoo-dev] Six month major project on Gentoo 2011-12-15 16:30 ` Mike Frysinger @ 2011-12-15 16:51 ` Mike Frysinger 0 siblings, 0 replies; 28+ messages in thread From: Mike Frysinger @ 2011-12-15 16:51 UTC (permalink / raw To: gentoo-dev; +Cc: linux.gentoo.dev [-- Attachment #1: Type: Text/Plain, Size: 174 bytes --] On Thursday 15 December 2011 11:30:46 Mike Frysinger wrote: > if people want to post multiple binpkgs with different metadata err, half formed thought here ... ignore -mike [-- Attachment #2: This is a digitally signed message part. --] [-- Type: application/pgp-signature, Size: 836 bytes --] ^ permalink raw reply [flat|nested] 28+ messages in thread
end of thread, other threads:[~2012-01-02 14:33 UTC | newest] Thread overview: 28+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2011-12-14 11:06 [gentoo-dev] Six month major project on Gentoo Gaurav Saxena 2011-12-14 18:05 ` Christian Ruppert 2011-12-14 18:14 ` "C. Bergström" 2011-12-15 18:07 ` Gaurav Saxena 2011-12-19 17:46 ` Christian Ruppert 2011-12-14 18:29 ` Alec Warner 2011-12-15 18:09 ` Gaurav Saxena 2011-12-22 4:43 ` Donnie Berkholz 2011-12-22 10:52 ` Rich Freeman 2011-12-22 11:11 ` Francesco Riosa 2011-12-22 15:55 ` Michał Górny 2011-12-22 16:09 ` Rich Freeman 2011-12-22 17:40 ` [gentoo-dev] " Duncan 2011-12-18 17:02 ` [gentoo-dev] " Petteri Räty 2011-12-18 17:13 ` "Paweł Hajdan, Jr." 2011-12-18 17:17 ` [gentoo-dev] libbash licensing Petteri Räty 2011-12-18 17:45 ` [gentoo-dev] Six month major project on Gentoo Michał Górny 2011-12-19 18:14 ` Sébastien Fabbro 2012-01-02 14:33 ` "Paweł Hajdan, Jr." [not found] <i7USJ-40Q-3@gated-at.bofh.it> 2011-12-14 21:25 ` Leho Kraav 2011-12-14 23:43 ` Alec Warner 2011-12-14 23:58 ` Mike Frysinger 2011-12-15 5:39 ` Nirbheek Chauhan 2011-12-15 12:27 ` Mike Frysinger 2011-12-15 12:42 ` Nirbheek Chauhan 2011-12-15 12:43 ` Rich Freeman 2011-12-15 16:30 ` Mike Frysinger 2011-12-15 16:51 ` Mike Frysinger
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox