public inbox for gentoo-dev@lists.gentoo.org
 help / color / mirror / Atom feed
* [gentoo-dev] Six month major project on Gentoo
@ 2011-12-14 11:06 Gaurav Saxena
  2011-12-14 18:05 ` Christian Ruppert
                   ` (3 more replies)
  0 siblings, 4 replies; 28+ messages in thread
From: Gaurav Saxena @ 2011-12-14 11:06 UTC (permalink / raw
  To: gentoo-dev

[-- Attachment #1: Type: text/plain, Size: 472 bytes --]

Hello all,
I am interested in doing my final year computer scence project on gentoo. I
would be having a duration of six months to work on the project. Could you
please suggest me some good project ideas that would be helpful to me as
well as gentoo. I am interested in parallel computing, data structures ,
operating system. I am well versed in C/C++. I think  there might be
projects which need to be done, I would like to work on them.

-- 
Thanks and Regards ,
Gaurav

[-- Attachment #2: Type: text/html, Size: 511 bytes --]

^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
  2011-12-14 11:06 [gentoo-dev] Six month major project on Gentoo Gaurav Saxena
@ 2011-12-14 18:05 ` Christian Ruppert
  2011-12-14 18:14   ` "C. Bergström"
  2011-12-15 18:07   ` Gaurav Saxena
  2011-12-14 18:29 ` Alec Warner
                   ` (2 subsequent siblings)
  3 siblings, 2 replies; 28+ messages in thread
From: Christian Ruppert @ 2011-12-14 18:05 UTC (permalink / raw
  To: gentoo-dev; +Cc: Gaurav Saxena

[-- Attachment #1: Type: text/plain, Size: 895 bytes --]

On Wednesday 14 December 2011 16:36:42 Gaurav Saxena wrote:
> Hello all,
> I am interested in doing my final year computer scence project on gentoo. I
> would be having a duration of six months to work on the project. Could you
> please suggest me some good project ideas that would be helpful to me as
> well as gentoo. I am interested in parallel computing, data structures ,
> operating system. I am well versed in C/C++. I think  there might be
> projects which need to be done, I would like to work on them.

What about OpenRC? :)
We could need some help.
http://www.gentoo.org/proj/en/base/openrc/
#openrc or #gentoo-base (IRC) via FreeNode
Or openrc@gentoo.org.

Let me know if you're interested or need more details :)

-- 
Regards,
Christian Ruppert
Gentoo Linux developer, Bugzilla administrator and Infrastructure member
Fingerprint: EEB1 C341 7C84 B274 6C59 F243 5EAB 0C62 B427 ABC8

[-- Attachment #2: This is a digitally signed message part. --]
[-- Type: application/pgp-signature, Size: 490 bytes --]

^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
  2011-12-14 18:05 ` Christian Ruppert
@ 2011-12-14 18:14   ` "C. Bergström"
  2011-12-15 18:07   ` Gaurav Saxena
  1 sibling, 0 replies; 28+ messages in thread
From: "C. Bergström" @ 2011-12-14 18:14 UTC (permalink / raw
  To: gentoo-dev; +Cc: Christian Ruppert, Gaurav Saxena

On 12/15/11 01:05 AM, Christian Ruppert wrote:
> On Wednesday 14 December 2011 16:36:42 Gaurav Saxena wrote:
>> Hello all,
>> I am interested in doing my final year computer scence project on gentoo. I
>> would be having a duration of six months to work on the project. Could you
>> please suggest me some good project ideas that would be helpful to me as
>> well as gentoo. I am interested in parallel computing, data structures ,
>> operating system. I am well versed in C/C++. I think  there might be
>> projects which need to be done, I would like to work on them.
Not directly gentoo, but certainly would impact all gentoo users - 
toolchain/compilers/path64
There's a number of ways the compiler could be improved for better SIMD 
vectorization and parallel computing
/* I'm biased and work on this project - ping me on irc if you're 
interested */

./C

#pathscale



^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
  2011-12-14 11:06 [gentoo-dev] Six month major project on Gentoo Gaurav Saxena
  2011-12-14 18:05 ` Christian Ruppert
@ 2011-12-14 18:29 ` Alec Warner
  2011-12-15 18:09   ` Gaurav Saxena
  2011-12-22  4:43   ` Donnie Berkholz
  2011-12-18 17:02 ` [gentoo-dev] " Petteri Räty
  2011-12-19 18:14 ` Sébastien Fabbro
  3 siblings, 2 replies; 28+ messages in thread
From: Alec Warner @ 2011-12-14 18:29 UTC (permalink / raw
  To: gentoo-dev

On Wed, Dec 14, 2011 at 3:06 AM, Gaurav Saxena <grvsaxena419@gmail.com> wrote:
> Hello all,
> I am interested in doing my final year computer scence project on gentoo. I
> would be having a duration of six months to work on the project. Could you
> please suggest me some good project ideas that would be helpful to me as
> well as gentoo. I am interested in parallel computing, data structures ,
> operating system. I am well versed in C/C++. I think  there might be
> projects which need to be done, I would like to work on them.

The only idea I can think of for parallel computing / distributed
systems would be at the build level.

distcc-ng, a farm of user-controlled machines that compile your code
in a p2p fashion.
a distributed hash table of input, output tuples (basically .o caching
so users can fetch the .o from the DHT)

Both of these have *massive* trust issues. When random guys on the
internet are compiling your code you have to be very careful about how
you verify and execute that code. When you fetch .o files from a DHT
you have the same problem.

Almost every other problem I can think of at the Gentoo OS level can
fit on a meager sized machine (i.e. it is not a distributed systems
nor a parallel computing problem.)

Many of the annoying parts of Gentoo are merely tools problems; the
existing tools are poor / under-maintained or standard tools do not
exist (so users / developers roll their own.) You may have success in
the tools arena if you talk to mgorny or portage-utils@; I know mgorny
has written a few C tools and might have sufficient 'gentoo' C
libraries you could utilize; the portage-utils alias holds the
portage-utils authors (portage-utils being another set of tools
written in C for gentoo.)

I actually liked cbergstrom's idea of toolchain-type stuff; but I'm
not really sure how easy it is to on-board with those communities
(lord knows in my senior year of CS I would have been useless working
on a compiler.)

>
> --
> Thanks and Regards ,
> Gaurav



^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
       [not found] <i7USJ-40Q-3@gated-at.bofh.it>
@ 2011-12-14 21:25 ` Leho Kraav
  2011-12-14 23:43   ` Alec Warner
  0 siblings, 1 reply; 28+ messages in thread
From: Leho Kraav @ 2011-12-14 21:25 UTC (permalink / raw
  To: linux.gentoo.dev; +Cc: gentoo-dev

i'd be really happy if someone took care of https://bugs.gentoo.org/150031 :>

"include more info about binpkg in file name"



^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
  2011-12-14 21:25 ` Leho Kraav
@ 2011-12-14 23:43   ` Alec Warner
  2011-12-14 23:58     ` Mike Frysinger
  0 siblings, 1 reply; 28+ messages in thread
From: Alec Warner @ 2011-12-14 23:43 UTC (permalink / raw
  To: gentoo-dev; +Cc: linux.gentoo.dev

On Wed, Dec 14, 2011 at 1:25 PM, Leho Kraav <leho@kraav.com> wrote:
> i'd be really happy if someone took care of https://bugs.gentoo.org/150031 :>
>
> "include more info about binpkg in file name"
>

That is great, but its not a 6 month project...

-A



^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
  2011-12-14 23:43   ` Alec Warner
@ 2011-12-14 23:58     ` Mike Frysinger
  2011-12-15  5:39       ` Nirbheek Chauhan
  0 siblings, 1 reply; 28+ messages in thread
From: Mike Frysinger @ 2011-12-14 23:58 UTC (permalink / raw
  To: gentoo-dev; +Cc: Alec Warner, linux.gentoo.dev

[-- Attachment #1: Type: Text/Plain, Size: 452 bytes --]

On Wednesday 14 December 2011 18:43:33 Alec Warner wrote:
> On Wed, Dec 14, 2011 at 1:25 PM, Leho Kraav <leho@kraav.com> wrote:
> > i'd be really happy if someone took care of
> > https://bugs.gentoo.org/150031 :>
> > 
> > "include more info about binpkg in file name"
> 
> That is great, but its not a 6 month project...

is it though ?  i'm inclined to mark INVALID.  hijacking filenames for metadata 
is a tuuuuuuuuuuurrible idea.
-mike

[-- Attachment #2: This is a digitally signed message part. --]
[-- Type: application/pgp-signature, Size: 836 bytes --]

^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
  2011-12-14 23:58     ` Mike Frysinger
@ 2011-12-15  5:39       ` Nirbheek Chauhan
  2011-12-15 12:27         ` Mike Frysinger
  2011-12-15 12:43         ` Rich Freeman
  0 siblings, 2 replies; 28+ messages in thread
From: Nirbheek Chauhan @ 2011-12-15  5:39 UTC (permalink / raw
  To: gentoo-dev; +Cc: Alec Warner, linux.gentoo.dev

On Thu, Dec 15, 2011 at 5:28 AM, Mike Frysinger <vapier@gentoo.org> wrote:
> On Wednesday 14 December 2011 18:43:33 Alec Warner wrote:
>> On Wed, Dec 14, 2011 at 1:25 PM, Leho Kraav <leho@kraav.com> wrote:
>> > i'd be really happy if someone took care of
>> > https://bugs.gentoo.org/150031 :>
>> >
>> > "include more info about binpkg in file name"
>>
>> That is great, but its not a 6 month project...
>
> is it though ?  i'm inclined to mark INVALID.  hijacking filenames for metadata
> is a tuuuuuuuuuuurrible idea.

I agree. It's along the same lines as only using file extensions for
determining the filetype (and we all know how that turned out...). It
*does* have the advantage of being really fast, though.

Nevertheless, the basic bug is about changing the distfile repository
format in such a way that a single repo can contain several distfiles
built with differing build conditions. Putting metadata in the
filename is only one way of ensuring that.

-- 
~Nirbheek Chauhan

Gentoo GNOME+Mozilla Team



^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
  2011-12-15  5:39       ` Nirbheek Chauhan
@ 2011-12-15 12:27         ` Mike Frysinger
  2011-12-15 12:42           ` Nirbheek Chauhan
  2011-12-15 12:43         ` Rich Freeman
  1 sibling, 1 reply; 28+ messages in thread
From: Mike Frysinger @ 2011-12-15 12:27 UTC (permalink / raw
  To: gentoo-dev; +Cc: linux.gentoo.dev

[-- Attachment #1: Type: Text/Plain, Size: 1538 bytes --]

On Thursday 15 December 2011 00:39:44 Nirbheek Chauhan wrote:
> On Thu, Dec 15, 2011 at 5:28 AM, Mike Frysinger <vapier@gentoo.org> wrote:
> > On Wednesday 14 December 2011 18:43:33 Alec Warner wrote:
> >> On Wed, Dec 14, 2011 at 1:25 PM, Leho Kraav <leho@kraav.com> wrote:
> >> > i'd be really happy if someone took care of
> >> > https://bugs.gentoo.org/150031 :>
> >> > 
> >> > "include more info about binpkg in file name"
> >> 
> >> That is great, but its not a 6 month project...
> > 
> > is it though ?  i'm inclined to mark INVALID.  hijacking filenames for
> > metadata is a tuuuuuuuuuuurrible idea.
> 
> I agree. It's along the same lines as only using file extensions for
> determining the filetype (and we all know how that turned out...). It
> *does* have the advantage of being really fast, though.

it just doesn't scale though (encoding all metadata into the filename quickly 
hits filesystem limits on name length), and i think the speed increase is only 
to a limit.  once you get into larger repos, using the already existing 
"Packages" file would be faster.  and since that compresses, it should scale a 
lot nicer.

> Nevertheless, the basic bug is about changing the distfile repository
> format in such a way that a single repo can contain several distfiles
> built with differing build conditions. Putting metadata in the
> filename is only one way of ensuring that.

sounds like the summary needs updating then by someone who has waded through 
all the followup comments ;)
-mike

[-- Attachment #2: This is a digitally signed message part. --]
[-- Type: application/pgp-signature, Size: 836 bytes --]

^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
  2011-12-15 12:27         ` Mike Frysinger
@ 2011-12-15 12:42           ` Nirbheek Chauhan
  0 siblings, 0 replies; 28+ messages in thread
From: Nirbheek Chauhan @ 2011-12-15 12:42 UTC (permalink / raw
  To: gentoo-dev

On Thu, Dec 15, 2011 at 5:57 PM, Mike Frysinger <vapier@gentoo.org> wrote:
> On Thursday 15 December 2011 00:39:44 Nirbheek Chauhan wrote:
>> Nevertheless, the basic bug is about changing the distfile repository
>> format in such a way that a single repo can contain several distfiles
>> built with differing build conditions. Putting metadata in the
>> filename is only one way of ensuring that.
>
> sounds like the summary needs updating then by someone who has waded through
> all the followup comments ;)

I didn't read every word, but I think I got the gist. I've changed the
subject accordingly. :)

-- 
~Nirbheek Chauhan

Gentoo GNOME+Mozilla Team



^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
  2011-12-15  5:39       ` Nirbheek Chauhan
  2011-12-15 12:27         ` Mike Frysinger
@ 2011-12-15 12:43         ` Rich Freeman
  2011-12-15 16:30           ` Mike Frysinger
  1 sibling, 1 reply; 28+ messages in thread
From: Rich Freeman @ 2011-12-15 12:43 UTC (permalink / raw
  To: gentoo-dev; +Cc: Alec Warner, linux.gentoo.dev

On Thu, Dec 15, 2011 at 12:39 AM, Nirbheek Chauhan <nirbheek@gentoo.org> wrote:
> Nevertheless, the basic bug is about changing the distfile repository
> format in such a way that a single repo can contain several distfiles
> built with differing build conditions. Putting metadata in the
> filename is only one way of ensuring that.

Well, having the filename vary when the metadata changes is the only
way of ensuring that.  Putting the metadata in the filename is just
one of many ways to make the filename vary.

Another solution (which I can already sense the objections to), would
be to content-hash the files and use that as the filename.  Then use
indexes to point to the files.  You could use symlink indexes to point
to the files so that superficially it looks the same as it does now
for the last version emerged.  Then people looking for a particular
set of metadata could use more detailed indexes to find the right
file.  Portage could look for an exact match when trying to merge a
binpkg since searching indexes is a trivial problem.

The indexes could be anything from text files to binary files to
databases to a couple of directory trees full of symlinks (like
/dev/disk/by-*).  The symlinks could get tricky with all the metadata
- it might make more sense to just keep it simple and use something
more like a database for the full details and symlinks for the basics.

There are countless variations on this as well - like sticking a copy
of the environment for each package in a separate text file with the
same base name so that it is easy to grep/search/etc.

You can also make it more user-friendly by keeping the PF in the
filename followed by the hash - like gvim-1.23-r1-723ba298d92f.  In
such a case you probably don't even need to index the PFs.

Rich



^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
  2011-12-15 12:43         ` Rich Freeman
@ 2011-12-15 16:30           ` Mike Frysinger
  2011-12-15 16:51             ` Mike Frysinger
  0 siblings, 1 reply; 28+ messages in thread
From: Mike Frysinger @ 2011-12-15 16:30 UTC (permalink / raw
  To: gentoo-dev; +Cc: linux.gentoo.dev

[-- Attachment #1: Type: Text/Plain, Size: 2260 bytes --]

On Thursday 15 December 2011 07:43:26 Rich Freeman wrote:
> On Thu, Dec 15, 2011 at 12:39 AM, Nirbheek Chauhan wrote:
> > Nevertheless, the basic bug is about changing the distfile repository
> > format in such a way that a single repo can contain several distfiles
> > built with differing build conditions. Putting metadata in the
> > filename is only one way of ensuring that.
> 
> Well, having the filename vary when the metadata changes is the only
> way of ensuring that.  Putting the metadata in the filename is just
> one of many ways to make the filename vary.

there is more raw metadata available than fits into a filename.  so that is 
already a non-starter.

if people want to post multiple binpkgs with different metadata

> Another solution (which I can already sense the objections to), would
> be to content-hash the files and use that as the filename.  Then use
> indexes to point to the files.  You could use symlink indexes to point
> to the files so that superficially it looks the same as it does now
> for the last version emerged.  Then people looking for a particular
> set of metadata could use more detailed indexes to find the right
> file.  Portage could look for an exact match when trying to merge a
> binpkg since searching indexes is a trivial problem.

we already hash all the files (i.e. the CONTENTS file), so using the hash of 
that file alone wouldn't be a bad idea.  although i think that file gets 
generated on the fly when merging the binpkg (seems like a waste to not cache 
that in the binary package ...).

> There are countless variations on this as well - like sticking a copy
> of the environment for each package in a separate text file with the
> same base name so that it is easy to grep/search/etc.

the env is already "trivial" to extract:
	qtbz2 -x -O bison-2.5.tbz2 | qxpak -O -x - environment.bz2 | bzgrep foo

> You can also make it more user-friendly by keeping the PF in the
> filename followed by the hash - like gvim-1.23-r1-723ba298d92f.  In
> such a case you probably don't even need to index the PFs.

portage should be using the generated "Packages" index to look up actual tbz2 
filenames, so having ${PF}-<hash>.tbz2 shouldn't be too painful.
-mike

[-- Attachment #2: This is a digitally signed message part. --]
[-- Type: application/pgp-signature, Size: 836 bytes --]

^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
  2011-12-15 16:30           ` Mike Frysinger
@ 2011-12-15 16:51             ` Mike Frysinger
  0 siblings, 0 replies; 28+ messages in thread
From: Mike Frysinger @ 2011-12-15 16:51 UTC (permalink / raw
  To: gentoo-dev; +Cc: linux.gentoo.dev

[-- Attachment #1: Type: Text/Plain, Size: 174 bytes --]

On Thursday 15 December 2011 11:30:46 Mike Frysinger wrote:
> if people want to post multiple binpkgs with different metadata

err, half formed thought here ... ignore
-mike

[-- Attachment #2: This is a digitally signed message part. --]
[-- Type: application/pgp-signature, Size: 836 bytes --]

^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
  2011-12-14 18:05 ` Christian Ruppert
  2011-12-14 18:14   ` "C. Bergström"
@ 2011-12-15 18:07   ` Gaurav Saxena
  2011-12-19 17:46     ` Christian Ruppert
  1 sibling, 1 reply; 28+ messages in thread
From: Gaurav Saxena @ 2011-12-15 18:07 UTC (permalink / raw
  To: Christian Ruppert; +Cc: gentoo-dev

Hello all , Thanks a lot for your replies.

Christian, I am interested in Open RC, it sounds interesting to me, I
would like to know more details regarding what type of projects are
there that could be done.

On Wed, Dec 14, 2011 at 11:35 PM, Christian Ruppert <idl0r@gentoo.org> wrote:
> On Wednesday 14 December 2011 16:36:42 Gaurav Saxena wrote:
>> Hello all,
>> I am interested in doing my final year computer scence project on gentoo. I
>> would be having a duration of six months to work on the project. Could you
>> please suggest me some good project ideas that would be helpful to me as
>> well as gentoo. I am interested in parallel computing, data structures ,
>> operating system. I am well versed in C/C++. I think  there might be
>> projects which need to be done, I would like to work on them.
>
> What about OpenRC? :)
> We could need some help.
> http://www.gentoo.org/proj/en/base/openrc/
> #openrc or #gentoo-base (IRC) via FreeNode
> Or openrc@gentoo.org.
>
> Let me know if you're interested or need more details :)
>
> --
> Regards,
> Christian Ruppert
> Gentoo Linux developer, Bugzilla administrator and Infrastructure member
> Fingerprint: EEB1 C341 7C84 B274 6C59 F243 5EAB 0C62 B427 ABC8



-- 
Thanks and Regards ,
Gaurav



^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
  2011-12-14 18:29 ` Alec Warner
@ 2011-12-15 18:09   ` Gaurav Saxena
  2011-12-22  4:43   ` Donnie Berkholz
  1 sibling, 0 replies; 28+ messages in thread
From: Gaurav Saxena @ 2011-12-15 18:09 UTC (permalink / raw
  To: gentoo-dev

Hello Alec,
I am interested in a distributed compiler like this ,this involves
quite a good project it seems. I would like to work on it, if possible
could you please give me some pointers regarding more details about
the project so that I could decide what to work upon.

On Wed, Dec 14, 2011 at 11:59 PM, Alec Warner <antarus@gentoo.org> wrote:
> On Wed, Dec 14, 2011 at 3:06 AM, Gaurav Saxena <grvsaxena419@gmail.com> wrote:
>> Hello all,
>> I am interested in doing my final year computer scence project on gentoo. I
>> would be having a duration of six months to work on the project. Could you
>> please suggest me some good project ideas that would be helpful to me as
>> well as gentoo. I am interested in parallel computing, data structures ,
>> operating system. I am well versed in C/C++. I think  there might be
>> projects which need to be done, I would like to work on them.
>
> The only idea I can think of for parallel computing / distributed
> systems would be at the build level.
>
> distcc-ng, a farm of user-controlled machines that compile your code
> in a p2p fashion.
> a distributed hash table of input, output tuples (basically .o caching
> so users can fetch the .o from the DHT)
>
> Both of these have *massive* trust issues. When random guys on the
> internet are compiling your code you have to be very careful about how
> you verify and execute that code. When you fetch .o files from a DHT
> you have the same problem.
>
> Almost every other problem I can think of at the Gentoo OS level can
> fit on a meager sized machine (i.e. it is not a distributed systems
> nor a parallel computing problem.)
>
> Many of the annoying parts of Gentoo are merely tools problems; the
> existing tools are poor / under-maintained or standard tools do not
> exist (so users / developers roll their own.) You may have success in
> the tools arena if you talk to mgorny or portage-utils@; I know mgorny
> has written a few C tools and might have sufficient 'gentoo' C
> libraries you could utilize; the portage-utils alias holds the
> portage-utils authors (portage-utils being another set of tools
> written in C for gentoo.)
>
> I actually liked cbergstrom's idea of toolchain-type stuff; but I'm
> not really sure how easy it is to on-board with those communities
> (lord knows in my senior year of CS I would have been useless working
> on a compiler.)
>
>>
>> --
>> Thanks and Regards ,
>> Gaurav
>



-- 
Thanks and Regards ,
Gaurav



^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
  2011-12-14 11:06 [gentoo-dev] Six month major project on Gentoo Gaurav Saxena
  2011-12-14 18:05 ` Christian Ruppert
  2011-12-14 18:29 ` Alec Warner
@ 2011-12-18 17:02 ` Petteri Räty
  2011-12-18 17:13   ` "Paweł Hajdan, Jr."
  2011-12-19 18:14 ` Sébastien Fabbro
  3 siblings, 1 reply; 28+ messages in thread
From: Petteri Räty @ 2011-12-18 17:02 UTC (permalink / raw
  To: gentoo-dev

[-- Attachment #1: Type: text/plain, Size: 865 bytes --]

On 14.12.2011 13:06, Gaurav Saxena wrote:
> Hello all, 
> I am interested in doing my final year computer scence project on
> gentoo. I would be having a duration of six months to work on the
> project. Could you please suggest me some good project ideas that would
> be helpful to me as well as gentoo. I am interested in parallel
> computing, data structures , operating system. I am well versed in
> C/C++. I think  there might be projects which need to be done, I would
> like to work on them.
> 

There are parallel computing aspects in libbash for metadata generation,
data structures in AST building for bash and it's quite low level. Feel
free to contact me off list if you are interested. It would be nice to
get that project back on track again after the last GSoC.

http://www.gentoo.org/proj/en/libbash/index.xml

Regards,
Petteri


[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 900 bytes --]

^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
  2011-12-18 17:02 ` [gentoo-dev] " Petteri Räty
@ 2011-12-18 17:13   ` "Paweł Hajdan, Jr."
  2011-12-18 17:17     ` [gentoo-dev] libbash licensing Petteri Räty
  2011-12-18 17:45     ` [gentoo-dev] Six month major project on Gentoo Michał Górny
  0 siblings, 2 replies; 28+ messages in thread
From: "Paweł Hajdan, Jr." @ 2011-12-18 17:13 UTC (permalink / raw
  To: gentoo-dev

[-- Attachment #1: Type: text/plain, Size: 549 bytes --]

On 12/18/11 6:02 PM, Petteri Räty wrote:
> There are parallel computing aspects in libbash for metadata generation,
> data structures in AST building for bash and it's quite low level.

By the way, I've always wondered why libbash is separate from the
"upstream" bash.

Have you considered contributing to the upstream bash to convert the
shell itself to a more library-oriented design (somewhat similar to
LLVM), so that you have a guarantee that the lib and the shell stay in sync?

Feel free to change the subject when responding.


[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 203 bytes --]

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [gentoo-dev] libbash licensing
  2011-12-18 17:13   ` "Paweł Hajdan, Jr."
@ 2011-12-18 17:17     ` Petteri Räty
  2011-12-18 17:45     ` [gentoo-dev] Six month major project on Gentoo Michał Górny
  1 sibling, 0 replies; 28+ messages in thread
From: Petteri Räty @ 2011-12-18 17:17 UTC (permalink / raw
  To: gentoo-dev

[-- Attachment #1: Type: text/plain, Size: 783 bytes --]

On 18.12.2011 19:13, "Paweł Hajdan, Jr." wrote:
> On 12/18/11 6:02 PM, Petteri Räty wrote:
>> There are parallel computing aspects in libbash for metadata generation,
>> data structures in AST building for bash and it's quite low level.
> 
> By the way, I've always wondered why libbash is separate from the
> "upstream" bash.
> 
> Have you considered contributing to the upstream bash to convert the
> shell itself to a more library-oriented design (somewhat similar to
> LLVM), so that you have a guarantee that the lib and the shell stay in sync?
> 

The main reason is that Portage is GPL-2 and bash upstream is GPL-3.
Unless someone is willing to do the work to get Portage to GPL-3 or FSF
to get bash GPL-2 then those two will not mix.

Regards,
Petteri


[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 900 bytes --]

^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
  2011-12-18 17:13   ` "Paweł Hajdan, Jr."
  2011-12-18 17:17     ` [gentoo-dev] libbash licensing Petteri Räty
@ 2011-12-18 17:45     ` Michał Górny
  1 sibling, 0 replies; 28+ messages in thread
From: Michał Górny @ 2011-12-18 17:45 UTC (permalink / raw
  To: gentoo-dev; +Cc: phajdan.jr

[-- Attachment #1: Type: text/plain, Size: 798 bytes --]

On Sun, 18 Dec 2011 18:13:42 +0100
""Paweł Hajdan, Jr."" <phajdan.jr@gentoo.org> wrote:

> On 12/18/11 6:02 PM, Petteri Räty wrote:
> > There are parallel computing aspects in libbash for metadata
> > generation, data structures in AST building for bash and it's quite
> > low level.
> 
> By the way, I've always wondered why libbash is separate from the
> "upstream" bash.
> 
> Have you considered contributing to the upstream bash to convert the
> shell itself to a more library-oriented design (somewhat similar to
> LLVM), so that you have a guarantee that the lib and the shell stay
> in sync?

I don't think upstream bash would be interested in converting bash into
C++. I wouldn't be interested in running such a bash, for instance.

-- 
Best regards,
Michał Górny

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 316 bytes --]

^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
  2011-12-15 18:07   ` Gaurav Saxena
@ 2011-12-19 17:46     ` Christian Ruppert
  0 siblings, 0 replies; 28+ messages in thread
From: Christian Ruppert @ 2011-12-19 17:46 UTC (permalink / raw
  To: gentoo-dev; +Cc: Gaurav Saxena

[-- Attachment #1: Type: text/plain, Size: 2233 bytes --]

On Thursday 15 December 2011 23:37:10 Gaurav Saxena wrote:
> Hello all , Thanks a lot for your replies.
> 
> Christian, I am interested in Open RC, it sounds interesting to me, I
> would like to know more details regarding what type of projects are
> there that could be done.
> 
> On Wed, Dec 14, 2011 at 11:35 PM, Christian Ruppert <idl0r@gentoo.org> 
wrote:
> > On Wednesday 14 December 2011 16:36:42 Gaurav Saxena wrote:
> >> Hello all,
> >> I am interested in doing my final year computer scence project on
> >> gentoo. I would be having a duration of six months to work on the
> >> project. Could you please suggest me some good project ideas that
> >> would be helpful to me as well as gentoo. I am interested in parallel
> >> computing, data structures , operating system. I am well versed in
> >> C/C++. I think  there might be projects which need to be done, I
> >> would like to work on them.
> > 
> > What about OpenRC? :)
> > We could need some help.
> > http://www.gentoo.org/proj/en/base/openrc/
> > #openrc or #gentoo-base (IRC) via FreeNode
> > Or openrc@gentoo.org.
> > 
> > Let me know if you're interested or need more details :)
> > 
> > --
> > Regards,
> > Christian Ruppert
> > Gentoo Linux developer, Bugzilla administrator and Infrastructure member
> > Fingerprint: EEB1 C341 7C84 B274 6C59 F243 5EAB 0C62 B427 ABC8

Hi Gaurav,

you may know OpenRC already from your Gentoo machines, it's the service 
management fronted of "sysvinit" that handles the startup/shutdown of services 
in various runlevels incl. dependencies.

What I can say is:
The rc_parallel (so starting services parallel) feature needs some love. There 
are some issues re service dependencies, locking etc.
https://bugs.gentoo.org/391945
https://bugs.gentoo.org/360013
and some more.

We also have some issues with links of init scripts:

There are a lot of other "major" and minor bugs. See 
http://preview.tinyurl.com/openrc-bugs

You can also ask us via IRC if you want (just stay longer, remind the different 
timezones so it may take some time till one replies :P)

-- 
Regards,
Christian Ruppert
Gentoo Linux developer, Bugzilla administrator and Infrastructure member
Fingerprint: EEB1 C341 7C84 B274 6C59 F243 5EAB 0C62 B427 ABC8

[-- Attachment #2: This is a digitally signed message part. --]
[-- Type: application/pgp-signature, Size: 490 bytes --]

^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
  2011-12-14 11:06 [gentoo-dev] Six month major project on Gentoo Gaurav Saxena
                   ` (2 preceding siblings ...)
  2011-12-18 17:02 ` [gentoo-dev] " Petteri Räty
@ 2011-12-19 18:14 ` Sébastien Fabbro
  2012-01-02 14:33   ` "Paweł Hajdan, Jr."
  3 siblings, 1 reply; 28+ messages in thread
From: Sébastien Fabbro @ 2011-12-19 18:14 UTC (permalink / raw
  To: gentoo-dev

Gaurav Saxena <grvsaxena419@gmail.com> wrote:

> I am interested in doing my final year computer scence project on gentoo. I
> would be having a duration of six months to work on the project. Could you
> please suggest me some good project ideas that would be helpful to me as
> well as gentoo. I am interested in parallel computing, data structures ,
> operating system. I am well versed in C/C++. I think  there might be
> projects which need to be done, I would like to work on them.
>

One project that could be very useful for Gentoo is an automated
stabilization/testing for ebuilds. Obviously it will require some work
from the ebuild maintainers, but the ability to distribute the
stabilization recipes across a volunteering Gentoo community via
something like BOINC could be worth looking at.

-- 
Sébastien



^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
  2011-12-14 18:29 ` Alec Warner
  2011-12-15 18:09   ` Gaurav Saxena
@ 2011-12-22  4:43   ` Donnie Berkholz
  2011-12-22 10:52     ` Rich Freeman
  1 sibling, 1 reply; 28+ messages in thread
From: Donnie Berkholz @ 2011-12-22  4:43 UTC (permalink / raw
  To: gentoo-dev

[-- Attachment #1: Type: text/plain, Size: 1234 bytes --]

On 10:29 Wed 14 Dec     , Alec Warner wrote:
> On Wed, Dec 14, 2011 at 3:06 AM, Gaurav Saxena <grvsaxena419@gmail.com> wrote:
> > Hello all,
> > I am interested in doing my final year computer scence project on gentoo. I
> > would be having a duration of six months to work on the project. Could you
> > please suggest me some good project ideas that would be helpful to me as
> > well as gentoo. I am interested in parallel computing, data structures ,
> > operating system. I am well versed in C/C++. I think  there might be
> > projects which need to be done, I would like to work on them.
> 
> The only idea I can think of for parallel computing / distributed
> systems would be at the build level.
> 
> distcc-ng, a farm of user-controlled machines that compile your code
> in a p2p fashion.

I looked into this 6 or 7 years ago. It wasn't feasible unless you were 
on an extremely high-speed, low-latency network, beyond what was 
typically accessible at the time outside of universities and LANs. Could 
be worth exploring again now that 25-100 mbps connections are becoming 
more common.

-- 
Thanks,
Donnie

Donnie Berkholz
Council Member / Sr. Developer
Gentoo Linux
Blog: http://dberkholz.com

[-- Attachment #2: Type: application/pgp-signature, Size: 198 bytes --]

^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
  2011-12-22  4:43   ` Donnie Berkholz
@ 2011-12-22 10:52     ` Rich Freeman
  2011-12-22 11:11       ` Francesco Riosa
  0 siblings, 1 reply; 28+ messages in thread
From: Rich Freeman @ 2011-12-22 10:52 UTC (permalink / raw
  To: gentoo-dev

On Wed, Dec 21, 2011 at 11:43 PM, Donnie Berkholz <dberkholz@gentoo.org> wrote:
> I looked into this 6 or 7 years ago. It wasn't feasible unless you were
> on an extremely high-speed, low-latency network, beyond what was
> typically accessible at the time outside of universities and LANs. Could
> be worth exploring again now that 25-100 mbps connections are becoming
> more common.

I tried messing around with this with Amazon EC2.  The problem was
that due to latency I only really saw the benefit for VERY high levels
of parallelization (think -j25+)..  However, make isn't actually
"distcc-aware" so it just runs 25 jobs of anything in parallel.  So,
anytime a makefile launched a ton of java or python jobs the host
ground to a halt as it wasn't distributed and it was way more than the
host could handle (especially java - which swapped like there was no
tomorrow).

If somebody were to do a distcc-ng for a large cluster one of the
problems to solve would be having it not run jobs in parallel if it
couldn't actually distribute them.

Rich



^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
  2011-12-22 10:52     ` Rich Freeman
@ 2011-12-22 11:11       ` Francesco Riosa
  2011-12-22 15:55         ` Michał Górny
  0 siblings, 1 reply; 28+ messages in thread
From: Francesco Riosa @ 2011-12-22 11:11 UTC (permalink / raw
  To: gentoo-dev

2011/12/22 Rich Freeman <rich0@gentoo.org>:
> On Wed, Dec 21, 2011 at 11:43 PM, Donnie Berkholz <dberkholz@gentoo.org> wrote:
>> I looked into this 6 or 7 years ago. It wasn't feasible unless you were
>> on an extremely high-speed, low-latency network, beyond what was
>> typically accessible at the time outside of universities and LANs. Could
>> be worth exploring again now that 25-100 mbps connections are becoming
>> more common.
>
> I tried messing around with this with Amazon EC2.  The problem was
> that due to latency I only really saw the benefit for VERY high levels
> of parallelization (think -j25+)..  However, make isn't actually
> "distcc-aware" so it just runs 25 jobs of anything in parallel.  So,
> anytime a makefile launched a ton of java or python jobs the host
> ground to a halt as it wasn't distributed and it was way more than the
> host could handle (especially java - which swapped like there was no
> tomorrow).
>
> If somebody were to do a distcc-ng for a large cluster one of the
> problems to solve would be having it not run jobs in parallel if it
> couldn't actually distribute them.
>
> Rich
>

Just wanted to point out that (if there is enough memory) recent
kernels manage much better parallelism, even excess of it, once
reached the maximum load augmenting threads only bring minimal loss of
"real" time.



^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
  2011-12-22 11:11       ` Francesco Riosa
@ 2011-12-22 15:55         ` Michał Górny
  2011-12-22 16:09           ` Rich Freeman
  0 siblings, 1 reply; 28+ messages in thread
From: Michał Górny @ 2011-12-22 15:55 UTC (permalink / raw
  To: gentoo-dev; +Cc: vivo75

[-- Attachment #1: Type: text/plain, Size: 954 bytes --]

On Thu, 22 Dec 2011 12:11:32 +0100
Francesco Riosa <vivo75@gmail.com> wrote:

> > I tried messing around with this with Amazon EC2.  The problem was
> > that due to latency I only really saw the benefit for VERY high
> > levels of parallelization (think -j25+)..  However, make isn't
> > actually "distcc-aware" so it just runs 25 jobs of anything in
> > parallel.  So, anytime a makefile launched a ton of java or python
> > jobs the host ground to a halt as it wasn't distributed and it was
> > way more than the host could handle (especially java - which
> > swapped like there was no tomorrow).
> 
> Just wanted to point out that (if there is enough memory) recent
> kernels manage much better parallelism, even excess of it, once
> reached the maximum load augmenting threads only bring minimal loss of
> "real" time.

Does that include handling complete lack of memory and heavy swapping?

-- 
Best regards,
Michał Górny

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 316 bytes --]

^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
  2011-12-22 15:55         ` Michał Górny
@ 2011-12-22 16:09           ` Rich Freeman
  2011-12-22 17:40             ` [gentoo-dev] " Duncan
  0 siblings, 1 reply; 28+ messages in thread
From: Rich Freeman @ 2011-12-22 16:09 UTC (permalink / raw
  To: gentoo-dev

On Thu, Dec 22, 2011 at 10:55 AM, Michał Górny <mgorny@gentoo.org> wrote:
>> Just wanted to point out that (if there is enough memory) recent
>> kernels manage much better parallelism, even excess of it, once
>> reached the maximum load augmenting threads only bring minimal loss of
>> "real" time.
>
> Does that include handling complete lack of memory and heavy swapping?
>

I think the key was the "if there is enough memory" - which I think is
a pretty big issue.  Running 25 invocations of java each wanting
100-200MB of RAM will cripple a system relatively quickly.  And 25
really is just scratching the surface when most people think
"cluster."

I think the only way to really fix this would be to integrate
distributed behavior into make itself.

Rich



^ permalink raw reply	[flat|nested] 28+ messages in thread

* [gentoo-dev] Re: Six month major project on Gentoo
  2011-12-22 16:09           ` Rich Freeman
@ 2011-12-22 17:40             ` Duncan
  0 siblings, 0 replies; 28+ messages in thread
From: Duncan @ 2011-12-22 17:40 UTC (permalink / raw
  To: gentoo-dev

Rich Freeman posted on Thu, 22 Dec 2011 11:09:16 -0500 as excerpted:

> On Thu, Dec 22, 2011 at 10:55 AM, Michał Górny <mgorny@gentoo.org>
> wrote:
>>> Just wanted to point out that (if there is enough memory) recent
>>> kernels manage much better parallelism, even excess of it, once
>>> reached the maximum load augmenting threads only bring minimal loss of
>>> "real" time.
>>
>> Does that include handling complete lack of memory and heavy swapping?
>>
>>
> I think the key was the "if there is enough memory" - which I think is a
> pretty big issue.

User experience but it matters too...

The above memory limit-facter being the real limiter was true here when I 
used to run unlimited local jobs, too (I moderated a decent bit since 
portage itself parallelizes now, the fascination phase is over, and the 
kernel no longer requires double-load-factor to stay 95-100% CPU 
utilized).  If the make is setup in such a way that it can sufficiently 
parallelize, even back on dual-core, I could run hundreds of parallel 
jobs (say, with the kernel build) surprisingly well -- as long as they 
didn't require much memory.

As I've moderated since then, it's mostly to keep memory usage (and I/O, 
especially when it compounds with swap) sensible, way more than it is any 
real problem scheduling the actual loads.  Plus, the kernel has gotten 
rather better at fully utilizing CPU resources; it used to be that you 
had to run about double (sometimes more, but generally slightly less) 
your cores in load average to avoid idle CPUs.  That's no longer the case 
as a 25% over-scheduling seems to do what it used to require a 100% over-
scheduling to achieve in terms of keeping 95-100% CPU usage, so by 50% 
over you're just wasting cycles and bytes.

What I /really/ wish was that there was a make (and portage) switch 
parallel to -l, that responded based on memory instead of load average!  
Then those single-job gig-plus links could limit to say 4 jobs on an 8-
gig machine, even if it's an 8-way that would otherwise be -j16 -l10 or
-l12 (regardless of the balance or lack thereof, of an 8-way with only a 
gig of RAM per core... 2 gigs/core seems to be about the sweet-spot I've 
found for a general purpose gentooer build-station/desktop).

That'd be an interesting and very useful parallel-domain task, adding a 
make switch that responded to memory just as -l responds to load.  Maybe 
a make other than gmake already has such an option?

-- 
Duncan - List replies preferred.   No HTML msgs.
"Every nonfree program has a lord, a master --
and if you use the program, he is your master."  Richard Stallman




^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [gentoo-dev] Six month major project on Gentoo
  2011-12-19 18:14 ` Sébastien Fabbro
@ 2012-01-02 14:33   ` "Paweł Hajdan, Jr."
  0 siblings, 0 replies; 28+ messages in thread
From: "Paweł Hajdan, Jr." @ 2012-01-02 14:33 UTC (permalink / raw
  To: gentoo-dev

[-- Attachment #1: Type: text/plain, Size: 668 bytes --]

On 12/19/11 7:14 PM, Sébastien Fabbro wrote:
> One project that could be very useful for Gentoo is an automated
> stabilization/testing for ebuilds. Obviously it will require some work
> from the ebuild maintainers, but the ability to distribute the
> stabilization recipes across a volunteering Gentoo community via
> something like BOINC could be worth looking at.

I can help with guidance and mentoring here.

I'm maintaining a set of scripts
<http://git.overlays.gentoo.org/gitweb/?p=proj/arch-tools.git;a=summary>
that I use for mass-stabilization and related tasks. If someone wants to
extend it so the testing can be distributed, that's great.


[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 203 bytes --]

^ permalink raw reply	[flat|nested] 28+ messages in thread

end of thread, other threads:[~2012-01-02 14:33 UTC | newest]

Thread overview: 28+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-12-14 11:06 [gentoo-dev] Six month major project on Gentoo Gaurav Saxena
2011-12-14 18:05 ` Christian Ruppert
2011-12-14 18:14   ` "C. Bergström"
2011-12-15 18:07   ` Gaurav Saxena
2011-12-19 17:46     ` Christian Ruppert
2011-12-14 18:29 ` Alec Warner
2011-12-15 18:09   ` Gaurav Saxena
2011-12-22  4:43   ` Donnie Berkholz
2011-12-22 10:52     ` Rich Freeman
2011-12-22 11:11       ` Francesco Riosa
2011-12-22 15:55         ` Michał Górny
2011-12-22 16:09           ` Rich Freeman
2011-12-22 17:40             ` [gentoo-dev] " Duncan
2011-12-18 17:02 ` [gentoo-dev] " Petteri Räty
2011-12-18 17:13   ` "Paweł Hajdan, Jr."
2011-12-18 17:17     ` [gentoo-dev] libbash licensing Petteri Räty
2011-12-18 17:45     ` [gentoo-dev] Six month major project on Gentoo Michał Górny
2011-12-19 18:14 ` Sébastien Fabbro
2012-01-02 14:33   ` "Paweł Hajdan, Jr."
     [not found] <i7USJ-40Q-3@gated-at.bofh.it>
2011-12-14 21:25 ` Leho Kraav
2011-12-14 23:43   ` Alec Warner
2011-12-14 23:58     ` Mike Frysinger
2011-12-15  5:39       ` Nirbheek Chauhan
2011-12-15 12:27         ` Mike Frysinger
2011-12-15 12:42           ` Nirbheek Chauhan
2011-12-15 12:43         ` Rich Freeman
2011-12-15 16:30           ` Mike Frysinger
2011-12-15 16:51             ` Mike Frysinger

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox