From: Troy Dack <troy@tkdack.com>
To: gentoo-dev@lists.gentoo.org
Subject: Re: [gentoo-dev] Idea for the portage maintainers
Date: Tue, 13 Apr 2004 01:12:29 +1000 [thread overview]
Message-ID: <1081782749.25073.6.camel@carbon> (raw)
In-Reply-To: <200404120836.16542.tom@securescience.net>
On Mon, 2004-04-12 at 22:36, Tom St Denis wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> On April 12, 2004 08:23 am, Georgi Georgiev wrote:
> > maillog: 12/04/2004-08:03:13(-0400): Tom St Denis types
> >
> > > Oh, ok so I'll just format my disk, reinstall Gentoo from scratch so that
> > > I can not waste 200M of space on 80k small files.
> >
> > You can always use a loopback device with a filesystem of your choice.
> > http://forums.gentoo.org/viewtopic.php?t=68215
>
> That's a potential solution but a bit out of the way for the user don't you
> think? Isn't the point of quality software to attract users simply by having
> merit?
>
> > > That's not really user friendly. Could have done a JAR like setup for
> > > each dir of the tree. e.g. all of app-text be one huge ZIP file [with no
> > > compression]. Such a setup might be a little slower to add/remove files
> > > but would waste less space.
> > >
> > > The idea would make a little sense though in practice. When I do "emerge
> > > sync" instead of fetching 1000s of small files I just check the timestamp
> > > on the directory zips and download them wholesale. [Ok so maybe
> > > compression makes sense here].
> >
> > How is this method faster or in any way better than rsync?
>
> Less metadata? If I want to download 1000 files off your site I have to say
>
> +GET /file1
> - -HTTP/1.1 200 OK
> ... data
> +GET /file2
> - -HTTP/1.1 200 OK
> ... data
> +GET /file3
> - -HTTP/1.1 200 OK
> ... data
> ETC
>
> The overhead translates on both sides too. E.g. My FS now has to find room
> for and enter in 1000s of files. The server side has to locate 1000s of
> files.
>
> Also the portage files are not compressed. If you take an entire directory
> [again say app-text] and compress it you save bandwidth. A quick check of
> app-text on my box [updated last night] gives a 850K zip file [290K tar.bz2]
> compared to 2.7M of raw data. Obviously sending the entire zip would be
> wasting more bandwidth when only small changes occur.
Rsync compresses the information it sends, additionally (and most
importantly) rsync only sends the differences, not the whole file, so it
is even more efficient. Visit rsync.samba.org and have a read about how
it all works.
Another point against a monolithic zip containing all the ebuilds (or
even per directory zips) is the performance hit that slow machines would
take, not everybody runs gentoo on a 2GHz plus machine (eg: my little
PII-400 in the corner)
--
Troy Dack http://linux.tkdack.com
<troy@tkdack.com> http://webportage.sf.net
Public Key: http://pgp.mit.edu:11371/pks/lookup?op=get&search=0x4D90BE3C
Key fingerprint = 1F3D 6C15 16AA 09D5 0C96 92E5 FD89 16F9 4D90 BE3C
--
gentoo-dev@gentoo.org mailing list
next prev parent reply other threads:[~2004-04-12 15:11 UTC|newest]
Thread overview: 39+ messages / expand[flat|nested] mbox.gz Atom feed top
2004-04-11 11:55 [gentoo-dev] Idea for the portage maintainers Tom St Denis
2004-04-12 10:45 ` Alexander Gretencord
2004-04-12 12:03 ` Tom St Denis
2004-04-12 12:23 ` Georgi Georgiev
2004-04-12 12:36 ` Tom St Denis
2004-04-12 14:18 ` N. Owen Gunden
2004-04-12 15:12 ` Troy Dack [this message]
2004-04-12 15:15 ` Jason Stubbs
2004-04-12 16:22 ` Andrew Gaffney
2004-04-12 16:23 ` Todd Berman
2004-04-12 16:59 ` Andrew Gaffney
2004-04-12 17:03 ` Todd Berman
2004-04-12 17:17 ` Andrew Gaffney
2004-04-12 17:39 ` Todd Berman
2004-04-13 1:04 ` Jason Stubbs
2004-04-13 3:35 ` Todd Berman
2004-04-13 15:39 ` [gentoo-dev] Idea for the portage maintainers - personal experiences with a .zip-db Karl Trygve Kalleberg
2004-04-12 17:09 ` [gentoo-dev] Idea for the portage maintainers Tom St Denis
2004-04-12 17:19 ` Norberto Bensa
2004-04-12 17:21 ` Tom St Denis
2004-04-13 12:18 ` Chris Bainbridge
2004-04-13 16:12 ` Chris Bainbridge
2004-04-12 11:57 ` Senor Rodgman
-- strict thread matches above, loose matches on Subject: below --
2004-04-12 12:46 brettholcomb
2004-04-12 12:59 ` Tom St Denis
2004-04-12 19:55 ` Marius Mauch
2004-04-12 17:39 brettholcomb
2004-04-12 17:51 ` Andrew Gaffney
2004-04-12 20:00 ` Marius Mauch
2004-04-12 20:31 ` Marius Mauch
2004-04-12 20:46 ` Stuart Herbert
2004-04-12 20:58 ` Marius Mauch
2004-04-12 21:17 ` Stuart Herbert
2004-04-12 21:26 ` Spider
2004-04-12 23:44 ` Drake Wyrm
2004-04-12 21:26 ` Marius Mauch
2004-04-12 22:20 ` Stuart Herbert
2004-04-12 22:18 ` Andrew Gaffney
2004-04-12 22:38 ` Stuart Herbert
2004-04-12 22:32 ` Marius Mauch
2004-04-12 22:44 ` Marius Mauch
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1081782749.25073.6.camel@carbon \
--to=troy@tkdack.com \
--cc=gentoo-dev@lists.gentoo.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox