public inbox for gentoo-dev@lists.gentoo.org
 help / color / mirror / Atom feed
Search results ordered by [date|relevance]  view[summary|nested|Atom feed]
thread overview below | download: 
* Re: [gentoo-dev] Having fun with compression
  @ 2006-05-02 15:50 99% ` Ryan Phillips
  0 siblings, 0 replies; 1+ results
From: Ryan Phillips @ 2006-05-02 15:50 UTC (permalink / raw
  To: Patrick Lauer; +Cc: gentoo-dev

[-- Attachment #1: Type: text/plain, Size: 950 bytes --]

Patrick Lauer <patrick@gentoo.org> said:
> Hi all,
> 
> I had this random idea that many of our distfiles are .tar.gz while more
> efficient compression methods exist. So I did some testing for fun:
> 
> We have ~15k .tar.gz in distfiles. ~6500 .tar.bz2, ~2000 others.
> A short run over 477 distfiles spanning 833M gave me 586M of .tar.bz2 -
> roughly 30% more efficient!
> A comparison run with 7zip gave me 590M files, so bzip2 seems to be
> quite good.
> 
> I don't think repackaging every .tar.gz as .tar.bz2 is a reasonable
> option (breaks MD5 digests, we lose the fallback download from the
> homepage), but maybe this motivates people to save bandwidth and migrate
> their packaging to bzip2.

Patrick, 

did you benchmark CPU load?  Often bzip2 takes 3x as long to
uncompress a package than bzip.  Often, the space savings doesn't
justify the cost of how long it takes for the cpu to decompress the
archive.

-ryan

[-- Attachment #2: Type: application/pgp-signature, Size: 187 bytes --]

^ permalink raw reply	[relevance 99%]

Results 1-1 of 1 | reverse | options above
-- pct% links below jump to the message on this page, permalinks otherwise --
2006-04-30 16:30     [gentoo-dev] Having fun with compression Patrick Lauer
2006-05-02 15:50 99% ` Ryan Phillips

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox