* [gentoo-portage-dev] [Fwd: [gentoo-dev] [RFC] DIGESTS metadata variable for cache validation]
@ 2009-02-03 5:46 Zac Medico
0 siblings, 0 replies; only message in thread
From: Zac Medico @ 2009-02-03 5:46 UTC (permalink / raw
To: gentoo-portage-dev
[-- Attachment #1: Type: text/plain, Size: 0 bytes --]
[-- Attachment #2: [gentoo-dev] [RFC] DIGESTS metadata variable for cache validation.eml --]
[-- Type: message/rfc822, Size: 4970 bytes --]
From: Zac Medico <zmedico@gentoo.org>
To: Gentoo Dev <gentoo-dev@lists.gentoo.org>
Subject: [gentoo-dev] [RFC] DIGESTS metadata variable for cache validation
Date: Mon, 02 Feb 2009 12:34:46 -0800
Message-ID: <498758E6.5080609@gentoo.org>
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Hi,
I'd like to add a new metadata cache value called DIGESTS which will
contain a space separated list of digests which can be
used to validate the metadata cache. Like INHERITED and
DEFINED_PHASES [1], it will be automatically generated. The first
digest in the list will correspond to the ebuild. If there are any
inherited eclasses, the digests of those eclasses will follow in a
space separated list, in the same order that they occur in the
INHERITED variable. The value of the DIGESTS variable will be on
line 18 of the metadata cache (just after DEFINED_PHASES).
For the digest format, I suggest that we use the leftmost 10
hexadecimal digits of the SHA-1 digest. The rationale for limiting
it to 10 digits (out of 40) is to save space. Due to the avalanche
effect [2], 10 digits should be sufficient to ensure that problems
resulting from hash collisions are extremely unlikely.
The primary reason to use a digest for cache validation instead of a
timestamp is that it allows the cache validation mechanism to work
even if the tree is distributed with a protocol that does not
preserve timestamps, such as git or subversion. This would make it
possible to distribute metadata cache directly from git and
subversion repositories (among others). Since a digest is inherently
more expensive to obtain than a timestamp, package managers may use
the Manifest entries as a digest cache, in order to avoid the need
to compute digests of ebuilds during dependency calculations.
Does the suggested approach seem reasonable? Would anybody like to
suggest any changes?
[1]
http://archives.gentoo.org/gentoo-dev/msg_8c34d8efbc0d31ab28c517403dc83f62.xml
[2] http://en.wikipedia.org/wiki/Avalanche_effect
- --
Thanks,
Zac
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.9 (GNU/Linux)
iEYEARECAAYFAkmHWOQACgkQ/ejvha5XGaOJeQCgouZGO+pbOgJYkzssRVhzMDwt
Cq4AoN6NG7SmJ6XjEked1WnZ+CJPXVWj
=JSDL
-----END PGP SIGNATURE-----
^ permalink raw reply [flat|nested] only message in thread
only message in thread, other threads:[~2009-02-03 5:46 UTC | newest]
Thread overview: (only message) (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2009-02-03 5:46 [gentoo-portage-dev] [Fwd: [gentoo-dev] [RFC] DIGESTS metadata variable for cache validation] Zac Medico
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox