From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 31036 invoked by uid 1002); 27 Feb 2003 15:39:12 -0000 Mailing-List: contact gentoo-dev-help@gentoo.org; run by ezmlm Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-Id: Gentoo Linux mail X-BeenThere: gentoo-dev@gentoo.org Received: (qmail 10976 invoked from network); 27 Feb 2003 15:39:12 -0000 From: Yannick Koehler Organization: Colubris Networks Inc. To: gentoo-dev@gentoo.org Date: Thu, 27 Feb 2003 10:26:52 -0500 User-Agent: KMail/1.5 References: <20030227102305.GB12293@cherenkov.orbis-terrarum.net> In-Reply-To: <20030227102305.GB12293@cherenkov.orbis-terrarum.net> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200302271026.53568.yannick.koehler@colubris.com> X-OriginalArrivalTime: 27 Feb 2003 15:27:33.0588 (UTC) FILETIME=[BFD69940:01C2DE74] Subject: Re: [gentoo-dev] Portage metadata X-Archives-Salt: 7baf5abe-77e0-42b7-b25a-5768aedb7d8f X-Archives-Hash: 8c856af7a945b88f22f2d51a5908ed4d On February 27, 2003 05:23 am, robbat2@orbis-terrarum.net wrote: > 1. Is there any defined format to the metadata? Each line is a DB field. The DB field is defined inside portage.py -> auxdbkeys=['DEPEND','RDEPEND','SLOT','SRC_URI','RESTRICT','HOMEPAGE','LICENSE','DESCRIPTION','KEYWORDS','INHERITED','IUSE','CDEPEND','PDEPEND'] > 2. For later speedups, is anybody looking into using actual database > formats for the data? (GDBM/BDB/NDBM/etc.) Python internal DB is cache in memory and is quite fast, for certain task a DB would be faster but most of the time, the python DB seems to kick ass unless you do description search. I have written in the past a bash script that generated an XML files out of the /var/db/pkg. If you can import that 4 Megs files inside a DB then you could run some query and time them and see how things get faster. -- Yannick Koehler -- gentoo-dev@gentoo.org mailing list