From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from pigeon.gentoo.org ([208.92.234.80] helo=lists.gentoo.org) by finch.gentoo.org with esmtp (Exim 4.60) (envelope-from ) id 1S8sQt-0002Zf-5L for garchives@archives.gentoo.org; Sat, 17 Mar 2012 12:08:28 +0000 Received: from pigeon.gentoo.org (localhost [127.0.0.1]) by pigeon.gentoo.org (Postfix) with SMTP id BDD8CE0B88 for ; Sat, 17 Mar 2012 12:08:26 +0000 (UTC) Received: from mail-iy0-f181.google.com (mail-iy0-f181.google.com [209.85.210.181]) by pigeon.gentoo.org (Postfix) with ESMTP id 05F97E09AB for ; Sat, 17 Mar 2012 10:50:06 +0000 (UTC) Received: by iaoo28 with SMTP id o28so8205306iao.40 for ; Sat, 17 Mar 2012 03:50:06 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; bh=v1Nsy2vS5dpQ21u7lK8AYsUug6fF5UmDzwD94ZV7Uig=; b=EkR451w+VsB6NOpcLk3dQEyGx5NJuyuSaMcE3dWN3y311uP8GKGTqB0o59zqYly7So QuVfHIJfIHmq9UsmkOl6pAFQMmW9oDGuTQ6uuYk6X3FMRxLGNaxQhxjtFKzapn5mwE3B OVOpXo0nJlzt7mg26g/QVXBpsHEMLVbu+z3FbOrWGmyda/uj/qGdaiW6ceqswHfmwFiZ w7yY5F2cQsAsJkrkyuTNq7NO+PDLzSz0AGRVYnJYcMgDgyWJDHOaEbf+n8QL0EBGZGtw qfLq2zHLlFmXxxlt3wW+0+PJdyzxKazrifBxIg5OPBEZFrAt3xz9H3R/xEiYXxmtTWGw Xu1g== Received: by 10.50.106.200 with SMTP id gw8mr2021405igb.10.1331981406512; Sat, 17 Mar 2012 03:50:06 -0700 (PDT) Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-Id: Gentoo Linux mail X-BeenThere: gentoo-soc@lists.gentoo.org Reply-to: gentoo-soc@lists.gentoo.org MIME-Version: 1.0 Received: by 10.50.179.70 with HTTP; Sat, 17 Mar 2012 03:49:46 -0700 (PDT) In-Reply-To: References: From: Yanbo Wang Date: Sat, 17 Mar 2012 18:49:46 +0800 Message-ID: Subject: Re: [gentoo-soc] Package statistics reporting tool To: gentoo-soc@lists.gentoo.org Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Archives-Salt: 358817eb-2cf6-485d-a8de-37138e1b19b6 X-Archives-Hash: 0394e438c2955c84fa890716721646f0 Thanks for your reply. 2012/3/17 Rafael Goncalves Martins : > Hi, > > On Fri, Mar 16, 2012 at 12:59 PM, Corentin Chary > wrote: >> Hi, >> >> Ccing Rafael. >> >> On Thu, Mar 15, 2012 at 2:44 PM, Yanbo Wang wrote: >>> Hi everyone, >>> >>> I'm a post graduate student from China. I'm interested in the idea abou= t >>> Package statistics reporting tool and I'd like to take part in this yea= r's GSoc >>> as a student. >>> >>> I have read the code written by Vikraman and deployed the server on my = gentoo. >>> The server gathered a lot of information about environment and >>> packages installed. >>> I could get the statistic through web interface or terminal.That's >>> very great, I really want >>> to contribute to the project. >>> Please tell me this year's focus, and I have some ideas to share. >>> 1. Upload more information such as developers' email, whether the packa= ges >>> installed is stable or not, =C2=A0the crash info. >>> 2. Some statistics like popularity(recently installed packages, least >>> popular packages) >>> or the relationship between packages could be done in a cronjob. >>> 3. Integrate with bugzilla. Developers can easily get the bugs of >>> packages through >>> statistic web interface or terminal, like PyBugz [1]. These will help >>> developers find bugs >>> of their packages easier. > > @Yanbo, thanks for your interest. Some of your ideas are great, but > one of the things that we should keep in mind for this project is > privacy. We should make sure that the user is aware of what data he is > sending for our servers, and disable it if needed. The code should be > able to handle this properly. I think that we shouldn't install a > cronjob automatically, for example. > Of course, privacy is an important part of this project. Currently, users can decide what data they want to sending by payload.cfg file and it will be easily extended if we add more features. The client generates a UUID for the host on installation and the server uses UUID to identify the host and packages installed. Maybe my expression was not clear, a cronjob could be installed on the server, not on the host, because the extraction of some interesting statistics, like relationship between packages, will be time-consuming and we should preprocess them with a cronjob. These statistics may be used for recommending related packages or finding bugs between packages. >>> About me: >>> I=E2=80=99ve had 3 years=E2=80=99 experience in java development (Java = Web and Swing), 2 years' >>> in python development (Django) and also be fimilar with c/c++. =C2=A0I= =E2=80=99ve been using >>> gentoo for over 1 year and I really want to contribute a little to >>> gentoo community. >>> >>> [1] http://www.liquidx.net/pybugz/ >> >> I took a quick look at last year code, and I think the project would >> be easier to extend if it was using django. This would avoid writing >> custom SQL queries, etc... And you can probably steal some code from >> euscan to do that. >> Note that I'm not the mentor for this project and it is just an idea. > > @Corentin: During last year's gsoc I talked quickly with Vikraman and > Alec about the web framework being used. I agree that web.py isn't the > better tool for the job, I think I suggested flask that time, don't > recall correctly. > Django is okey for me and I can learn something related from euscan. Thanks for Corentin's advice. > This topic should be well discussed, because any change of framework > will spend a considerable amount of time, considering that we just > have 3 months. > >> I think this years goals should be: >> - stabilization of the client, and integration with the base system, >> documentation during installation, etc... >> - adding more features to the web interface, like what we can find =C2= =A0on >> http://popcon.debian.org/ (charts, sortable tables, overlays usage, >> etc..) >> - integration with other gentoo webservices (how ?) Like PyBugz [1], developers can get bugs through terminal when they query packages, rather than Bugzilla website. In addition, we can add bugs' links to Bugzilla website in packages statistic page. >> - putting the project in production of gentoo-infra >> > > @Corentin: I tend to agree with these goals. Thanks for the answer. > > @Yanbo, your ideas are good, but keep security and privacy in mind. I > think that these are the key points of the success of this project. > Also, automatic bug reports are cool, but we need to stabilize the > basic features first. > Yes, thanks for yours and Corentin's suggestion, stabilization and adding more features to the web interface are most import to complete. There has been one package named abrt [2], and this tool helps users easily report bugs of packages to Bugzilla. My idea about bug report is reporting package name with some comment to the server with terminal like current gentoostats-send script when the package crashs. On the server side, reports are analysed offline. We can extract the usefull things and help developers find bugs. For example, most of users build one package with the same CFLAG have the same problem or install one package with another package together have the same crash info. > Best regards, > > -- > Rafael Goncalves Martins > Gentoo Linux developer > http://rafaelmartins.eng.br/ > [1] http://packages.gentoo.org/package/www-client/pybugz [2] http://packages.gentoo.org/package/app-admin/abrt --=20 Thanks, Yanbo Wang