public inbox for gentoo-soc@lists.gentoo.org
 help / color / mirror / Atom feed
* [gentoo-soc] Package statistics reporting tool
@ 2011-03-29 13:24 Vikraman
  2011-03-29 13:53 ` Donnie Berkholz
  0 siblings, 1 reply; 10+ messages in thread
From: Vikraman @ 2011-03-29 13:24 UTC (permalink / raw
  To: gentoo-soc

I've been doing research on this project for sometime, and would really 
like my ideas to be reviewed by the community before submitting my application.

* User runs a script as a cronjob (or a portage hook ?), which collects 
stats on installed packages, and uploads to the online database. Surprisingly 
easy to code using the gentoolkit API.

* Webapp generates statistics on the packages (top packages, versions, 
popularity, packages from overlays). Thanks to ideas from an old thread on 
the forums. Devs, please suggest some more useful ideas/stats too.

* Dev/user runs a cli script to read stats or a gui program to display 
graphs.

I'm looking forward to coding the project in Python, gui using PyGTK, 
database using MySQL. For the web service implementation, should I use 
SOAP (soappy, which I'll need to learn) or something like web.py (I've 
used it earlier) ?

It was also suggested earlier to allow a dev to trigger a popup feedback 
from the user. I think that would require the user ro run a daemon, which 
will listen for queries.

Please suggest what details to include and what not to include in the 
application.

--
vh4x0r



^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [gentoo-soc] Package statistics reporting tool
  2011-03-29 13:24 [gentoo-soc] Package statistics reporting tool Vikraman
@ 2011-03-29 13:53 ` Donnie Berkholz
  2011-03-29 14:20   ` Brian Dolbec
  0 siblings, 1 reply; 10+ messages in thread
From: Donnie Berkholz @ 2011-03-29 13:53 UTC (permalink / raw
  To: gentoo-soc

[-- Attachment #1: Type: text/plain, Size: 2945 bytes --]

On 18:54 Tue 29 Mar     , Vikraman wrote:
> I've been doing research on this project for sometime, and would
> really like my ideas to be reviewed by the community before
> submitting my application.
> 
> * User runs a script as a cronjob (or a portage hook ?), which
> collects stats on installed packages, and uploads to the online
> database. Surprisingly easy to code using the gentoolkit API.

Sure, one of those two options sounds good. The choice can be easily 
changed later, but I prefer the idea of a hook that runs after the 
package manager completes installing a package set.

People on laptops may not run cron jobs regularly, or at all.

> * Webapp generates statistics on the packages (top packages,
> versions, popularity, packages from overlays). Thanks to ideas from
> an old thread on the forums. Devs, please suggest some more useful
> ideas/stats too.

For Gentoo users:
- Search by package name
- Search by packages often installed with this one (recommendations)
- Add ratings. Maybe just via the GUI frontend for now.
- Add user reputations and profiles. How many ratings, etc. Provide the 
  option of logging in as user or anonymously for package uploads.


Tools for Gentoo developers:
- Least popular packages
- Search by maintainer/herd
- Integrate with Bugzilla by showing # of bugs & linking to the list
- Integrate w/ BZ by showing # of tinderbox-related bugs

For example, a useful search would be ≤20 users, package assigned to 
maintainer-needed or maintainer wanted, and ≥1 open bugs.

> * Dev/user runs a cli script to read stats or a gui program to
> display graphs.
>
> I'm looking forward to coding the project in Python, gui using
> PyGTK, database using MySQL. For the web service implementation,
> should I use SOAP (soappy, which I'll need to learn) or something
> like web.py (I've used it earlier) ?

What is the role of the GUI? Please describe the use cases and justify 
why it should be a desktop app rather than a webapp.

Consider REST for the web service rather than SOAP. There's a nice book 
called "RESTful Web Services" you might like to take a look at.

> It was also suggested earlier to allow a dev to trigger a popup
> feedback from the user. I think that would require the user ro run a
> daemon, which will listen for queries.

It doesn't necessarily have to be instant. Upon the next installation, 
when uploading new stats, the client could check for messages.

> Please suggest what details to include and what not to include in
> the application.

Many resources exist on this topic:

- http://www.gentoo.org/proj/en/userrel/soc/applying.xml
- http://www.booki.cc/gsocstudentguide/_v/1.0/writing-a-proposal/
- http://google-opensource.blogspot.com/2011/03/dos-and-donts-of-google-summer-of-code.html

-- 
Thanks,
Donnie

Donnie Berkholz
Admin, Summer of Code
Gentoo Linux
Blog: http://dberkholz.com

[-- Attachment #2: Type: application/pgp-signature, Size: 198 bytes --]

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [gentoo-soc] Package statistics reporting tool
  2011-03-29 13:53 ` Donnie Berkholz
@ 2011-03-29 14:20   ` Brian Dolbec
  2011-03-29 19:01     ` Yanbo Wang
  0 siblings, 1 reply; 10+ messages in thread
From: Brian Dolbec @ 2011-03-29 14:20 UTC (permalink / raw
  To: gentoo-soc

[-- Attachment #1: Type: text/plain, Size: 3496 bytes --]

On Tue, 2011-03-29 at 08:53 -0500, Donnie Berkholz wrote:
> On 18:54 Tue 29 Mar     , Vikraman wrote:
> > I've been doing research on this project for sometime, and would
> > really like my ideas to be reviewed by the community before
> > submitting my application.
> > 
> > * User runs a script as a cronjob (or a portage hook ?), which
> > collects stats on installed packages, and uploads to the online
> > database. Surprisingly easy to code using the gentoolkit API.

:) Thanks, anything else needed in there, just ask.

> 
> Sure, one of those two options sounds good. The choice can be easily 
> changed later, but I prefer the idea of a hook that runs after the 
> package manager completes installing a package set.
> 
> People on laptops may not run cron jobs regularly, or at all.

 good point.

> 
> > * Webapp generates statistics on the packages (top packages,
> > versions, popularity, packages from overlays). Thanks to ideas from
> > an old thread on the forums. Devs, please suggest some more useful
> > ideas/stats too.
> 
> For Gentoo users:
> - Search by package name
> - Search by packages often installed with this one (recommendations)
> - Add ratings. Maybe just via the GUI frontend for now.
> - Add user reputations and profiles. How many ratings, etc. Provide the 
>   option of logging in as user or anonymously for package uploads.
> 
> 
> Tools for Gentoo developers:
> - Least popular packages
> - Search by maintainer/herd
> - Integrate with Bugzilla by showing # of bugs & linking to the list
> - Integrate w/ BZ by showing # of tinderbox-related bugs
> 
> For example, a useful search would be ≤20 users, package assigned to 
> maintainer-needed or maintainer wanted, and ≥1 open bugs.
> 
> > * Dev/user runs a cli script to read stats or a gui program to
> > display graphs.
> >
> > I'm looking forward to coding the project in Python, gui using
> > PyGTK, database using MySQL. For the web service implementation,
> > should I use SOAP (soappy, which I'll need to learn) or something
> > like web.py (I've used it earlier) ?
> 
> What is the role of the GUI? Please describe the use cases and justify 
> why it should be a desktop app rather than a webapp.

As much of a gui person as I am, I too think a web browser would be
preferred connecting for this. It could also be expanded as the need or
new query type comes up and then becomes available to all without the
need to upgrade.

> 
> Consider REST for the web service rather than SOAP. There's a nice book 
> called "RESTful Web Services" you might like to take a look at.
> 
> > It was also suggested earlier to allow a dev to trigger a popup
> > feedback from the user. I think that would require the user ro run a
> > daemon, which will listen for queries.
> 
> It doesn't necessarily have to be instant. Upon the next installation, 
> when uploading new stats, the client could check for messages.

Yes, this was what I was trying to suggest at the time, perhaps I could
have chosen better words :/

> 
> > Please suggest what details to include and what not to include in
> > the application.
> 
> Many resources exist on this topic:
> 
> - http://www.gentoo.org/proj/en/userrel/soc/applying.xml
> - http://www.booki.cc/gsocstudentguide/_v/1.0/writing-a-proposal/
> - http://google-opensource.blogspot.com/2011/03/dos-and-donts-of-google-summer-of-code.html
> 


-- 
Brian Dolbec <brian.dolbec@gmail.com>

[-- Attachment #2: This is a digitally signed message part --]
[-- Type: application/pgp-signature, Size: 490 bytes --]

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [gentoo-soc] Package statistics reporting tool
  2011-03-29 14:20   ` Brian Dolbec
@ 2011-03-29 19:01     ` Yanbo Wang
  2011-03-29 19:11       ` Vikraman Choudhury
  0 siblings, 1 reply; 10+ messages in thread
From: Yanbo Wang @ 2011-03-29 19:01 UTC (permalink / raw
  To: gentoo-soc

[-- Attachment #1: Type: text/plain, Size: 4423 bytes --]

Hi all, I'm a student from China. I'm interested in this idea and have 
different ideas to share.

于 2011-3-29 22:20, Brian Dolbec 写道:
> On Tue, 2011-03-29 at 08:53 -0500, Donnie Berkholz wrote:
>> On 18:54 Tue 29 Mar     , Vikraman wrote:
>>> I've been doing research on this project for sometime, and would
>>> really like my ideas to be reviewed by the community before
>>> submitting my application.
>>>
>>> * User runs a script as a cronjob (or a portage hook ?), which
>>> collects stats on installed packages, and uploads to the online
>>> database. Surprisingly easy to code using the gentoolkit API.
> :) Thanks, anything else needed in there, just ask.
>
>> Sure, one of those two options sounds good. The choice can be easily
>> changed later, but I prefer the idea of a hook that runs after the
>> package manager completes installing a package set.
>>
>> People on laptops may not run cron jobs regularly, or at all.
>   good point.
>
>>> * Webapp generates statistics on the packages (top packages,
>>> versions, popularity, packages from overlays). Thanks to ideas from
>>> an old thread on the forums. Devs, please suggest some more useful
>>> ideas/stats too.
>> For Gentoo users:
>> - Search by package name
>> - Search by packages often installed with this one (recommendations)
>> - Add ratings. Maybe just via the GUI frontend for now.
>> - Add user reputations and profiles. How many ratings, etc. Provide the
>>    option of logging in as user or anonymously for package uploads.
>>
>>
I think the statistics about relationship between packages is more 
important, because if the packages installed together run well and 
appear together frequently, we can recommend one package to the user 
when the other package is installed. and if the packages installed 
together usually crash, there may be some problems between them.
So there can be more types of deep statistics that provide more useful 
information.
>> Tools for Gentoo developers:
>> - Least popular packages
>> - Search by maintainer/herd
>> - Integrate with Bugzilla by showing # of bugs&  linking to the list
>> - Integrate w/ BZ by showing # of tinderbox-related bugs
>>
>> For example, a useful search would be ≤20 users, package assigned to
>> maintainer-needed or maintainer wanted, and ≥1 open bugs.
>>
>>> * Dev/user runs a cli script to read stats or a gui program to
>>> display graphs.
>>>
>>> I'm looking forward to coding the project in Python, gui using
>>> PyGTK, database using MySQL. For the web service implementation,
>>> should I use SOAP (soappy, which I'll need to learn) or something
>>> like web.py (I've used it earlier) ?
>> What is the role of the GUI? Please describe the use cases and justify
>> why it should be a desktop app rather than a webapp.
> As much of a gui person as I am, I too think a web browser would be
> preferred connecting for this. It could also be expanded as the need or
> new query type comes up and then becomes available to all without the
> need to upgrade.
>
>> Consider REST for the web service rather than SOAP. There's a nice book
>> called "RESTful Web Services" you might like to take a look at.
>>
>>> It was also suggested earlier to allow a dev to trigger a popup
>>> feedback from the user. I think that would require the user ro run a
>>> daemon, which will listen for queries.
>> It doesn't necessarily have to be instant. Upon the next installation,
>> when uploading new stats, the client could check for messages.
> Yes, this was what I was trying to suggest at the time, perhaps I could
> have chosen better words :/
>
>>> Please suggest what details to include and what not to include in
>>> the application.
>> Many resources exist on this topic:
>>
>> - http://www.gentoo.org/proj/en/userrel/soc/applying.xml
>> - http://www.booki.cc/gsocstudentguide/_v/1.0/writing-a-proposal/
>> - http://google-opensource.blogspot.com/2011/03/dos-and-donts-of-google-summer-of-code.html
>>
>
The informations about installed packages are regularly uploaded to the 
online database, but they are usually reduplicate because most of them 
may be same. I think each host can be identified by a unique random id 
that is used to track submissions issued by the same host. Security is 
very important for this project, the host id should be kept secret.

-- 
Thanks,
Yanbo Wang


[-- Attachment #2: Type: text/html, Size: 7288 bytes --]

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [gentoo-soc] Package statistics reporting tool
  2011-03-29 19:01     ` Yanbo Wang
@ 2011-03-29 19:11       ` Vikraman Choudhury
  2011-04-08 15:18         ` Vikraman Choudhury
  0 siblings, 1 reply; 10+ messages in thread
From: Vikraman Choudhury @ 2011-03-29 19:11 UTC (permalink / raw
  To: gentoo-soc

[-- Attachment #1: Type: text/plain, Size: 4672 bytes --]

On Tue, Mar 29, 2011 at 7:01 PM, Yanbo Wang <wybb86@gmail.com> wrote:

>  Hi all, I'm a student from China. I'm interested in this idea and have
> different ideas to share.
>
> 于 2011-3-29 22:20, Brian Dolbec 写道:
>
> On Tue, 2011-03-29 at 08:53 -0500, Donnie Berkholz wrote:
>
>  On 18:54 Tue 29 Mar     , Vikraman wrote:
>
>  I've been doing research on this project for sometime, and would
> really like my ideas to be reviewed by the community before
> submitting my application.
>
> * User runs a script as a cronjob (or a portage hook ?), which
> collects stats on installed packages, and uploads to the online
> database. Surprisingly easy to code using the gentoolkit API.
>
>  :) Thanks, anything else needed in there, just ask.
>
>
>  Sure, one of those two options sounds good. The choice can be easily
> changed later, but I prefer the idea of a hook that runs after the
> package manager completes installing a package set.
>
> People on laptops may not run cron jobs regularly, or at all.
>
>   good point.
>
>
>   * Webapp generates statistics on the packages (top packages,
> versions, popularity, packages from overlays). Thanks to ideas from
> an old thread on the forums. Devs, please suggest some more useful
> ideas/stats too.
>
>  For Gentoo users:
> - Search by package name
> - Search by packages often installed with this one (recommendations)
> - Add ratings. Maybe just via the GUI frontend for now.
> - Add user reputations and profiles. How many ratings, etc. Provide the
>   option of logging in as user or anonymously for package uploads.
>
>
>
>  I think the statistics about relationship between packages is more
> important, because if the packages installed together run well and appear
> together frequently, we can recommend one package to the user when the
> other package is installed. and if the packages installed together usually
> crash, there may be some problems between them.
> So there can be more types of deep statistics that provide more useful
> information.
>
>  Tools for Gentoo developers:
> - Least popular packages
> - Search by maintainer/herd
> - Integrate with Bugzilla by showing # of bugs & linking to the list
> - Integrate w/ BZ by showing # of tinderbox-related bugs
>
> For example, a useful search would be ≤20 users, package assigned to
> maintainer-needed or maintainer wanted, and ≥1 open bugs.
>
>
>  * Dev/user runs a cli script to read stats or a gui program to
> display graphs.
>
> I'm looking forward to coding the project in Python, gui using
> PyGTK, database using MySQL. For the web service implementation,
> should I use SOAP (soappy, which I'll need to learn) or something
> like web.py (I've used it earlier) ?
>
>  What is the role of the GUI? Please describe the use cases and justify
> why it should be a desktop app rather than a webapp.
>
>  As much of a gui person as I am, I too think a web browser would be
> preferred connecting for this. It could also be expanded as the need or
> new query type comes up and then becomes available to all without the
> need to upgrade.
>
>
>  Consider REST for the web service rather than SOAP. There's a nice book
> called "RESTful Web Services" you might like to take a look at.
>
>
>  It was also suggested earlier to allow a dev to trigger a popup
> feedback from the user. I think that would require the user ro run a
> daemon, which will listen for queries.
>
>  It doesn't necessarily have to be instant. Upon the next installation,
> when uploading new stats, the client could check for messages.
>
>  Yes, this was what I was trying to suggest at the time, perhaps I could
> have chosen better words :/
>
>
>   Please suggest what details to include and what not to include in
> the application.
>
>  Many resources exist on this topic:
>
> - http://www.gentoo.org/proj/en/userrel/soc/applying.xml
> - http://www.booki.cc/gsocstudentguide/_v/1.0/writing-a-proposal/
> - http://google-opensource.blogspot.com/2011/03/dos-and-donts-of-google-summer-of-code.html
>
>   The informations about installed packages are regularly uploaded to the
> online database, but they are usually reduplicate because most of them may
> be same. I think each host can be identified by a unique random id that is
> used to track submissions issued by the same host. Security is very
> important for this project, the host id should be kept secret.
>
> --
> Thanks,
> Yanbo Wang
>
>  Yes, smolt handles that by generating a uuid from
/proc/sys/kernel/random/uuid for the first time and saving it in
/etc/smolt/hw-uuid

-- 
vh4x0r

[-- Attachment #2: Type: text/html, Size: 7016 bytes --]

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [gentoo-soc] Package statistics reporting tool
  2011-03-29 19:11       ` Vikraman Choudhury
@ 2011-04-08 15:18         ` Vikraman Choudhury
  0 siblings, 0 replies; 10+ messages in thread
From: Vikraman Choudhury @ 2011-04-08 15:18 UTC (permalink / raw
  To: gentoo-soc

[-- Attachment #1: Type: text/plain, Size: 318 bytes --]

Hi, I've finally finished my application and submitted it. Please take your
time and suggest improvements and/or additions to it. I had to do this last
minute, since I've been seriously ill for the past few days (I still am).

[1]
http://www.google-melange.com/gsoc/proposal/review/google/gsoc2011/vh4x0r/1

--
vh4x0r

[-- Attachment #2: Type: text/html, Size: 549 bytes --]

^ permalink raw reply	[flat|nested] 10+ messages in thread

* [gentoo-soc] Package statistics reporting tool
@ 2012-03-15 13:44 Yanbo Wang
  2012-03-16 15:59 ` Corentin Chary
  0 siblings, 1 reply; 10+ messages in thread
From: Yanbo Wang @ 2012-03-15 13:44 UTC (permalink / raw
  To: gentoo-soc

Hi everyone,

I'm a post graduate student from China. I'm interested in the idea about
Package statistics reporting tool and I'd like to take part in this year's GSoc
as a student.

I have read the code written by Vikraman and deployed the server on my gentoo.
The server gathered a lot of information about environment and
packages installed.
I could get the statistic through web interface or terminal.That's
very great, I really want
to contribute to the project.

Please tell me this year's focus, and I have some ideas to share.
1. Upload more information such as developers' email, whether the packages
installed is stable or not,  the crash info.
2. Some statistics like popularity(recently installed packages, least
popular packages)
or the relationship between packages could be done in a cronjob.
3. Integrate with bugzilla. Developers can easily get the bugs of
packages through
statistic web interface or terminal, like PyBugz [1]. These will help
developers find bugs
of their packages easier.

About me:
I’ve had 3 years’ experience in java development (Java Web and Swing), 2 years'
in python development (Django) and also be fimilar with c/c++.  I’ve been using
gentoo for over 1 year and I really want to contribute a little to
gentoo community.

[1] http://www.liquidx.net/pybugz/
-- 
Thanks,
Yanbo Wang



^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [gentoo-soc] Package statistics reporting tool
  2012-03-15 13:44 Yanbo Wang
@ 2012-03-16 15:59 ` Corentin Chary
  2012-03-16 23:37   ` Rafael Goncalves Martins
  0 siblings, 1 reply; 10+ messages in thread
From: Corentin Chary @ 2012-03-16 15:59 UTC (permalink / raw
  To: gentoo-soc; +Cc: rafaelmartins

Hi,

Ccing Rafael.

On Thu, Mar 15, 2012 at 2:44 PM, Yanbo Wang <wybb86@gmail.com> wrote:
> Hi everyone,
>
> I'm a post graduate student from China. I'm interested in the idea about
> Package statistics reporting tool and I'd like to take part in this year's GSoc
> as a student.
>
> I have read the code written by Vikraman and deployed the server on my gentoo.
> The server gathered a lot of information about environment and
> packages installed.
> I could get the statistic through web interface or terminal.That's
> very great, I really want
> to contribute to the project.
>
> Please tell me this year's focus, and I have some ideas to share.
> 1. Upload more information such as developers' email, whether the packages
> installed is stable or not,  the crash info.
> 2. Some statistics like popularity(recently installed packages, least
> popular packages)
> or the relationship between packages could be done in a cronjob.
> 3. Integrate with bugzilla. Developers can easily get the bugs of
> packages through
> statistic web interface or terminal, like PyBugz [1]. These will help
> developers find bugs
> of their packages easier.
>
> About me:
> I’ve had 3 years’ experience in java development (Java Web and Swing), 2 years'
> in python development (Django) and also be fimilar with c/c++.  I’ve been using
> gentoo for over 1 year and I really want to contribute a little to
> gentoo community.
>
> [1] http://www.liquidx.net/pybugz/

I took a quick look at last year code, and I think the project would
be easier to extend if it was using django. This would avoid writing
custom SQL queries, etc... And you can probably steal some code from
euscan to do that.
Note that I'm not the mentor for this project and it is just an idea.

I think this years goals should be:
- stabilization of the client, and integration with the base system,
documentation during installation, etc...
- adding more features to the web interface, like what we can find  on
http://popcon.debian.org/ (charts, sortable tables, overlays usage,
etc..)
- integration with other gentoo webservices (how ?)
- putting the project in production of gentoo-infra


-- 
Corentin Chary
http://xf.iksaif.net



^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [gentoo-soc] Package statistics reporting tool
  2012-03-16 15:59 ` Corentin Chary
@ 2012-03-16 23:37   ` Rafael Goncalves Martins
  2012-03-17 10:49     ` Yanbo Wang
  0 siblings, 1 reply; 10+ messages in thread
From: Rafael Goncalves Martins @ 2012-03-16 23:37 UTC (permalink / raw
  To: Corentin Chary; +Cc: gentoo-soc

Hi,

On Fri, Mar 16, 2012 at 12:59 PM, Corentin Chary
<corentin.chary@gmail.com> wrote:
> Hi,
>
> Ccing Rafael.
>
> On Thu, Mar 15, 2012 at 2:44 PM, Yanbo Wang <wybb86@gmail.com> wrote:
>> Hi everyone,
>>
>> I'm a post graduate student from China. I'm interested in the idea about
>> Package statistics reporting tool and I'd like to take part in this year's GSoc
>> as a student.
>>
>> I have read the code written by Vikraman and deployed the server on my gentoo.
>> The server gathered a lot of information about environment and
>> packages installed.
>> I could get the statistic through web interface or terminal.That's
>> very great, I really want
>> to contribute to the project.
>> Please tell me this year's focus, and I have some ideas to share.
>> 1. Upload more information such as developers' email, whether the packages
>> installed is stable or not,  the crash info.
>> 2. Some statistics like popularity(recently installed packages, least
>> popular packages)
>> or the relationship between packages could be done in a cronjob.
>> 3. Integrate with bugzilla. Developers can easily get the bugs of
>> packages through
>> statistic web interface or terminal, like PyBugz [1]. These will help
>> developers find bugs
>> of their packages easier.

@Yanbo, thanks for your interest. Some of your ideas are great, but
one of the things that we should keep in mind for this project is
privacy. We should make sure that the user is aware of what data he is
sending for our servers, and disable it if needed. The code should be
able to handle this properly. I think that we shouldn't install a
cronjob automatically, for example.

>> About me:
>> I’ve had 3 years’ experience in java development (Java Web and Swing), 2 years'
>> in python development (Django) and also be fimilar with c/c++.  I’ve been using
>> gentoo for over 1 year and I really want to contribute a little to
>> gentoo community.
>>
>> [1] http://www.liquidx.net/pybugz/
>
> I took a quick look at last year code, and I think the project would
> be easier to extend if it was using django. This would avoid writing
> custom SQL queries, etc... And you can probably steal some code from
> euscan to do that.
> Note that I'm not the mentor for this project and it is just an idea.

@Corentin: During last year's gsoc I talked quickly with Vikraman and
Alec about the web framework being used. I agree that web.py isn't the
better tool for the job, I think I suggested flask that time, don't
recall correctly.

This topic should be well discussed, because any change of framework
will spend a considerable amount of time, considering that we just
have 3 months.

> I think this years goals should be:
> - stabilization of the client, and integration with the base system,
> documentation during installation, etc...
> - adding more features to the web interface, like what we can find  on
> http://popcon.debian.org/ (charts, sortable tables, overlays usage,
> etc..)
> - integration with other gentoo webservices (how ?)
> - putting the project in production of gentoo-infra
>

@Corentin: I tend to agree with these goals. Thanks for the answer.

@Yanbo, your ideas are good, but keep security and privacy in mind. I
think that these are the key points of the success of this project.
Also, automatic bug reports are cool, but we need to stabilize the
basic features first.

Best regards,

-- 
Rafael Goncalves Martins
Gentoo Linux developer
http://rafaelmartins.eng.br/



^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [gentoo-soc] Package statistics reporting tool
  2012-03-16 23:37   ` Rafael Goncalves Martins
@ 2012-03-17 10:49     ` Yanbo Wang
  0 siblings, 0 replies; 10+ messages in thread
From: Yanbo Wang @ 2012-03-17 10:49 UTC (permalink / raw
  To: gentoo-soc

Thanks for your reply.

2012/3/17 Rafael Goncalves Martins <rafaelmartins@gentoo.org>:
> Hi,
>
> On Fri, Mar 16, 2012 at 12:59 PM, Corentin Chary
> <corentin.chary@gmail.com> wrote:
>> Hi,
>>
>> Ccing Rafael.
>>
>> On Thu, Mar 15, 2012 at 2:44 PM, Yanbo Wang <wybb86@gmail.com> wrote:
>>> Hi everyone,
>>>
>>> I'm a post graduate student from China. I'm interested in the idea about
>>> Package statistics reporting tool and I'd like to take part in this year's GSoc
>>> as a student.
>>>
>>> I have read the code written by Vikraman and deployed the server on my gentoo.
>>> The server gathered a lot of information about environment and
>>> packages installed.
>>> I could get the statistic through web interface or terminal.That's
>>> very great, I really want
>>> to contribute to the project.
>>> Please tell me this year's focus, and I have some ideas to share.
>>> 1. Upload more information such as developers' email, whether the packages
>>> installed is stable or not,  the crash info.
>>> 2. Some statistics like popularity(recently installed packages, least
>>> popular packages)
>>> or the relationship between packages could be done in a cronjob.
>>> 3. Integrate with bugzilla. Developers can easily get the bugs of
>>> packages through
>>> statistic web interface or terminal, like PyBugz [1]. These will help
>>> developers find bugs
>>> of their packages easier.
>
> @Yanbo, thanks for your interest. Some of your ideas are great, but
> one of the things that we should keep in mind for this project is
> privacy. We should make sure that the user is aware of what data he is
> sending for our servers, and disable it if needed. The code should be
> able to handle this properly. I think that we shouldn't install a
> cronjob automatically, for example.
>

Of course, privacy is an important part of this project. Currently,
users can decide what data they want to sending  by payload.cfg file
and it will be easily extended if we add more features. The client
generates a UUID for the host on installation and the server uses UUID
to identify the host and packages installed.
Maybe my expression was not clear, a cronjob could be installed on the
server, not on the host, because the extraction of some interesting
statistics, like relationship between packages, will be time-consuming
and we should preprocess them with a cronjob. These statistics may be
used for recommending related packages or finding bugs between
packages.

>>> About me:
>>> I’ve had 3 years’ experience in java development (Java Web and Swing), 2 years'
>>> in python development (Django) and also be fimilar with c/c++.  I’ve been using
>>> gentoo for over 1 year and I really want to contribute a little to
>>> gentoo community.
>>>
>>> [1] http://www.liquidx.net/pybugz/
>>
>> I took a quick look at last year code, and I think the project would
>> be easier to extend if it was using django. This would avoid writing
>> custom SQL queries, etc... And you can probably steal some code from
>> euscan to do that.
>> Note that I'm not the mentor for this project and it is just an idea.
>
> @Corentin: During last year's gsoc I talked quickly with Vikraman and
> Alec about the web framework being used. I agree that web.py isn't the
> better tool for the job, I think I suggested flask that time, don't
> recall correctly.
>

Django is okey for me and I can learn something related from euscan.
Thanks for Corentin's advice.

> This topic should be well discussed, because any change of framework
> will spend a considerable amount of time, considering that we just
> have 3 months.
>
>> I think this years goals should be:
>> - stabilization of the client, and integration with the base system,
>> documentation during installation, etc...
>> - adding more features to the web interface, like what we can find  on
>> http://popcon.debian.org/ (charts, sortable tables, overlays usage,
>> etc..)
>> - integration with other gentoo webservices (how ?)

Like PyBugz [1], developers can get bugs through terminal when they
query packages, rather than Bugzilla website. In addition, we can add
bugs' links to Bugzilla website in packages statistic page.

>> - putting the project in production of gentoo-infra
>>
>
> @Corentin: I tend to agree with these goals. Thanks for the answer.
>
> @Yanbo, your ideas are good, but keep security and privacy in mind. I
> think that these are the key points of the success of this project.
> Also, automatic bug reports are cool, but we need to stabilize the
> basic features first.
>

Yes, thanks for yours and Corentin's suggestion, stabilization and
adding more features to the web interface are most import to complete.
There has been one package named abrt [2], and this tool helps users
easily report bugs of packages to Bugzilla. My idea about bug report
is reporting package name with some comment to the server with
terminal like current gentoostats-send script when the package crashs.
On the server side, reports are analysed offline. We can extract the
usefull things and help developers find bugs. For example, most of
users build one package with the same CFLAG have the same problem or
install one package with another package together have the same crash
info.

> Best regards,
>
> --
> Rafael Goncalves Martins
> Gentoo Linux developer
> http://rafaelmartins.eng.br/
>

[1] http://packages.gentoo.org/package/www-client/pybugz
[2] http://packages.gentoo.org/package/app-admin/abrt

-- 
Thanks,
Yanbo Wang



^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2012-03-17 12:08 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-03-29 13:24 [gentoo-soc] Package statistics reporting tool Vikraman
2011-03-29 13:53 ` Donnie Berkholz
2011-03-29 14:20   ` Brian Dolbec
2011-03-29 19:01     ` Yanbo Wang
2011-03-29 19:11       ` Vikraman Choudhury
2011-04-08 15:18         ` Vikraman Choudhury
  -- strict thread matches above, loose matches on Subject: below --
2012-03-15 13:44 Yanbo Wang
2012-03-16 15:59 ` Corentin Chary
2012-03-16 23:37   ` Rafael Goncalves Martins
2012-03-17 10:49     ` Yanbo Wang

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox