* [gentoo-soc] Package statistics reporting tool
@ 2011-03-29 13:24 Vikraman
2011-03-29 13:53 ` Donnie Berkholz
0 siblings, 1 reply; 10+ messages in thread
From: Vikraman @ 2011-03-29 13:24 UTC (permalink / raw
To: gentoo-soc
I've been doing research on this project for sometime, and would really
like my ideas to be reviewed by the community before submitting my application.
* User runs a script as a cronjob (or a portage hook ?), which collects
stats on installed packages, and uploads to the online database. Surprisingly
easy to code using the gentoolkit API.
* Webapp generates statistics on the packages (top packages, versions,
popularity, packages from overlays). Thanks to ideas from an old thread on
the forums. Devs, please suggest some more useful ideas/stats too.
* Dev/user runs a cli script to read stats or a gui program to display
graphs.
I'm looking forward to coding the project in Python, gui using PyGTK,
database using MySQL. For the web service implementation, should I use
SOAP (soappy, which I'll need to learn) or something like web.py (I've
used it earlier) ?
It was also suggested earlier to allow a dev to trigger a popup feedback
from the user. I think that would require the user ro run a daemon, which
will listen for queries.
Please suggest what details to include and what not to include in the
application.
--
vh4x0r
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [gentoo-soc] Package statistics reporting tool
2011-03-29 13:24 [gentoo-soc] Package statistics reporting tool Vikraman
@ 2011-03-29 13:53 ` Donnie Berkholz
2011-03-29 14:20 ` Brian Dolbec
0 siblings, 1 reply; 10+ messages in thread
From: Donnie Berkholz @ 2011-03-29 13:53 UTC (permalink / raw
To: gentoo-soc
[-- Attachment #1: Type: text/plain, Size: 2945 bytes --]
On 18:54 Tue 29 Mar , Vikraman wrote:
> I've been doing research on this project for sometime, and would
> really like my ideas to be reviewed by the community before
> submitting my application.
>
> * User runs a script as a cronjob (or a portage hook ?), which
> collects stats on installed packages, and uploads to the online
> database. Surprisingly easy to code using the gentoolkit API.
Sure, one of those two options sounds good. The choice can be easily
changed later, but I prefer the idea of a hook that runs after the
package manager completes installing a package set.
People on laptops may not run cron jobs regularly, or at all.
> * Webapp generates statistics on the packages (top packages,
> versions, popularity, packages from overlays). Thanks to ideas from
> an old thread on the forums. Devs, please suggest some more useful
> ideas/stats too.
For Gentoo users:
- Search by package name
- Search by packages often installed with this one (recommendations)
- Add ratings. Maybe just via the GUI frontend for now.
- Add user reputations and profiles. How many ratings, etc. Provide the
option of logging in as user or anonymously for package uploads.
Tools for Gentoo developers:
- Least popular packages
- Search by maintainer/herd
- Integrate with Bugzilla by showing # of bugs & linking to the list
- Integrate w/ BZ by showing # of tinderbox-related bugs
For example, a useful search would be ≤20 users, package assigned to
maintainer-needed or maintainer wanted, and ≥1 open bugs.
> * Dev/user runs a cli script to read stats or a gui program to
> display graphs.
>
> I'm looking forward to coding the project in Python, gui using
> PyGTK, database using MySQL. For the web service implementation,
> should I use SOAP (soappy, which I'll need to learn) or something
> like web.py (I've used it earlier) ?
What is the role of the GUI? Please describe the use cases and justify
why it should be a desktop app rather than a webapp.
Consider REST for the web service rather than SOAP. There's a nice book
called "RESTful Web Services" you might like to take a look at.
> It was also suggested earlier to allow a dev to trigger a popup
> feedback from the user. I think that would require the user ro run a
> daemon, which will listen for queries.
It doesn't necessarily have to be instant. Upon the next installation,
when uploading new stats, the client could check for messages.
> Please suggest what details to include and what not to include in
> the application.
Many resources exist on this topic:
- http://www.gentoo.org/proj/en/userrel/soc/applying.xml
- http://www.booki.cc/gsocstudentguide/_v/1.0/writing-a-proposal/
- http://google-opensource.blogspot.com/2011/03/dos-and-donts-of-google-summer-of-code.html
--
Thanks,
Donnie
Donnie Berkholz
Admin, Summer of Code
Gentoo Linux
Blog: http://dberkholz.com
[-- Attachment #2: Type: application/pgp-signature, Size: 198 bytes --]
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [gentoo-soc] Package statistics reporting tool
2011-03-29 13:53 ` Donnie Berkholz
@ 2011-03-29 14:20 ` Brian Dolbec
2011-03-29 19:01 ` Yanbo Wang
0 siblings, 1 reply; 10+ messages in thread
From: Brian Dolbec @ 2011-03-29 14:20 UTC (permalink / raw
To: gentoo-soc
[-- Attachment #1: Type: text/plain, Size: 3496 bytes --]
On Tue, 2011-03-29 at 08:53 -0500, Donnie Berkholz wrote:
> On 18:54 Tue 29 Mar , Vikraman wrote:
> > I've been doing research on this project for sometime, and would
> > really like my ideas to be reviewed by the community before
> > submitting my application.
> >
> > * User runs a script as a cronjob (or a portage hook ?), which
> > collects stats on installed packages, and uploads to the online
> > database. Surprisingly easy to code using the gentoolkit API.
:) Thanks, anything else needed in there, just ask.
>
> Sure, one of those two options sounds good. The choice can be easily
> changed later, but I prefer the idea of a hook that runs after the
> package manager completes installing a package set.
>
> People on laptops may not run cron jobs regularly, or at all.
good point.
>
> > * Webapp generates statistics on the packages (top packages,
> > versions, popularity, packages from overlays). Thanks to ideas from
> > an old thread on the forums. Devs, please suggest some more useful
> > ideas/stats too.
>
> For Gentoo users:
> - Search by package name
> - Search by packages often installed with this one (recommendations)
> - Add ratings. Maybe just via the GUI frontend for now.
> - Add user reputations and profiles. How many ratings, etc. Provide the
> option of logging in as user or anonymously for package uploads.
>
>
> Tools for Gentoo developers:
> - Least popular packages
> - Search by maintainer/herd
> - Integrate with Bugzilla by showing # of bugs & linking to the list
> - Integrate w/ BZ by showing # of tinderbox-related bugs
>
> For example, a useful search would be ≤20 users, package assigned to
> maintainer-needed or maintainer wanted, and ≥1 open bugs.
>
> > * Dev/user runs a cli script to read stats or a gui program to
> > display graphs.
> >
> > I'm looking forward to coding the project in Python, gui using
> > PyGTK, database using MySQL. For the web service implementation,
> > should I use SOAP (soappy, which I'll need to learn) or something
> > like web.py (I've used it earlier) ?
>
> What is the role of the GUI? Please describe the use cases and justify
> why it should be a desktop app rather than a webapp.
As much of a gui person as I am, I too think a web browser would be
preferred connecting for this. It could also be expanded as the need or
new query type comes up and then becomes available to all without the
need to upgrade.
>
> Consider REST for the web service rather than SOAP. There's a nice book
> called "RESTful Web Services" you might like to take a look at.
>
> > It was also suggested earlier to allow a dev to trigger a popup
> > feedback from the user. I think that would require the user ro run a
> > daemon, which will listen for queries.
>
> It doesn't necessarily have to be instant. Upon the next installation,
> when uploading new stats, the client could check for messages.
Yes, this was what I was trying to suggest at the time, perhaps I could
have chosen better words :/
>
> > Please suggest what details to include and what not to include in
> > the application.
>
> Many resources exist on this topic:
>
> - http://www.gentoo.org/proj/en/userrel/soc/applying.xml
> - http://www.booki.cc/gsocstudentguide/_v/1.0/writing-a-proposal/
> - http://google-opensource.blogspot.com/2011/03/dos-and-donts-of-google-summer-of-code.html
>
--
Brian Dolbec <brian.dolbec@gmail.com>
[-- Attachment #2: This is a digitally signed message part --]
[-- Type: application/pgp-signature, Size: 490 bytes --]
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [gentoo-soc] Package statistics reporting tool
2011-03-29 14:20 ` Brian Dolbec
@ 2011-03-29 19:01 ` Yanbo Wang
2011-03-29 19:11 ` Vikraman Choudhury
0 siblings, 1 reply; 10+ messages in thread
From: Yanbo Wang @ 2011-03-29 19:01 UTC (permalink / raw
To: gentoo-soc
[-- Attachment #1: Type: text/plain, Size: 4423 bytes --]
Hi all, I'm a student from China. I'm interested in this idea and have
different ideas to share.
于 2011-3-29 22:20, Brian Dolbec 写道:
> On Tue, 2011-03-29 at 08:53 -0500, Donnie Berkholz wrote:
>> On 18:54 Tue 29 Mar , Vikraman wrote:
>>> I've been doing research on this project for sometime, and would
>>> really like my ideas to be reviewed by the community before
>>> submitting my application.
>>>
>>> * User runs a script as a cronjob (or a portage hook ?), which
>>> collects stats on installed packages, and uploads to the online
>>> database. Surprisingly easy to code using the gentoolkit API.
> :) Thanks, anything else needed in there, just ask.
>
>> Sure, one of those two options sounds good. The choice can be easily
>> changed later, but I prefer the idea of a hook that runs after the
>> package manager completes installing a package set.
>>
>> People on laptops may not run cron jobs regularly, or at all.
> good point.
>
>>> * Webapp generates statistics on the packages (top packages,
>>> versions, popularity, packages from overlays). Thanks to ideas from
>>> an old thread on the forums. Devs, please suggest some more useful
>>> ideas/stats too.
>> For Gentoo users:
>> - Search by package name
>> - Search by packages often installed with this one (recommendations)
>> - Add ratings. Maybe just via the GUI frontend for now.
>> - Add user reputations and profiles. How many ratings, etc. Provide the
>> option of logging in as user or anonymously for package uploads.
>>
>>
I think the statistics about relationship between packages is more
important, because if the packages installed together run well and
appear together frequently, we can recommend one package to the user
when the other package is installed. and if the packages installed
together usually crash, there may be some problems between them.
So there can be more types of deep statistics that provide more useful
information.
>> Tools for Gentoo developers:
>> - Least popular packages
>> - Search by maintainer/herd
>> - Integrate with Bugzilla by showing # of bugs& linking to the list
>> - Integrate w/ BZ by showing # of tinderbox-related bugs
>>
>> For example, a useful search would be ≤20 users, package assigned to
>> maintainer-needed or maintainer wanted, and ≥1 open bugs.
>>
>>> * Dev/user runs a cli script to read stats or a gui program to
>>> display graphs.
>>>
>>> I'm looking forward to coding the project in Python, gui using
>>> PyGTK, database using MySQL. For the web service implementation,
>>> should I use SOAP (soappy, which I'll need to learn) or something
>>> like web.py (I've used it earlier) ?
>> What is the role of the GUI? Please describe the use cases and justify
>> why it should be a desktop app rather than a webapp.
> As much of a gui person as I am, I too think a web browser would be
> preferred connecting for this. It could also be expanded as the need or
> new query type comes up and then becomes available to all without the
> need to upgrade.
>
>> Consider REST for the web service rather than SOAP. There's a nice book
>> called "RESTful Web Services" you might like to take a look at.
>>
>>> It was also suggested earlier to allow a dev to trigger a popup
>>> feedback from the user. I think that would require the user ro run a
>>> daemon, which will listen for queries.
>> It doesn't necessarily have to be instant. Upon the next installation,
>> when uploading new stats, the client could check for messages.
> Yes, this was what I was trying to suggest at the time, perhaps I could
> have chosen better words :/
>
>>> Please suggest what details to include and what not to include in
>>> the application.
>> Many resources exist on this topic:
>>
>> - http://www.gentoo.org/proj/en/userrel/soc/applying.xml
>> - http://www.booki.cc/gsocstudentguide/_v/1.0/writing-a-proposal/
>> - http://google-opensource.blogspot.com/2011/03/dos-and-donts-of-google-summer-of-code.html
>>
>
The informations about installed packages are regularly uploaded to the
online database, but they are usually reduplicate because most of them
may be same. I think each host can be identified by a unique random id
that is used to track submissions issued by the same host. Security is
very important for this project, the host id should be kept secret.
--
Thanks,
Yanbo Wang
[-- Attachment #2: Type: text/html, Size: 7288 bytes --]
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [gentoo-soc] Package statistics reporting tool
2011-03-29 19:01 ` Yanbo Wang
@ 2011-03-29 19:11 ` Vikraman Choudhury
2011-04-08 15:18 ` Vikraman Choudhury
0 siblings, 1 reply; 10+ messages in thread
From: Vikraman Choudhury @ 2011-03-29 19:11 UTC (permalink / raw
To: gentoo-soc
[-- Attachment #1: Type: text/plain, Size: 4672 bytes --]
On Tue, Mar 29, 2011 at 7:01 PM, Yanbo Wang <wybb86@gmail.com> wrote:
> Hi all, I'm a student from China. I'm interested in this idea and have
> different ideas to share.
>
> 于 2011-3-29 22:20, Brian Dolbec 写道:
>
> On Tue, 2011-03-29 at 08:53 -0500, Donnie Berkholz wrote:
>
> On 18:54 Tue 29 Mar , Vikraman wrote:
>
> I've been doing research on this project for sometime, and would
> really like my ideas to be reviewed by the community before
> submitting my application.
>
> * User runs a script as a cronjob (or a portage hook ?), which
> collects stats on installed packages, and uploads to the online
> database. Surprisingly easy to code using the gentoolkit API.
>
> :) Thanks, anything else needed in there, just ask.
>
>
> Sure, one of those two options sounds good. The choice can be easily
> changed later, but I prefer the idea of a hook that runs after the
> package manager completes installing a package set.
>
> People on laptops may not run cron jobs regularly, or at all.
>
> good point.
>
>
> * Webapp generates statistics on the packages (top packages,
> versions, popularity, packages from overlays). Thanks to ideas from
> an old thread on the forums. Devs, please suggest some more useful
> ideas/stats too.
>
> For Gentoo users:
> - Search by package name
> - Search by packages often installed with this one (recommendations)
> - Add ratings. Maybe just via the GUI frontend for now.
> - Add user reputations and profiles. How many ratings, etc. Provide the
> option of logging in as user or anonymously for package uploads.
>
>
>
> I think the statistics about relationship between packages is more
> important, because if the packages installed together run well and appear
> together frequently, we can recommend one package to the user when the
> other package is installed. and if the packages installed together usually
> crash, there may be some problems between them.
> So there can be more types of deep statistics that provide more useful
> information.
>
> Tools for Gentoo developers:
> - Least popular packages
> - Search by maintainer/herd
> - Integrate with Bugzilla by showing # of bugs & linking to the list
> - Integrate w/ BZ by showing # of tinderbox-related bugs
>
> For example, a useful search would be ≤20 users, package assigned to
> maintainer-needed or maintainer wanted, and ≥1 open bugs.
>
>
> * Dev/user runs a cli script to read stats or a gui program to
> display graphs.
>
> I'm looking forward to coding the project in Python, gui using
> PyGTK, database using MySQL. For the web service implementation,
> should I use SOAP (soappy, which I'll need to learn) or something
> like web.py (I've used it earlier) ?
>
> What is the role of the GUI? Please describe the use cases and justify
> why it should be a desktop app rather than a webapp.
>
> As much of a gui person as I am, I too think a web browser would be
> preferred connecting for this. It could also be expanded as the need or
> new query type comes up and then becomes available to all without the
> need to upgrade.
>
>
> Consider REST for the web service rather than SOAP. There's a nice book
> called "RESTful Web Services" you might like to take a look at.
>
>
> It was also suggested earlier to allow a dev to trigger a popup
> feedback from the user. I think that would require the user ro run a
> daemon, which will listen for queries.
>
> It doesn't necessarily have to be instant. Upon the next installation,
> when uploading new stats, the client could check for messages.
>
> Yes, this was what I was trying to suggest at the time, perhaps I could
> have chosen better words :/
>
>
> Please suggest what details to include and what not to include in
> the application.
>
> Many resources exist on this topic:
>
> - http://www.gentoo.org/proj/en/userrel/soc/applying.xml
> - http://www.booki.cc/gsocstudentguide/_v/1.0/writing-a-proposal/
> - http://google-opensource.blogspot.com/2011/03/dos-and-donts-of-google-summer-of-code.html
>
> The informations about installed packages are regularly uploaded to the
> online database, but they are usually reduplicate because most of them may
> be same. I think each host can be identified by a unique random id that is
> used to track submissions issued by the same host. Security is very
> important for this project, the host id should be kept secret.
>
> --
> Thanks,
> Yanbo Wang
>
> Yes, smolt handles that by generating a uuid from
/proc/sys/kernel/random/uuid for the first time and saving it in
/etc/smolt/hw-uuid
--
vh4x0r
[-- Attachment #2: Type: text/html, Size: 7016 bytes --]
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [gentoo-soc] Package statistics reporting tool
2011-03-29 19:11 ` Vikraman Choudhury
@ 2011-04-08 15:18 ` Vikraman Choudhury
0 siblings, 0 replies; 10+ messages in thread
From: Vikraman Choudhury @ 2011-04-08 15:18 UTC (permalink / raw
To: gentoo-soc
[-- Attachment #1: Type: text/plain, Size: 318 bytes --]
Hi, I've finally finished my application and submitted it. Please take your
time and suggest improvements and/or additions to it. I had to do this last
minute, since I've been seriously ill for the past few days (I still am).
[1]
http://www.google-melange.com/gsoc/proposal/review/google/gsoc2011/vh4x0r/1
--
vh4x0r
[-- Attachment #2: Type: text/html, Size: 549 bytes --]
^ permalink raw reply [flat|nested] 10+ messages in thread
* [gentoo-soc] Package statistics reporting tool
@ 2012-03-15 13:44 Yanbo Wang
2012-03-16 15:59 ` Corentin Chary
0 siblings, 1 reply; 10+ messages in thread
From: Yanbo Wang @ 2012-03-15 13:44 UTC (permalink / raw
To: gentoo-soc
Hi everyone,
I'm a post graduate student from China. I'm interested in the idea about
Package statistics reporting tool and I'd like to take part in this year's GSoc
as a student.
I have read the code written by Vikraman and deployed the server on my gentoo.
The server gathered a lot of information about environment and
packages installed.
I could get the statistic through web interface or terminal.That's
very great, I really want
to contribute to the project.
Please tell me this year's focus, and I have some ideas to share.
1. Upload more information such as developers' email, whether the packages
installed is stable or not, the crash info.
2. Some statistics like popularity(recently installed packages, least
popular packages)
or the relationship between packages could be done in a cronjob.
3. Integrate with bugzilla. Developers can easily get the bugs of
packages through
statistic web interface or terminal, like PyBugz [1]. These will help
developers find bugs
of their packages easier.
About me:
I’ve had 3 years’ experience in java development (Java Web and Swing), 2 years'
in python development (Django) and also be fimilar with c/c++. I’ve been using
gentoo for over 1 year and I really want to contribute a little to
gentoo community.
[1] http://www.liquidx.net/pybugz/
--
Thanks,
Yanbo Wang
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [gentoo-soc] Package statistics reporting tool
2012-03-15 13:44 Yanbo Wang
@ 2012-03-16 15:59 ` Corentin Chary
2012-03-16 23:37 ` Rafael Goncalves Martins
0 siblings, 1 reply; 10+ messages in thread
From: Corentin Chary @ 2012-03-16 15:59 UTC (permalink / raw
To: gentoo-soc; +Cc: rafaelmartins
Hi,
Ccing Rafael.
On Thu, Mar 15, 2012 at 2:44 PM, Yanbo Wang <wybb86@gmail.com> wrote:
> Hi everyone,
>
> I'm a post graduate student from China. I'm interested in the idea about
> Package statistics reporting tool and I'd like to take part in this year's GSoc
> as a student.
>
> I have read the code written by Vikraman and deployed the server on my gentoo.
> The server gathered a lot of information about environment and
> packages installed.
> I could get the statistic through web interface or terminal.That's
> very great, I really want
> to contribute to the project.
>
> Please tell me this year's focus, and I have some ideas to share.
> 1. Upload more information such as developers' email, whether the packages
> installed is stable or not, the crash info.
> 2. Some statistics like popularity(recently installed packages, least
> popular packages)
> or the relationship between packages could be done in a cronjob.
> 3. Integrate with bugzilla. Developers can easily get the bugs of
> packages through
> statistic web interface or terminal, like PyBugz [1]. These will help
> developers find bugs
> of their packages easier.
>
> About me:
> I’ve had 3 years’ experience in java development (Java Web and Swing), 2 years'
> in python development (Django) and also be fimilar with c/c++. I’ve been using
> gentoo for over 1 year and I really want to contribute a little to
> gentoo community.
>
> [1] http://www.liquidx.net/pybugz/
I took a quick look at last year code, and I think the project would
be easier to extend if it was using django. This would avoid writing
custom SQL queries, etc... And you can probably steal some code from
euscan to do that.
Note that I'm not the mentor for this project and it is just an idea.
I think this years goals should be:
- stabilization of the client, and integration with the base system,
documentation during installation, etc...
- adding more features to the web interface, like what we can find on
http://popcon.debian.org/ (charts, sortable tables, overlays usage,
etc..)
- integration with other gentoo webservices (how ?)
- putting the project in production of gentoo-infra
--
Corentin Chary
http://xf.iksaif.net
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [gentoo-soc] Package statistics reporting tool
2012-03-16 15:59 ` Corentin Chary
@ 2012-03-16 23:37 ` Rafael Goncalves Martins
2012-03-17 10:49 ` Yanbo Wang
0 siblings, 1 reply; 10+ messages in thread
From: Rafael Goncalves Martins @ 2012-03-16 23:37 UTC (permalink / raw
To: Corentin Chary; +Cc: gentoo-soc
Hi,
On Fri, Mar 16, 2012 at 12:59 PM, Corentin Chary
<corentin.chary@gmail.com> wrote:
> Hi,
>
> Ccing Rafael.
>
> On Thu, Mar 15, 2012 at 2:44 PM, Yanbo Wang <wybb86@gmail.com> wrote:
>> Hi everyone,
>>
>> I'm a post graduate student from China. I'm interested in the idea about
>> Package statistics reporting tool and I'd like to take part in this year's GSoc
>> as a student.
>>
>> I have read the code written by Vikraman and deployed the server on my gentoo.
>> The server gathered a lot of information about environment and
>> packages installed.
>> I could get the statistic through web interface or terminal.That's
>> very great, I really want
>> to contribute to the project.
>> Please tell me this year's focus, and I have some ideas to share.
>> 1. Upload more information such as developers' email, whether the packages
>> installed is stable or not, the crash info.
>> 2. Some statistics like popularity(recently installed packages, least
>> popular packages)
>> or the relationship between packages could be done in a cronjob.
>> 3. Integrate with bugzilla. Developers can easily get the bugs of
>> packages through
>> statistic web interface or terminal, like PyBugz [1]. These will help
>> developers find bugs
>> of their packages easier.
@Yanbo, thanks for your interest. Some of your ideas are great, but
one of the things that we should keep in mind for this project is
privacy. We should make sure that the user is aware of what data he is
sending for our servers, and disable it if needed. The code should be
able to handle this properly. I think that we shouldn't install a
cronjob automatically, for example.
>> About me:
>> I’ve had 3 years’ experience in java development (Java Web and Swing), 2 years'
>> in python development (Django) and also be fimilar with c/c++. I’ve been using
>> gentoo for over 1 year and I really want to contribute a little to
>> gentoo community.
>>
>> [1] http://www.liquidx.net/pybugz/
>
> I took a quick look at last year code, and I think the project would
> be easier to extend if it was using django. This would avoid writing
> custom SQL queries, etc... And you can probably steal some code from
> euscan to do that.
> Note that I'm not the mentor for this project and it is just an idea.
@Corentin: During last year's gsoc I talked quickly with Vikraman and
Alec about the web framework being used. I agree that web.py isn't the
better tool for the job, I think I suggested flask that time, don't
recall correctly.
This topic should be well discussed, because any change of framework
will spend a considerable amount of time, considering that we just
have 3 months.
> I think this years goals should be:
> - stabilization of the client, and integration with the base system,
> documentation during installation, etc...
> - adding more features to the web interface, like what we can find on
> http://popcon.debian.org/ (charts, sortable tables, overlays usage,
> etc..)
> - integration with other gentoo webservices (how ?)
> - putting the project in production of gentoo-infra
>
@Corentin: I tend to agree with these goals. Thanks for the answer.
@Yanbo, your ideas are good, but keep security and privacy in mind. I
think that these are the key points of the success of this project.
Also, automatic bug reports are cool, but we need to stabilize the
basic features first.
Best regards,
--
Rafael Goncalves Martins
Gentoo Linux developer
http://rafaelmartins.eng.br/
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [gentoo-soc] Package statistics reporting tool
2012-03-16 23:37 ` Rafael Goncalves Martins
@ 2012-03-17 10:49 ` Yanbo Wang
0 siblings, 0 replies; 10+ messages in thread
From: Yanbo Wang @ 2012-03-17 10:49 UTC (permalink / raw
To: gentoo-soc
Thanks for your reply.
2012/3/17 Rafael Goncalves Martins <rafaelmartins@gentoo.org>:
> Hi,
>
> On Fri, Mar 16, 2012 at 12:59 PM, Corentin Chary
> <corentin.chary@gmail.com> wrote:
>> Hi,
>>
>> Ccing Rafael.
>>
>> On Thu, Mar 15, 2012 at 2:44 PM, Yanbo Wang <wybb86@gmail.com> wrote:
>>> Hi everyone,
>>>
>>> I'm a post graduate student from China. I'm interested in the idea about
>>> Package statistics reporting tool and I'd like to take part in this year's GSoc
>>> as a student.
>>>
>>> I have read the code written by Vikraman and deployed the server on my gentoo.
>>> The server gathered a lot of information about environment and
>>> packages installed.
>>> I could get the statistic through web interface or terminal.That's
>>> very great, I really want
>>> to contribute to the project.
>>> Please tell me this year's focus, and I have some ideas to share.
>>> 1. Upload more information such as developers' email, whether the packages
>>> installed is stable or not, the crash info.
>>> 2. Some statistics like popularity(recently installed packages, least
>>> popular packages)
>>> or the relationship between packages could be done in a cronjob.
>>> 3. Integrate with bugzilla. Developers can easily get the bugs of
>>> packages through
>>> statistic web interface or terminal, like PyBugz [1]. These will help
>>> developers find bugs
>>> of their packages easier.
>
> @Yanbo, thanks for your interest. Some of your ideas are great, but
> one of the things that we should keep in mind for this project is
> privacy. We should make sure that the user is aware of what data he is
> sending for our servers, and disable it if needed. The code should be
> able to handle this properly. I think that we shouldn't install a
> cronjob automatically, for example.
>
Of course, privacy is an important part of this project. Currently,
users can decide what data they want to sending by payload.cfg file
and it will be easily extended if we add more features. The client
generates a UUID for the host on installation and the server uses UUID
to identify the host and packages installed.
Maybe my expression was not clear, a cronjob could be installed on the
server, not on the host, because the extraction of some interesting
statistics, like relationship between packages, will be time-consuming
and we should preprocess them with a cronjob. These statistics may be
used for recommending related packages or finding bugs between
packages.
>>> About me:
>>> I’ve had 3 years’ experience in java development (Java Web and Swing), 2 years'
>>> in python development (Django) and also be fimilar with c/c++. I’ve been using
>>> gentoo for over 1 year and I really want to contribute a little to
>>> gentoo community.
>>>
>>> [1] http://www.liquidx.net/pybugz/
>>
>> I took a quick look at last year code, and I think the project would
>> be easier to extend if it was using django. This would avoid writing
>> custom SQL queries, etc... And you can probably steal some code from
>> euscan to do that.
>> Note that I'm not the mentor for this project and it is just an idea.
>
> @Corentin: During last year's gsoc I talked quickly with Vikraman and
> Alec about the web framework being used. I agree that web.py isn't the
> better tool for the job, I think I suggested flask that time, don't
> recall correctly.
>
Django is okey for me and I can learn something related from euscan.
Thanks for Corentin's advice.
> This topic should be well discussed, because any change of framework
> will spend a considerable amount of time, considering that we just
> have 3 months.
>
>> I think this years goals should be:
>> - stabilization of the client, and integration with the base system,
>> documentation during installation, etc...
>> - adding more features to the web interface, like what we can find on
>> http://popcon.debian.org/ (charts, sortable tables, overlays usage,
>> etc..)
>> - integration with other gentoo webservices (how ?)
Like PyBugz [1], developers can get bugs through terminal when they
query packages, rather than Bugzilla website. In addition, we can add
bugs' links to Bugzilla website in packages statistic page.
>> - putting the project in production of gentoo-infra
>>
>
> @Corentin: I tend to agree with these goals. Thanks for the answer.
>
> @Yanbo, your ideas are good, but keep security and privacy in mind. I
> think that these are the key points of the success of this project.
> Also, automatic bug reports are cool, but we need to stabilize the
> basic features first.
>
Yes, thanks for yours and Corentin's suggestion, stabilization and
adding more features to the web interface are most import to complete.
There has been one package named abrt [2], and this tool helps users
easily report bugs of packages to Bugzilla. My idea about bug report
is reporting package name with some comment to the server with
terminal like current gentoostats-send script when the package crashs.
On the server side, reports are analysed offline. We can extract the
usefull things and help developers find bugs. For example, most of
users build one package with the same CFLAG have the same problem or
install one package with another package together have the same crash
info.
> Best regards,
>
> --
> Rafael Goncalves Martins
> Gentoo Linux developer
> http://rafaelmartins.eng.br/
>
[1] http://packages.gentoo.org/package/www-client/pybugz
[2] http://packages.gentoo.org/package/app-admin/abrt
--
Thanks,
Yanbo Wang
^ permalink raw reply [flat|nested] 10+ messages in thread
end of thread, other threads:[~2012-03-17 12:08 UTC | newest]
Thread overview: 10+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-03-29 13:24 [gentoo-soc] Package statistics reporting tool Vikraman
2011-03-29 13:53 ` Donnie Berkholz
2011-03-29 14:20 ` Brian Dolbec
2011-03-29 19:01 ` Yanbo Wang
2011-03-29 19:11 ` Vikraman Choudhury
2011-04-08 15:18 ` Vikraman Choudhury
-- strict thread matches above, loose matches on Subject: below --
2012-03-15 13:44 Yanbo Wang
2012-03-16 15:59 ` Corentin Chary
2012-03-16 23:37 ` Rafael Goncalves Martins
2012-03-17 10:49 ` Yanbo Wang
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox