public inbox for gentoo-dev@lists.gentoo.org
 help / color / mirror / Atom feed
From: malc <mlashley@gmail.com>
To: gentoo-dev@lists.gentoo.org
Subject: Re: [gentoo-dev] Re: [gentoo-project] Portage repo usage survey and change evaluation
Date: Wed, 2 Mar 2016 19:48:44 +0000	[thread overview]
Message-ID: <CAPkQJpR8jbLhpAJGB=zGVoqDJHoLu1yYjhN5asqsJTT=WydJQA@mail.gmail.com> (raw)
In-Reply-To: <22231.11642.809779.509501@a1i15.kph.uni-mainz.de>

I still fail to understand the bikeshedding here - you really don't
need a git checkout to get something akin to a changelog. Use the
github API directly...

The following 1-liner could be trivially productised (maybe even parse
$PWD to set the path argument...)

curl https://api.github.com/repos/gentoo/gentoo/commits?path=app-admin/eselect
| perl -MJSON -e 'foreach $i (@{decode_json(join("",@lines=<STDIN>))})
{ print "$i->{commit}->{author}->{name} -
$i->{commit}->{author}->{date}\n\n $i->{commit}->{message}\n"; }'

Yeah - it's not quite as pretty as our current Changelog, but date,
author/committer, commit-msg etc. are all there and you can filter by
path just the same as you would with native git log...
You could parse the local $PORTDIR/metadata/timestamp* and add an
'until' param to the URL to filter commits beyond where a user has
rsync'd up to...

Cheers,
malc.


On Wed, Mar 2, 2016 at 6:14 PM, Ulrich Mueller <ulm@gentoo.org> wrote:
>>>>>> On Wed, 2 Mar 2016, Ian Stakenvicius wrote:
>
>> On 02/03/16 03:50 AM, Ulrich Mueller wrote:
>>> How is it possible that we have 52 MiB of ChangeLog entries
>>> generated in the 0.5 years since the git conversion, whereas we had
>>> only a total of 103 MiB in the 13.5 years since ChangeLogs were
>>> introduced in 2002? Certainly our commit rate hasn't increased by
>>> more than an order of magnitude in the last half year?
>
>> The content of a changelog entry from git is a lot bigger than it
>> was just from echangelog, isn't it?
>
> Not by a factor of ten.
>
> I've investigated a bit, and the main problem seems to be that for git
> commits that extend over several directories, the commit message is
> duplicated into many ChangeLog entries.
>
> For example, the message of the initial commit 56bd759 appears in some
> 18000 files, which accounts for 25 MiB. Then there is commit eaaface
> and its revert 1bfb585, again appearing in almost all ChangeLog files
> in the tree. These account for another 9 MiB. Last example, commit
> 8849b09, another 2 MiB.
>
> So about 70% of the size is caused by these 4 tree-wide commits alone.
> However, there are many more examples of duplication on a smaller
> scale.
>
> Ulrich


  reply	other threads:[~2016-03-02 19:48 UTC|newest]

Thread overview: 14+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-02-01  6:11 [gentoo-dev] Portage repo usage survey and change evaluation Robin H. Johnson
2016-02-01 11:46 ` [gentoo-dev] Re: [gentoo-project] " Rich Freeman
2016-02-01 21:27   ` Michał Górny
2016-02-29  8:01 ` Ulrich Mueller
2016-03-02  1:32   ` Robin H. Johnson
2016-03-02  6:18     ` Patrick Lauer
2016-03-02  8:50     ` Ulrich Mueller
2016-03-02 17:31       ` Ian Stakenvicius
2016-03-02 18:14         ` Ulrich Mueller
2016-03-02 19:48           ` malc [this message]
2016-03-02 20:43             ` Rich Freeman
2016-03-03  7:20             ` Patrick Lauer
2016-03-04  0:13               ` Gordon Pettey
2016-03-02 20:40           ` Rich Freeman

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='CAPkQJpR8jbLhpAJGB=zGVoqDJHoLu1yYjhN5asqsJTT=WydJQA@mail.gmail.com' \
    --to=mlashley@gmail.com \
    --cc=gentoo-dev@lists.gentoo.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox