From: malc <mlashley@gmail.com>
To: gentoo-dev@lists.gentoo.org
Subject: Re: [gentoo-dev] Re: [gentoo-project] Portage repo usage survey and change evaluation
Date: Wed, 2 Mar 2016 19:48:44 +0000 [thread overview]
Message-ID: <CAPkQJpR8jbLhpAJGB=zGVoqDJHoLu1yYjhN5asqsJTT=WydJQA@mail.gmail.com> (raw)
In-Reply-To: <22231.11642.809779.509501@a1i15.kph.uni-mainz.de>
I still fail to understand the bikeshedding here - you really don't
need a git checkout to get something akin to a changelog. Use the
github API directly...
The following 1-liner could be trivially productised (maybe even parse
$PWD to set the path argument...)
curl https://api.github.com/repos/gentoo/gentoo/commits?path=app-admin/eselect
| perl -MJSON -e 'foreach $i (@{decode_json(join("",@lines=<STDIN>))})
{ print "$i->{commit}->{author}->{name} -
$i->{commit}->{author}->{date}\n\n $i->{commit}->{message}\n"; }'
Yeah - it's not quite as pretty as our current Changelog, but date,
author/committer, commit-msg etc. are all there and you can filter by
path just the same as you would with native git log...
You could parse the local $PORTDIR/metadata/timestamp* and add an
'until' param to the URL to filter commits beyond where a user has
rsync'd up to...
Cheers,
malc.
On Wed, Mar 2, 2016 at 6:14 PM, Ulrich Mueller <ulm@gentoo.org> wrote:
>>>>>> On Wed, 2 Mar 2016, Ian Stakenvicius wrote:
>
>> On 02/03/16 03:50 AM, Ulrich Mueller wrote:
>>> How is it possible that we have 52 MiB of ChangeLog entries
>>> generated in the 0.5 years since the git conversion, whereas we had
>>> only a total of 103 MiB in the 13.5 years since ChangeLogs were
>>> introduced in 2002? Certainly our commit rate hasn't increased by
>>> more than an order of magnitude in the last half year?
>
>> The content of a changelog entry from git is a lot bigger than it
>> was just from echangelog, isn't it?
>
> Not by a factor of ten.
>
> I've investigated a bit, and the main problem seems to be that for git
> commits that extend over several directories, the commit message is
> duplicated into many ChangeLog entries.
>
> For example, the message of the initial commit 56bd759 appears in some
> 18000 files, which accounts for 25 MiB. Then there is commit eaaface
> and its revert 1bfb585, again appearing in almost all ChangeLog files
> in the tree. These account for another 9 MiB. Last example, commit
> 8849b09, another 2 MiB.
>
> So about 70% of the size is caused by these 4 tree-wide commits alone.
> However, there are many more examples of duplication on a smaller
> scale.
>
> Ulrich
next prev parent reply other threads:[~2016-03-02 19:48 UTC|newest]
Thread overview: 14+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-02-01 6:11 [gentoo-dev] Portage repo usage survey and change evaluation Robin H. Johnson
2016-02-01 11:46 ` [gentoo-dev] Re: [gentoo-project] " Rich Freeman
2016-02-01 21:27 ` Michał Górny
2016-02-29 8:01 ` Ulrich Mueller
2016-03-02 1:32 ` Robin H. Johnson
2016-03-02 6:18 ` Patrick Lauer
2016-03-02 8:50 ` Ulrich Mueller
2016-03-02 17:31 ` Ian Stakenvicius
2016-03-02 18:14 ` Ulrich Mueller
2016-03-02 19:48 ` malc [this message]
2016-03-02 20:43 ` Rich Freeman
2016-03-03 7:20 ` Patrick Lauer
2016-03-04 0:13 ` Gordon Pettey
2016-03-02 20:40 ` Rich Freeman
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to='CAPkQJpR8jbLhpAJGB=zGVoqDJHoLu1yYjhN5asqsJTT=WydJQA@mail.gmail.com' \
--to=mlashley@gmail.com \
--cc=gentoo-dev@lists.gentoo.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox