* [gentoo-dev] rsync mirrorstats page (generation and parsing) @ 2009-07-05 0:44 Sebastian Pipping 2009-07-05 0:52 ` [gentoo-dev] mirrorselect: request for extraction of class MirrorParser Sebastian Pipping 0 siblings, 1 reply; 14+ messages in thread From: Sebastian Pipping @ 2009-07-05 0:44 UTC (permalink / raw To: Gentoo Dev Hello! When collecting information on the SYNC variable for my Summer of Code gentoo stats project I'd like to check if the URL in SYNC is publically known or some private/secret rsync mirror. The page behind http://mirrorstats.gentoo.org/rsync/ seems to show that publicly known list. As parsing the current page format is not trivial I'd like to know - if there is code parsing that page anywhere already - if the script generating that page could be extended to produce an XML equivalent of the provided information. Integration of such a parser into app-portage/mirrorselect could be interesting, too. Sebastian ^ permalink raw reply [flat|nested] 14+ messages in thread
* [gentoo-dev] mirrorselect: request for extraction of class MirrorParser @ 2009-07-05 0:52 ` Sebastian Pipping 2009-07-05 1:48 ` [gentoo-dev] rsync mirrorstats page (generation and parsing) Robin H. Johnson 2009-07-05 7:57 ` [gentoo-dev] mirrorselect: request for extraction of class MirrorParser Zac Medico 0 siblings, 2 replies; 14+ messages in thread From: Sebastian Pipping @ 2009-07-05 0:52 UTC (permalink / raw To: Gentoo Dev Hello! app-portage/mirrorselect is a single file Python program. It contains a class MirrorParser that parses mirrors.xml from the Gentoo website. I would like to use that code (unmodified) for my GSoC project. My request is to extract an extra file for that class from mirrorselect so the Gentoo part of smolt can depend on mirrorselect in the near future instead of shipping a "dependency fork" forever. mirrorselect's author seems to be Colin Kingsley (tercel@g.o.) whom I cannot find in the list of current gentoo devs. Please let me know if you can help/guide/.. with this. Sebastian ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [gentoo-dev] rsync mirrorstats page (generation and parsing) 2009-07-05 0:52 ` [gentoo-dev] mirrorselect: request for extraction of class MirrorParser Sebastian Pipping @ 2009-07-05 1:48 ` Robin H. Johnson 2009-07-05 13:52 ` Sebastian Pipping 2009-07-06 6:37 ` Alec Warner 2009-07-05 7:57 ` [gentoo-dev] mirrorselect: request for extraction of class MirrorParser Zac Medico 1 sibling, 2 replies; 14+ messages in thread From: Robin H. Johnson @ 2009-07-05 1:48 UTC (permalink / raw To: gentoo-dev [-- Attachment #1: Type: text/plain, Size: 1941 bytes --] On Sun, Jul 05, 2009 at 02:44:07AM +0200, Sebastian Pipping wrote: > When collecting information on the SYNC variable for my Summer of Code > gentoo stats project I'd like to check if the URL in SYNC is publically > known or some private/secret rsync mirror. The page behind > http://mirrorstats.gentoo.org/rsync/ Mirrorstats is known to be out of date, because somebody needs to sit down and integrate it with the datasources, so manual updates aren't needed. Even better, would be hooking it into bouncer2 for the sentry output. It needs somebody to update it and hook at into the SOURCE of this data: http://www.gentoo.org/main/en/mirrors3.xml But wait, you say, that page is distfiles mirrors? Mirror-admin have a common data source, non-published as it contains private contact details for each administrator. From that data source, mirrors3 and rsync mirrors gets updated. mirrors.xml - old page, only used by mirrorselect now, manually updated. mirrors3.xml - new page, generated from internal dataset. mirrors2.xml - not a real page (See http://www.gentoo.org/main/en/mirrors2.xml?passthru=1 and the magic <mirrorlist> element. Relatedly, the original author of mirrorselect retired from Gentoo several years ago. The tools-portage team maintain it now, so you should co-operate with them. It would be nice if they implemented the mirrors3 usage too, I think mirror-admin asked them more than a year ago, but I can't find the bug. In the meantime, for your original question: > is the URL in SYNC public or private Simply check by matching against gentoo.org$ in the hostname part of the field. P.S. Please report empty SYNC variables too ;-). These turn up when users/devs have their tree coming from a VCS instead of rsync. -- Robin Hugh Johnson Gentoo Linux Developer & Infra Guy E-Mail : robbat2@gentoo.org GnuPG FP : 11AC BA4F 4778 E3F6 E4ED F38E B27B 944E 3488 4E85 [-- Attachment #2: Type: application/pgp-signature, Size: 330 bytes --] ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [gentoo-dev] rsync mirrorstats page (generation and parsing) 2009-07-05 1:48 ` [gentoo-dev] rsync mirrorstats page (generation and parsing) Robin H. Johnson @ 2009-07-05 13:52 ` Sebastian Pipping 2009-07-06 6:37 ` Alec Warner 1 sibling, 0 replies; 14+ messages in thread From: Sebastian Pipping @ 2009-07-05 13:52 UTC (permalink / raw To: gentoo-dev; +Cc: Zac Medico, mirror-admin Robin H. Johnson wrote: > On Sun, Jul 05, 2009 at 02:44:07AM +0200, Sebastian Pipping wrote: >> When collecting information on the SYNC variable for my Summer of Code >> gentoo stats project I'd like to check if the URL in SYNC is publically >> known or some private/secret rsync mirror. The page behind >> http://mirrorstats.gentoo.org/rsync/ > Mirrorstats is known to be out of date, because somebody needs to sit > down and integrate it with the datasources, so manual updates aren't > needed. Even better, would be hooking it into bouncer2 for the sentry > output. What are these datasources? What kind of integration are you thinking of? > It needs somebody to update it and hook at into the SOURCE of this data: > http://www.gentoo.org/main/en/mirrors3.xml > > But wait, you say, that page is distfiles mirrors? Mirror-admin have a > common data source, non-published as it contains private contact details > for each administrator. From that data source, mirrors3 and rsync > mirrors gets updated. I see. > mirrors.xml - old page, only used by mirrorselect now, manually updated. > mirrors3.xml - new page, generated from internal dataset. > mirrors2.xml - not a real page (See > http://www.gentoo.org/main/en/mirrors2.xml?passthru=1 and the magic > <mirrorlist> element. Compared to [..]/mirrors.xml?passthru=1 it seems to me that on mirror3 [..]/mirrors3.xml?passthru=1 passthru= is working in the opposite direction: 1 turns style sheets on 0 turns them off (default) The one for mirrors3 makes less sense to me. Is this inconsistency intended? > Relatedly, the original author of mirrorselect retired from Gentoo > several years ago. The tools-portage team maintain it now, so you should > co-operate with them. It would be nice if they implemented the mirrors3 > usage too, I think mirror-admin asked them more than a year ago, but I > can't find the bug. I agree that would be a good idea and another reason to touch mirrorselect. Does it have a source repo somewhere?, not seen any. > In the meantime, for your original question: >> is the URL in SYNC public or private > Simply check by matching against gentoo.org$ in the hostname part of the > field. Good idea, now implemented: http://git.goodpoint.de/?p=smolt-gentoo.git;a=commitdiff;h=aeb14433e7c29a6045fb702775a3455ebb61aa1d > P.S. Please report empty SYNC variables too ;-). These turn up when > users/devs have their tree coming from a VCS instead of rsync. Good point. Now also implemented, same commit as above. Sebastian ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [gentoo-dev] rsync mirrorstats page (generation and parsing) 2009-07-05 1:48 ` [gentoo-dev] rsync mirrorstats page (generation and parsing) Robin H. Johnson 2009-07-05 13:52 ` Sebastian Pipping @ 2009-07-06 6:37 ` Alec Warner 2009-07-06 7:28 ` Zac Medico 1 sibling, 1 reply; 14+ messages in thread From: Alec Warner @ 2009-07-06 6:37 UTC (permalink / raw To: gentoo-dev; +Cc: tools-portage On Sat, Jul 4, 2009 at 6:48 PM, Robin H. Johnson<robbat2@gentoo.org> wrote: > On Sun, Jul 05, 2009 at 02:44:07AM +0200, Sebastian Pipping wrote: >> When collecting information on the SYNC variable for my Summer of Code >> gentoo stats project I'd like to check if the URL in SYNC is publically >> known or some private/secret rsync mirror. The page behind >> http://mirrorstats.gentoo.org/rsync/ > Mirrorstats is known to be out of date, because somebody needs to sit > down and integrate it with the datasources, so manual updates aren't > needed. Even better, would be hooking it into bouncer2 for the sentry > output. > > It needs somebody to update it and hook at into the SOURCE of this data: > http://www.gentoo.org/main/en/mirrors3.xml > > But wait, you say, that page is distfiles mirrors? Mirror-admin have a > common data source, non-published as it contains private contact details > for each administrator. From that data source, mirrors3 and rsync > mirrors gets updated. > > mirrors.xml - old page, only used by mirrorselect now, manually updated. > mirrors3.xml - new page, generated from internal dataset. > mirrors2.xml - not a real page (See > http://www.gentoo.org/main/en/mirrors2.xml?passthru=1 and the magic > <mirrorlist> element. > > Relatedly, the original author of mirrorselect retired from Gentoo > several years ago. The tools-portage team maintain it now, so you should > co-operate with them. It would be nice if they implemented the mirrors3 > usage too, I think mirror-admin asked them more than a year ago, but I > can't find the bug. +cc tools-portage shit, I think I was the last one to touch that thing ;p Where is mirrorselect hiding these days, a private git repo? -A > > In the meantime, for your original question: >> is the URL in SYNC public or private > Simply check by matching against gentoo.org$ in the hostname part of the > field. > > P.S. Please report empty SYNC variables too ;-). These turn up when > users/devs have their tree coming from a VCS instead of rsync. > > -- > Robin Hugh Johnson > Gentoo Linux Developer & Infra Guy > E-Mail : robbat2@gentoo.org > GnuPG FP : 11AC BA4F 4778 E3F6 E4ED F38E B27B 944E 3488 4E85 > ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [gentoo-dev] rsync mirrorstats page (generation and parsing) 2009-07-06 6:37 ` Alec Warner @ 2009-07-06 7:28 ` Zac Medico 2009-07-06 7:31 ` Robin H. Johnson 2009-07-07 22:09 ` Sebastian Pipping 0 siblings, 2 replies; 14+ messages in thread From: Zac Medico @ 2009-07-06 7:28 UTC (permalink / raw To: Alec Warner, gentoo-dev; +Cc: tools-portage Alec Warner wrote: > Where is mirrorselect hiding these days, a private git repo? Yeah, here's the history since I started maintaining it: http://dev.gentoo.org/~zmedico/projects/mirrorselect.git/ -- Thanks, Zac ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [gentoo-dev] rsync mirrorstats page (generation and parsing) 2009-07-06 7:28 ` Zac Medico @ 2009-07-06 7:31 ` Robin H. Johnson 2009-07-07 23:32 ` Sebastian Pipping 2009-07-07 22:09 ` Sebastian Pipping 1 sibling, 1 reply; 14+ messages in thread From: Robin H. Johnson @ 2009-07-06 7:31 UTC (permalink / raw To: gentoo-dev; +Cc: Alec Warner, tools-portage [-- Attachment #1: Type: text/plain, Size: 567 bytes --] On Mon, Jul 06, 2009 at 12:28:59AM -0700, Zac Medico wrote: > Alec Warner wrote: > > Where is mirrorselect hiding these days, a private git repo? > Yeah, here's the history since I started maintaining it: > http://dev.gentoo.org/~zmedico/projects/mirrorselect.git/ I'll try to suck that down soon and build up a larger history with old tarballs, and then push it somewhere useful. Thanks for it. -- Robin Hugh Johnson Gentoo Linux Developer & Infra Guy E-Mail : robbat2@gentoo.org GnuPG FP : 11AC BA4F 4778 E3F6 E4ED F38E B27B 944E 3488 4E85 [-- Attachment #2: Type: application/pgp-signature, Size: 330 bytes --] ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [gentoo-dev] rsync mirrorstats page (generation and parsing) 2009-07-06 7:31 ` Robin H. Johnson @ 2009-07-07 23:32 ` Sebastian Pipping 0 siblings, 0 replies; 14+ messages in thread From: Sebastian Pipping @ 2009-07-07 23:32 UTC (permalink / raw To: gentoo-dev; +Cc: Alec Warner, tools-portage Robin H. Johnson wrote: > I'll try to suck that down soon and build up a larger history with old > tarballs, and then push it somewhere useful. To re-build mirrorselect's complete history we'd need the original tarballs for each line starting with "[ ] " below. Please let us now if you have some of these files on some harddisc of yours. Here's the list [F] 0.1 [F] 0.1-r1 [F] 0.1-r2 [F] 0.2 [F] 0.2-r1 [F] 0.3 [ ] 0.4 [ ] 0.5 [ ] 0.6 [ ] 0.7 [ ] 0.7-r1 [ ] 0.8 [ ] 0.81 [ ] 0.82 [R] 0.82-r1 [R] 0.82-r2 [R] 0.82-r3 [R] 0.83 [R] 0.84 [ ] 0.85 [ ] 0.86 [R] 0.87 [R] 0.89 [ ] 1.0 [ ] 1.0.1 [ ] 1.0.2 [ ] 1.0.3 [ ] 1.0.4_rc2 [ ] 1.1 [ ] 1.1.1 [ ] 1.1.2 [ ] 1.1.3 [ ] 1.1.4 [ ] 1.1.5 [R] 1.1.6 [R] 1.1.7 [R] 1.2 [R] 1.3 [R] 1.4 [R] 1.4.1 [R] 1.4.2 F = files dir in http://sources.gentoo.org/viewcvs.py/gentoo-x86/app-admin/mirrorselect/?hideattic=0 R = http://dev.gentoo.org/~robbat2/mirrorselect-archive/ Sebastian ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [gentoo-dev] rsync mirrorstats page (generation and parsing) 2009-07-06 7:28 ` Zac Medico 2009-07-06 7:31 ` Robin H. Johnson @ 2009-07-07 22:09 ` Sebastian Pipping 2009-07-08 0:09 ` Robin H. Johnson 1 sibling, 1 reply; 14+ messages in thread From: Sebastian Pipping @ 2009-07-07 22:09 UTC (permalink / raw To: gentoo-dev; +Cc: Alec Warner, tools-portage Zac Medico wrote: > Yeah, here's the history since I started maintaining it: > > http://dev.gentoo.org/~zmedico/projects/mirrorselect.git/ I've been adding mirror3.xml support to the above today. Repo over here: http://git.goodpoint.de/?p=mirrorselect.git;a=summary Sebastian ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [gentoo-dev] rsync mirrorstats page (generation and parsing) 2009-07-07 22:09 ` Sebastian Pipping @ 2009-07-08 0:09 ` Robin H. Johnson 2009-07-18 12:46 ` Sebastian Pipping 0 siblings, 1 reply; 14+ messages in thread From: Robin H. Johnson @ 2009-07-08 0:09 UTC (permalink / raw To: gentoo-dev; +Cc: Alec Warner, tools-portage [-- Attachment #1: Type: text/plain, Size: 767 bytes --] On Wed, Jul 08, 2009 at 12:09:21AM +0200, Sebastian Pipping wrote: > Zac Medico wrote: > > Yeah, here's the history since I started maintaining it: > > http://dev.gentoo.org/~zmedico/projects/mirrorselect.git/ > I've been adding mirror3.xml support to the above today. > Repo over here: > http://git.goodpoint.de/?p=mirrorselect.git;a=summary Nice, thanks for the start. The mirrors3 format was designed to explicitly make it clear: - Geographic structure: region -> country -> mirror - Separate protocols - IPv4, IPv6: mirrors may have either or both. - Easy separation of partial mirrors. -- Robin Hugh Johnson Gentoo Linux Developer & Infra Guy E-Mail : robbat2@gentoo.org GnuPG FP : 11AC BA4F 4778 E3F6 E4ED F38E B27B 944E 3488 4E85 [-- Attachment #2: Type: application/pgp-signature, Size: 330 bytes --] ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [gentoo-dev] rsync mirrorstats page (generation and parsing) 2009-07-08 0:09 ` Robin H. Johnson @ 2009-07-18 12:46 ` Sebastian Pipping 2009-07-19 20:10 ` Zac Medico 0 siblings, 1 reply; 14+ messages in thread From: Sebastian Pipping @ 2009-07-18 12:46 UTC (permalink / raw To: gentoo-dev; +Cc: Alec Warner, tools-portage Hello! So what's needed to get a new mirrorselect release out? Sebastian ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [gentoo-dev] rsync mirrorstats page (generation and parsing) 2009-07-18 12:46 ` Sebastian Pipping @ 2009-07-19 20:10 ` Zac Medico 2009-07-19 20:20 ` Sebastian Pipping 0 siblings, 1 reply; 14+ messages in thread From: Zac Medico @ 2009-07-19 20:10 UTC (permalink / raw To: Sebastian Pipping; +Cc: gentoo-dev, Alec Warner, tools-portage Sebastian Pipping wrote: > Hello! > > > So what's needed to get a new mirrorselect release out? Are all of your changes here? git://git.goodpoint.de/mirrorselect.git Now we just need to create an ebuild to install it, and put it in the tree. You can file a bug for that and assign it to tools-portage. > > > > Sebastian -- Thanks, Zac ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [gentoo-dev] rsync mirrorstats page (generation and parsing) 2009-07-19 20:10 ` Zac Medico @ 2009-07-19 20:20 ` Sebastian Pipping 0 siblings, 0 replies; 14+ messages in thread From: Sebastian Pipping @ 2009-07-19 20:20 UTC (permalink / raw To: Zac Medico; +Cc: gentoo-dev, Alec Warner, tools-portage Zac Medico wrote: >> So what's needed to get a new mirrorselect release out? > > Are all of your changes here? > > git://git.goodpoint.de/mirrorselect.git Yes. > Now we just need to create an ebuild to install it, and put it in > the tree. You can file a bug for that and assign it to tools-portage. Done. https://bugs.gentoo.org/show_bug.cgi?id=278351 Sebastian ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [gentoo-dev] mirrorselect: request for extraction of class MirrorParser 2009-07-05 0:52 ` [gentoo-dev] mirrorselect: request for extraction of class MirrorParser Sebastian Pipping 2009-07-05 1:48 ` [gentoo-dev] rsync mirrorstats page (generation and parsing) Robin H. Johnson @ 2009-07-05 7:57 ` Zac Medico 1 sibling, 0 replies; 14+ messages in thread From: Zac Medico @ 2009-07-05 7:57 UTC (permalink / raw To: gentoo-dev Sebastian Pipping wrote: > Hello! > > > app-portage/mirrorselect is a single file Python program. > It contains a class MirrorParser that parses mirrors.xml from the Gentoo > website. I would like to use that code (unmodified) for my GSoC > project. > > My request is to extract an extra file for that class from > mirrorselect so the Gentoo part of smolt can depend on mirrorselect > in the near future instead of shipping a "dependency fork" forever. > > mirrorselect's author seems to be Colin Kingsley (tercel@g.o.) whom > I cannot find in the list of current gentoo devs. > > Please let me know if you can help/guide/.. with this. > > > > Sebastian It's only about 30 lines of code, so I wouldn't worry too much about this "dependency fork" thing. Maybe if it was larger then I would. So, I suggest that you simply copy it and include a comment about its origin. -- Thanks, Zac ^ permalink raw reply [flat|nested] 14+ messages in thread
end of thread, other threads:[~2009-07-19 20:20 UTC | newest] Thread overview: 14+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2009-07-05 0:44 [gentoo-dev] rsync mirrorstats page (generation and parsing) Sebastian Pipping 2009-07-05 0:52 ` [gentoo-dev] mirrorselect: request for extraction of class MirrorParser Sebastian Pipping 2009-07-05 1:48 ` [gentoo-dev] rsync mirrorstats page (generation and parsing) Robin H. Johnson 2009-07-05 13:52 ` Sebastian Pipping 2009-07-06 6:37 ` Alec Warner 2009-07-06 7:28 ` Zac Medico 2009-07-06 7:31 ` Robin H. Johnson 2009-07-07 23:32 ` Sebastian Pipping 2009-07-07 22:09 ` Sebastian Pipping 2009-07-08 0:09 ` Robin H. Johnson 2009-07-18 12:46 ` Sebastian Pipping 2009-07-19 20:10 ` Zac Medico 2009-07-19 20:20 ` Sebastian Pipping 2009-07-05 7:57 ` [gentoo-dev] mirrorselect: request for extraction of class MirrorParser Zac Medico
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox