public inbox for gentoo-portage-dev@lists.gentoo.org
 help / color / mirror / Atom feed
* [gentoo-portage-dev] changelog encoding
@ 2004-10-08  4:19 Brian
  2004-10-08 12:31 ` Marius Mauch
  2004-10-08 17:34 ` George Shapovalov
  0 siblings, 2 replies; 9+ messages in thread
From: Brian @ 2004-10-08  4:19 UTC (permalink / raw
  To: gentoo-portage-dev

What is the official encoding method(s) for the changelogs.   It has
been reported that porthole often fails getting the changelogs due to
the encoding.  Currently it is assuming ascii.   Many are reported to be
iso-8859-1.
-- 
Brian <dol-sen@telus.net>


--
gentoo-portage-dev@gentoo.org mailing list


^ permalink raw reply	[flat|nested] 9+ messages in thread
* Fw: [gentoo-portage-dev] changelog encoding
@ 2004-10-08 15:29 Marius Mauch
  2004-10-08 16:26 ` Luke-Jr
  0 siblings, 1 reply; 9+ messages in thread
From: Marius Mauch @ 2004-10-08 15:29 UTC (permalink / raw
  To: gentoo-portage-dev

[-- Attachment #1: Type: text/plain, Size: 1861 bytes --]



Begin forwarded message:

Date: Fri, 8 Oct 2004 13:56:17 +0100
From: Ciaran McCreesh <ciaranm@gentoo.org>
To: Marius Mauch <genone@gentoo.org>
Cc: gentoo-portage-dev@lists.gentoo.org
Subject: Re: [gentoo-portage-dev] changelog encoding


[ not sure if I can post to gentoo-portage-dev, please forward on if
not... ]

On Fri, 8 Oct 2004 14:31:52 +0200 Marius Mauch <genone@gentoo.org>
wrote:
| On 10/07/04  Brian wrote:
| > What is the official encoding method(s) for the changelogs.   It has
| > been reported that porthole often fails getting the changelogs due
| > to the encoding.  Currently it is assuming ascii.   Many are
| > reported to be iso-8859-1.
| 
| I don't think we have an official encoding, but I think ciaranm knows
| a bit more about that issue.

Yup. We *need* to have an official encoding. Reason being, at least one
developer has a non-(ASCII as in characters 0..126 only) character in
their name. Said encoding should also apply to ebuilds, but not to
files/ entries (I could give the lengthy explanation if anyone really
wants to know, but basically certain things would break).

I've been whinging about this on and off for about a year now, and every
time it's been dismissed as irrelevant :)

If we're going to standardise on an encoding, it's got to be UTF-8.
iso-8859-1 is not sufficient to represent every developer (and potential
patch contributor)'s name correctly. UTF-16 and plain old four byte
unicode aren't compatible with our existing files (in UTF-8, characters
1 to 126 are the same as in regular ASCII). Yes, UTF-8 kinda sucks in
terms of space when encoding japanese or russian characters, but since
these will be a rare occurance it's not really a problem.

-- 
Ciaran McCreesh : Gentoo Developer (Sparc, MIPS, Vim, Fluxbox)
Mail            : ciaranm at gentoo.org
Web             : http://dev.gentoo.org/~ciaranm



[-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --]

^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2004-10-09 17:38 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2004-10-08  4:19 [gentoo-portage-dev] changelog encoding Brian
2004-10-08 12:31 ` Marius Mauch
2004-10-08 17:34 ` George Shapovalov
  -- strict thread matches above, loose matches on Subject: below --
2004-10-08 15:29 Fw: " Marius Mauch
2004-10-08 16:26 ` Luke-Jr
2004-10-08 16:40   ` Ciaran McCreesh
2004-10-09  1:05     ` Ed Grimm
2004-10-09 13:53       ` Ciaran McCreesh
2004-10-09 15:24         ` Brian
2004-10-09 17:10           ` Ciaran McCreesh
2004-10-09 17:38             ` Marius Mauch

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox