* [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) @ 2003-08-13 23:34 Mike Frysinger 2003-08-13 23:38 ` Heinrich Wendel 0 siblings, 1 reply; 25+ messages in thread From: Mike Frysinger @ 2003-08-13 23:34 UTC (permalink / raw To: gentoo-dev http://bugs.gentoo.org/show_bug.cgi?id=26561 <summary> At the moment the relevant USE flags seem to be: cjk - Adds support for Multi-byte character languages (Chinese, Japanese, Korean) nls - Adds Native Language Support (using gettext - GNU locale utilities) If I want to enable unicode support or similar for an application (say, vim) only cjk seems applicable at the moment, and is a tad misleading (there are hundreds of other languages in unicode!). </summary> please post your input ... personally i'd prefer if everyone just learned ASCII compliant languages ;) -mike -- gentoo-dev@gentoo.org mailing list ^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) 2003-08-13 23:34 [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) Mike Frysinger @ 2003-08-13 23:38 ` Heinrich Wendel 2003-08-13 23:40 ` Heinrich Wendel 0 siblings, 1 reply; 25+ messages in thread From: Heinrich Wendel @ 2003-08-13 23:38 UTC (permalink / raw To: gentoo-dev On Thursday 14 August 2003 01:34, Mike Frysinger wrote: > http://bugs.gentoo.org/show_bug.cgi?id=26561 > <summary> > At the moment the relevant USE flags seem to be: > cjk - Adds support for Multi-byte character languages (Chinese, Japanese, > Korean) > > nls - Adds Native Language Support (using gettext - GNU locale utilities) > > If I want to enable unicode support or similar for an application (say, > vim) only cjk seems applicable at the moment, and is a tad misleading > (there are hundreds of other languages in unicode!). > </summary> > > please post your input ... personally i'd prefer if everyone just learned > ASCII compliant languages ;) hehe ;) /me votes for unicode use-flag > -mike mfg, Heinrich :) > > -- > gentoo-dev@gentoo.org mailing list -- gentoo-dev@gentoo.org mailing list ^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) 2003-08-13 23:38 ` Heinrich Wendel @ 2003-08-13 23:40 ` Heinrich Wendel 2003-10-30 11:44 ` Mamoru KOMACHI 0 siblings, 1 reply; 25+ messages in thread From: Heinrich Wendel @ 2003-08-13 23:40 UTC (permalink / raw To: gentoo-dev On Thursday 14 August 2003 01:38, Heinrich Wendel wrote: > On Thursday 14 August 2003 01:34, Mike Frysinger wrote: > > http://bugs.gentoo.org/show_bug.cgi?id=26561 > > <summary> > > At the moment the relevant USE flags seem to be: > > cjk - Adds support for Multi-byte character languages (Chinese, > > Japanese, Korean) > > > > nls - Adds Native Language Support (using gettext - GNU locale > > utilities) > > > > If I want to enable unicode support or similar for an application (say, > > vim) only cjk seems applicable at the moment, and is a tad misleading > > (there are hundreds of other languages in unicode!). > > </summary> > > > > please post your input ... personally i'd prefer if everyone just learned > > ASCII compliant languages ;) > > hehe ;) > > /me votes for unicode use-flag should also be activated in all profiles by default. > > > -mike > > mfg, Heinrich :) > > > -- > > gentoo-dev@gentoo.org mailing list > > -- > gentoo-dev@gentoo.org mailing list -- gentoo-dev@gentoo.org mailing list ^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) 2003-08-13 23:40 ` Heinrich Wendel @ 2003-10-30 11:44 ` Mamoru KOMACHI 2003-10-30 12:26 ` Paul de Vrieze ` (3 more replies) 0 siblings, 4 replies; 25+ messages in thread From: Mamoru KOMACHI @ 2003-10-30 11:44 UTC (permalink / raw To: gentoo-dev Hi all, At Thu, 14 Aug 2003 01:40:40 +0200, Heinrich Wendel wrote: > On Thursday 14 August 2003 01:38, Heinrich Wendel wrote: > > On Thursday 14 August 2003 01:34, Mike Frysinger wrote: > > > http://bugs.gentoo.org/show_bug.cgi?id=26561 > > > <summary> > > > If I want to enable unicode support or similar for an application (say, > > > vim) only cjk seems applicable at the moment, and is a tad misleading > > > (there are hundreds of other languages in unicode!). > > > </summary> > > /me votes for unicode use-flag > > should also be activated in all profiles by default. Any objection if I add unicode USE flag (and make it default in all profiles) ? We cannot simply convert cjk USE flag to unicode, but in some cases unicode USE flag seems appropriate and we definitely need it. (currently dev-python/wxPython and x11-libs/wxGTK have unicode local USE flag and others use cjk or nls for unicode support) If nobody objects (and hopefully many approve ;) ) I'll create it this weekend. -- Mamoru KOMACHI <usata@gentoo.org> http://www.gentoo.org/~usata/ -- gentoo-dev@gentoo.org mailing list ^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) 2003-10-30 11:44 ` Mamoru KOMACHI @ 2003-10-30 12:26 ` Paul de Vrieze 2003-10-30 12:41 ` Spider 2003-10-30 12:40 ` Bartosch Pixa ` (2 subsequent siblings) 3 siblings, 1 reply; 25+ messages in thread From: Paul de Vrieze @ 2003-10-30 12:26 UTC (permalink / raw To: gentoo-dev -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On Thursday 30 October 2003 12:44, Mamoru KOMACHI wrote: > > Any objection if I add unicode USE flag (and make it default in all > profiles) ? We cannot simply convert cjk USE flag to unicode, but in > some cases unicode USE flag seems appropriate and we definitely need > it. (currently dev-python/wxPython and x11-libs/wxGTK have unicode > local USE flag and others use cjk or nls for unicode support) > > If nobody objects (and hopefully many approve ;) ) I'll create it this > weekend. That seems like a good idea. A thing a bit unrelated is that it might be a good idea to create some documentation on how to setup gentoo for foreign languages. Paul - -- Paul de Vrieze Gentoo Developer Mail: pauldv@gentoo.org Homepage: http://www.devrieze.net -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.3 (GNU/Linux) iD8DBQE/oQN5bKx5DBjWFdsRAj4BAJ9JnywETKx8XphaJ2vvGGxo+L36CACfdO0k ZOBIBNvR3eKjWeiFbGA+yT4= =Aq26 -----END PGP SIGNATURE----- -- gentoo-dev@gentoo.org mailing list ^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) 2003-10-30 12:26 ` Paul de Vrieze @ 2003-10-30 12:41 ` Spider 2003-10-30 13:26 ` Paul de Vrieze 2003-10-30 14:09 ` Mamoru KOMACHI 0 siblings, 2 replies; 25+ messages in thread From: Spider @ 2003-10-30 12:41 UTC (permalink / raw To: gentoo-dev [-- Attachment #1: Type: text/plain, Size: 1126 bytes --] begin quote On Thu, 30 Oct 2003 13:26:17 +0100 Paul de Vrieze <pauldv@gentoo.org> wrote: > -----BEGIN PGP SIGNED MESSAGE----- > Hash: SHA1 > > On Thursday 30 October 2003 12:44, Mamoru KOMACHI wrote: > > > > Any objection if I add unicode USE flag (and make it default in all > > profiles) ? We cannot simply convert cjk USE flag to unicode, but in > > some cases unicode USE flag seems appropriate and we definitely need > > it. (currently dev-python/wxPython and x11-libs/wxGTK have unicode > > local USE flag and others use cjk or nls for unicode support) > > > > If nobody objects (and hopefully many approve ;) ) I'll create it > > this weekend. unicode, or utf-8 ? There ought to be some magic to convert locales to utf-8 as well if this is enabled. > A thing a bit unrelated is that it might be a good idea to create some > documentation on how to setup gentoo for foreign languages. like this : http://www.gentoo.org/doc/en/guide-localization.xml //Spider -- begin .signature This is a .signature virus! Please copy me into your .signature! See Microsoft KB Article Q265230 for more information. end [-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --] ^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) 2003-10-30 12:41 ` Spider @ 2003-10-30 13:26 ` Paul de Vrieze 2003-10-30 13:43 ` Spider 2003-10-30 14:09 ` Mamoru KOMACHI 1 sibling, 1 reply; 25+ messages in thread From: Paul de Vrieze @ 2003-10-30 13:26 UTC (permalink / raw To: gentoo-dev -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On Thursday 30 October 2003 13:41, Spider wrote: > begin quote > On Thu, 30 Oct 2003 13:26:17 +0100 > > like this : > http://www.gentoo.org/doc/en/guide-localization.xml > Yes, (it looked to me like a document on translating the guides). I would also like sections on things like input methods etc. Paul - -- Paul de Vrieze Gentoo Developer Mail: pauldv@gentoo.org Homepage: http://www.devrieze.net -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.3 (GNU/Linux) iD8DBQE/oRGhbKx5DBjWFdsRAkxwAJ0ROxNPnFnTDvhGYLvn52B51Wk3PQCguwtY UF8+XZ3zEOZRZNniPHgKWmY= =8fn5 -----END PGP SIGNATURE----- -- gentoo-dev@gentoo.org mailing list ^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) 2003-10-30 13:26 ` Paul de Vrieze @ 2003-10-30 13:43 ` Spider 2003-10-31 13:44 ` Svyatogor 0 siblings, 1 reply; 25+ messages in thread From: Spider @ 2003-10-30 13:43 UTC (permalink / raw To: gentoo-dev [-- Attachment #1: Type: text/plain, Size: 539 bytes --] begin quote On Thu, 30 Oct 2003 14:26:42 +0100 Paul de Vrieze <pauldv@gentoo.org> wrote: > > http://www.gentoo.org/doc/en/guide-localization.xml > > Yes, (it looked to me like a document on translating the guides). I > would also like sections on things like input methods etc. True, that would be nice addition to it. Anyone here who has experience with that sort of thing? //Spider -- begin .signature This is a .signature virus! Please copy me into your .signature! See Microsoft KB Article Q265230 for more information. end [-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --] ^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) 2003-10-30 13:43 ` Spider @ 2003-10-31 13:44 ` Svyatogor 0 siblings, 0 replies; 25+ messages in thread From: Svyatogor @ 2003-10-31 13:44 UTC (permalink / raw To: gentoo-dev On Thu, 30 Oct 2003 14:43:59 +0100 Spider <spider@gentoo.org> wrote: > begin quote > On Thu, 30 Oct 2003 14:26:42 +0100 > Paul de Vrieze <pauldv@gentoo.org> wrote: > > > > http://www.gentoo.org/doc/en/guide-localization.xml > > > > Yes, (it looked to me like a document on translating the guides). I > > would also like sections on things like input methods etc. > > > True, that would be nice addition to it. > > Anyone here who has experience with that sort of thing? I am currently working on a document explaning how to set up Unicode locale convert whatever you need to UTF-8 and configure various apps. I anyone could write a section on input methods, (got no idea about how this stuff works) I'll be happy to include it. -- gentoo-dev@gentoo.org mailing list ^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) 2003-10-30 12:41 ` Spider 2003-10-30 13:26 ` Paul de Vrieze @ 2003-10-30 14:09 ` Mamoru KOMACHI 2003-10-30 16:18 ` Alastair Tse 1 sibling, 1 reply; 25+ messages in thread From: Mamoru KOMACHI @ 2003-10-30 14:09 UTC (permalink / raw To: gentoo-dev At Thu, 30 Oct 2003 13:41:51 +0100, Spider wrote: > unicode, or utf-8 ? I personally vote for unicode, but if it is going to be enabled by default (i.e. Gentoo support UTF-8 immediately after installation), utf8 seems adequate rather than unicode. I just don't want to add utf7, utf16, utf32 USE flags for each encoding but have unicode USE flag which will support Unicode encodings (if any). > There ought to be some magic to convert locales to utf-8 as well if > this is enabled. Even if the USE flag is enabled by default, it doesn't take any effect unless each ebuild actually uses the USE flag. It may break wxGTK related ebuilds as darkspecter suggested (thanks for the comment) and I'm thinking that it isn't time to make the USE flag default yet (liquidx is working on getting UTF-8 down on Gentoo system). Anyhow, creating unicode/utf8 USE flag and making it default are two different matters and we can discuss them separately. -- Mamoru KOMACHI <usata@gentoo.org> http://www.gentoo.org/~usata/ -- gentoo-dev@gentoo.org mailing list ^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) 2003-10-30 14:09 ` Mamoru KOMACHI @ 2003-10-30 16:18 ` Alastair Tse 2003-10-30 23:58 ` Spider 0 siblings, 1 reply; 25+ messages in thread From: Alastair Tse @ 2003-10-30 16:18 UTC (permalink / raw To: gentoo-dev [-- Attachment #1: Type: text/plain, Size: 1636 bytes --] On Thu, 2003-10-30 at 14:09, Mamoru KOMACHI wrote: > > unicode, or utf-8 ? > > I personally vote for unicode, but if it is going to be enabled by > default (i.e. Gentoo support UTF-8 immediately after installation), > utf8 seems adequate rather than unicode. I just don't want to add > utf7, utf16, utf32 USE flags for each encoding but have unicode USE > flag which will support Unicode encodings (if any). Exactly. "unicode" should denote adding support for UCS-* or UTF-* encodings and internal representations. > > There ought to be some magic to convert locales to utf-8 as well if > > this is enabled. > I'm thinking that it isn't time to make the USE flag default yet > (liquidx is working on getting UTF-8 down on Gentoo system). Anyhow, > creating unicode/utf8 USE flag and making it default are two different > matters and we can discuss them separately. From my point of view, "unicode" USE flag encompasses UTF-8 support. I think they are the essentially the same problem. Think of it this way, treating them as separate issues would be like treating iso2022-jp as a separate issue to Japanese support. Anyhow, the generation of UTF-8 locales can be accomplished by a patch to glibc which could be applied depending on the unicode USE flag. I've got one such patch around, I'll submit a bug to track it in bugzilla soon. Cheers, -- Alastair 'liquidx' Tse >> Gentoo Developer >> http://www.liquidx.net/ | http://dev.gentoo.org/~liquidx/ >> GPG Key : http://dev.gentoo.org/~liquidx/liquidx_gentoo_org.asc >> FingerPrint : 579A 9B0E 43E8 0E40 EE93 BB1C 38CE 1C7B 3907 14F6 [-- Attachment #2: This is a digitally signed message part --] [-- Type: application/pgp-signature, Size: 189 bytes --] ^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) 2003-10-30 16:18 ` Alastair Tse @ 2003-10-30 23:58 ` Spider 0 siblings, 0 replies; 25+ messages in thread From: Spider @ 2003-10-30 23:58 UTC (permalink / raw To: gentoo-dev [-- Attachment #1: Type: text/plain, Size: 1971 bytes --] begin quote Combined reply below. > > > unicode, or utf-8 ? > > > > I personally vote for unicode, but if it is going to be enabled by > > default (i.e. Gentoo support UTF-8 immediately after installation), > > utf8 seems adequate rather than unicode. I just don't want to add > > utf7, utf16, utf32 USE flags for each encoding but have unicode USE > > flag which will support Unicode encodings (if any). > > Exactly. "unicode" should denote adding support for UCS-* or UTF-* > encodings and internal representations. I agree here, just wanted to get a statement out that it should encompass both, since theres a few interesting aspects of UTF8 locales and filenames. (Namely, gtk 1.2 doesn't like utf8 filenames , which makes me want to deprecate it. I've been looking for a solution and for all that I can see no -easy- one was avaiable. ) > > > > There ought to be some magic to convert locales to utf-8 as well > > > if this is enabled. > > > I'm thinking that it isn't time to make the USE flag default yet > > (liquidx is working on getting UTF-8 down on Gentoo system). Anyhow, > > creating unicode/utf8 USE flag and making it default are two > > different> matters and we can discuss them separately. > <cut> > Anyhow, the generation of UTF-8 locales can be accomplished by a patch > to glibc which could be applied depending on the unicode USE flag. > I've got one such patch around, I'll submit a bug to track it in > bugzilla soon. Yeah, I haven't spoken for making it Default, even if I could go radical and demand it default and that we remove all legacy software that doesn't cut it.... *G* (No, I don't intend to ;) However, as for the locale genereation patch, that would be very welcome, and i think that alone would make a "unicode" useflag valid. I'm all for this. //Spider -- begin .signature This is a .signature virus! Please copy me into your .signature! See Microsoft KB Article Q265230 for more information. end [-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --] ^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) 2003-10-30 11:44 ` Mamoru KOMACHI 2003-10-30 12:26 ` Paul de Vrieze @ 2003-10-30 12:40 ` Bartosch Pixa 2003-10-30 16:08 ` Alastair Tse 2003-10-31 13:43 ` Svyatogor 3 siblings, 0 replies; 25+ messages in thread From: Bartosch Pixa @ 2003-10-30 12:40 UTC (permalink / raw To: gentoo-dev [-- Attachment #1: Type: text/plain, Size: 654 bytes --] On Thu, 2003-10-30 at 12:44, Mamoru KOMACHI wrote: > Any objection if I add unicode USE flag (and make it default in all > profiles) ? We cannot simply convert cjk USE flag to unicode, but in > some cases unicode USE flag seems appropriate and we definitely need > it. (currently dev-python/wxPython and x11-libs/wxGTK have unicode > local USE flag and others use cjk or nls for unicode support) > > If nobody objects (and hopefully many approve ;) ) I'll create it this > weekend. wxGTK with unicode enabled breaks most apps that link to it. regards -- Bartosch Pixa Gentoo Linux Developer http://www.gentoo.org [-- Attachment #2: This is a digitally signed message part --] [-- Type: application/pgp-signature, Size: 189 bytes --] ^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) 2003-10-30 11:44 ` Mamoru KOMACHI 2003-10-30 12:26 ` Paul de Vrieze 2003-10-30 12:40 ` Bartosch Pixa @ 2003-10-30 16:08 ` Alastair Tse 2003-11-09 4:53 ` Mart Raudsepp 2003-10-31 13:43 ` Svyatogor 3 siblings, 1 reply; 25+ messages in thread From: Alastair Tse @ 2003-10-30 16:08 UTC (permalink / raw To: gentoo-dev [-- Attachment #1: Type: text/plain, Size: 835 bytes --] On Thu, 2003-10-30 at 11:44, Mamoru KOMACHI wrote: > > > /me votes for unicode use-flag > > > > should also be activated in all profiles by default. > > Any objection if I add unicode USE flag (and make it default in all > profiles) ? We cannot simply convert cjk USE flag to unicode, but in wxGTK is currently using a local unicode useflag to enable unicode support. However, this is experimental so I rather not have unicode default in all profiles just yet. By compiling with the unicode useflag, you'll end up with certain wxGTK apps failing to merge. Cheers, -- Alastair 'liquidx' Tse >> Gentoo Developer >> http://www.liquidx.net/ | http://dev.gentoo.org/~liquidx/ >> GPG Key : http://dev.gentoo.org/~liquidx/liquidx_gentoo_org.asc >> FingerPrint : 579A 9B0E 43E8 0E40 EE93 BB1C 38CE 1C7B 3907 14F6 [-- Attachment #2: This is a digitally signed message part --] [-- Type: application/pgp-signature, Size: 189 bytes --] ^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) 2003-10-30 16:08 ` Alastair Tse @ 2003-11-09 4:53 ` Mart Raudsepp 2003-11-09 11:33 ` Alastair Tse 0 siblings, 1 reply; 25+ messages in thread From: Mart Raudsepp @ 2003-11-09 4:53 UTC (permalink / raw To: gentoo-dev Alastair Tse kirjutas N, 30.10.2003 kell 18:08: > On Thu, 2003-10-30 at 11:44, Mamoru KOMACHI wrote: > > > > /me votes for unicode use-flag > > > > > > should also be activated in all profiles by default. > > > > Any objection if I add unicode USE flag (and make it default in all > > profiles) ? We cannot simply convert cjk USE flag to unicode, but in > > wxGTK is currently using a local unicode useflag to enable unicode > support. However, this is experimental so I rather not have unicode > default in all profiles just yet. By compiling with the unicode useflag, > you'll end up with certain wxGTK apps failing to merge. Sorry for a really delayed answer. To my knowledge unicode support in wxGTK isn't experimental but rather suggested but not enforced. The reason why certain wxGTK apps fail to merge is that they don't comform with the unicode wxWindows prerequisites. Quote from wxWindows docs: "The important conclusion is that if you use wxChar instead of char, avoid using C style strings and use wxString instead and don't forget to enclose all string literals inside wxT() macro, your program automatically becomes (almost) Unicode compliant!" Those certain apps fail to follow the above. And to be honest, my own wxWindows projects fail to follow it aswell, but that will be fixed soon ;) Oh, and instead of wxT() one should use _() to automagically add i18n support aswell. So my conclusion is that we still should make USE=unicode be default (put into the profiles), but ebuilds that depend upon wxGTK should be checked if the wxWindows app is unicode aware or not. If it isn't then it should simply remove the unicode flag in the ebuild. Personally I would add the unicode flag to make.conf and wxGTK apps that don't compile emerge with USE=-unicode, also I'd prod the app devs to make it follow wxWindows unicode prerequisites that aren't hard to follow (personal experience). -- gentoo-dev@gentoo.org mailing list ^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) 2003-11-09 4:53 ` Mart Raudsepp @ 2003-11-09 11:33 ` Alastair Tse 0 siblings, 0 replies; 25+ messages in thread From: Alastair Tse @ 2003-11-09 11:33 UTC (permalink / raw To: gentoo-dev [-- Attachment #1: Type: text/plain, Size: 1200 bytes --] > > wxGTK is currently using a local unicode useflag to enable unicode > > support. However, this is experimental so I rather not have unicode > > default in all profiles just yet. By compiling with the unicode useflag, > > you'll end up with certain wxGTK apps failing to merge. > > Sorry for a really delayed answer. > To my knowledge unicode support in wxGTK isn't experimental but rather > suggested but not enforced. Yes, I mean experimental is in for Gentoo, not for wxGTK. I've only enabled that option a couple of weeks ago because some people wanted to test wxGTK with unicode support. > The reason why certain wxGTK apps fail to merge is that they don't > comform with the unicode wxWindows prerequisites. Yes, if you would like to submit patches against those programs that don't conform to those requirements, that would be most appreciated by the developers. Right now, because at least half of the applications that use wxGTK don't support unicode, there is no way I'm going to enable that as default for wxGTK until that is fixed. Cheers, -- Alastair 'liquidx' Tse >> Gentoo Developer >> http://www.liquidx.net/ | http://dev.gentoo.org/~liquidx/ [-- Attachment #2: This is a digitally signed message part --] [-- Type: application/pgp-signature, Size: 189 bytes --] ^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) 2003-10-30 11:44 ` Mamoru KOMACHI ` (2 preceding siblings ...) 2003-10-30 16:08 ` Alastair Tse @ 2003-10-31 13:43 ` Svyatogor 2003-10-31 14:24 ` Mamoru KOMACHI 3 siblings, 1 reply; 25+ messages in thread From: Svyatogor @ 2003-10-31 13:43 UTC (permalink / raw To: gentoo-dev On Thu, 30 Oct 2003 20:44:33 +0900 Mamoru KOMACHI <usata@gentoo.org> wrote: > Any objection if I add unicode USE flag (and make it default in all > profiles) ? We cannot simply convert cjk USE flag to unicode, but in > some cases unicode USE flag seems appropriate and we definitely need > it. (currently dev-python/wxPython and x11-libs/wxGTK have unicode > local USE flag and others use cjk or nls for unicode support) > > If nobody objects (and hopefully many approve ;) ) I'll create it this > weekend. I'd love to see this flag! However this should be *off* by default cause there are some applications (like MC for example) which have troubles understanding unicode. -- gentoo-dev@gentoo.org mailing list ^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) 2003-10-31 13:43 ` Svyatogor @ 2003-10-31 14:24 ` Mamoru KOMACHI 2003-10-31 15:05 ` Spider 0 siblings, 1 reply; 25+ messages in thread From: Mamoru KOMACHI @ 2003-10-31 14:24 UTC (permalink / raw To: gentoo-dev Hi all, thanks for commenting on this issue ;) At Thu, 30 Oct 2003 16:08:52 +0000, Alastair Tse wrote: > wxGTK is currently using a local unicode useflag to enable unicode > support. However, this is experimental so I rather not have unicode > default in all profiles just yet. By compiling with the unicode useflag, > you'll end up with certain wxGTK apps failing to merge. At Fri, 31 Oct 2003 15:43:05 +0200, Svyatogor wrote: > I'd love to see this flag! However this should be *off* by default > cause there are some applications (like MC for example) which have > troubles understanding unicode. So what I will do is: 1. Create global USE flag named "unicode" which enables Unicode(including UTF-7/UTF-8/UTF-16 and so on) support. 2. Don't make it default in any profiles. (We will consider it later when those applications settle down.) Everybody happy with this? I hope it will be a good starting point for integrating Unicoded support into Gentoo Linux. -- Mamoru KOMACHI <usata@gentoo.org> http://www.gentoo.org/~usata/ -- gentoo-dev@gentoo.org mailing list ^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) 2003-10-31 14:24 ` Mamoru KOMACHI @ 2003-10-31 15:05 ` Spider 2003-11-01 12:06 ` Mamoru KOMACHI 2003-11-01 19:10 ` Martin Schlemmer 0 siblings, 2 replies; 25+ messages in thread From: Spider @ 2003-10-31 15:05 UTC (permalink / raw To: gentoo-dev [-- Attachment #1: Type: text/plain, Size: 948 bytes --] begin quote On Fri, 31 Oct 2003 23:24:52 +0900 Mamoru KOMACHI <usata@gentoo.org> wrote: > > So what I will do is: > > 1. Create global USE flag named "unicode" which enables > Unicode(including UTF-7/UTF-8/UTF-16 and so on) support. > 2. Don't make it default in any profiles. (We will consider it later > when those applications settle down.) > > Everybody happy with this? I hope it will be a good starting point for > integrating Unicoded support into Gentoo Linux. > Sounds great. proposed addition to baselayout: if [ `use unicode` ] ; then cat >> ${D}/etc/rc.conf << EOF # This setting enables a default UTF-8 locale for your system. # please look inside /usr/share/locale for more examples LANG="en_US.UTF-8" EOF fi This is rough, but something like it perhaps? //Spider -- begin .signature This is a .signature virus! Please copy me into your .signature! See Microsoft KB Article Q265230 for more information. end [-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --] ^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) 2003-10-31 15:05 ` Spider @ 2003-11-01 12:06 ` Mamoru KOMACHI 2003-11-01 13:44 ` Spider 2003-11-01 19:10 ` Martin Schlemmer 1 sibling, 1 reply; 25+ messages in thread From: Mamoru KOMACHI @ 2003-11-01 12:06 UTC (permalink / raw To: gentoo-dev Hi, At Fri, 31 Oct 2003 16:05:30 +0100, Spider wrote: > > 1. Create global USE flag named "unicode" which enables > > Unicode(including UTF-7/UTF-8/UTF-16 and so on) support. > > 2. Don't make it default in any profiles. (We will consider it later > > when those applications settle down.) > Sounds great. I've just added unicode USE flag :) > proposed addition to baselayout: > > if [ `use unicode` ] ; > then > cat >> ${D}/etc/rc.conf << EOF > # This setting enables a default UTF-8 locale for your system. > # please look inside /usr/share/locale for more examples > LANG="en_US.UTF-8" > > EOF > fi > > This is rough, but something like it perhaps? If http://bugs.gentoo.org/show_bug.cgi?id=9988 "setting LANG variable" is implemented we may think about it, but I don't think the system should set LANG variable (at least) until then. -- Mamoru KOMACHI <usata@gentoo.org> http://www.gentoo.org/~usata/ -- gentoo-dev@gentoo.org mailing list ^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) 2003-11-01 12:06 ` Mamoru KOMACHI @ 2003-11-01 13:44 ` Spider 0 siblings, 0 replies; 25+ messages in thread From: Spider @ 2003-11-01 13:44 UTC (permalink / raw To: gentoo-dev [-- Attachment #1: Type: text/plain, Size: 645 bytes --] begin quote On Sat, 01 Nov 2003 21:06:56 +0900 Mamoru KOMACHI <usata@gentoo.org> wrote: > > I've just added unicode USE flag :) Good. > If http://bugs.gentoo.org/show_bug.cgi?id=9988 "setting LANG variable" > is implemented we may think about it, but I don't think the system > should set LANG variable (at least) until then. > Well, ok. Except that if you use utf-8 enabled locales and the like, you really want to have LANG set to a utf-8 locale, rather than a "normal" one. //Spider -- begin .signature This is a .signature virus! Please copy me into your .signature! See Microsoft KB Article Q265230 for more information. end [-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --] ^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) 2003-10-31 15:05 ` Spider 2003-11-01 12:06 ` Mamoru KOMACHI @ 2003-11-01 19:10 ` Martin Schlemmer 2003-11-05 13:50 ` Spider 1 sibling, 1 reply; 25+ messages in thread From: Martin Schlemmer @ 2003-11-01 19:10 UTC (permalink / raw To: Spider; +Cc: Gentoo-Dev [-- Attachment #1: Type: text/plain, Size: 1390 bytes --] On Fri, 2003-10-31 at 17:05, Spider wrote: > begin quote > On Fri, 31 Oct 2003 23:24:52 +0900 > Mamoru KOMACHI <usata@gentoo.org> wrote: > > > > > So what I will do is: > > > > 1. Create global USE flag named "unicode" which enables > > Unicode(including UTF-7/UTF-8/UTF-16 and so on) support. > > 2. Don't make it default in any profiles. (We will consider it later > > when those applications settle down.) > > > > Everybody happy with this? I hope it will be a good starting point for > > integrating Unicoded support into Gentoo Linux. > > > > Sounds great. > > > proposed addition to baselayout: > > if [ `use unicode` ] ; > then > cat >> ${D}/etc/rc.conf << EOF > # This setting enables a default UTF-8 locale for your system. > # please look inside /usr/share/locale for more examples > LANG="en_US.UTF-8" > > EOF > fi > > This is rough, but something like it perhaps? I do not see why. If you enable utf8 support, it does not mean you want it in console. The user should specify his LANG himself, as well as the fact that it should rather be: ELANG="en_US.UTF-8" and then get it to propagate only to the users, as there are still a lot of things breaking during build with LANG, et el set. Thanks, -- Martin Schlemmer Gentoo Linux Developer, Desktop/System Team Developer Cape Town, South Africa [-- Attachment #2: This is a digitally signed message part --] [-- Type: application/pgp-signature, Size: 189 bytes --] ^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) 2003-11-01 19:10 ` Martin Schlemmer @ 2003-11-05 13:50 ` Spider 2003-11-05 22:11 ` Stanislav Brabec 0 siblings, 1 reply; 25+ messages in thread From: Spider @ 2003-11-05 13:50 UTC (permalink / raw To: gentoo-dev [-- Attachment #1: Type: text/plain, Size: 845 bytes --] begin quote On Sat, 01 Nov 2003 21:10:50 +0200 Martin Schlemmer <azarah@gentoo.org> wrote: > I do not see why. If you enable utf8 support, it does not > mean you want it in console. The user should specify his > LANG himself, as well as the fact that it should rather be: > > ELANG="en_US.UTF-8" > > and then get it to propagate only to the users, as there are > still a lot of things breaking during build with LANG, et el > set. Ahh, thats very true. Except that I'd still want it documented. Yep, I'm having a drive for UTF-8 right now, I've ran into places where I need filenames in UTF-8 and most things have started to break otherwise. unpleasant compability problems. //Spider -- begin .signature This is a .signature virus! Please copy me into your .signature! See Microsoft KB Article Q265230 for more information. end [-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --] ^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) 2003-11-05 13:50 ` Spider @ 2003-11-05 22:11 ` Stanislav Brabec [not found] ` <20031106081925.GA2446@core.realtime.sk> 0 siblings, 1 reply; 25+ messages in thread From: Stanislav Brabec @ 2003-11-05 22:11 UTC (permalink / raw To: Spider; +Cc: gentoo-dev V St, 05. 11. 2003 v 14:50, Spider píše: > Ahh, thats very true. Except that I'd still want it documented. Yep, > I'm having a drive for UTF-8 right now, I've ran into places where I > need filenames in UTF-8 and most things have started to break otherwise. > unpleasant compability problems. These problems are currently discussed in glib bugzilla: http://bugzilla.gnome.org/show_bug.cgi?id=114068 And there are my ideas, what needs to be done for seamless UTF-8 support: 0) baselayout: Add "official" way to set system default locale: /etc/rc.conf or /etc/env.d/{number}locale (or similar). 1) glibc: Generate UTF-8 locales for all languages, if unicode USE flag is on. Either as patch, or ex-post: for i in ...... ; do localedef -i $i -f UTF-8 -u charids.894 $i.UTF-8 done 2) Enable UTF-8 locales in GDM: Two alternatives: # Create UTF-8 for all locales. sed 's:\([^ ]*\)\([ ][ ]*\)\(.*\)UTF-8,\(.*\):\1 \2\4,\3UTF-8\ \1(UTF-8)\2\3UTF-8,\4:g' <config/locale.alias | sed 's:^\([^ ][^ ][^ ][^ ][^ ][^ ][^ ][^ ]\)*(UTF-8):& :' >config/locale.alias~ mv config/locale.alias~ config/locale.alias # Prefer non UTF-8 for all locales. sed 's:\([^ ]*\)\([ ][ ]*\)\(.*\)UTF-8,\(.*\):\1 \2\4,\3UTF-8:g' <config/locale.alias | sed 's:^\([^ ][^ ][^ ][^ ][^ ][^ ][^ ][^ ]\)*(UTF-8):& :' >config/locale.alias~ mv config/locale.alias~ config/locale.alias 3) Grab Redhat patches for most problematic applications (mc, slang, maybe ncurses). Setting language mini HOWTO: LANG LANG sets overall NLS support. Use LANG in form cs_CZ, cs_CZ.UTF-8 or cs_CZ.UTF-8@variant. Avoid cs (X does not like it), cs_CZ.utf8 (again - X doesn't like it) and czech (X and some old apps does not like it - very old gtk). Test: run LANG={mysetup} xcalc. No warning must appear. Note: localedef --list shows cs_CZ.utf8. It's because glibc ignores all '-' characters and converts charset to lowercase. LANGUAGE LANGUAGE can be optionally set to list of languages for searching messages, e. g. cs_CZ:sk_SK. Applications translated to cs will use cs, applications translated to sk but not cs will use sk. Warning: Some desktop managers with ability to set session language (GDM) does not properly set/reset this variable and it is inherited to all locale setups. It can cause strange setups. LC_{category} - Set it for special purposes, if you want to use different locales for messages, ctypes, collating etc. LC_ALL - no need to set in environment. It nearly duplicates LANG. Nice to use in scripts, not environment. -- Stanislav Brabec http://www.penguin.cz/~utx, ICQ 116020046 -- gentoo-dev@gentoo.org mailing list ^ permalink raw reply [flat|nested] 25+ messages in thread
[parent not found: <20031106081925.GA2446@core.realtime.sk>]
* Re: [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) [not found] ` <20031106081925.GA2446@core.realtime.sk> @ 2003-11-06 22:57 ` Stanislav Brabec 0 siblings, 0 replies; 25+ messages in thread From: Stanislav Brabec @ 2003-11-06 22:57 UTC (permalink / raw To: gabor; +Cc: gentoo-dev gabor wrote: > could you explain to me what's the difference between LC_ALL and LANG? > > until now i'm using export LC_ALL=en_US.UTF-8 in /etc/profile... > should i change it to LANG=en_US.UTF-8? If you don't use other LC_{category}, there is no difference. But if you use it with these variables, there it is: LANG is fallback, overridden by all other LC_* variables. LC_ALL is enforcement, and overrides LANG and all LC_{category} variables. It means: if you define LC_ALL, sessing LC_{category} has no effect. There is an example: utx$ ( export LANG=C LC_NUMERIC=C LC_ALL=C ; printf "%'5.3f\\n" 1 ) 1.000 utx$ ( export LANG=C LC_NUMERIC=C LC_ALL=cs_CZ ; printf "%'5.3f\\n" 1 ) 1,000 utx$ ( export LANG=cs_CZ LC_NUMERIC=C LC_ALL=C ; printf "%'5.3f\\n" 1 ) 1.000 utx$ ( export LANG=C LC_NUMERIC=cs_CZ LC_ALL=C ; printf "%'5.3f\\n" 1 ) 1.000 utx$ ( export LANG=C LC_NUMERIC=cs_CZ ; printf "%'5.3f\\n" 1 ) 1,000 Note: Script programmers do often a mistake, using following construction: LC_TIME=C date It works incorrectly, if you have LC_ALL set to any language. -- Stanislav Brabec http://www.penguin.cz/~utx, ICQ 116020046 -- gentoo-dev@gentoo.org mailing list ^ permalink raw reply [flat|nested] 25+ messages in thread
end of thread, other threads:[~2003-11-09 11:34 UTC | newest] Thread overview: 25+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2003-08-13 23:34 [gentoo-dev] better handling of multibyte characters (nls/cjk/unicode) Mike Frysinger 2003-08-13 23:38 ` Heinrich Wendel 2003-08-13 23:40 ` Heinrich Wendel 2003-10-30 11:44 ` Mamoru KOMACHI 2003-10-30 12:26 ` Paul de Vrieze 2003-10-30 12:41 ` Spider 2003-10-30 13:26 ` Paul de Vrieze 2003-10-30 13:43 ` Spider 2003-10-31 13:44 ` Svyatogor 2003-10-30 14:09 ` Mamoru KOMACHI 2003-10-30 16:18 ` Alastair Tse 2003-10-30 23:58 ` Spider 2003-10-30 12:40 ` Bartosch Pixa 2003-10-30 16:08 ` Alastair Tse 2003-11-09 4:53 ` Mart Raudsepp 2003-11-09 11:33 ` Alastair Tse 2003-10-31 13:43 ` Svyatogor 2003-10-31 14:24 ` Mamoru KOMACHI 2003-10-31 15:05 ` Spider 2003-11-01 12:06 ` Mamoru KOMACHI 2003-11-01 13:44 ` Spider 2003-11-01 19:10 ` Martin Schlemmer 2003-11-05 13:50 ` Spider 2003-11-05 22:11 ` Stanislav Brabec [not found] ` <20031106081925.GA2446@core.realtime.sk> 2003-11-06 22:57 ` Stanislav Brabec
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox