From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from pigeon.gentoo.org ([208.92.234.80] helo=lists.gentoo.org) by finch.gentoo.org with esmtp (Exim 4.60) (envelope-from <gentoo-user+bounces-104733-garchives=archives.gentoo.org@lists.gentoo.org>) id 1NGHhT-0006Zx-BU for garchives@archives.gentoo.org; Thu, 03 Dec 2009 19:50:52 +0000 Received: from pigeon.gentoo.org (localhost [127.0.0.1]) by pigeon.gentoo.org (Postfix) with SMTP id 819D5E07F7; Thu, 3 Dec 2009 19:50:19 +0000 (UTC) Received: from mail.gmx.net (mail.gmx.net [213.165.64.20]) by pigeon.gentoo.org (Postfix) with SMTP id 04E51E07F7 for <gentoo-user@lists.gentoo.org>; Thu, 3 Dec 2009 19:50:18 +0000 (UTC) Received: (qmail invoked by alias); 03 Dec 2009 19:50:17 -0000 Received: from koln-4db401f3.pool.mediaWays.net (EHLO localhost) [77.180.1.243] by mail.gmx.net (mp068) with SMTP; 03 Dec 2009 20:50:17 +0100 X-Authenticated: #3423037 X-Provags-ID: V01U2FsdGVkX1+hUAMrbOeh6Bf+ThN9Wawf2JbA7rq4b/cswRUZ+5 0FgjNUwZD9XQ8M Date: Thu, 3 Dec 2009 20:50:08 +0100 From: Renat Golubchyk <ragermany@gmx.net> To: gentoo-user@lists.gentoo.org Subject: Re: [gentoo-user] [OT] Need advice from people who use non-ascii all day long Message-ID: <20091203205008.5584fa37@gmx.net> In-Reply-To: <20091203192003.GA1702@crowfix.com> References: <20091203192003.GA1702@crowfix.com> X-Mailer: Claws Mail 3.7.3 (GTK+ 2.18.3; x86_64-pc-linux-gnu) Precedence: bulk List-Post: <mailto:gentoo-user@lists.gentoo.org> List-Help: <mailto:gentoo-user+help@lists.gentoo.org> List-Unsubscribe: <mailto:gentoo-user+unsubscribe@lists.gentoo.org> List-Subscribe: <mailto:gentoo-user+subscribe@lists.gentoo.org> List-Id: Gentoo Linux mail <gentoo-user.gentoo.org> X-BeenThere: gentoo-user@lists.gentoo.org Reply-to: gentoo-user@lists.gentoo.org Mime-Version: 1.0 Content-Type: multipart/signed; micalg=PGP-SHA1; boundary="Sig_/YeXwvlA/Lu56axUDe9e8O1B"; protocol="application/pgp-signature" X-Y-GMX-Trusted: 0 X-FuHaFi: 0.58 X-Archives-Salt: 580ba90b-3558-4612-ab1a-9840b74630c7 X-Archives-Hash: b85a4c29c59b1d83bf0a4a5c56a781e0 --Sig_/YeXwvlA/Lu56axUDe9e8O1B Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Hi! On Thu, 3 Dec 2009 11:20:03 -0800 felix@crowfix.com wrote: > In Germany is a district "Busingen", with an umlauted 'u'. Is it > reasonable to consider it the same word whether with or without the > unlauted u? No. For many words it would be ok, but not for all. For example, "drucken" means "to print", "dr=FCcken" (with an umlaut) means "to press". In German you can exchange an umlaut with the combination "base letter + e", i.e. =FC --> ue, =F6 --> oe, and =DF --> ss. There are words with the combination "oe" that is in that particular case does not mean "=F6". So it's not straight forward, especially with names. Those may have a rather odd spelling for historical reasons. > Or put another way, I don't know much about German, French, Spanish, > etc keyboards. Do your keyboards have any of the extra keys, all of > them? Are German keyboards and French and Spanish keyboards as > restricted to their own languages as US keyboards are? If you have to > hit two or three keys to keep the umlauts, accents, and tildes, do you > get lazy sometimes and type the base character by itself? Is it even > considered the base character, or is it considered lazy and sloppy, > much as I get complaints about typing "thru" because "through" is too > much trouble? German keyboards have keys for all umlauts and '=DF'. You can google for pictures of different keyboard layouts. > I need something the equivalent of the C function strcasecmp() which > not only ignores case, but all other differences without distinction, > whatever they may be. I'd suggest you use a unicode library. BTW, what about cyrillic letters or other alphabets? Those may have nothing to do with ASCII. Or is your project restricted to latin letters? Cheers, Renat --=20 Probleme kann man niemals mit derselben Denkweise loesen, durch die sie entstanden sind. (Einstein) --Sig_/YeXwvlA/Lu56axUDe9e8O1B Content-Type: application/pgp-signature; name=signature.asc Content-Disposition: attachment; filename=signature.asc -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.13 (GNU/Linux) iEYEARECAAYFAksYFncACgkQRZZXkGjHI/0dAACdHcRcoQfN1rgAG/xUbyQ1IJdq OJUAoLg17SCJT+QASK6PeP6/mixhf129 =bhKt -----END PGP SIGNATURE----- --Sig_/YeXwvlA/Lu56axUDe9e8O1B--