X11R7.5 and C.UTF-8

Thomas Dickey dickey@his.com
Thu Dec 3 09:55:00 GMT 2009


On Thu, 3 Dec 2009, Andy Koppe wrote:

> 2009/12/3 Linda Walsh:
>> C.UTF_8 doesn't exist.
...
>> You can't have "C" and "UTF-8", because C means no encoding (default).
>> UTF-8 IS an encoding, so they are mutually exclusive.
>
> From http://www.opengroup.org/onlinepubs/009695399/basedefs/xbd_chap07.html,
> §7.2:
>
> "The tables in Locale Definition describe the characteristics and
> behavior of the POSIX locale for data consisting entirely of
> characters from the portable character set and the control character
> set. For other characters, the behavior is unspecified."
>
> This means that characters 0..127 have to be treated as ASCII, but
> beyond that an implementation can do what it wants. And on Cygwin 1.7,
> plain "C" actually does imply UTF-8, which happily is
> backward-compatible with ASCII.

That's an interpretation that so far hasn't been blessed by the standards
people.  Any discussion of this topic should mention that, as a caveat.

ymmv

-- 
Thomas E. Dickey
http://invisible-island.net
ftp://invisible-island.net
-------------- next part --------------
--
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple
Problem reports:       http://cygwin.com/problems.html
Documentation:         http://x.cygwin.com/docs/
FAQ:                   http://x.cygwin.com/docs/faq/


More information about the Cygwin-xfree mailing list