This is the mail archive of the
java@gcc.gnu.org
mailing list for the Java project.
Re: iconv on Solaris
- To: Mark Mitchell <mark at codesourcery dot com>
- Subject: Re: iconv on Solaris
- From: Jeff Sturm <jsturm at one-point dot com>
- Date: Mon, 24 Sep 2001 08:47:26 -0400 (EDT)
- cc: java at gcc dot gnu dot org
On Mon, 24 Sep 2001, Jeff Sturm wrote:
> Apparently Solaris 8 iconv supplies the missing converters but breaks
> somewhere else.
Or not. I just checked more carefully. 0xFEFF is the Byte Order Mark.
>From unicode.org:
Q: What is a BOM?
A: The special characteristics of U+FEFF ZERO WIDTH NON-BREAKING SPACE
have been provided for use by higher level protocols as a signature at the
beginning of certain data streams (primarily unmarked plaintext files).
(When used as a signature, it is referred to as the BOM character, for
Byte Order Mark.) Under those protocols, the BOM may be mandatory in those
Unicode data streams.
In the absence of such protocols and when not at the beginning of a text
stream, U+FEFF is given its normal interpretation, as ZERO WIDTH
NON-BREAKING SPACE, and is part of the content of the file or string. [MD]
Jeff