This is the mail archive of the
gcc@gcc.gnu.org
mailing list for the GCC project.
Re: Windows Unicode and GCC
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Mike Hearn wrote:
> On Mon, 24 Apr 2006 15:27:07 -0400, Nicolas De Rico wrote:
>> I would like to compile files created on Windows and encoded in
>> "Unicode" (UTF-8 or UTF-16). Microsoft puts a little header at the
>> beginning of files to indicate that they are UTF-16, UTF-8, etc. I
>> believe that this header is standard unicode btw, not an extension!
>
> Are you thinking of the byte order mark (BOM)? If so then this is a quirk
> of UTF-16 and is a Windows thing that many apps can't handle correctly ...
> UTF-8 should not have any headers at all and GCC should handle them fine.
> Try using some text editor to check it really is UTF-8.
Windows Notepad still inserts a BOM (0xEF 0xBB 0xBF) at
the beginning of files encoded with UTF-8. See:
http://www.microsoft.com/globaldev/getwr/steps/wrg_unicode.mspx
http://en.wikipedia.org/wiki/Byte_Order_Mark
Ranjit.
- --
Ranjit Mathew Email: rmathew AT gmail DOT com
Bangalore, INDIA. Web: http://rmathew.com/
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.2.2 (MingW32)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
iD8DBQFETfs0Yb1hx2wRS48RAkvmAKCae/o9vD3doaDKD1VPOSUlSlhRjACdGqv0
nD0cMiSvZLu9TfmIf/BUuIU=
=lZaM
-----END PGP SIGNATURE-----