This is the mail archive of the
libstdc++@gcc.gnu.org
mailing list for the libstdc++ project.
RE: wchar_t on Solaris 8
- From: "Lev Assinovsky" <LAssinovsky at algorithm dot aelita dot com>
- To: "L. Nicoara" <nicoara at roguewave dot com>
- Cc: "Benjamin Kosnik" <bkoz at redhat dot com>,<cxwang_wt at hotmail dot com>,<libstdc++ at gcc dot gnu dot org>
- Date: Thu, 17 Apr 2003 12:01:09 +0400
- Subject: RE: wchar_t on Solaris 8
----
Lev Assinovsky
Aelita Software Corporation
O&S Core Division, Programmer
ICQ# 165072909
> -----Original Message-----
> From: L. Nicoara [mailto:nicoara at roguewave dot com]
> Sent: Wednesday, April 16, 2003 8:58 PM
> To: Lev Assinovsky
> Cc: Benjamin Kosnik; cxwang_wt at hotmail dot com; libstdc++ at gcc dot gnu dot org
> Subject: Re: wchar_t on Solaris 8
>
>
> Lev,
>
> Solaris having UCS as internal encoding is big news for me.
> Is this for
> all locales or for just some selected few?
It's for Unicode, which set just a width of integer.
What's unicode representation for a particular letter
of any given locale - I have no idea.
>
> 8 bytes encoding is even bigger news - I don't remember ever hearing
> about a standard that uses 8 bytes for character encoding.
Yes, wchar_t is int on 64 bit architecture and is long with 32 bit arc.
I was wrong.
>
> Also you say that Sparc architecture uses LE and Intel BE. Isn't it
> viceversa?
You are right! Is was my mistyping.
>
> Can you point me to some documentation?
>
> Thanks,
> Liviu
>
> Lev Assinovsky wrote:
> > Solaris Sparc: 4/8 bytes UCS-4LE
> > Solaris Intel: 4 bytes UCS-4BE
> >
> > ----
> > Lev Assinovsky
> > Aelita Software Corporation
> > O&S Core Division, Programmer
> > ICQ# 165072909
> >
> >
> >> -----Original Message-----
> >> From: Benjamin Kosnik [mailto:bkoz at redhat dot com]
> >> Sent: Tuesday, April 15, 2003 9:03 PM
> >> To: L. Nicoara
> >> Cc: cxwang_wt at hotmail dot com; libstdc++ at gcc dot gnu dot org
> >> Subject: Re: wchar_t on Solaris 8
> >>
> >>
> >>
> >> >Also, you may be confused about external vs. internal
> representation
> >> >(encoding) of characters. EUC is an external encoding,
> whereas [for
> >> >glibc] UCS4 is the internal encoding [for all locales
> regardless the
> >> >external encoding].
> >>
> >> See my previous note about linux/glibc.
> >>
> >> And for Solaris? I thought wchar_t was a 2 byte type with internal
> >> UCS-2BE encoding?
> >>
> >> I'll put this in the docs since this seems to come up regularly.
> >>
> >> best,
> >> benjamin
> >>
> >
>
>