How about basic_string<UTF-8> ?

Shiv Shankar Ramakrishnan
Tue Jun 1 12:43:00 GMT 1999

How about having a char_traits<> specialisation for UTF-8 chars?! I am
not really sure but I guess Linux uses the UTF-8 version of Unicode for
its internationalisation. This would be a great help to do development
with strings for UTF-8 data. As it is the whole LDAP (Light weight
directory access protocol RFC 2251) world uses UTF-8 for its data.
All of us would love to have a UTF-8 version of string. If somebody
gives me some pointers I am willing to do this. BTW does the char_traits
spec have a req for the char to be an integral no of bytes? I think not.
In that case I don't think it would be a lot of work to implement a
UTF-8 specialisation.

More information about the Libstdc++ mailing list