Boost logo

Boost :

From: Steinar Bang (sb_at_[hidden])
Date: 1999-06-21 02:45:10

One argument I've seen used against wstring, is that it isn't
Unicode, ie. that it hasn't any Unicode functionality built in.

I think would be the ideal repository for some "standard"
Unicode code_cvt facets.

One problem is that we would have to deal with both UCS-2 and UCS-4,
because I think the standard would allow both 16 bit and 32 bit
wchar_t. As I understand it, the Unicode standard recommends UTF-16
for internal representation (this is what the Java string uses), but
basic_string<> is not well suited to variable length characters.

Some code_cvt facets I could think of would be UCS-2 or UCS-4 to UTF-8
(and back) and UCS-2/UCS-4 to ISO-8859-x.

I'm leaving for vacation at the end of this week, so I won't have any
time to look into this for a while. But the idea has been buzzing
around in my head for a while so I'm sending this message as a way of
clearing my head, and maybe start some discussion.

- Steinar

------------------------------------------------------------------------ home: - Simplifying group communications

Boost list run by bdawes at, gregod at, cpdaniel at, john at