Boost logo

Boost :

From: Steinar Bang (sb_at_[hidden])
Date: 1999-06-21 02:45:10


One argument I've seen used against wstring, is that it isn't
Unicode, ie. that it hasn't any Unicode functionality built in.

I think boost.org would be the ideal repository for some "standard"
Unicode code_cvt facets.

One problem is that we would have to deal with both UCS-2 and UCS-4,
because I think the standard would allow both 16 bit and 32 bit
wchar_t. As I understand it, the Unicode standard recommends UTF-16
        http://www.terena.nl/projects/multiling/unicode/wg2n1035.html
for internal representation (this is what the Java string uses), but
basic_string<> is not well suited to variable length characters.

Some code_cvt facets I could think of would be UCS-2 or UCS-4 to UTF-8
(and back) and UCS-2/UCS-4 to ISO-8859-x.

I'm leaving for vacation at the end of this week, so I won't have any
time to look into this for a while. But the idea has been buzzing
around in my head for a while so I'm sending this message as a way of
clearing my head, and maybe start some discussion.

- Steinar

------------------------------------------------------------------------

eGroups.com home: http://www.egroups.com/group/boost
http://www.egroups.com - Simplifying group communications


Boost list run by bdawes at acm.org, gregod at cs.rpi.edu, cpdaniel at pacbell.net, john at johnmaddock.co.uk