Boost logo

Boost :

Subject: Re: [boost] [General] Always treat std::strings as UTF-8
From: John B. Turpish (jbturp_at_[hidden])
Date: 2011-01-14 18:33:02


On Fri, Jan 14, 2011 at 5:52 PM, Robert Kawulak
<robert.kawulak_at_[hidden]> wrote:
> IIUC you can't assume a fixed size for a character even with UTF-32. In UTF-32 only _codepoints_ have fixed size, yet one character
> may be composed of several codepoints, e.g. a latin letter followed by a diacritical mark, making up one character
> (http://en.wikipedia.org/wiki/Combining_character).
>
> Best regards,
> Robert

I stand corrected. This sort of the thing is the reason I start with
disclaimers like, "I'm not an expert, so take this with a grain of
salt."

Anyhow, thanks for the info.


Boost list run by bdawes at acm.org, gregod at cs.rpi.edu, cpdaniel at pacbell.net, john at johnmaddock.co.uk