Boost logo

Boost :

Subject: Re: [boost] [locale] Review results for Boost.Locale library
From: Mathias Gaunard (mathias.gaunard_at_[hidden])
Date: 2011-04-26 11:18:55


On 26/04/2011 14:27, Artyom wrote:

> The problem even if the source is UTF-8 with BOM "שלום" would
> be encoded according to locale's 8bit codepage like 1255 or 936
> and not UTF-8 string (codepage 65001).
>
> It is rather stupid, but this is how MSVC works or understands
> the place of UTF-8 in this world.
>
> Unicode and Visual Studio is just broken...

That's not broken, this is the expected behaviour. The execution
character set is necessarily ANSI with that compiler, and the compiler
performs source character set to execution character set as expected.

To be able to input UTF-8 in string literals, you should use unicode
string literals (C++0x only) or wide string literals (but then you end
up with UTF-16).


Boost list run by bdawes at acm.org, gregod at cs.rpi.edu, cpdaniel at pacbell.net, john at johnmaddock.co.uk