Boost logo

Boost Users :

Subject: Re: [Boost-users] Boost's direction regarding UTF8 -> UTF32 and UTF32 -> UTF8
From: Cory Nelson (phrosty_at_[hidden])
Date: 2010-06-24 09:58:40


On Thu, Jun 24, 2010 at 5:17 AM, Rodrigo Madera
<rodrigo.madera_at_[hidden]> wrote:
>> There are John Maddock's iterator adapters, those of the Boost.Unicode
>> library under development that are similar to them, and there is also
>> Boost.Locale, another library under development that is a frontend to ICU,
>> that provides function to do conversions (but it works with memory buffers).
>
> I see that Maddock's adapters does the job, but does it provide the
> reliability of ICU? For example, when it comes to round-trip conversions?

Each code point only has a single valid representation in any of the
UTF encodings, so anything but perfect round-trip transcoding would be
a bug. Overlong encodings are invalid, and normalization forms are a
separate issue outside of UTF transcoding.

-- 
Cory Nelson
http://int64.org

Boost-users list run by williamkempf at hotmail.com, kalb at libertysoft.com, bjorn.karlsson at readsoft.com, gregod at cs.rpi.edu, wekempf at cox.net