Boost Users :

Date view	Thread view	Subject view	Author view

From: John Maddock (john_at_[hidden])
Date: 2007-10-01 04:42:09

Next message: gast128: "[Boost-users] string_algo no replace_if 2"
Previous message: Cory Nelson: "[Boost-users] Passing extra compiler arguments with bjam"
In reply to: Anjaly: "Re: [Boost-users] u32regex_search crashes"
Next in thread: Anjaly: "Re: [Boost-users] u32regex_search crashes"
Reply: Anjaly: "Re: [Boost-users] u32regex_search crashes"

Anjaly wrote:
> In the regex document it was said that the size of data type of the
> variable passed to the make_u32regex that determines character
> encoding (utf8,utf16 or utf32) .

*For construction of the regex object*.

The search algorithms operate independently on any of UTF8/16/32.

> I passed wchar_t (which i think size
> is 4) so that the buffer encoding is considered as utf8 by
> u32regex_search irrespectively. Actually i am trying to do a utf8
> search.

Except the data file you sent *was not valid UTF8* !

It looks like it's probably UTF16LE, it's up to you in that case to decode
the byte order mark and read the text into something that Boost.Regex can
handle (for example platform-native UTF16). ICU should have some file IO
routines for doing that kind of thing: for example for loading a file into a
UnicodeString type.

HTH, John.

Next message: gast128: "[Boost-users] string_algo no replace_if 2"
Previous message: Cory Nelson: "[Boost-users] Passing extra compiler arguments with bjam"
In reply to: Anjaly: "Re: [Boost-users] u32regex_search crashes"
Next in thread: Anjaly: "Re: [Boost-users] u32regex_search crashes"
Reply: Anjaly: "Re: [Boost-users] u32regex_search crashes"

Date view	Thread view	Subject view	Author view

Boost-users list run by williamkempf at hotmail.com, kalb at libertysoft.com, bjorn.karlsson at readsoft.com, gregod at cs.rpi.edu, wekempf at cox.net