|
Boost Users : |
Subject: Re: [Boost-users] spurious string
From: OvermindDL1 (overminddl1_at_[hidden])
Date: 2009-10-01 04:21:48
On Thu, Oct 1, 2009 at 2:08 AM, Surya Kiran Gullapalli
<suryakiran.gullapalli_at_[hidden]> wrote:
> Hello,
> The html page is located at
> "http://photography.nationalgeographic.com/photography/photo-of-the-day/northern-spotted-owl-pod.html"
>
> btw, when googling around for xpressive the search results pointed to
> http://lists.boost.org/boost-users/2008/08/39761.php, which says xpressive
> is not directly usable with utf-8. I did not find any examples of xpressive
> with utf-8 strings.
>
> does boost::regex with icu have answer to my question ? (i'm going over it
> now)
Xpressive does not have direct support for UTF8, but it does work fine
with character strings, which is all your search and replace would
require.
Looking at the webpage now...
Ah, yep, that is not a ' symbol or a ` symbol, it is one of those
'specia' Microsoft Word and such things special forward tick symbols,
which yes, encodes as <bh:e2><bh:80><bh:99> (bh mean binary format,
hex, copied from the hex program I opened the page with). They are
annoying as all freaking heck, but yes, regex or xpressive would work
fine (and xpressive would work faster in static mode).
That is not even the correct placement of a forward tick, someone
screwed up there anyway.
Boost-users list run by williamkempf at hotmail.com, kalb at libertysoft.com, bjorn.karlsson at readsoft.com, gregod at cs.rpi.edu, wekempf at cox.net