Hello,
The html page is located at "http://photography.nationalgeographic.com/photography/photo-of-the-day/northern-spotted-owl-pod.html"

btw, when googling around for xpressive the search results pointed to http://lists.boost.org/boost-users/2008/08/39761.php, which says xpressive is not directly usable with utf-8. I did not find any examples of xpressive with utf-8 strings.

does boost::regex with icu have answer to my question ? (i'm going over it now)

Surya

On Thu, Oct 1, 2009 at 11:02 AM, OvermindDL1 <overminddl1@gmail.com> wrote:
On Wed, Sep 30, 2009 at 10:54 PM, Surya Kiran Gullapalli
<suryakiran.gullapalli@gmail.com> wrote:
> Hello all,
>
> I'm reading an html file and parsing data from it. I've encountered a
> spurious string (researcher’s). Notepad++ shows that string as
> researcher's .
>
> I'm reading the file using std::fstream class. Is there any way i can read
> the string and get the spurious characters replaced using boost string/regex
> algorithms ?

It is not spurious, it is probably a UTF-8 file or something.  Can you
attach it, if so then we can confirm that.

But yes, you can do such a replacement with Boost.Regex, although I
would recommend doing it with Boost.Xpressive instead (it even has an
example doing such a replace in the docs, and if you use the static
version it will run faster).
_______________________________________________
Boost-users mailing list
Boost-users@lists.boost.org
http://lists.boost.org/mailman/listinfo.cgi/boost-users