Hello,
The html page is located at "http://photography.nationalgeographic.com/photography/photo-of-the-day/northern-spotted-owl-pod.html"
btw, when googling around for xpressive the search results pointed to http://lists.boost.org/boost-users/2008/08/39761.php, which says xpressive is not directly usable with utf-8. I did not find any examples of xpressive with utf-8 strings.
does boost::regex with icu have answer to my question ? (i'm going over it now)
Surya
It is not spurious, it is probably a UTF-8 file or something. Can youOn Wed, Sep 30, 2009 at 10:54 PM, Surya Kiran Gullapalli
<suryakiran.gullapalli@gmail.com> wrote:
> Hello all,
>
> I'm reading an html file and parsing data from it. I've encountered a
> spurious string (researcher’s). Notepad++ shows that string as
> researcher's .
>
> I'm reading the file using std::fstream class. Is there any way i can read
> the string and get the spurious characters replaced using boost string/regex
> algorithms ?
attach it, if so then we can confirm that.
But yes, you can do such a replacement with Boost.Regex, although I
would recommend doing it with Boost.Xpressive instead (it even has an
example doing such a replace in the docs, and if you use the static
version it will run faster).
_______________________________________________
Boost-users mailing list
Boost-users@lists.boost.org
http://lists.boost.org/mailman/listinfo.cgi/boost-users