Boost logo

Boost :

From: Miro Jurisic (macdev_at_[hidden])
Date: 2005-04-05 04:22:11


In article <e094f9eb05040500266a200bf8_at_[hidden]>,
 Rogier van Dalen <rogiervd_at_[hidden]> wrote:

> On Apr 4, 2005 10:45 PM, Miro Jurisic <macdev_at_[hidden]> wrote:
> > In article <e094f9eb050404061051334a6b_at_[hidden]>,
> > Rogier van Dalen <rogiervd_at_[hidden]> wrote:
> > > If two strings are normalised and in the same encoding, you can find one
> > > in the other by comparing bytes, yes.
> >
> > No, you can't. Consider searching for "e" in a decomposed representation of
> > e-acute -- you should find nothing, but you will get a false positive
> > instead.
>
> Whoops, I'd forgotten about that for a moment. But, if I'm not mistaken
> again, you never get false negatives. False positives can be skipped quite
> easily after basic_string::find has found one.

I wouldn't call anything that requires knowledge of Unicode composition rules
"quite easy". :-)

meeroh


Boost list run by bdawes at acm.org, gregod at cs.rpi.edu, cpdaniel at pacbell.net, john at johnmaddock.co.uk