I was looking around to see if Boost has support for tokenizing UTF-8 strings. I don't see any mention of UTF-8 or Unicode in the Boost.Tokenizer documentation. If not Boost.Tokenizer … any other ideas ? I am also looking into ICU.

Thanks.

- Roshan