|
Boost : |
From: Rogier van Dalen (rogiervd_at_[hidden])
Date: 2005-03-18 16:41:51
On Thu, 17 Mar 2005 17:52:25 +0100, Erik Wien <wien_at_[hidden]> wrote:
> What exactly do mean by the term "character"? Abstract characters?
I really need to remember the correct terminology - what I mean is the
thing "a user thinks of as a character", a "grapheme cluster", of
which the Unicode standard says:
"[T]here is a core concept of "characters that should be kept
together" that can be defined for the Unicode Standard in a
language-independent way. This core concept is known as a grapheme
cluster, and it consists of any combining character sequence that
contains only nonspacing combining marks, or any sequence of
characters that constitutes a Hangul syllable (possibly followed by
one or more nonspacing marks)."
I believe this is what a Unicode library should use as its basic unit.
Sorry for any confusion caused,
Rogier
Boost list run by bdawes at acm.org, gregod at cs.rpi.edu, cpdaniel at pacbell.net, john at johnmaddock.co.uk