Unicode Version: 5.0.0
Date: 2006-06-13, 23:23:42 GMT
This page illustrates the application of the boundary specifications. The first chart shows where breaks would appear between different sample characters or strings. The sample characters are chosen mechanically to represent the different properties used by the specification. Where properties used in the rules have 'overlaps', the samples are given 'composed' names. For example, SentenceBreak uses GCLF_Sep: Sep is the SentenceBreak property, but it overlaps with the GraphemeClusterBreak property LF.
Other | CR | LF | Control | Extend | L | V | T | LV | LVT | |
---|---|---|---|---|---|---|---|---|---|---|
Other | ÷ | ÷ | ÷ | ÷ | × | ÷ | ÷ | ÷ | ÷ | ÷ |
CR | ÷ | ÷ | × | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ |
LF | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ |
Control | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ |
Extend | ÷ | ÷ | ÷ | ÷ | × | ÷ | ÷ | ÷ | ÷ | ÷ |
L | ÷ | ÷ | ÷ | ÷ | × | × | × | ÷ | × | × |
V | ÷ | ÷ | ÷ | ÷ | × | ÷ | × | × | ÷ | ÷ |
T | ÷ | ÷ | ÷ | ÷ | × | ÷ | ÷ | × | ÷ | ÷ |
LV | ÷ | ÷ | ÷ | ÷ | × | ÷ | × | × | ÷ | ÷ |
LVT | ÷ | ÷ | ÷ | ÷ | × | ÷ | ÷ | × | ÷ | ÷ |
Due to the way they have been mechanically processed for generation, the following rules do not match the UAX rules precisely. In particular:
For the original rules, see the UAX.