Unicode.String.Segment (Unicode String v1.7.0)

View Source

Implements the compilation of the Unicode segment rules.

Summary

Functions

Returns a list of the ancestor locales of the a given locale.

Compiles a segment rule in the context of a list of variables.

Evaluates a list of rules against a given string.

Identifies if a codepoint is a valid identifier character

Identifies if a codepoint is a valid start of an identifier

Returns a list of the known locales that have segmentation data.

Return the rules as defined by CLDR for a given locale and break type.

Return the rules as defined by CLDR for a given locale and break type and raises on error.

Returns a list of the suppressions for a given locale and segment type.

Returns a list of the suppressions for a given locale and segment type and raises on error.

Functions

ancestors(locale_name)

Returns a list of the ancestor locales of the a given locale.

The list includes the given locale.

compile_rule(rule, variables, regex_options \\ [])

Compiles a segment rule in the context of a list of variables.

The compile rule can then be inserted into a rule set.

compile_rules(rules, variables, regex_options)

evaluate_rules(string, rules)

Evaluates a list of rules against a given string.

expand_variables(variables, additional_variables)

is_id_continue(char)

(macro)

Identifies if a codepoint is a valid identifier character

is_id_start(char)

(macro)

Identifies if a codepoint is a valid start of an identifier

known_segmentation_locales()

Returns a list of the known locales that have segmentation data.

rules(locale, segment_type, additional_variables \\ [])

Return the rules as defined by CLDR for a given locale and break type.

rules!(locale, segment_type, additional_variables \\ [])

Return the rules as defined by CLDR for a given locale and break type and raises on error.

suppressions(locale, segment_type)

Returns a list of the suppressions for a given locale and segment type.

suppressions!(locale, segment_type)

Returns a list of the suppressions for a given locale and segment type and raises on error.