Unicode.Guards (Unicode function guards for Elixir v1.0.1)

Defines a set of guards that can be used with Elixir functions.

Each guard operates on a UTF8 codepoint since the permitted operators in a guard clause are restricted to simple comparisons that do not include string comparators.

The data that underpins these guards is generated from the Unicode character database and therefore includes a broad range of scripts well beyond the basic ASCII definitions.

Summary

Functions

is_blank(codepoint)

Guards whether a UTF8 codepoint is a space character.

is_currency_symbol(codepoint)

Guards whether a UTF8 codepoint is a currency symbol character.

is_digit(codepoint)

Guards whether a UTF8 codepoint is a digit character.

is_graph(codepoint)

Guards whether a UTF8 codepoint is a graphic character.

is_lower(codepoint)

Guards whether a UTF8 codepoint is a lower case character.

is_print(codepoint)

Guargs where a UTF8 codepoint is a printing character.

is_printable(codepoint)

Guards whether a UTF8 codepoint is a printable.

is_quote_mark(codepoint)

Guards whether a UTF8 codepoint is a unicode quote symbol character.

is_quote_mark_ambidextrous(codepoint)

Guards whether a UTF8 codepoint is a unicode quote symbol that can be used either left or right.

is_quote_mark_double(codepoint)

Guards whether a UTF8 codepoint is a unicode quote symbol that is considered a single quote.

is_quote_mark_left(codepoint)

Guards whether a UTF8 codepoint is a unicode left quote symbol character.

is_quote_mark_right(codepoint)

Guards whether a UTF8 codepoint is a unicode right quote symbol character.

is_quote_mark_single(codepoint)

Guards whether a UTF8 codepoint is a unicode quote symbol that is considered a single quote.

is_separator(codepoint)

Guards whether a UTF8 codepoint is a unicode separator symbol character.

is_upper(codepoint)

Guards whether a UTF8 codepoint is an upper case character.

is_whitespace(codepoint)

Guards whether a UTF8 codepoint is a whitespace symbol character.

Functions

is_blank(codepoint)

(macro)

Guards whether a UTF8 codepoint is a space character.

This is the Unicode definition of those characters that horizontal space (space, non-breaking space etc) as well as the horizontal tab character.

is_currency_symbol(codepoint)

(macro)

Guards whether a UTF8 codepoint is a currency symbol character.

is_digit(codepoint)

(macro)

Guards whether a UTF8 codepoint is a digit character.

This guard will match any digit character from any Unicode script, not only the ASCII decimal digits.

is_graph(codepoint)

(macro)

Guards whether a UTF8 codepoint is a graphic character.

This is the Unicode definition of those characters that are visible. It is defined as those characters that are non-space, non-control and non-surrogate.

is_lower(codepoint)

(macro)

Guards whether a UTF8 codepoint is a lower case character.

The match is for any UTF8 character that is defined in Unicode to be an lower case character in any script.

is_print(codepoint)

(macro)

Guargs where a UTF8 codepoint is a printing character.

This is the Unicode definition which is a combination of the [:graph:] set and the [:blank:] set.

is_printable(codepoint)

(macro)

Guards whether a UTF8 codepoint is a printable.

The definition of what is printable is the same as used by String.printable?/1

is_quote_mark(codepoint)

(macro)

Guards whether a UTF8 codepoint is a unicode quote symbol character.

This includes the Unicode set Pi, Pf and other characters documents in this Wikpedia article.

See also Unicode.Category.QuoteMarks.

is_quote_mark_ambidextrous(codepoint)

(macro)

Guards whether a UTF8 codepoint is a unicode quote symbol that can be used either left or right.

See also Unicode.Category.QuoteMarks.

is_quote_mark_double(codepoint)

(macro)

Guards whether a UTF8 codepoint is a unicode quote symbol that is considered a single quote.

See also Unicode.Category.QuoteMarks.

is_quote_mark_left(codepoint)

(macro)

Guards whether a UTF8 codepoint is a unicode left quote symbol character.

See also Unicode.Category.QuoteMarks.

is_quote_mark_right(codepoint)

(macro)

Guards whether a UTF8 codepoint is a unicode right quote symbol character.

See also Unicode.Category.QuoteMarks.

is_quote_mark_single(codepoint)

(macro)

Guards whether a UTF8 codepoint is a unicode quote symbol that is considered a single quote.

See also Unicode.Category.QuoteMarks.

is_separator(codepoint)

(macro)

Guards whether a UTF8 codepoint is a unicode separator symbol character.

This includes the Unicode set Zs.

is_upper(codepoint)

(macro)

Guards whether a UTF8 codepoint is an upper case character.

The match is for any UTF8 character that is defined in Unicode to be an upper case character in any script.

is_whitespace(codepoint)

(macro)

Guards whether a UTF8 codepoint is a whitespace symbol character.

This includes the Unicode set Zs plus the characters in the range 0x9-0xd which incudes tab, newline and carriage return.