View Source ExFuzzywuzzy (ex_fuzzywuzzy v0.3.0)

ex_fuzzywuzzy is a fuzzy string matching library that uses a customizable measure to calculate a distance ratio

Choose the ratio function which fits best your needs among the available, providing the two strings to be matched and - if needed - overwriting options over the configured ones.

Available methods are:

Simple ratio
Quick ratio
Partial ratio
Token sort ratio
Partial token sort ratio
Token set ratio
Partial token set ratio
Best score ratio

Available options are:

Similarity function (Levenshtein and Jaro-Winkler provided in library)
Case sensitiveness of match
Decimal precision of output score

Here are some examples.

Simple ratio

iex> ExFuzzywuzzy.ratio("this is a test", "this is a test!")
96.55

Quick ratio

iex> ExFuzzywuzzy.quick_ratio("this is a test", "this is a test!")
100.0

Partial ratio

iex> ExFuzzywuzzy.partial_ratio("this is a test", "this is a test!")
100.0

Best Score ratio

iex> ExFuzzywuzzy.best_score_ratio("this is a test", "this is a test!")
{:quick, 100.0}

Summary

Types

full_match_method()

Ratio methods available that match the full string

fuzzywuzzy_option()

Configurable runtime option types

fuzzywuzzy_options()

Configurable runtime options for ratio

match_method()

All ratio methods available

partial_match_method()

Ratio methods available that works on the best matching substring

ratio_calculator()

Ratio calculator-like signature

Functions

best_score_ratio(left, right, partial \\ false, options \\ [])

Calculates the ratio between the strings using various methods, returning the best score and algorithm

partial_ratio(left, right, options \\ [])

Calculates the partial ratio between two strings, that is the ratio between the best matching m-length substrings

partial_token_set_ratio(left, right, options \\ [])

Like token set ratio, but a partial ratio - instead a full one - is applied

partial_token_sort_ratio(left, right, options \\ [])

Like token sort ratio, but a partial ratio - instead of a standard one - is applied

process(_, _, _)

Process a list of strings, finding the best match on a string reference. Not implemented yet

quick_ratio(left, right, options \\ [])

Like standard ratio, but ignores any non-alphanumeric character

ratio(left, right, options \\ [])

Calculates the standard ratio between two strings as a percentage. It demands the calculus to the chosen measure, standardizing the produced output

token_set_ratio(left, right, options \\ [])

Calculates the token set ratio between two strings, that is the ratio calculated after tokenizing each string, splitting in two sets (a set with fully matching tokens, a set with other tokens), then sorting on set membership and alphabetically

token_sort_ratio(left, right, options \\ [])

Calculates the token sort ratio between two strings, that is the ratio calculated after tokenizing and sorting alphabetically each string

weighted_ratio(_, _, _)

Weighted ratio. Not implemented yet

Types

full_match_method()

@type full_match_method() :: :standard | :quick | :token_sort | :token_set

Ratio methods available that match the full string

fuzzywuzzy_option()

@type fuzzywuzzy_option() ::
  {:similarity_fn, ratio_calculator()}
  | {:case_sensitive, boolean()}
  | {:precision, non_neg_integer()}

Configurable runtime option types

fuzzywuzzy_options()

@type fuzzywuzzy_options() :: [fuzzywuzzy_option()]

Configurable runtime options for ratio

match_method()

@type match_method() :: full_match_method() | partial_match_method()

All ratio methods available

partial_match_method()

@type partial_match_method() :: :partial | :partial_token_sort | :partial_token_set

Ratio methods available that works on the best matching substring

ratio_calculator()

@type ratio_calculator() :: (String.t(), String.t() -> float())

Ratio calculator-like signature

Functions

best_score_ratio(left, right, partial \\ false, options \\ [])

@spec best_score_ratio(String.t(), String.t(), boolean(), fuzzywuzzy_options()) ::
  {match_method(), float()}

Calculates the ratio between the strings using various methods, returning the best score and algorithm

partial_ratio(left, right, options \\ [])

@spec partial_ratio(String.t(), String.t(), fuzzywuzzy_options()) :: float()

Calculates the partial ratio between two strings, that is the ratio between the best matching m-length substrings

iex> partial_ratio("this is a test", "this is a test!")
100.0

iex> partial_ratio("yankees", "new york yankees")
100.0

partial_token_set_ratio(left, right, options \\ [])

@spec partial_token_set_ratio(String.t(), String.t(), fuzzywuzzy_options()) :: float()

Like token set ratio, but a partial ratio - instead a full one - is applied

iex> partial_token_set_ratio("grizzly was a bear", "a grizzly inside a box")
100.0

iex> partial_token_set_ratio("grizzly was a bear", "be what you wear")
43.75

partial_token_sort_ratio(left, right, options \\ [])

@spec partial_token_sort_ratio(String.t(), String.t(), fuzzywuzzy_options()) ::
  float()

Like token sort ratio, but a partial ratio - instead of a standard one - is applied

iex> partial_token_sort_ratio("fuzzy wuzzy was a bear", "wuzzy fuzzy was a bear")
100.0

iex> partial_token_sort_ratio("fuzzy was a bear", "fuzzy fuzzy was a bear")
81.25

process(_, _, _)

@spec process(String.t(), [String.t()], fuzzywuzzy_options()) :: String.t()

Process a list of strings, finding the best match on a string reference. Not implemented yet

quick_ratio(left, right, options \\ [])

@spec quick_ratio(String.t(), String.t(), fuzzywuzzy_options()) :: float()

Like standard ratio, but ignores any non-alphanumeric character

iex> quick_ratio("this is a test", "this is a test!")
100.0

ratio(left, right, options \\ [])

@spec ratio(String.t(), String.t(), fuzzywuzzy_options()) :: float()

Calculates the standard ratio between two strings as a percentage. It demands the calculus to the chosen measure, standardizing the produced output

iex> ratio("this is a test", "this is a test!")
96.55

token_set_ratio(left, right, options \\ [])

@spec token_set_ratio(String.t(), String.t(), fuzzywuzzy_options()) :: float()

Calculates the token set ratio between two strings, that is the ratio calculated after tokenizing each string, splitting in two sets (a set with fully matching tokens, a set with other tokens), then sorting on set membership and alphabetically

iex> token_set_ratio("fuzzy was a bear", "fuzzy fuzzy was a bear")
100.0

iex> token_set_ratio("fuzzy was a bear", "muzzy wuzzy was a bear")
78.95

token_sort_ratio(left, right, options \\ [])

@spec token_sort_ratio(String.t(), String.t(), fuzzywuzzy_options()) :: float()

Calculates the token sort ratio between two strings, that is the ratio calculated after tokenizing and sorting alphabetically each string

iex> token_sort_ratio("fuzzy wuzzy was a bear", "wuzzy fuzzy was a bear")
100.0

iex> token_sort_ratio("fuzzy muzzy was a bear", "wuzzy fuzzy was a bear")
77.27

weighted_ratio(_, _, _)

@spec weighted_ratio(String.t(), String.t(), fuzzywuzzy_options()) :: float()

Weighted ratio. Not implemented yet