View Source Dmp.Diff (diff_match_patch v0.2.0)

Compare two blocks of plain text and efficiently return a list of differences.

Link to this section Summary

Types

difflist()

A list of diff operations, representing the difference between two text versions.

expiry()

first_pass_acc()

half_match_result()

The result of a successful Diff.half_match/3 call.

op()

A diff's operation type. The operation :nil is used internally to indicate a nil value for the diff.

options()

t()

The diff tuple, consisting of two elements: the operation and the associated text.

Functions

bisect(text1, text2, deadline)

Find the "middle snake" of a diff, split the problem in two and return the recursively constructed diff.

bisect_split(text1, text2, x, y, deadline)

Given the location of the "middle snake", split the diff in two parts and recurse.

chars_to_lines(diffs, line_array)

Rehydrate the text in a diff from a string of line hashes to real lines of text.

cleanup_efficiency(diffs, diff_edit_cost)

Reduce the number of edits in a diff by eliminating operationally trivial equalities.

cleanup_merge(diffs)

Reorder and merge like edit sections in a diff, merging equalities.

cleanup_semantic(diffs)

Reduce the number of edits in a diff by eliminating semantically trivial equalities.

cleanup_semantic_lossless(diffs)

Look for single edits in a diff that are surrounded on both sides by equalities which can be shifted sideways to align the edit to a word boundary.

combine_previous_inequalities(diffs, text, count_delete, count_insert, text_delete, text_insert)

common_overlap(text1, text2)

Determine if the suffix of one string is the prefix of another.

common_prefix(text1, text2)

Determine the common prefix of two strings.

common_suffix(text1, text2)

Determine the common suffix of two strings.

compute(text1, text2, check_lines, deadline)

Find the differences between two texts.

factor_out_prefixes(diffs, text_delete, text_insert)

factor_out_suffixes(diffs, text, text_delete, text_insert)

from_delta(text1, delta)

Given the original text1, and an encoded string which describes the operations required to transform text1 into text2, compute the full diff.

half_match(text1, text2, deadline)

Do the two texts share a substring which is at least half the length of the longer text?

levenshtein(diffs)

Compute the Levenshtein distance of a diff--the number of inserted, deleted or substituted characters.

line_mode(text1, text2, deadline)

Do a quick line-level diff on both strings, then rediff the parts for greater accuracy.

lines_to_chars(text1, text2)

Split two texts into a list of strings.

main(text1, text2, check_lines \\ true, opts \\ [])

Find the differences between two texts.

main_(text1, text2, check_lines, opts)

Skips validation of options. Used internally by Patch.apply.

pretty_html(diffs)

Generate a pretty HTML report from a difflist.

semantic_score(one, two)

Given two strings, compute a score representing whether the internal boundary falls on logical boundaries.

sorted_half_match(hm, arg2)

text1(diffs)

Compute and return the source text of a diff (all equalities and deletions).

text2(diffs)

Compute and return the destination text of a diff (all equalities and insertions).

to_delta(diffs)

Crush a diff into an encoded string which describes the operations required to transform text1 into text2.

undiff(arg1)

Returns the diff tuple, or a "nil" pseudo-diff (with op :nil and empty text).

x_index(diffs, loc)

Given loc, a location in text1, compute and return the equivalent location in text2.

Link to this section Types

difflist()

@type difflist() :: [t()]

A list of diff operations, representing the difference between two text versions.

A "difflist" is an Elixir list of "diff" tuples. Here is an example difflist:

[{:delete, "Hello"}, {:insert, "Goodbye"}, {:equal, " world."}]

which means: delete "Hello", add "Goodbye" and keep " world."

expiry()

@type expiry() :: :never | non_neg_integer()

first_pass_acc()

@type first_pass_acc() ::
  {non_neg_integer(), non_neg_integer(), String.t(), String.t()}

half_match_result()

@type half_match_result() ::
  {String.t(), String.t(), String.t(), String.t(), String.t()}

The result of a successful Diff.half_match/3 call.

A tuple of five strings:

the prefix of text1
the suffix of text1
the prefix of text2
the suffix of text2
the common middle

op()

@type op() :: :delete | :insert | :equal | nil

A diff's operation type. The operation :nil is used internally to indicate a nil value for the diff.

options()

@type options() :: Dmp.Options.t()

t()

@type t() :: {op(), String.t()}

The diff tuple, consisting of two elements: the operation and the associated text.

Link to this section Functions

bisect(text1, text2, deadline)

@spec bisect(String.t(), String.t(), expiry()) :: difflist()

Find the "middle snake" of a diff, split the problem in two and return the recursively constructed diff.

See: An O(ND) Difference Algorithm and Its Variations (Meyers, 1986)

text1 - Old string to be diffed.
text2 - New string to be diffed.
deadline - Unix timestamp (in milliseconds) at which to bail if not yet complete.

Returns a difflist.

bisect_split(text1, text2, x, y, deadline)

@spec bisect_split(
  String.t(),
  String.t(),
  non_neg_integer(),
  non_neg_integer(),
  non_neg_integer()
) :: difflist()

Given the location of the "middle snake", split the diff in two parts and recurse.

text1 - Old string to be diffed.
text2 - New string to be diffed.
x - Index of split point in text1.
y - Index of split point in text2.
deadline - Unix timestamp (in milliseconds) at which to bail if not yet complete.

Returns a difflist.

chars_to_lines(diffs, line_array)

@spec chars_to_lines(difflist(), [String.t()]) :: difflist()

Rehydrate the text in a diff from a string of line hashes to real lines of text.

diffs - A difflist.
line_array - A list of unique strings.

Returns the rehydrated difflist.

cleanup_efficiency(diffs, diff_edit_cost)

@spec cleanup_efficiency(difflist(), non_neg_integer()) :: difflist()

Reduce the number of edits in a diff by eliminating operationally trivial equalities.

diff_edit_cost Cost of an empty edit operation in terms of edit characters.

Returns the updated difflist.

cleanup_merge(diffs)

@spec cleanup_merge(difflist()) :: difflist()

Reorder and merge like edit sections in a diff, merging equalities.

Any edit section can move as long as it doesn't cross an equality.

Returns the updated difflist.

cleanup_semantic(diffs)

@spec cleanup_semantic(difflist()) :: difflist()

Reduce the number of edits in a diff by eliminating semantically trivial equalities.

Returns the updated difflist.

cleanup_semantic_lossless(diffs)

@spec cleanup_semantic_lossless(difflist()) :: difflist()

Look for single edits in a diff that are surrounded on both sides by equalities which can be shifted sideways to align the edit to a word boundary.

Example: The c<ins>at c</ins>ame. becomes The <ins>cat </ins>came.

Returns the updated difflist.

combine_previous_inequalities(diffs, text, count_delete, count_insert, text_delete, text_insert)

common_overlap(text1, text2)

@spec common_overlap(String.t(), String.t()) :: non_neg_integer()

Determine if the suffix of one string is the prefix of another.

text1 - First string.
text2 - Second string.

Returns the number of characters common to the end of the first string and the start of the second string.

common_prefix(text1, text2)

@spec common_prefix(String.t(), String.t()) :: {String.t(), String.t(), String.t()}

Determine the common prefix of two strings.

text1 - First string.
text2 - Second string.

Returns a tuple {prefix, rest1, rest2}, where

prefix - The common prefix.
rest1 - text1 with the prefix removed.
rest2 - text2 with the prefix removed.

common_suffix(text1, text2)

@spec common_suffix(String.t(), String.t()) :: {String.t(), String.t(), String.t()}

Determine the common suffix of two strings.

text1 - First string.
text2 - Second string.

Returns a tuple {suffix, rest1, rest2}, where

suffix - The common suffix.
rest1 - text1 with the suffix removed.
rest2 - text2 with the suffix removed.

compute(text1, text2, check_lines, deadline)

@spec compute(String.t(), String.t(), boolean(), expiry()) :: difflist()

Find the differences between two texts.

text1 - Old string to be diffed.
text2 - New string to be diffed.
check_lines - Speedup flag. If false, then don't run a line-level diff first to identify the changed areas. If true, then run a faster slightly less optimal diff.
deadline - Unix timestamp in milliseconds when the diff should be complete by.

Assumes that the texts do not have any common prefix or suffix.

Returns a difflist.

factor_out_prefixes(diffs, text_delete, text_insert)

factor_out_suffixes(diffs, text, text_delete, text_insert)

from_delta(text1, delta)

@spec from_delta(String.t(), String.t()) :: nil | difflist()

Given the original text1, and an encoded string which describes the operations required to transform text1 into text2, compute the full diff.

text1 - Source string for the diff. *delta - Encoded delta text.

Returns a difflist.

Raises an ArgumentError if the encoded delta has invalid contents for the given text.

half_match(text1, text2, deadline)

@spec half_match(String.t(), String.t(), non_neg_integer()) ::
  nil | half_match_result()

Do the two texts share a substring which is at least half the length of the longer text?

text1 - First string.
text2 - Second string.
deadline - Unix timestamp (in milliseconds) at which to bail if not yet complete.

This speedup can produce non-minimal diffs.

Returns a half_match_result 5-tuple, or nil if there was no match. Returns nil if deadline is zero (no time limit specified).

levenshtein(diffs)

@spec levenshtein(difflist()) :: non_neg_integer()

Compute the Levenshtein distance of a diff--the number of inserted, deleted or substituted characters.

line_mode(text1, text2, deadline)

@spec line_mode(String.t(), String.t(), expiry()) :: difflist()

Do a quick line-level diff on both strings, then rediff the parts for greater accuracy.

text1 - Old string to be diffed.
text2 - New string to be diffed.
deadline - Unix timestamp (in milliseconds) when the diff should be complete by.

This speedup can produce non-minimal diffs.

Returns a difflist.

lines_to_chars(text1, text2)

@spec lines_to_chars(String.t(), String.t()) :: {String.t(), String.t(), [String.t()]}

Split two texts into a list of strings.

Reduce the texts to a string of hashes where each Unicode character represents one line.

text1 - First string.
text2 - Second string.

Returns a tuple containing the encoded text1, the encoded text2 and the list of unique strings. The zeroth element of the list of unique strings is intentionally blank.

main(text1, text2, check_lines \\ true, opts \\ [])

@spec main(String.t(), String.t(), boolean(), options()) :: difflist()

Find the differences between two texts.

text1 - Old string to be diffed.
text2 - New string to be diffed.
check_lines - Speedup flag. If false, then don't run a line-level diff first to identify the changed areas. If true, then run a faster slightly less optimal diff.
opts - A options keyword list, [] to use the default options.

Most of the time check_lines is wanted, so it defaults to true.

Returns a difflist.

main_(text1, text2, check_lines, opts)

@spec main_(String.t(), String.t(), boolean(), options()) :: difflist()

Skips validation of options. Used internally by Patch.apply.

pretty_html(diffs)

@spec pretty_html(difflist()) :: String.t()

Generate a pretty HTML report from a difflist.

semantic_score(one, two)

@spec semantic_score(String.t(), String.t()) :: non_neg_integer()

Given two strings, compute a score representing whether the internal boundary falls on logical boundaries.

Scores range from 6 (best) to 0 (worst).

one - First string.
two - Second string.

Scores are:

6 if one or two is an empty string.
5 if a blank line ends in one or a blank line starts in two.
4 if one ends, or two starts, with a newline.
3 if one ends in a punctuation and two starts with white space.
2 if one ends, or two starts, with white space.
1 if one ends, or two starts, with a non-alphanumeric.
0 otherwise

examples
Examples

iex> Diff.semantic_score("two is empty string", "")
6

iex> Diff.semantic_score("one ends in blank line\n\n", "two")
5

iex> Diff.semantic_score("one ends in new line\n", "two")
4

iex> Diff.semantic_score("one sentence.", " space before two")
3

iex> Diff.semantic_score("one sentence.", "no space before two")
1

iex> Diff.semantic_score("one ends with white space ", "two")
2

iex> Diff.semantic_score("one ends in 'punctuation'", "two")
1

iex> Diff.semantic_score("one ends in middle of word", "two")
0

sorted_half_match(hm, arg2)

text1(diffs)

@spec text1(difflist()) :: String.t()

Compute and return the source text of a diff (all equalities and deletions).

text2(diffs)

@spec text2(difflist()) :: String.t()

Compute and return the destination text of a diff (all equalities and insertions).

to_delta(diffs)

@spec to_delta(difflist()) :: String.t()

Crush a diff into an encoded string which describes the operations required to transform text1 into text2.

For example, "=3 -2 +ing" means keep 3 chars, delete 2 chars, insert "ing".

Operations are tab-separated. Inserted text is escaped using %xx notation.

examples
Examples

|> iex [{:equal, "abc"}, {:delete, "de"}, {:insert, "ing"}] |> to_delta() |> IO.inspect()
"=3\t-2\t+ing"

undiff(arg1)

@spec undiff(nil | t()) :: t()

Returns the diff tuple, or a "nil" pseudo-diff (with op :nil and empty text).

x_index(diffs, loc)

@spec x_index(difflist(), non_neg_integer()) :: non_neg_integer()

Given loc, a location in text1, compute and return the equivalent location in text2.

diffs - a difflist.
loc - Location within text1.

Returns location within text2.

examples
Examples

iex> Diff.main("The cat", "The big cat") |> Diff.x_index(1)
1

iex> Diff.main("The cat", "The big cat") |> Diff.x_index(4)
8

Settings View Source Dmp.Diff (diff_match_patch v0.2.0)

Link to this section Summary

Types

Functions

Link to this section Types

difflist()

expiry()

first_pass_acc()

half_match_result()

op()

options()

t()

Link to this section Functions

bisect(text1, text2, deadline)

bisect_split(text1, text2, x, y, deadline)

chars_to_lines(diffs, line_array)

cleanup_efficiency(diffs, diff_edit_cost)

cleanup_merge(diffs)

cleanup_semantic(diffs)

cleanup_semantic_lossless(diffs)

combine_previous_inequalities(diffs, text, count_delete, count_insert, text_delete, text_insert)

common_overlap(text1, text2)

common_prefix(text1, text2)

common_suffix(text1, text2)

compute(text1, text2, check_lines, deadline)

factor_out_prefixes(diffs, text_delete, text_insert)

factor_out_suffixes(diffs, text, text_delete, text_insert)

from_delta(text1, delta)

half_match(text1, text2, deadline)

levenshtein(diffs)

line_mode(text1, text2, deadline)

lines_to_chars(text1, text2)

main(text1, text2, check_lines \\ true, opts \\ [])

main_(text1, text2, check_lines, opts)

pretty_html(diffs)

semantic_score(one, two)

examples Examples

sorted_half_match(hm, arg2)

text1(diffs)

text2(diffs)

to_delta(diffs)

examples Examples

undiff(arg1)

x_index(diffs, loc)

examples Examples

View Source Dmp.Diff (diff_match_patch v0.2.0)

examples
Examples

examples
Examples

examples
Examples