View Source Evision.DNN.TextRecognitionModel (Evision v0.2.11)

Create text recognition model from network represented in one of the supported formats Call setDecodeType() and setVocabulary() after constructor to initialize the decoding method

textRecognitionModel(model, opts)

Create text recognition model from network represented in one of the supported formats Call setDecodeType() and setVocabulary() after constructor to initialize the decoding method

Types

t()

@type t() :: %Evision.DNN.TextRecognitionModel{ref: reference()}

Type that represents an DNN.TextRecognitionModel struct.

ref. reference()
The underlying erlang resource variable.

Functions

enableWinograd(named_args)

@spec enableWinograd(Keyword.t()) :: any() | {:error, String.t()}

enableWinograd(self, useWinograd)

@spec enableWinograd(t(), boolean()) :: Evision.DNN.Model.t() | {:error, String.t()}

enableWinograd

Positional Arguments

self: Evision.DNN.TextRecognitionModel.t()
useWinograd: bool

Return

retval: Evision.DNN.Model.t()

Python prototype (for reference only):

enableWinograd(useWinograd) -> retval

getDecodeType(named_args)

@spec getDecodeType(Keyword.t()) :: any() | {:error, String.t()}

@spec getDecodeType(t()) :: binary() | {:error, String.t()}

Get the decoding method

Positional Arguments

self: Evision.DNN.TextRecognitionModel.t()

Return

retval: string

@return the decoding method

Python prototype (for reference only):

getDecodeType() -> retval

getVocabulary(named_args)

@spec getVocabulary(Keyword.t()) :: any() | {:error, String.t()}

@spec getVocabulary(t()) :: [binary()] | {:error, String.t()}

Get the vocabulary for recognition.

Positional Arguments

self: Evision.DNN.TextRecognitionModel.t()

Return

retval: [string]

@return vocabulary the associated vocabulary

Python prototype (for reference only):

getVocabulary() -> retval

predict(named_args)

@spec predict(Keyword.t()) :: any() | {:error, String.t()}

predict(self, frame)

@spec predict(t(), Evision.Mat.maybe_mat_in()) ::
  [Evision.Mat.t()] | {:error, String.t()}

Given the @p input frame, create input blob, run net and return the output @p blobs.

Positional Arguments

self: Evision.DNN.TextRecognitionModel.t()
frame: Evision.Mat

Return

outs: [Evision.Mat].
Allocated output blobs, which will store results of the computation.

Python prototype (for reference only):

predict(frame[, outs]) -> outs

predict(self, frame, opts)

@spec predict(t(), Evision.Mat.maybe_mat_in(), [{atom(), term()}, ...] | nil) ::
  [Evision.Mat.t()] | {:error, String.t()}

Given the @p input frame, create input blob, run net and return the output @p blobs.

Positional Arguments

self: Evision.DNN.TextRecognitionModel.t()
frame: Evision.Mat

Return

outs: [Evision.Mat].
Allocated output blobs, which will store results of the computation.

Python prototype (for reference only):

predict(frame[, outs]) -> outs

recognize(named_args)

@spec recognize(Keyword.t()) :: any() | {:error, String.t()}

recognize(self, frame)

@spec recognize(t(), Evision.Mat.maybe_mat_in()) :: binary() | {:error, String.t()}

Given the @p input frame, create input blob, run net and return recognition result

Positional Arguments

self: Evision.DNN.TextRecognitionModel.t()
frame: Evision.Mat.
The input image

Return

retval: string

@return The text recognition result

Python prototype (for reference only):

recognize(frame) -> retval

recognize(self, frame, roiRects)

@spec recognize(t(), Evision.Mat.maybe_mat_in(), [Evision.Mat.maybe_mat_in()]) ::
  [binary()] | {:error, String.t()}

Given the @p input frame, create input blob, run net and return recognition result

Positional Arguments

self: Evision.DNN.TextRecognitionModel.t()
frame: Evision.Mat.
The input image
roiRects: [Evision.Mat].
List of text detection regions of interest (cv::Rect, CV_32SC4). ROIs is be cropped as the network inputs

Return

results: [string].
A set of text recognition results.

Python prototype (for reference only):

recognize(frame, roiRects) -> results

setDecodeOptsCTCPrefixBeamSearch(named_args)

@spec setDecodeOptsCTCPrefixBeamSearch(Keyword.t()) :: any() | {:error, String.t()}

setDecodeOptsCTCPrefixBeamSearch(self, beamSize)

@spec setDecodeOptsCTCPrefixBeamSearch(t(), integer()) :: t() | {:error, String.t()}

Set the decoding method options for "CTC-prefix-beam-search" decode usage

Positional Arguments

self: Evision.DNN.TextRecognitionModel.t()
beamSize: integer().
Beam size for search

Keyword Arguments

vocPruneSize: integer().
Parameter to optimize big vocabulary search, only take top @p vocPruneSize tokens in each search step, @p vocPruneSize <= 0 stands for disable this prune.

Return

retval: Evision.DNN.TextRecognitionModel.t()

Python prototype (for reference only):

setDecodeOptsCTCPrefixBeamSearch(beamSize[, vocPruneSize]) -> retval

setDecodeOptsCTCPrefixBeamSearch(self, beamSize, opts)

@spec setDecodeOptsCTCPrefixBeamSearch(
  t(),
  integer(),
  [{:vocPruneSize, term()}] | nil
) ::
  t() | {:error, String.t()}

Set the decoding method options for "CTC-prefix-beam-search" decode usage

Positional Arguments

self: Evision.DNN.TextRecognitionModel.t()
beamSize: integer().
Beam size for search

Keyword Arguments

vocPruneSize: integer().
Parameter to optimize big vocabulary search, only take top @p vocPruneSize tokens in each search step, @p vocPruneSize <= 0 stands for disable this prune.

Return

retval: Evision.DNN.TextRecognitionModel.t()

Python prototype (for reference only):

setDecodeOptsCTCPrefixBeamSearch(beamSize[, vocPruneSize]) -> retval

setDecodeType(named_args)

@spec setDecodeType(Keyword.t()) :: any() | {:error, String.t()}

setDecodeType(self, decodeType)

@spec setDecodeType(t(), binary()) :: t() | {:error, String.t()}

Set the decoding method of translating the network output into string

Positional Arguments

self: Evision.DNN.TextRecognitionModel.t()
decodeType: string.The decoding method of translating the network output into string, currently supported type:
- "CTC-greedy" greedy decoding for the output of CTC-based methods
- "CTC-prefix-beam-search" Prefix beam search decoding for the output of CTC-based methods

Return

retval: Evision.DNN.TextRecognitionModel.t()

Python prototype (for reference only):

setDecodeType(decodeType) -> retval

setInputCrop(named_args)

@spec setInputCrop(Keyword.t()) :: any() | {:error, String.t()}

setInputCrop(self, crop)

@spec setInputCrop(t(), boolean()) :: Evision.DNN.Model.t() | {:error, String.t()}

Set flag crop for frame.

Positional Arguments

self: Evision.DNN.TextRecognitionModel.t()
crop: bool.
Flag which indicates whether image will be cropped after resize or not.

Return

retval: Evision.DNN.Model.t()

Python prototype (for reference only):

setInputCrop(crop) -> retval

setInputMean(named_args)

@spec setInputMean(Keyword.t()) :: any() | {:error, String.t()}

setInputMean(self, mean)

@spec setInputMean(t(), Evision.scalar()) ::
  Evision.DNN.Model.t() | {:error, String.t()}

Set mean value for frame.

Positional Arguments

self: Evision.DNN.TextRecognitionModel.t()
mean: Evision.scalar().
Scalar with mean values which are subtracted from channels.

Return

retval: Evision.DNN.Model.t()

Python prototype (for reference only):

setInputMean(mean) -> retval

setInputParams(named_args)

@spec setInputParams(Keyword.t()) :: any() | {:error, String.t()}

@spec setInputParams(t()) :: Evision.DNN.Model.t() | {:error, String.t()}

Set preprocessing parameters for frame.

Positional Arguments

self: Evision.DNN.TextRecognitionModel.t()

Keyword Arguments

scale: double.
Multiplier for frame values.
size: Size.
New input size.
mean: Evision.scalar().
Scalar with mean values which are subtracted from channels.
swapRB: bool.
Flag which indicates that swap first and last channels.
crop: bool.
Flag which indicates whether image will be cropped after resize or not. blob(n, c, y, x) = scale * resize( frame(y, x, c) ) - mean(c) )

Python prototype (for reference only):

setInputParams([, scale[, size[, mean[, swapRB[, crop]]]]]) -> None

setInputParams(self, opts)

@spec setInputParams(
  t(),
  [crop: term(), mean: term(), scale: term(), size: term(), swapRB: term()]
  | nil
) :: Evision.DNN.Model.t() | {:error, String.t()}

Set preprocessing parameters for frame.

Positional Arguments

self: Evision.DNN.TextRecognitionModel.t()

Keyword Arguments

scale: double.
Multiplier for frame values.
size: Size.
New input size.
mean: Evision.scalar().
Scalar with mean values which are subtracted from channels.
swapRB: bool.
Flag which indicates that swap first and last channels.
crop: bool.
Flag which indicates whether image will be cropped after resize or not. blob(n, c, y, x) = scale * resize( frame(y, x, c) ) - mean(c) )

Python prototype (for reference only):

setInputParams([, scale[, size[, mean[, swapRB[, crop]]]]]) -> None

setInputScale(named_args)

@spec setInputScale(Keyword.t()) :: any() | {:error, String.t()}

setInputScale(self, scale)

@spec setInputScale(t(), Evision.scalar()) ::
  Evision.DNN.Model.t() | {:error, String.t()}

Set scalefactor value for frame.

Positional Arguments

self: Evision.DNN.TextRecognitionModel.t()
scale: Evision.scalar().
Multiplier for frame values.

Return

retval: Evision.DNN.Model.t()

Python prototype (for reference only):

setInputScale(scale) -> retval

setInputSize(named_args)

@spec setInputSize(Keyword.t()) :: any() | {:error, String.t()}

setInputSize(self, size)

@spec setInputSize(
  t(),
  {number(), number()}
) :: Evision.DNN.Model.t() | {:error, String.t()}

Set input size for frame.

Positional Arguments

self: Evision.DNN.TextRecognitionModel.t()
size: Size.
New input size.

Return

retval: Evision.DNN.Model.t()

Note: If shape of the new blob less than 0, then frame size not change.

Python prototype (for reference only):

setInputSize(size) -> retval

setInputSize(self, width, height)

@spec setInputSize(t(), integer(), integer()) ::
  Evision.DNN.Model.t() | {:error, String.t()}

setInputSize

Positional Arguments

self: Evision.DNN.TextRecognitionModel.t()
width: integer().
New input width.
height: integer().
New input height.

Return

retval: Evision.DNN.Model.t()

Has overloading in C++

Python prototype (for reference only):

setInputSize(width, height) -> retval

setInputSwapRB(named_args)

@spec setInputSwapRB(Keyword.t()) :: any() | {:error, String.t()}

setInputSwapRB(self, swapRB)

@spec setInputSwapRB(t(), boolean()) :: Evision.DNN.Model.t() | {:error, String.t()}

Set flag swapRB for frame.

Positional Arguments

self: Evision.DNN.TextRecognitionModel.t()
swapRB: bool.
Flag which indicates that swap first and last channels.

Return

retval: Evision.DNN.Model.t()

Python prototype (for reference only):

setInputSwapRB(swapRB) -> retval

setOutputNames(named_args)

@spec setOutputNames(Keyword.t()) :: any() | {:error, String.t()}

setOutputNames(self, outNames)

@spec setOutputNames(t(), [binary()]) :: Evision.DNN.Model.t() | {:error, String.t()}

Set output names for frame.

Positional Arguments

self: Evision.DNN.TextRecognitionModel.t()
outNames: [String].
Names for output layers.

Return

retval: Evision.DNN.Model.t()

Python prototype (for reference only):

setOutputNames(outNames) -> retval

setPreferableBackend(named_args)

@spec setPreferableBackend(Keyword.t()) :: any() | {:error, String.t()}

setPreferableBackend(self, backendId)

@spec setPreferableBackend(t(), Evision.DNN.Backend.enum()) ::
  Evision.DNN.Model.t() | {:error, String.t()}

setPreferableBackend

Positional Arguments

self: Evision.DNN.TextRecognitionModel.t()
backendId: dnn_Backend

Return

retval: Evision.DNN.Model.t()

Python prototype (for reference only):

setPreferableBackend(backendId) -> retval

setPreferableTarget(named_args)

@spec setPreferableTarget(Keyword.t()) :: any() | {:error, String.t()}

setPreferableTarget(self, targetId)

@spec setPreferableTarget(t(), Evision.DNN.Target.enum()) ::
  Evision.DNN.Model.t() | {:error, String.t()}

setPreferableTarget

Positional Arguments

self: Evision.DNN.TextRecognitionModel.t()
targetId: dnn_Target

Return

retval: Evision.DNN.Model.t()

Python prototype (for reference only):

setPreferableTarget(targetId) -> retval

setVocabulary(named_args)

@spec setVocabulary(Keyword.t()) :: any() | {:error, String.t()}

setVocabulary(self, vocabulary)

@spec setVocabulary(t(), [binary()]) :: t() | {:error, String.t()}

Set the vocabulary for recognition.

Positional Arguments

self: Evision.DNN.TextRecognitionModel.t()
vocabulary: [string].
the associated vocabulary of the network.

Return

retval: Evision.DNN.TextRecognitionModel.t()

Python prototype (for reference only):

setVocabulary(vocabulary) -> retval

textRecognitionModel(named_args)

@spec textRecognitionModel(Keyword.t()) :: any() | {:error, String.t()}

@spec textRecognitionModel(binary()) :: t() | {:error, String.t()}

@spec textRecognitionModel(Evision.DNN.Net.t()) :: t() | {:error, String.t()}

Variant 1:

Create text recognition model from network represented in one of the supported formats Call setDecodeType() and setVocabulary() after constructor to initialize the decoding method

Positional Arguments

model: string.
Binary file contains trained weights

Keyword Arguments

config: string.
Text file contains network configuration

Return

self: Evision.DNN.TextRecognitionModel.t()

Python prototype (for reference only):

TextRecognitionModel(model[, config]) -> <dnn_TextRecognitionModel object>

Variant 2:

Create Text Recognition model from deep learning network Call setDecodeType() and setVocabulary() after constructor to initialize the decoding method

Positional Arguments

network: Evision.DNN.Net.t().
Net object

Return

self: Evision.DNN.TextRecognitionModel.t()

Python prototype (for reference only):

TextRecognitionModel(network) -> <dnn_TextRecognitionModel object>

textRecognitionModel(model, opts)

@spec textRecognitionModel(binary(), [{:config, term()}] | nil) ::
  t() | {:error, String.t()}

Create text recognition model from network represented in one of the supported formats Call setDecodeType() and setVocabulary() after constructor to initialize the decoding method

Positional Arguments

model: string.
Binary file contains trained weights

Keyword Arguments

config: string.
Text file contains network configuration

Return

self: Evision.DNN.TextRecognitionModel.t()

Python prototype (for reference only):

TextRecognitionModel(model[, config]) -> <dnn_TextRecognitionModel object>