View Source Evision.DNN.TextDetectionModelDB (Evision v0.2.11)

Summary

Types

t()

Type that represents an DNN.TextDetectionModelDB struct.

Functions

detect(named_args)

detect(self, frame)

detect

detectTextRectangles(named_args)

detectTextRectangles(self, frame)

Performs detection

enableWinograd(named_args)

enableWinograd(self, useWinograd)

enableWinograd

getBinaryThreshold(named_args)

getBinaryThreshold

getMaxCandidates(named_args)

getMaxCandidates

getPolygonThreshold(named_args)

getPolygonThreshold

getUnclipRatio(named_args)

getUnclipRatio

predict(named_args)

predict(self, frame)

Given the @p input frame, create input blob, run net and return the output @p blobs.

predict(self, frame, opts)

Given the @p input frame, create input blob, run net and return the output @p blobs.

setBinaryThreshold(named_args)

setBinaryThreshold(self, binaryThreshold)

setBinaryThreshold

setInputCrop(named_args)

setInputCrop(self, crop)

Set flag crop for frame.

setInputMean(named_args)

setInputMean(self, mean)

Set mean value for frame.

setInputParams(named_args)

Set preprocessing parameters for frame.

setInputParams(self, opts)

Set preprocessing parameters for frame.

setInputScale(named_args)

setInputScale(self, scale)

Set scalefactor value for frame.

setInputSize(named_args)

setInputSize(self, size)

Set input size for frame.

setInputSize(self, width, height)

setInputSize

setInputSwapRB(named_args)

setInputSwapRB(self, swapRB)

Set flag swapRB for frame.

setMaxCandidates(named_args)

setMaxCandidates(self, maxCandidates)

setMaxCandidates

setOutputNames(named_args)

setOutputNames(self, outNames)

Set output names for frame.

setPolygonThreshold(named_args)

setPolygonThreshold(self, polygonThreshold)

setPolygonThreshold

setPreferableBackend(named_args)

setPreferableBackend(self, backendId)

setPreferableBackend

setPreferableTarget(named_args)

setPreferableTarget(self, targetId)

setPreferableTarget

setUnclipRatio(named_args)

setUnclipRatio(self, unclipRatio)

setUnclipRatio

textDetectionModelDB(named_args)

Variant 1:

Create text detection model from network represented in one of the supported formats. An order of @p model and @p config arguments does not matter.

textDetectionModelDB(model, opts)

Create text detection model from network represented in one of the supported formats. An order of @p model and @p config arguments does not matter.

Types

t()

@type t() :: %Evision.DNN.TextDetectionModelDB{ref: reference()}

Type that represents an DNN.TextDetectionModelDB struct.

ref. reference()
The underlying erlang resource variable.

Functions

detect(named_args)

@spec detect(Keyword.t()) :: any() | {:error, String.t()}

detect(self, frame)

@spec detect(t(), Evision.Mat.maybe_mat_in()) ::
  [[{number(), number()}]] | {:error, String.t()}

detect

Positional Arguments

self: Evision.DNN.TextDetectionModelDB.t()
frame: Evision.Mat

Return

detections: [[Point]]

Has overloading in C++

Python prototype (for reference only):

detect(frame) -> detections

detectTextRectangles(named_args)

@spec detectTextRectangles(Keyword.t()) :: any() | {:error, String.t()}

detectTextRectangles(self, frame)

@spec detectTextRectangles(t(), Evision.Mat.maybe_mat_in()) ::
  {[{{number(), number()}, {number(), number()}, number()}], [number()]}
  | {:error, String.t()}

Performs detection

Positional Arguments

self: Evision.DNN.TextDetectionModelDB.t()
frame: Evision.Mat.
the input image

Return

detections: [{centre={x, y}, size={s1, s2}, angle}].
array with detections' RotationRect results
confidences: [float].
array with detection confidences

Given the input @p frame, prepare network input, run network inference, post-process network output and return result detections. Each result is rotated rectangle. Note: Result may be inaccurate in case of strong perspective transformations.

Python prototype (for reference only):

detectTextRectangles(frame) -> detections, confidences

enableWinograd(named_args)

@spec enableWinograd(Keyword.t()) :: any() | {:error, String.t()}

enableWinograd(self, useWinograd)

@spec enableWinograd(t(), boolean()) :: Evision.DNN.Model.t() | {:error, String.t()}

enableWinograd

Positional Arguments

self: Evision.DNN.TextDetectionModelDB.t()
useWinograd: bool

Return

retval: Evision.DNN.Model.t()

Python prototype (for reference only):

enableWinograd(useWinograd) -> retval

getBinaryThreshold(named_args)

@spec getBinaryThreshold(Keyword.t()) :: any() | {:error, String.t()}

@spec getBinaryThreshold(t()) :: number() | {:error, String.t()}

getBinaryThreshold

Positional Arguments

self: Evision.DNN.TextDetectionModelDB.t()

Return

retval: float

Python prototype (for reference only):

getBinaryThreshold() -> retval

getMaxCandidates(named_args)

@spec getMaxCandidates(Keyword.t()) :: any() | {:error, String.t()}

@spec getMaxCandidates(t()) :: integer() | {:error, String.t()}

getMaxCandidates

Positional Arguments

self: Evision.DNN.TextDetectionModelDB.t()

Return

retval: integer()

Python prototype (for reference only):

getMaxCandidates() -> retval

getPolygonThreshold(named_args)

@spec getPolygonThreshold(Keyword.t()) :: any() | {:error, String.t()}

@spec getPolygonThreshold(t()) :: number() | {:error, String.t()}

getPolygonThreshold

Positional Arguments

self: Evision.DNN.TextDetectionModelDB.t()

Return

retval: float

Python prototype (for reference only):

getPolygonThreshold() -> retval

getUnclipRatio(named_args)

@spec getUnclipRatio(Keyword.t()) :: any() | {:error, String.t()}

@spec getUnclipRatio(t()) :: number() | {:error, String.t()}

getUnclipRatio

Positional Arguments

self: Evision.DNN.TextDetectionModelDB.t()

Return

retval: double

Python prototype (for reference only):

getUnclipRatio() -> retval

predict(named_args)

@spec predict(Keyword.t()) :: any() | {:error, String.t()}

predict(self, frame)

@spec predict(t(), Evision.Mat.maybe_mat_in()) ::
  [Evision.Mat.t()] | {:error, String.t()}

Given the @p input frame, create input blob, run net and return the output @p blobs.

Positional Arguments

self: Evision.DNN.TextDetectionModelDB.t()
frame: Evision.Mat

Return

outs: [Evision.Mat].
Allocated output blobs, which will store results of the computation.

Python prototype (for reference only):

predict(frame[, outs]) -> outs

predict(self, frame, opts)

@spec predict(t(), Evision.Mat.maybe_mat_in(), [{atom(), term()}, ...] | nil) ::
  [Evision.Mat.t()] | {:error, String.t()}

Given the @p input frame, create input blob, run net and return the output @p blobs.

Positional Arguments

self: Evision.DNN.TextDetectionModelDB.t()
frame: Evision.Mat

Return

outs: [Evision.Mat].
Allocated output blobs, which will store results of the computation.

Python prototype (for reference only):

predict(frame[, outs]) -> outs

setBinaryThreshold(named_args)

@spec setBinaryThreshold(Keyword.t()) :: any() | {:error, String.t()}

setBinaryThreshold(self, binaryThreshold)

@spec setBinaryThreshold(t(), number()) :: t() | {:error, String.t()}

setBinaryThreshold

Positional Arguments

self: Evision.DNN.TextDetectionModelDB.t()
binaryThreshold: float

Return

retval: Evision.DNN.TextDetectionModelDB.t()

Python prototype (for reference only):

setBinaryThreshold(binaryThreshold) -> retval

setInputCrop(named_args)

@spec setInputCrop(Keyword.t()) :: any() | {:error, String.t()}

setInputCrop(self, crop)

@spec setInputCrop(t(), boolean()) :: Evision.DNN.Model.t() | {:error, String.t()}

Set flag crop for frame.

Positional Arguments

self: Evision.DNN.TextDetectionModelDB.t()
crop: bool.
Flag which indicates whether image will be cropped after resize or not.

Return

retval: Evision.DNN.Model.t()

Python prototype (for reference only):

setInputCrop(crop) -> retval

setInputMean(named_args)

@spec setInputMean(Keyword.t()) :: any() | {:error, String.t()}

setInputMean(self, mean)

@spec setInputMean(t(), Evision.scalar()) ::
  Evision.DNN.Model.t() | {:error, String.t()}

Set mean value for frame.

Positional Arguments

self: Evision.DNN.TextDetectionModelDB.t()
mean: Evision.scalar().
Scalar with mean values which are subtracted from channels.

Return

retval: Evision.DNN.Model.t()

Python prototype (for reference only):

setInputMean(mean) -> retval

setInputParams(named_args)

@spec setInputParams(Keyword.t()) :: any() | {:error, String.t()}

@spec setInputParams(t()) :: Evision.DNN.Model.t() | {:error, String.t()}

Set preprocessing parameters for frame.

Positional Arguments

self: Evision.DNN.TextDetectionModelDB.t()

Keyword Arguments

scale: double.
Multiplier for frame values.
size: Size.
New input size.
mean: Evision.scalar().
Scalar with mean values which are subtracted from channels.
swapRB: bool.
Flag which indicates that swap first and last channels.
crop: bool.
Flag which indicates whether image will be cropped after resize or not. blob(n, c, y, x) = scale * resize( frame(y, x, c) ) - mean(c) )

Python prototype (for reference only):

setInputParams([, scale[, size[, mean[, swapRB[, crop]]]]]) -> None

setInputParams(self, opts)

@spec setInputParams(
  t(),
  [crop: term(), mean: term(), scale: term(), size: term(), swapRB: term()]
  | nil
) :: Evision.DNN.Model.t() | {:error, String.t()}

Set preprocessing parameters for frame.

Positional Arguments

self: Evision.DNN.TextDetectionModelDB.t()

Keyword Arguments

scale: double.
Multiplier for frame values.
size: Size.
New input size.
mean: Evision.scalar().
Scalar with mean values which are subtracted from channels.
swapRB: bool.
Flag which indicates that swap first and last channels.
crop: bool.
Flag which indicates whether image will be cropped after resize or not. blob(n, c, y, x) = scale * resize( frame(y, x, c) ) - mean(c) )

Python prototype (for reference only):

setInputParams([, scale[, size[, mean[, swapRB[, crop]]]]]) -> None

setInputScale(named_args)

@spec setInputScale(Keyword.t()) :: any() | {:error, String.t()}

setInputScale(self, scale)

@spec setInputScale(t(), Evision.scalar()) ::
  Evision.DNN.Model.t() | {:error, String.t()}

Set scalefactor value for frame.

Positional Arguments

self: Evision.DNN.TextDetectionModelDB.t()
scale: Evision.scalar().
Multiplier for frame values.

Return

retval: Evision.DNN.Model.t()

Python prototype (for reference only):

setInputScale(scale) -> retval

setInputSize(named_args)

@spec setInputSize(Keyword.t()) :: any() | {:error, String.t()}

setInputSize(self, size)

@spec setInputSize(
  t(),
  {number(), number()}
) :: Evision.DNN.Model.t() | {:error, String.t()}

Set input size for frame.

Positional Arguments

self: Evision.DNN.TextDetectionModelDB.t()
size: Size.
New input size.

Return

retval: Evision.DNN.Model.t()

Note: If shape of the new blob less than 0, then frame size not change.

Python prototype (for reference only):

setInputSize(size) -> retval

setInputSize(self, width, height)

@spec setInputSize(t(), integer(), integer()) ::
  Evision.DNN.Model.t() | {:error, String.t()}

setInputSize

Positional Arguments

self: Evision.DNN.TextDetectionModelDB.t()
width: integer().
New input width.
height: integer().
New input height.

Return

retval: Evision.DNN.Model.t()

Has overloading in C++

Python prototype (for reference only):

setInputSize(width, height) -> retval

setInputSwapRB(named_args)

@spec setInputSwapRB(Keyword.t()) :: any() | {:error, String.t()}

setInputSwapRB(self, swapRB)

@spec setInputSwapRB(t(), boolean()) :: Evision.DNN.Model.t() | {:error, String.t()}

Set flag swapRB for frame.

Positional Arguments

self: Evision.DNN.TextDetectionModelDB.t()
swapRB: bool.
Flag which indicates that swap first and last channels.

Return

retval: Evision.DNN.Model.t()

Python prototype (for reference only):

setInputSwapRB(swapRB) -> retval

setMaxCandidates(named_args)

@spec setMaxCandidates(Keyword.t()) :: any() | {:error, String.t()}

setMaxCandidates(self, maxCandidates)

@spec setMaxCandidates(t(), integer()) :: t() | {:error, String.t()}

setMaxCandidates

Positional Arguments

self: Evision.DNN.TextDetectionModelDB.t()
maxCandidates: integer()

Return

retval: Evision.DNN.TextDetectionModelDB.t()

Python prototype (for reference only):

setMaxCandidates(maxCandidates) -> retval

setOutputNames(named_args)

@spec setOutputNames(Keyword.t()) :: any() | {:error, String.t()}

setOutputNames(self, outNames)

@spec setOutputNames(t(), [binary()]) :: Evision.DNN.Model.t() | {:error, String.t()}

Set output names for frame.

Positional Arguments

self: Evision.DNN.TextDetectionModelDB.t()
outNames: [String].
Names for output layers.

Return

retval: Evision.DNN.Model.t()

Python prototype (for reference only):

setOutputNames(outNames) -> retval

setPolygonThreshold(named_args)

@spec setPolygonThreshold(Keyword.t()) :: any() | {:error, String.t()}

setPolygonThreshold(self, polygonThreshold)

@spec setPolygonThreshold(t(), number()) :: t() | {:error, String.t()}

setPolygonThreshold

Positional Arguments

self: Evision.DNN.TextDetectionModelDB.t()
polygonThreshold: float

Return

retval: Evision.DNN.TextDetectionModelDB.t()

Python prototype (for reference only):

setPolygonThreshold(polygonThreshold) -> retval

setPreferableBackend(named_args)

@spec setPreferableBackend(Keyword.t()) :: any() | {:error, String.t()}

setPreferableBackend(self, backendId)

@spec setPreferableBackend(t(), Evision.DNN.Backend.enum()) ::
  Evision.DNN.Model.t() | {:error, String.t()}

setPreferableBackend

Positional Arguments

self: Evision.DNN.TextDetectionModelDB.t()
backendId: dnn_Backend

Return

retval: Evision.DNN.Model.t()

Python prototype (for reference only):

setPreferableBackend(backendId) -> retval

setPreferableTarget(named_args)

@spec setPreferableTarget(Keyword.t()) :: any() | {:error, String.t()}

setPreferableTarget(self, targetId)

@spec setPreferableTarget(t(), Evision.DNN.Target.enum()) ::
  Evision.DNN.Model.t() | {:error, String.t()}

setPreferableTarget

Positional Arguments

self: Evision.DNN.TextDetectionModelDB.t()
targetId: dnn_Target

Return

retval: Evision.DNN.Model.t()

Python prototype (for reference only):

setPreferableTarget(targetId) -> retval

setUnclipRatio(named_args)

@spec setUnclipRatio(Keyword.t()) :: any() | {:error, String.t()}

setUnclipRatio(self, unclipRatio)

@spec setUnclipRatio(t(), number()) :: t() | {:error, String.t()}

setUnclipRatio

Positional Arguments

self: Evision.DNN.TextDetectionModelDB.t()
unclipRatio: double

Return

retval: Evision.DNN.TextDetectionModelDB.t()

Python prototype (for reference only):

setUnclipRatio(unclipRatio) -> retval

textDetectionModelDB(named_args)

@spec textDetectionModelDB(Keyword.t()) :: any() | {:error, String.t()}

@spec textDetectionModelDB(binary()) :: t() | {:error, String.t()}

@spec textDetectionModelDB(Evision.DNN.Net.t()) :: t() | {:error, String.t()}

Variant 1:

Create text detection model from network represented in one of the supported formats. An order of @p model and @p config arguments does not matter.

Positional Arguments

model: string.
Binary file contains trained weights.

Keyword Arguments

config: string.
Text file contains network configuration.

Return

self: Evision.DNN.TextDetectionModelDB.t()

Python prototype (for reference only):

TextDetectionModel_DB(model[, config]) -> <dnn_TextDetectionModel_DB object>

Variant 2:

Create text detection algorithm from deep learning network.

Positional Arguments

network: Evision.DNN.Net.t().
Net object.

Return

self: Evision.DNN.TextDetectionModelDB.t()

Python prototype (for reference only):

TextDetectionModel_DB(network) -> <dnn_TextDetectionModel_DB object>

textDetectionModelDB(model, opts)

@spec textDetectionModelDB(binary(), [{:config, term()}] | nil) ::
  t() | {:error, String.t()}

Create text detection model from network represented in one of the supported formats. An order of @p model and @p config arguments does not matter.

Positional Arguments

model: string.
Binary file contains trained weights.

Keyword Arguments

config: string.
Text file contains network configuration.

Return

self: Evision.DNN.TextDetectionModelDB.t()

Python prototype (for reference only):

TextDetectionModel_DB(model[, config]) -> <dnn_TextDetectionModel_DB object>