Text
- Top
- Summary
- Types
- Functions
Text.Corpus
Text.Inflect.En
Text.Language
Text.Language.Classifier
- Top
- Summary
- Types
- Callbacks
  - order_scores/1
  - score_one_language/3
Text.Language.Classifier.CummulativeFrequency
Text.Language.Classifier.NaiveBayesian
Text.Language.Classifier.RankOrder
Text.Ngram
- Top
- Summary
- Types
  - ngram_range/0
- Functions
  - ngram/2
  - ngram/3
Text.Ngram.Frequency
- Top
- Summary
- Types
  - t/0
Text.Vocabulary
Text.Word
- Top
- Summary
- Types
- Functions

Text.Language.Classifier.NaiveBayesian (Text v0.2.0) View Source

A language detection model that uses n-gram frequencies.

It multiplies the frequencies of detected n-grams. Since the frequencies are stored as log(frequency) the addition of the log(frequency) entries is the same as frequency * frequency.

Link to this section Summary

Functions

order_scores(scores)

Return the {language score} tuples in the correct order for this classifier.

score_one_language(language, text_ngrams, vocabulary)

Sums the frequencies of each n-gram

Link to this section Functions

order_scores(scores)

Return the {language score} tuples in the correct order for this classifier.

score_one_language(language, text_ngrams, vocabulary)

Sums the frequencies of each n-gram

A strong negative weighting is applied if the n-gram is not contained in the given vocabulary.