ExAwabi (ExAwabi v0.2.0) View Source

Elixir wrapper for Awabi, the morphological analyzer using MeCab dictionary.

Link to this section Summary

Functions

Tokenize the string.

Tokenize the string with N best matches.

Link to this section Functions

Specs

tokenize(binary()) :: [binary()]

Tokenize the string.

Examples

iex> ExAwabi.tokenize("すもももももももものうち")
[
  {"すもも", "名詞,一般,*,*,*,*,すもも,スモモ,スモモ"},
  {"も", "助詞,係助詞,*,*,*,*,も,モ,モ"},
  {"もも", "名詞,一般,*,*,*,*,もも,モモ,モモ"},
  {"も", "助詞,係助詞,*,*,*,*,も,モ,モ"},
  {"もも", "名詞,一般,*,*,*,*,もも,モモ,モモ"},
  {"の", "助詞,連体化,*,*,*,*,の,ノ,ノ"},
  {"うち", "名詞,非自立,副詞可能,*,*,*,うち,ウチ,ウチ"}
]

Specs

tokenize_n_best(binary(), integer()) :: [[binary()]]

Tokenize the string with N best matches.

Examples

iex> ExAwabi.tokenize_n_best("すもももももももものうち", 3)
[
  [
    {"すもも", "名詞,一般,*,*,*,*,すもも,スモモ,スモモ"},
    {"も", "助詞,係助詞,*,*,*,*,も,モ,モ"},
    {"もも", "名詞,一般,*,*,*,*,もも,モモ,モモ"},
    {"も", "助詞,係助詞,*,*,*,*,も,モ,モ"},
    {"もも", "名詞,一般,*,*,*,*,もも,モモ,モモ"},
    {"の", "助詞,連体化,*,*,*,*,の,ノ,ノ"},
    {"うち", "名詞,非自立,副詞可能,*,*,*,うち,ウチ,ウチ"}
  ],
  [
    {"すもも", "名詞,一般,*,*,*,*,すもも,スモモ,スモモ"},
    {"も", "助詞,係助詞,*,*,*,*,も,モ,モ"},
    {"もも", "名詞,一般,*,*,*,*,もも,モモ,モモ"},
    {"もも", "名詞,一般,*,*,*,*,もも,モモ,モモ"},
    {"も", "助詞,係助詞,*,*,*,*,も,モ,モ"},
    {"の", "助詞,連体化,*,*,*,*,の,ノ,ノ"},
    {"うち", "名詞,非自立,副詞可能,*,*,*,うち,ウチ,ウチ"}
  ],
  [
    {"すもも", "名詞,一般,*,*,*,*,すもも,スモモ,スモモ"},
    {"もも", "名詞,一般,*,*,*,*,もも,モモ,モモ"},
    {"も", "助詞,係助詞,*,*,*,*,も,モ,モ"},
    {"もも", "名詞,一般,*,*,*,*,もも,モモ,モモ"},
    {"も", "助詞,係助詞,*,*,*,*,も,モ,モ"},
    {"の", "助詞,連体化,*,*,*,*,の,ノ,ノ"},
    {"うち", "名詞,非自立,副詞可能,*,*,*,うち,ウチ,ウチ"}
  ]
]