readability v0.0.1 Readability

Summary

Functions

Normalize and Parse to html tree(tuple or list)) from binary html

return raw html binary from html tree tuple

return only text binary from html tree tuple

Types

html_tree :: tuple | list
options :: list

Functions

content(raw_html, opts \\ [retry_length: 250, min_text_length: 25, remove_unlikely_candidates: true, weight_classes: true, clean_conditionally: true, remove_empty_nodes: true, min_image_width: 130, min_image_height: 80, ignore_image_format: [], blacklist: nil, whitelist: nil])

Specs

content(binary, options) :: binary

Using a variety of metrics (content score, classname, element types), find the content that is most likely to be the stuff a user wants to read

default_options()
parse(raw_html)

Specs

parse(binary) :: html_tree

Normalize and Parse to html tree(tuple or list)) from binary html

raw_html(html_tree)

Specs

raw_html(html_tree) :: binary
raw_html(html_tree) :: binary

return raw html binary from html tree tuple

readabl_text(html_tree)

return only text binary from html tree tuple

regexes()
title(html)