API Reference readability2 v0.7.0

modules

Modules

Readability library for extracting & curating articles.

Build article for readability

AuthorFinder extracts authors

Candidate can be article

Clean html tree for prepare candidates. It transforms misused tags and removes unlikely candidates.

The builing and finding candidates engine It traverses the HTML tree searching, removing, socring nodes

Helpers for parsing, updating, removing html tree

Clean an element of all tags of type "tag" if they look fishy. "Fishy" is an algorithm based on content length, classnames, link density, number of images & embeds, etc.

The TitleFinder engine traverses HTML tree searching for finding title.