API Reference readability2 v0.7.0
modules
Modules
Readability library for extracting & curating articles.
Build article for readability
AuthorFinder extracts authors
Candidate can be article
Clean html tree for prepare candidates. It transforms misused tags and removes unlikely candidates.
Score html tree
The builing and finding candidates engine It traverses the HTML tree searching, removing, socring nodes
Helpers for parsing, updating, removing html tree
Clean an element of all tags of type "tag" if they look fishy. "Fishy" is an algorithm based on content length, classnames, link density, number of images & embeds, etc.
The TitleFinder engine traverses HTML tree searching for finding title.