View Source treewalker_scraper behaviour (treewalker v0.4.1)

Scrap the page for useful data and links.

Link to this section Summary

Link to this section Types

-type content() :: treewalker_page:content().
-type options() :: term().
-type page_data() :: binary().
-type url() :: treewalker_page:url().

Link to this section Callbacks

-callback scrap(Url :: url(), PageData :: page_data(), ScrappingOptions :: options()) ->
         {ok, Scrapped :: content()} | {error, Reason :: term()}.
-callback scrap_links(Url :: url(), PageData :: page_data(), ScrappingOptions :: options()) ->
               {ok, Links :: [url()]} | {error, Reason :: term()}.

Link to this section Functions

Link to this function

scrap(Url, PageData, ScrappingOptions)

View Source
-spec scrap(url(), page_data(), options()) -> {ok, content()}.
Link to this function

scrap_links(Url, PageData, ScrappingOptions)

View Source
-spec scrap_links(url(), page_data(), options()) -> {ok, [url()]} | {error, term()}.