.get_site {textpress} | R Documentation |
Get Site Content and Extract HTML Elements
Description
This function attempts to retrieve the HTML content of a URL, extract specific
HTML elements (e.g., paragraphs, headings), and extract publication date information
using the extract_date
function.
Usage
.get_site(x)
Arguments
x |
A URL to extract content and publication date from. |
Value
A data frame with columns for the URL, HTML element types, text content, extracted date, and date source.
[Package textpress version 1.0.0 Index]