Feature request: Combined read_html/write_html function #193

richierocks · 2017-10-12T19:56:33Z

If you are scraping a website, it is good practise to download the page once and store a copy, so you don't have to keep re-downloading it.

That means that it is easy to end up with lots of code like

page <- read_html(url)
write_html(page, "somefile.html")

It seems like it would be useful to have a single function (save_html() perhaps) to do both operations.

The dumbest implementation is something like this:

save_html <- function(x, file) {
  page <- read_html(x)
  message("Writing to ", force(file))
  write_html(page, file)
  invisible(page)
}

It needs a bit of fleshing out to deal with ... args to read_html() and write_html(), to it would be nice to have a default filename generated from the URL.

The text was updated successfully, but these errors were encountered:

jimhester closed this as completed in c051e16 Jan 4, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: Combined read_html/write_html function #193

Feature request: Combined read_html/write_html function #193

Feature request: Combined read_html/write_html function #193

Feature request: Combined read_html/write_html function #193

Comments