Rcrawler() and LinkExtractor() do not collect 'external urls' from HTML>footer · Issue #69 · salimk/Rcrawler · GitHub

8000 Rcrawler() and LinkExtractor() do not collect 'external urls' from HTML>footer · Issue #69 · salimk/Rcrawler · GitHub

More Web Proxy on the site http://driver.im/

Rcrawler() and LinkExtractor() do not collect 'external urls' from HTML>footer #69

Open

Open

Rcrawler() and LinkExtractor() do not collect 'external urls' from HTML>footer#69

Sometimes when using LinkExtractor function doesnot collect external URLS from the footer of a webpage while the HTML is visible. Bug or wrong use?

For instance:
urls<-LinkExtractor("https://www.partou.nl", ExternalLInks = TRUE, Useragent = "Chrome/41.0.2228.0"))

or

urls<-LinkExtractor("https://www.kinderopvangoosterhout.nl", ExternalLInks = TRUE, Useragent = "Chrome/41.0.2228.0"))

ps. awesome package, makes scraping and parsing so much easier :)

Metadata

Assignees

No one assigned

Labels

No labels

No labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

0