8000 Rcrawler() and LinkExtractor() do not collect 'external urls' from HTML>footer · Issue #69 · salimk/Rcrawler · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
Rcrawler() and LinkExtractor() do not collect 'external urls' from HTML>footer #69
Open
@graskaas2014

Description

@graskaas2014

Sometimes when using LinkExtractor function doesnot collect external URLS from the footer of a webpage while the HTML is visible. Bug or wrong use?

For instance:
urls<-LinkExtractor("https://www.partou.nl", ExternalLInks = TRUE, Useragent = "Chrome/41.0.2228.0"))

or

urls<-LinkExtractor("https://www.kinderopvangoosterhout.nl", ExternalLInks = TRUE, Useragent = "Chrome/41.0.2228.0"))

ps. awesome package, makes scraping and parsing so much easier :)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions

      0