8000 GitHub - TAUSBV/WebCrawler: A web crawler based on Selenium to capture maximum text using browser automation and javascript.
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

A web crawler based on Selenium to capture maximum text using browser automation and javascript.

Notifications You must be signed in to change notification settings

TAUSBV/WebCrawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Selenium Based WebCrawler

Installation

mvn package

How to run

java -cp target/selenium-webcrawler-1.0-SNAPSHOT-jar-with-dependencies.jar net.taus.webcrawler.Crawler -c crawler.properties

Configuration

See the comments in crawler.properties file for details

About

A web crawler based on Selenium to capture maximum text using browser automation and javascript.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

0