8000 GitHub - mikuyves/scrapy-proxynova: Use scrapy with a list of proxies generated from proxynova.com
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

mikuyves/scrapy-proxynova

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

scrapy-proxynova

Use scrapy with a list of proxies generated from proxynova.com

The first run will generate the list of proxies from http://proxynova.com and store it in the cache.

It will individually check each proxy to see if they work and remove the ones that timed out or cannot connect to.

Example:

./run_example.sh

To regenerate the proxy list, run: python proxies.py

In settings.py add the following line: DOWNLOADER_MIDDLEWARES = { 'scrapy_proxynova.middleware.HttpProxyMiddleware': 543 }

Options

Set these options in the settings.py.

  • PROXY_SERVER_LIST_CACHE_FILE — a file to store proxies list. Default: proxies.txt.
  • PROXY_BYPASS_PERCENT — probability for a connection to use a direct connection and not use a proxy

About

Use scrapy with a list of proxies generated from proxynova.com

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 98.0%
  • Shell 2.0%
0