8000 GitHub - joaodaher/ideb-crawler
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

joaodaher/ideb-crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

IDEB Escola: Crawler

Crawler for http://idebescola.inep.gov.br/ideb/consulta-publica

[✓] Search using UF and city

[✓] Trigger AJAX accordions on school page

[✓] Save fully loaded page into HTML

[𝗫] Parse school page data

[𝗫] Store school data

[𝗫] API

[𝗫] Dynamically auto-download Selenium agent

Usage

  • git clone
  • run pip install -r requirements.txt
  • edit crawler.py and add the desired UF and City
  • run
  • check ./sources folder for outputted HTML files

Tools

  • Python 3
  • Selenium webdriver: by default, MacOS firefox.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

0