Crawler for http://idebescola.inep.gov.br/ideb/consulta-publica
[✓] Search using UF and city
[✓] Trigger AJAX accordions on school page
[✓] Save fully loaded page into HTML
[𝗫] Parse school page data
[𝗫] Store school data
[𝗫] API
[𝗫] Dynamically auto-download Selenium agent
- git clone
- run
pip install -r requirements.txt
- edit
crawler.py
and add the desired UF and City - run
- check
./sources
folder for outputted HTML files
- Python 3
- Selenium webdriver: by default, MacOS firefox.