This repository contains a reproducible pipeline for processing SISVAN microdata on nutritional status monitoring in Brazil (2008–2023) (Sistema de Vigilância Alimentar e Nutricional, n.d.). The main goal is to provide an open and reliable workflow for processing these data, supporting research and informed public policy decisions.
You can access the pipeline here.
Warning
This pipeline is still under development and may not be fully functional.
This warning will be removed once the pipeline is complete.
The processed data are available in both csv
and rds
formats via a dedicated repository on the Open Science Framework (OSF), accessible here. A metadata file is included alongside the validated data. You can also access these files directly from R using the osfr
package.
A backup copy of the raw data is also available in OSF. You can access it here.
The pipeline is fully reproducible. It was developed using the Quarto publishing system and the R programming language. To ensure consistent results, the renv
package is used to manage and restore the R environment.
After installing the three dependencies mentioned above, follow these steps to reproduce the analyses:
- Clone this repository to your local machine.
- Open the project in your preferred IDE.
- Restore the R environment by running
renv::restore()
in the R console. This will install all required software dependencies. - Open
index.qmd
and run the code as described in the report.
We use the vroom
R package together with the AWK programming language to efficiently handle large datasets and mitigate memory issues. This approach allows the pipeline to run locally on most machines, though we recommend a minimum of 12 GB of RAM for optimal performance. Alternatively, the pipeline can also be executed on cloud platforms such as Google Colab or RStudio Cloud.
To cite this work, please use the following format:
Vartanian, D., Schettino, J. P. J., & Carvalho, A. M. (2025). A reproducible pipeline for processing SISVAN microdata on nutritional status monitoring in Brazil (2008-2023) [Report]. Sustentarea Research and Extension Group at the University of São Paulo. https://sustentarea.github.io/sisvan-nutritional-status
A BibTeX entry for LaTeX users is
@techreport{vartanian2025,
title = {A reproducible pipeline for processing SISVAN microdata on nutritional status monitoring in Brazil (2008-2023)},
author = {{Daniel Vartanian} and {João Pedro Junqueira Schettino} and {Aline Martins de Carvalho}},
year = {2025},
address = {São Paulo},
institution = {Sustentarea Research and Extension Group at the University of São Paulo},
langid = {en},
url = {https://sustentarea.github.io/sisvan-nutritional-status}
}
The code in this repository is licensed under the MIT License, while the documents are available under the Creative Commons Attribution 4.0 International License.
|
This work is part of the Sustentarea Research and Extension Group project: Global syndemic: The impact of anthropogenic climate change on the health and nutrition of children under five years old attended by Brazil's public health system (SUS). |
|
This work was supported by the Conselho Nacional de Desenvolvimento Científico e Tecnológico - Brazil (CNPq). |