GitHub - paidiver/paidiverpy: Create pipelines for preprocessing image data for biodiversity analysis.

Paidiverpy is a Python package designed to create pipelines for preprocessing image data for biodiversity analysis.

Note: This package is still in active development, and frequent updates and changes are expected. The API and features may evolve as we continue improving it.

Documentation

The official documentation is hosted on ReadTheDocs.org: https://paidiverpy.readthedocs.io/

Note: Comprehensive documentation is under construction.

Installation

To install paidiverpy, run:

pip install paidiverpy

Build from Source

You can install paidiverpy locally or on a notebook server such as JASMIN or the NOC Data Science Platform (DSP). The following steps are applicable to both environments, but steps 2 and 3 are required if you are using a notebook server.

Clone the repository:

# ssh
git clone git@github.com:paidiver/paidiverpy.git

# https
# git clone https://github.com/paidiver/paidiverpy.git

cd paidiverpy

(Optional) Create a Python virtual environment to manage dependencies separately from other projects. For example, using conda:
```
conda env create -f environment.yml
conda activate Paidiverpy
```
Install the paidiverpy package:

Finally, you can install the paidiverpy package:
```
pip install -e .
```

Usage

You can run your preprocessing pipeline using Paidiverpy in several ways, typically requiring just one to three lines of code:

Python Package

Install the package and utilize it in your Python scripts.

# Import the Pipeline class
from paidiverpy.pipeline import Pipeline

# Instantiate the Pipeline class with the configuration file path
# Please refer to the documentation for the configuration file format
pipeline = Pipeline(config_file_path="../examples/config_files/config_simple2.yml")

# Run the pipeline
pipeline.run()

# You can export the output images to the specified output directory
pipeline.save_images(image_format="png")

Command-Line Arguments

Pipelines can be executed via command-line arguments. For example:

paidiverpy -c examples/config_files/config_simple.yml

This runs the pipeline according to the configuration file, saving output images to the directory defined in the output_path.

Gallery

Together with the documentation, you can explore various use cases through sample notebooks in the examples/example_notebooks directory:

Example Data

If you'd like to manually download example data for testing, you can use the following command:

from paidiverpy.utils.data import PaidiverpyData
PaidiverpyData().load(DATASET_NAME)

Available datasets:

plankton_csv: Plankton dataset with CSV file metadata
benthic_csv: Benthic dataset with CSV file metadata
benthic_ifdo: Benthic dataset with IFDO metadata
nef_raw: Sample images in Nef format (raw images) with CSV file metadata
benthic_raw_images: Benthic dataset in raw format with CSV file metadata

Example data will be automatically downloaded when running the example notebooks.

Note: Please check the documentation for more information about Paidiverpy: https://paidiverpy.readthedocs.io/

Contributing to paidiverpy

Want to support or improve paidiverpy? Check out our contribution guide to learn how to get started.

Acknowledgements

This project was supported by the UK Natural Environment Research Council (NERC) through the Tools for automating image analysis for biodiversity monitoring (AIAB) Funding Opportunity, reference code UKRI052.

Name		Name	Last commit message	Last commit date
Latest commit History 471 Commits
.githooks		.githooks
.github/workflows		.github/workflows
conda_recipes		conda_recipes
docs		docs
examples		examples
src		src
tests		tests
.bumpversion.toml		.bumpversion.toml
.codecov.yml		.codecov.yml
.coveragerc		.coveragerc
.gitignore		.gitignore
.lycheeignore		.lycheeignore
.readthedocs.yaml		.readthedocs.yaml
.zenodo.json		.zenodo.json
CHANGELOG.md		CHANGELOG.md
CITATION.cff		CITATION.cff
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.dev.md		README.dev.md
README.md		README.md
README.rst		README.rst
cov.xml		cov.xml
environment.yml		environment.yml
generate_conda_meta.py		generate_conda_meta.py
junit.xml		junit.xml
project_setup.md		project_setup.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Documentation

Installation

Build from Source

Usage

Python Package

Command-Line Arguments

Gallery

Example Data

Contributing to paidiverpy

Acknowledgements

About

Uh oh!

Releases 5

Packages

Uh oh!

Uh oh!

Contributors 4

Uh oh!

Languages

License

paidiver/paidiverpy

Folders and files

Latest commit

History

Repository files navigation

Documentation

Installation

Build from Source

Usage

Python Package

Command-Line Arguments

Gallery

Example Data

Contributing to paidiverpy

Acknowledgements

About

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Uh oh!

Uh oh!

Contributors 4

Uh oh!

Languages

Packages