An extension that helps to perform bulk operations over CKAN entities, such as datasets, resources, organizations and groups.
The extension provides an IBulk
interface that allows you to register custom entity managers or override the default ones.
By default the extension provides entity managers for datasets, resources, organizations and groups. You might have some custom
entities on your CKAN portal, or some specific logic for the existing ones.
Here's how to register a custom entity manager:
import ckan.plugins as p
from ckanext.bulk.interfaces import IBulk
from ckanext.bulk.entity_managers import base, DatasetEntityManager
class CustomDatasetEntityManager(DatasetEntityManager):
"""Custom implementation of the DatasetEntityManager."""
entity_type = "dataset"
@classmethod
def get_fields(cls) -> list[base.FieldItem]:
# Override to add custom fields
fields = super().get_fields()
fields.extend([
base.FieldItem(value="custom_field", text="Custom Field")
])
return fields
@classmethod
def search_entities_by_filters(
cls, filters: list[base.FilterItem], global_operator: str = const.GLOBAL_AND
) -> list[dict[str, Any]]:
# Implement custom search logic
return super().search_entities_by_filters(filters, global_operator)
class CustomBulk(p.SingletonPlugin):
p.implements(IBulk, inherit=True)
def register_entity_manager(
self, entity_managers: dict[str, type[base.EntityManager]]
):
# Register your custom entity manager
entity_managers[CustomDatasetEntityManager.entity_type] = CustomDatasetEntityManager
return entity_managers
The interface allows you to:
- Create custom entity managers by extending base managers (
DatasetEntityManager
,GroupEntityManager
, etc.) or creating your own. - Override default entity managers with your own implementations
Your entity manager should implement these key methods:
get_fields()
: Returns available fields for the entitysearch_entities_by_filters()
: Implements search logicupdate_entity()
: Handles entity updatesdelete_entity()
: Handles entity deletion
The extension provides a way to export data to CSV.
You can implement the IBulk
interface and override the prepare_csv_data()
method to prepare the data for export.
Important
Ensure, that your extension is loaded before the bulk
extension, as we call the first implementation of the method, to allow plugins to override the default implementation.
The method should return a list of dictionaries, the keys will be used as a header in the CSV file.
Data for search results and logs are stored in redis, and expire after 1 hour. This method will be called before the data is stored in redis, so we can redu 7207 ce the amount of data to store in redis.
The export_type
parameter can be "result"
or "logs"
.
Example of how to override the prepare_csv_data()
method:
class CustomBulk(p.SingletonPlugin):
p.implements(IBulk, inherit=True)
def prepare_csv_data(
self,
data: list[dict[str, Any]],
export_type: str,
entity_type: str | None = None,
) -> list[dict[str, Any]]:
# Implement custom logic to prepare the data for export
return data
Compatibility with core CKAN versions:
CKAN version | Compatible? |
---|---|
2.9 | no |
2.10 | yes |
2.11 | yes |
master | not tested |