8000 [data-1994] Remove parquet support from datamart plugin by alicia-koh · Pull Request #201 · OHDSI/d2e · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

[data-1994] Remove parquet support from datamart plugin #201

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 13 commits into from
Jun 4, 2025

Conversation

alicia-koh
Copy link
Collaborator

Merge Checklist

Please cross check this list if additions / modifications needs to be done on top of your core changes and tick them off. Reviewer can as well glance through and help the developer if something is missed out.

  • Automated Tests (Jasmine integration tests, Unit tests, and/or Performance tests)
  • Updated Manual tests / Demo Config
  • Documentation (Application guide, Admin guide, Markdown, Readme and/or Wiki)
  • Verified that local development environment is working with latest changes (integrated with latest develop branch)
  • following best practices in code review doc

@alicia-koh alicia-koh marked this pull request as ready for review May 27, 2025 07:43
@Copilot Copilot AI review requested due to automatic review settings May 27, 2025 07:43
Copy link
Contributor
@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR removes Parquet snapshot functionality from the Datamart plugin, simplifying the flow to only support schema snapshots and cleaning up related utilities.

  • Drop CREATE_PARQUET_SNAPSHOT enum and all Parquet-related branches
  • Remove Minio DAO, upload helper, and Pandas/time imports
  • Simplify copy_schema signature and datamart_plugin match logic

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated no comments.

File Description
flows/base/datamart_plugin/types.py Removed CREATE_PARQUET_SNAPSHOT action
flows/base/datamart_plugin/flow.py Removed Parquet imports, branches, and helper function; updated match and copy_schema signature
flows/_shared_flow_utils/dao/MinioDao.py Deleted entire file, dropping Parquet upload support
Comments suppressed due to low confidence (3)

flows/base/datamart_plugin/flow.py:18

  • Add a default case _: branch to the match on options.flow_action_type to explicitly handle unsupported action types and fail fast.
match options.flow_action_type:

flows/base/datamart_plugin/flow.py:87

  • [nitpick] Consider adding explicit type annotations for copy_schema parameters and return value to improve readability and editor support.
@task(log_prints=True)

flows/base/datamart_plugin/flow.py:20

  • Add or update unit/integration tests to verify that only CREATE_SNAPSHOT is supported and that using the removed CREATE_PARQUET_SNAPSHOT raises an appropriate error.
case DatamartFlowAction.CREATE_SNAPSHOT:

Signed-off-by: alicia-koh <110385419+alicia-koh@users.noreply.github.com>
@alicia-koh alicia-koh enabled auto-merge (squash) June 3, 2025 02:46
@alicia-koh alicia-koh merged commit 079f840 into develop Jun 4, 2025
53 checks passed
@alicia-koh alicia-koh deleted the alicia-koh/data-1994 branch June 4, 2025 02:44
csafreen pushed a commit that referenced this pull request Jun 4, 2025
* Remove parquet support from datamart plugin
* Remove minio in other plugins requirements.txt
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants
0