-
Notifications
You must be signed in to change notification settings - Fork 1
[data-1994] Remove parquet support from datamart plugin #201
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull Request Overview
This PR removes Parquet snapshot functionality from the Datamart plugin, simplifying the flow to only support schema snapshots and cleaning up related utilities.
- Drop
CREATE_PARQUET_SNAPSHOT
enum and all Parquet-related branches - Remove Minio DAO, upload helper, and Pandas/time imports
- Simplify
copy_schema
signature anddatamart_plugin
match logic
Reviewed Changes
Copilot reviewed 3 out of 3 changed files in this pull request and generated no comments.
File | Description |
---|---|
flows/base/datamart_plugin/types.py | Removed CREATE_PARQUET_SNAPSHOT action |
flows/base/datamart_plugin/flow.py | Removed Parquet imports, branches, and helper function; updated match and copy_schema signature |
flows/_shared_flow_utils/dao/MinioDao.py | Deleted entire file, dropping Parquet upload support |
Comments suppressed due to low confidence (3)
flows/base/datamart_plugin/flow.py:18
- Add a default
case _:
branch to the match onoptions.flow_action_type
to explicitly handle unsupported action types and fail fast.
match options.flow_action_type:
flows/base/datamart_plugin/flow.py:87
- [nitpick] Consider adding explicit type annotations for
copy_schema
parameters and return value to improve readability and editor support.
@task(log_prints=True)
flows/base/datamart_plugin/flow.py:20
- Add or update unit/integration tests to verify that only
CREATE_SNAPSHOT
is supported and that using the removedCREATE_PARQUET_SNAPSHOT
raises an appropriate error.
case DatamartFlowAction.CREATE_SNAPSHOT:
…nto alicia-koh/data-1994
Signed-off-by: alicia-koh <110385419+alicia-koh@users.noreply.github.com>
* Remove parquet support from datamart plugin * Remove minio in other plugins requirements.txt
Merge Checklist
Please cross check this list if additions / modifications needs to be done on top of your core changes and tick them off. Reviewer can as well glance through and help the developer if something is missed out.
develop
branch)