Why only S3? #127
-
Hi Dan. This project looks great. I read the blog https://blog.danthegoodman.com/icedb-v2 , and somehow felt that I could relate with every single word written over there. So I am about to try Icedb . But one question that came to mind (as someone who googled for parquet database and stumble upon this 5 minutes before ) is Why only S3? Can we run this locally ? |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 2 replies
-
You can run this locally using minio and docker quite easy. See https://github.com/danthegoodman1/icedb/blob/main/docker-compose.yml which is what is used for running all the examples and tests. Very simple to just run Note that this repo represents an evolution of https://blog.danthegoodman.com/icedb-v3--third-times-the-charm There's an open issue to add an "on-disk" mode but I've not needed it yet so I haven't spent time on it (although it shouldn't be very hard). As to why: Because the intention was to not need any state outside of usage-based billable infra like S3. To use local disks would suggest you might as well use something like duckdb or clickhouse directly. |
Beta Was this translation helpful? Give feedback.
-
Thank you Dan for your reply. This is great work here. |
Beta Was this translation helpful? Give feedback.
-
how do you connect to s3? Is it all based on duckdb and httpfs? You might consider using fsspec (or pyarrows own filesystem, pyarrow.fs) instead, which would easily allow to use other filesystems/storage backends (including local filesystem). Furthermore, I found, that in many scenarios, duckdb + fsspec is way faster than duckdb + https to run queries on s3. |
Beta Was this translation helpful? Give feedback.
You can run this locally using minio and docker quite easy. See https://github.com/danthegoodman1/icedb/blob/main/docker-compose.yml which is what is used for running all the examples and tests. Very simple to just run
docker compose up -d
and then run tests. Makes it easier to tear it down and reset all state as well withdocker compose down -v
.Note that this repo represents an evolution of https://blog.danthegoodman.com/icedb-v3--third-times-the-charm
There's an open issue to add an "on-disk" mode but I've not needed it yet so I haven't spent time on it (although it shouldn't be very hard).
As to why: Because the intention was to not need any state outside of usage-based billable infra…