#8183: Add S3 provider for Hetzner object storage #8551

spiffytech · 2025-05-12T00:00:33Z

What is the purpose of this change?

This PR closes issue #8183: adding an S3 provider for Hetzner Object Storage.

Was the change discussed in an issue or in the forum before?

Notes

Hetzner test performance

Hetzner only offers European regions for their object storage.

When I ran the integration tests from my home in North Carolina, or my VPS in Virginia, they blew past the 10-minute test timeout (especially the chunk tests).

When I used a Hetzner VPS in Finland, they ran well under the limit.

Yay, trans-Atlantic latency!

Hetzner `useAlreadyExists` behavior

The existing integration tests appear to assume that if a bucket is being asynchronously deleted, and you request creating a new bucket with the same name, the provider honor the bucket create command (cancelling the deletion?).

Hetzner doesn't do this. You'll get a create error (which the S3 fs code swallows), then the bucket still gets deleted, and then all the tests error out because they try to use a bucket that doesn't exist.

I considered waiting until the API reported the bucket name is available for reuse, but we can't count on every provider doing that swiftly. Hetzner, at least, is slow to report that the name is free. If we were recreating the bucket into a different region, AWS is known to take over an hour.

Instead, I altered the tests to use a new bucket name each time they request a bucket. With this change, the tests pass under Hetzner and AWS. I don't know if this breaks any assumptions.

Please let me know if this isn't the right way to handle this!

I considered extracting the fs.NewFS() + f.mkDir() + error handling into a reusable function, but it wasn't clear that there was a unified abstraction across the tests. That pattern appeared in several places, some others seemed to want manual control over those steps, and others used operations.MkDir() instead of f.MkDir(). I tried to keep the change small and concrete until I get feedback otherwise.

Checklist

I have read the contribution guidelines.
I have added tests for all changes in this PR if appropriate.
I have added documentation for the changes if appropriate.
All commit messages are in house style.
I'm done, this Pull Request is ready for review :-)

ncw · 2025-05-15T14:06:02Z

The existing integration tests appear to assume that if a bucket is being asynchronously deleted, and you request creating a new bucket with the same name, the provider honor the bucket create command (cancelling the deletion?).

I'm not particularly happy about altering the integration tests just for one provider.

The azureblob backend has a similar problem too and what it does is check the create error (hopefully it is speci 8000 fic enough) and sleep and retry. Could we use that technique here too?

spiffytech · 2025-05-15T16:03:40Z

Very understandable.

Note that this PR creates a new S3 provider, not a separate backend like azureblob. The proposal to retry would require changing the behavior for all S3 providers, or special-casing Hetzner Object Storage. Or maybe a new quirk? Are any of these choices acceptable, or do we need to look at a different approach?

Hetzner returns BucketAlreadyExists for buckets pending deletion. s3.go makeBucket explicitly swallows that error if quirk useAlreadyExists is false (which it is for Hetzner). Is it appropriate for all S3 providers to stop swallowing the error and switch to retry?

// Current code
case "BucketAlreadyExists", "BucketNameUnavailable":
    if f.opt.UseAlreadyExists.Value {
        // We can trust BucketAlreadyExists to mean not owned by us, so make it non retriable
        err = fserrors.NoRetryError(err)
    } else {
        // We can't trust BucketAlreadyExists to mean not owned by us, so ignore it
        err = nil
    }
}

I've implemented the retry logic by mimicking azureblob makeContainer. It's not perfect: S3's Pacer only does 2 retries. That's enough to get the tests passing today, but who knows how fast or slow any given delete will be.

With this change, all tests pass except one:

Test FsListRootedSubdir fails. I think it assumes BucketCreate is idempotent, and relies on the current error-swallowing behavior. The test calls testPut ➜ ... PutTestContentsMetadata ➜ ... NewFs against a bucket that already exists and is not pending deletion. So the tests just keep retrying, waiting for CreateBucket to stop returning BucketAlreadyExists.

I cannot see a way for Hetzner Object Storage to handle CreateBucket idempotently. The error for a permanent bucket is the same as for one that'll disappear any second. And we can't just ask "does this bucket exist" because we can't tell whether it's pending deletion.

Next steps
I need input to proceed with this PR:

How should s3.go know to apply Hetzner-specific behavior?
- A new quirk seems the least intrusive. Is this an appropriate use of quirks?
How can we preserve CreateBucket idempotence with Hetzner?
Can we have makeBucket use more retries than S3 Pacer's default?

spiffytech · 2025-05-16T15:49:06Z

I've pushed an update backing out the changes to the integration tests, and adding a retry for CreateBucket. This is WIP and isn't ready to merge, as noted above.

ncw · 2025-05-16T21:47:44Z

How should s3.go know to apply Hetzner-specific behavior?

A new quirk seems the least intrusive. Is this an appropriate use of quirks?

Absolutely. That is what quirks are for. Make sure you name it for what it does, not fixHetzner.

How can we preserve CreateBucket idempotence with Hetzner?

Good question!

We could just decide that one test failure is ok. Does it work if you just run that test with the -run flag? If so the integration tester would run tes tests, get one failure, and retry that test.

Can we have makeBucket use more retries than S3 Pacer's default?

The AWS SDK reties 10 times for most things anyway which is why we only use 2 by default.

You could just have a bit of manual logic, or make a new pacer, whichever you fancy. I think if the tests pass today it probably isn't worth worrying about too much though.

spiffytech · 2025-05-18T21:17:27Z

Does it work if you just run that test with the -run flag?

Nope: because an Fs is initialized at the top of the test suite, running just FsListRootedSubdir still fails to create the bucket.

I think if the tests pass today it probably isn't worth worrying about too much though.

Sounds good. YAGNI!

I've pushed an update that adds the quirk. Given we're willing to accept one test failure, I think this is ready for review!

spiffytech force-pushed the 8183-hetzner-s3 branch from 621b7ad to 124ddad Compare May 12, 2025 01:07

spiffytech mentioned this pull request May 12, 2025

Support for Hetzner Cloud S3 #8183

Open

spiffytech force-pushed the 8183-hetzner-s3 branch from 124ddad to 0c13880 Compare May 15, 2025 13:40

spiffytech force-pushed the 8183-hetzner-s3 branch from 0c13880 to d59b30f Compare May 16, 2025 15:49

spiffytech force-pushed the 8183-hetzner-s3 branch from d59b30f to 707c142 Compare May 18, 2025 21:16

backend: add S3 provider for Hetzner object storage rclone#8183

3c24a28

spiffytech force-pushed the 8183-hetzner-s3 branch from 707c142 to 3c24a28 Compare May 26, 2025 00:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

#8183: Add S3 provider for Hetzner object storage #8551

#8183: Add S3 provider for Hetzner object storage #8551

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

#8183: Add S3 provider for Hetzner object storage #8551

Are you sure you want to change the base?

#8183: Add S3 provider for Hetzner object storage #8551

Uh oh!

Conversation

Uh oh!

What is the purpose of this change?

Was the change discussed in an issue or in the forum before?

Notes

Hetzner test performance

Hetzner useAlreadyExists behavior

Checklist

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Hetzner `useAlreadyExists` behavior