-
Notifications
You must be signed in to change notification settings - Fork 9.1k
MAPREDUCE-7341. Add an intermediate manifest committer for Azure and GCS #4075
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closed
steveloughran
wants to merge
3
commits into
apache:trunk
from
steveloughran:mr/MAPREDUCE-7341-manifest-committer-merge
Closed
MAPREDUCE-7341. Add an intermediate manifest committer for Azure and GCS #4075
steveloughran
wants to merge
3
commits into
apache:trunk
from
steveloughran:mr/MAPREDUCE-7341-manifest-committer-merge
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
…tter * new stats names in StoreStatisticNames (for joint use with s3a committers) * improvements to IOStatistics * RateLimiting wrapper to guava RateLimiter * S3A committer Tasks moved over as TaskPool & support for RemoteIterator * JsonSerialization.load to fail fast if source does not exist + tests. Contributed by Steve Loughran Change-Id: Ia92e2ab5083ac3d8d3d713a4d9cb3e9e0278f654
This is a mapreduce/spark output committer optimized for performance and correctness on Azure ADLS Gen 2 storage (via the abfs connector) and Google Cloud Storage (via the external gcs connector library). * It is safe to use with HDFS, however it has not been optimized for that use. * It is *not* safe for use with S3, and will fail if an attempt is made to do so. Contributed by Steve Loughran Change-Id: I6f3502e79c578b9fd1a8c1485f826784b5421fca
…EDUCE-7341 Follow-on patch to MAPREDUCE-7341, adding ABFS support and tests * resilient rename * tests for job commit through the manifest committer. contains - HADOOP-17976. ABFS etag extraction inconsistent between LIST and HEAD calls - HADOOP-16204. ABFS tests to include terasort Contributed by Steve Loughran. Change-Id: I0a7d4043bdf19bcb00c033fc389730109b93b77f
🎊 +1 overall
This message was automatically generated. |
merged to 3.3 and trunk manually |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
this is the patch of #2971 spilt into three subcommits on top of trunk for merging, one per module, to be applied in order
once yetus is happy I will merge by cherrypicking each commit individually
tested: azure cardiff.