-
Notifications
You must be signed in to change notification settings - Fork 9k
HADOOP-18528. Disable readAhead/prefetching by default #5134
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
where do you test it?
what do you all think? this is us disabling the readahead so we can get on with the real fix without being in a rush and really trying to get the tests together. that should include taking the AbfsInputStream test, have it create real CSV rows (use my mkcsv code as a foundation) and then have use the mapreduce TextInputFormat code to read it back across multiple threads, just have hive and spark do |
🎊 +1 overall
This message was automatically generated. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good. Should we backport the test as well that I have added internally?
Okay, ran the test suite on trunk as well. There are no additional errors/failures after the changes, it's showing some errors on the base version.
Not sure why this is a failure and not just skipping the test in case the config isn't set.
Seeing it fail as a stand-alone test as well.
Intermittent failure, saw this in trunk once. Also had a timeout error in ITestAbfsTerasort. |
the lease one plays up on me, as does ITestSmallWriteOptimization; especially on parallel runs. i'm not worried there |
Disables block prefetching on ABFS InputStreams, by setting fs.azure.enable.readahead to false in core-default.xml and the matching java constant. This prevents HADOOP-18521. ABFS ReadBufferManager buffer sharing across concurrent HTTP requests. Once a fix for that is committed, this change can be reverted. Contributed by Mehakmeet Singh.
Disables block prefetching on ABFS InputStreams, by setting fs.azure.enable.readahead to false in core-default.xml and the matching java constant. This prevents HADOOP-18521. ABFS ReadBufferManager buffer sharing across concurrent HTTP requests. Once a fix for that is committed, this change can be reverted. Contributed by Mehakmeet Singh.
Disables block prefetching on ABFS InputStreams, by setting fs.azure.enable.readahead to false in core-default.xml and the matching java constant. This prevents HADOOP-18521. ABFS ReadBufferManager buffer sharing across concurrent HTTP requests. Once a fix for that is committed, this change can be reverted. Contributed by Mehakmeet Singh.
This reverts commit 69e50c7.
Description of PR
Setting readAhead config to false by default.
How was this patch tested?
Region:
us-west-2
mvn -Dparallel-tests=abfs -DtestsThreadCount=8 -Dscale clean verify
Seeing some timeout errors, going to test the latest ABFS trunk code to see the differences tomorrow.
For code changes:
LICENSE
,LICENSE-binary
,NOTICE-binary
files?