-
Notifications
You must be signed in to change notification settings - Fork 3k
Vectorize CompactProtocol write on aa 8000 rch64 #9605
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
This pull request was exported from Phabricator. Differential Revision: D72810122 |
77db092
to
d7f7ee8
Compare
Summary: Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 8.19us 122.04K CompactProtocol_write_BigListInt 9.19us 108.81K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Differential Revision: D72810122
This pull request was exported from Phabricator. Differential Revision: D72810122 |
Summary: Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 8.19us 122.04K CompactProtocol_write_BigListInt 9.19us 108.81K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Differential Revision: D72810122
d7f7ee8
to
50cfaaa
Compare
This pull request was exported from Phabricator. Differential Revision: D72810122 |
Summary: Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 8.19us 122.04K CompactProtocol_write_BigListInt 9.19us 108.81K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Differential Revision: D72810122
50cfaaa
to
b3c3c15
Compare
This pull request was exported from Phabricator. Differential Revision: D72810122 |
Summary: Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 8.19us 122.04K CompactProtocol_write_BigListInt 9.19us 108.81K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Differential Revision: D72810122
Summary: Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 8.19us 122.04K CompactProtocol_write_BigListInt 9.19us 108.81K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Differential Revision: D72810122
b3c3c15
to
2511a93
Compare
This pull request was exported from Phabricator. Differential Revision: D72810122 |
Summary: Pull Request resolved: facebook#9605 Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 8.19us 122.04K CompactProtocol_write_BigListInt 9.19us 108.81K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Differential Revision: D72810122
2511a93
to
1ac36e7
Compare
Summary: Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 8.19us 122.04K CompactProtocol_write_BigListInt 9.19us 108.81K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Differential Revision: D72810122
1ac36e7
to
eea7858
Compare
This pull request was exported from Phabricator. Differential Revision: D72810122 |
Summary: Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 8.19us 122.04K CompactProtocol_write_BigListInt 9.19us 108.81K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Differential Revision: D72810122
Summary: Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Perf 8000 ormance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 8.19us 122.04K CompactProtocol_write_BigListInt 9.19us 108.81K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Differential Revision: D72810122
eea7858
to
44b57d1
Compare
This pull request was exported from Phabricator. Differential Revision: D72810122 |
Summary: Pull Request resolved: facebook#9605 Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 7.18us 139.31K CompactProtocol_write_BigListInt 9.19us 108.81K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Differential Revision: D72810122
44b57d1
to
7349b9d
Compare
Summary: Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Differential Revision: D72810122
Summary: Pull Request resolved: facebook#9605 Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Differential Revision: D72810122
Summary: Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Differential Revision: D72810122
Summary: Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Reviewed By: vitaut Differential Revision: D72810122
This pull request was exported from Phabricator. Differential Revision: D72810122 |
Summary: Pull Request resolved: facebook#9605 Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Reviewed By: vitaut Differential Revision: D72810122
47f128c
to
428c0f1
Compare
This pull request was exported from Phabricator. Differential Revision: D72810122 |
Summary: Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Reviewed By: vitaut Differential Revision: D72810122
428c0f1
to
39ad5dd
Compare
Summary: Pull Request resolved: facebook#9605 Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Reviewed By: vitaut Differential Revision: D72810122
This pull request was exported from Phabricator. Differential Revision: D72810122 |
39ad5dd
to
80b15a6
Compare
Summary: Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Reviewed By: vitaut Differential Revision: D72810122
Summary: Pull Request resolved: facebook#9605 Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Differential Revision: D72810122
Summary: Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Reviewed By: vitaut Differential Revision: D72810122
Summary: Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Reviewed By: vitaut Differential Revision: D72810122
Summary: Pull Request resolved: facebook#9605 Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Differential Revision: D72810122
Summary: Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Reviewed By: vitaut Differential Revision: D72810122
Summary: Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Reviewed By: vitaut Differential Revision: D72810122
Summary: Pull Request resolved: facebook#9605 Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Differential Revision: D72810122
Summary: Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Reviewed By: vitaut Differential Revision: D72810122
Summary: Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Reviewed By: vitaut Differential Revision: D72810122
Summary: Pull Request resolved: facebook#9605 Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Differential Revision: D72810122
Summary: Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Reviewed By: vitaut Differential Revision: D72810122
Summary: Pull Request resolved: facebook#9605 Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Differential Revision: D72810122
Summary: Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Reviewed By: vitaut Differential Revision: D72810122
Summary: Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Reviewed By: vitaut Differential Revision: D72810122
Summary: Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Reviewed By: vitaut Differential Revision: D72810122
Summary: Pull Request resolved: facebook#9605 Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Differential Revision: D72810122 Reviewed By: vitaut
Summary: Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Reviewed By: vitaut Differential Revision: D72810122
Summary: Pull Request resolved: facebook#9605 Vectorize CompactProtocol's list writing on aarch64. Maybe new code should be placed in a different file instead. Performance gains varies by type: before: CompactProtocol_write_SmallListInt 38.38ns 26.05M CompactProtocol_write_BigListByte 18.40us 54.33K CompactProtocol_write_BigListShort 19.30us 51.82K CompactProtocol_write_BigListInt 19.96us 50.11K CompactProtocol_write_BigListBigInt 26.54us 37.68K CompactProtocol_write_BigListFloat 18.54us 53.92K CompactProtocol_write_BigListDouble 18.79us 53.22K after: CompactProtocol_write_SmallListInt 31.65ns 31.60M CompactProtocol_write_BigListByte 223.77ns 4.47M CompactProtocol_write_BigListShort 6.58us 152.07K CompactProtocol_write_BigListInt 8.26us 121.06K CompactProtocol_write_BigListBigInt 11.40us 87.73K CompactProtocol_write_BigListFloat 830.74ns 1.20M CompactProtocol_write_BigListDouble 1.55us 645.79K Differential Revision: D72810122 Reviewed By: vitaut
Summary:
Vectorize CompactProtocol's list writing on aarch64.
Maybe new code should be placed in a different file instead.
Performance gains varies by type:
before:
CompactProtocol_write_SmallListInt 38.38ns 26.05M
CompactProtocol_write_BigListByte 18.40us 54.33K
CompactProtocol_write_BigListShort 19.30us 51.82K
CompactProtocol_write_BigListInt 19.96us 50.11K
CompactProtocol_write_BigListBigInt 26.54us 37.68K
CompactProtocol_write_BigListFloat 18.54us 53.92K
CompactProtocol_write_BigListDouble 18.79us 53.22K
after:
CompactProtocol_write_SmallListInt 31.65ns 31.60M
CompactProtocol_write_BigListByte 223.77ns 4.47M
CompactProtocol_write_BigListShort 8.19us 122.04K
CompactProtocol_write_BigListInt 9.19us 108.81K
CompactProtocol_write_BigListBigInt 11.40us 87.73K
CompactProtocol_write_BigListFloat 830.74ns 1.20M
CompactProtocol_write_BigListDouble 1.55us 645.79K
Differential Revision: D72810122