-
Notifications
You must be signed in to change notification settings - Fork 504
Improvements to flat vector search #2512
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## master #2512 +/- ##
============================================
+ Coverage 67.07% 67.09% +0.01%
- Complexity 1469 1472 +3
============================================
Files 218 219 +1
Lines 12585 12628 +43
Branches 1523 1526 +3
============================================
+ Hits 8442 8473 +31
- Misses 3618 3628 +10
- Partials 525 527 +2 ☔ View full report in Codecov by Sentry. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
+ Two separate codecs: AnseriniLucene99FlatVectorFormat and AnseriniLucene99ScalarQuantizedVectorsFormat. + Hooked everything up to regressions for all BEIR datasets: {cached, ONNX} x {original, int8}.
This huge PR is ready for review.
I've hooked everything up to regressions, now we have complete set of regressions for all BEIR datasets: {cached, ONNX} x {original, int8}.
We now have two separate codecs:
AnseriniLucene99FlatVectorFormat
andAnseriniLucene99ScalarQuantizedVectorsFormat
.Mostly looking for a sanity check. In process of re-running all regressions to make sure everything still works.