Commit Graph

1341 Commits

Author SHA1 Message Date
Andrew Kane
8b0443afdf Test sparsevec vacuum recall 2024-04-23 17:10:48 -07:00
Andrew Kane
b609c343b4 Moved type-specific code to separate functions 2024-04-23 16:32:10 -07:00
Andrew Kane
bbfb3f200a DRY code for sorting vector arrays [skip ci] 2024-04-23 15:59:42 -07:00
Andrew Kane
99d367edc0 Improved code [skip ci] 2024-04-23 15:53:12 -07:00
Andrew Kane
991743786a Set length for newCenters and aggCenters [skip ci] 2024-04-23 15:47:04 -07:00
Andrew Kane
60ceaea4f2 Added safety check to NormCenters [skip ci] 2024-04-23 15:43:04 -07:00
Andrew Kane
9cd789fe06 Switched to support function for normalizing centers for k-means 2024-04-23 15:39:58 -07:00
Andrew Kane
0da6213a60 Moved type lookup to support functions - #527 2024-04-23 13:02:47 -07:00
Heikki Linnakangas
d1b83991af Forbid zero values in sparsevec's binary input function (#528)
The text input function simply left out any zero values, but the
binary input function did not. That's problematic because you end up
with an "unnormalized" sparse vector, which behaves in weird ways.  At
least sparsevec_cmp_internal() expects both inputs to not contain
zeros.

The binary send function never produces such zero values, but an
external tool could. Or to test, you can use COPY TO (FORMAT BINARY),
use a hex editor to edit one of the values to be zero, and copy it
back with COPY FROM (FORMAT BINARY).
2024-04-23 09:13:53 -07:00
Andrew Kane
6c247a38d3 Updated readme [skip ci] 2024-04-22 21:56:05 -07:00
Andrew Kane
6639cde19d Updated readme [skip ci] 2024-04-22 21:32:59 -07:00
Andrew Kane
bd409f0c6a Moved HnswGetType call [skip ci] 2024-04-22 19:22:09 -07:00
Andrew Kane
1994fd003a Removed unneeded headers [skip ci] 2024-04-22 19:10:50 -07:00
Andrew Kane
bd62561a19 Added support function for l2_normalize to ivfflat 2024-04-22 19:06:06 -07:00
Andrew Kane
f14c21748b Added support function for l2_normalize [skip ci] 2024-04-22 18:36:47 -07:00
Andrew Kane
2b77005610 Removed type-specific code from ivfscan 2024-04-22 18:12:18 -07:00
Andrew Kane
e884b3aa69 Added opclasses to readme [skip ci] 2024-04-22 16:29:34 -07:00
Andrew Kane
ab71c12a28 Added comments on dispatching [skip ci] 2024-04-22 16:18:57 -07:00
Andrew Kane
1804c63e27 Added more tests for vector distance functions [skip ci] 2024-04-22 15:53:13 -07:00
Andrew Kane
4e6aa2f0c1 Added DISABLE_DISPATCH option [skip ci] 2024-04-22 15:43:07 -07:00
Andrew Kane
40e86251c3 Added VECTOR_TARGET_CLONES to VectorL1Distance [skip ci] 2024-04-22 15:15:57 -07:00
Andrew Kane
0c9ae4b187 Added CPU dispatching for L1 distance for halfvec 2024-04-22 15:02:17 -07:00
Andrew Kane
d83af48e70 Improved tests for halfvec l1_distance [skip ci] 2024-04-22 14:43:54 -07:00
Andrew Kane
b2f7dad8a7 Removed support for L1 distance and Jaccard distance from ivfflat due to non-optimal clustering 2024-04-22 14:11:29 -07:00
Andrew Kane
881fbc15ef Added L1 distance operator to docs [skip ci] 2024-04-22 13:22:28 -07:00
Andrew Kane
f9941c2992 Moved L1 distance to halfutils [skip ci] 2024-04-22 13:19:42 -07:00
Andrew Kane
f9c071a761 Improved tests for L1 distance with halfvec 2024-04-22 13:14:45 -07:00
Andrew Kane
9f4b770db3 Added support for indexing sparsevec with L1 distance [skip ci] 2024-04-22 13:08:12 -07:00
Andrew Kane
70b299a7ff Added support for indexing halfvec with L1 distance [skip ci] 2024-04-22 13:00:59 -07:00
Andrew Kane
d46d014234 Updated test name [skip ci] 2024-04-22 12:57:01 -07:00
Andrew Kane
655adc535a Added bit examples for ivfflat [skip ci] 2024-04-22 12:51:38 -07:00
Andrew Kane
47f0a5e9ac Updated test name [skip ci] 2024-04-22 12:48:22 -07:00
Andrew Kane
af9d50481d Added support for indexing L1 distance 2024-04-22 12:44:03 -07:00
Andrew Kane
6dad8460a6 Updated readme [skip ci] 2024-04-22 10:45:41 -07:00
Andrew Kane
ed4837cc4f Renamed bit files 2024-04-22 10:22:18 -07:00
Andrew Kane
3df5655b30 Removed unneeded headers [skip ci] 2024-04-19 18:09:38 -07:00
Andrew Kane
e4c4ac9648 Added more tests for sparsevec to halfvec cast [skip ci] 2024-04-19 18:05:03 -07:00
Andrew Kane
fb77671d05 Added casts between halfvec and sparsevec 2024-04-19 18:03:07 -07:00
Andrew Kane
fd4fbd238c Updated sparsevec input to support indices in any order [skip ci] 2024-04-19 16:54:19 -07:00
Andrew Kane
4e093f95be Improved consistency of CPU dispatching code 2024-04-18 14:13:12 -07:00
Andrew Kane
fb3c964ac2 Improved performance of bit distance functions - #519
Co-authored-by: Nathan Bossart <nathan@postgresql.org>
Co-authored-by: "Jonathan S. Katz" <jkatz@users.noreply.github.com>
2024-04-18 13:45:00 -07:00
Andrew Kane
0b938f8328 Improved halfvec dispatching code [skip ci] 2024-04-17 20:22:14 -07:00
Andrew Kane
6153e173f3 Updated comments [skip ci] 2024-04-17 18:57:59 -07:00
Andrew Kane
eb48b9eec4 Added comments [skip ci] 2024-04-17 18:53:58 -07:00
Andrew Kane
fb6af03361 Fixed CPU dispatching check for halfvec distance functions 2024-04-17 18:27:27 -07:00
Andrew Kane
e2d8907180 Added todo [skip ci] 2024-04-17 17:25:53 -07:00
Andrew Kane
09ea1b0d5c Improved target_clones logic [skip ci] 2024-04-17 15:05:47 -07:00
Andrew Kane
8d68f88507 Improved target_clones logic [skip ci] 2024-04-17 15:05:19 -07:00
Andrew Kane
301c8083f5 Added check for undefined behavior to CI [skip ci] 2024-04-17 02:03:46 -07:00
Andrew Kane
8b33a359ce Updated VECTOR_ARRAY_SIZE for alignment [skip ci] 2024-04-17 00:55:28 -07:00