pgvector

mirror of https://github.com/pgvector/pgvector.git synced 2026-06-30 01:31:15 +08:00

Author	SHA1	Message	Date
Heikki Linnakangas	ca3b4cd029	Remove HnswSpool It was just used to pass heap/index relations to HnswParallelScanAndInsert. I think it was copied from nbtsort.c, which is more complicated. I don't think we need a struct like this. (That said, I actually think that we should have a state object that would hold fields like 'heap', 'index', 'procinfo', 'collation' etc. Passing that object around would simplify the signatures of many functions. But that's a different story).	2024-01-19 00:26:47 -08:00
Heikki Linnakangas	d96e486274	Remove unused 'scantuplesortstates' field	2024-01-19 00:26:47 -08:00
Heikki Linnakangas	88213186a5	Remove unused argument	2024-01-19 00:26:47 -08:00
Andrew Kane	7dd9534894	Use same locking as insert	2024-01-19 00:18:29 -08:00
Andrew Kane	d801a843f4	Removed HnswPtrSetNull to avoid setting relptr_off directly	2024-01-16 17:08:13 -08:00
Andrew Kane	1458c7bb2a	Improved code [skip ci]	2024-01-16 14:03:28 -08:00
Andrew Kane	cad48d9203	Improved locking	2024-01-16 13:34:55 -08:00
Heikki Linnakangas	719b4b7436	Use LWLocks instead of SpinLocks (#410 ) Spinlocks should be held only for a few instructions, for multiple reasons: - You have to be very careful not to elog() out while holding a spinlock, because there is no mechanism to release the spinlock on error. - Waiters can waste a lot of cycles spinning if the lock is contended. I you wait on a spinlock for too long, the PostgreSQL implementation will actually PANIC, see s_lock_stuck(). The flushLock is particularly problematic. It is held in exclusive mode, which means it holds a spinlock, over the call to FlushPages(). FlushPages() performs lots of I/O so it can take a very long time (>= minutes), and can also easily error out for various reasons. allocatorLock would perhaps be OK as a spinlocks, but even that feels a bit heavy, so I converted that to an LWLock, too. entryLock is usually held for a very short time, in shared mode, so that would be fine as a spinlock. However, in the rare case that the entry point is updated, it's held for a very long time. An LWLock used in shared mode is about as fast a spinlock, that path is pretty heavily optimized. I think we have some problems with the per-element spinlocks too. In HnswUpdateNeighborPagesInMemory(), it's held over a call to HnswUpdateConnection(), but HnswUpdateConnection() can error out at least in case of an out-of-memory error (it uses lappend(), which calls palloc()). It also calls the distance function, and I don't think they are guaranteed to be ereport-free either. However, I didn't address that in this PR, it needs a bit more thinking.	2024-01-16 13:25:03 -08:00
Andrew Kane	fa0acbf62d	Fixed CI	2024-01-15 19:55:46 -08:00
Andrew Kane	1612b84069	Fixed error on Windows [skip ci]	2024-01-15 19:33:16 -08:00
Andrew Kane	2f9371516d	Leave space for other objects in shared memory	2024-01-15 19:17:50 -08:00
Andrew Kane	9d3e4e74df	Added support for in-memory parallel index builds for HNSW	2024-01-15 15:07:31 -08:00
Andrew Kane	c7d60346d8	Improved macro [skip ci]	2024-01-13 20:02:41 -08:00
Andrew Kane	597bfdc76b	Added HnswGetNeighbors macro	2024-01-13 20:00:34 -08:00
Andrew Kane	cbf3eb4fa5	Improved HNSW build and insert code	2024-01-13 10:07:42 -08:00
Andrew Kane	cacd389f6d	Improved pattern for duplicates	2024-01-12 14:30:13 -08:00
Andrew Kane	1881b857f9	Simplified code	2024-01-09 18:53:31 -08:00
Andrew Kane	108fb09d7b	Improved code [skip ci]	2024-01-08 17:54:49 -08:00
Andrew Kane	65d060ac86	Reverted FlushPages pattern for parallel builds	2024-01-08 10:45:31 -08:00
Andrew Kane	62ee33bb92	Improved locking code	2024-01-08 09:05:12 -08:00
Andrew Kane	520e274dde	Improved locking code	2024-01-07 22:34:41 -08:00
Andrew Kane	9e680884bd	Moved indtuples to HnswGraph	2024-01-07 22:23:49 -08:00
Andrew Kane	19a0e1b341	Moved graph to separate struct	2024-01-07 20:15:30 -08:00
Andrew Kane	c7fe1571ee	Improved code	2024-01-07 18:30:51 -08:00
Andrew Kane	cb4c770df2	Switched to slist for elements to reduce allocations and remove limit	2024-01-07 18:26:19 -08:00
Andrew Kane	85fdecd79b	Moved FlushPages before HnswEndParallel	2024-01-07 17:50:46 -08:00
Andrew Kane	6132428914	Improved number of parallel workers for HNSW index builds - closes #397	2024-01-05 19:46:08 -08:00
Andrew Kane	81d13bd40f	Improved code [skip ci]	2024-01-03 13:53:23 -05:00
Andrew Kane	8ee37b60a0	Improved memory estimate for HNSW index builds	2024-01-03 13:47:50 -05:00
Andrew Kane	9b73b3d1a6	Reduced memory and allocations for heap TIDs - closes #385	2024-01-03 13:41:34 -05:00
Andrew Kane	cae630784b	Improved BuildCallback [skip ci]	2023-12-30 20:55:29 -05:00
Andrew Kane	d87bcd2deb	Added comments [skip ci]	2023-12-30 18:29:01 -05:00
Andrew Kane	736576220a	Improved BuildCallback	2023-12-30 18:24:03 -05:00
Andrew Kane	a508b120c1	Added IVFFLAT_MEMORY flag to show memory usage [skip ci]	2023-12-24 09:27:09 -05:00
Andrew Kane	9a782d29f8	Use consistent style [skip ci]	2023-12-22 16:41:25 -05:00
Andrew Kane	1e422cd62b	Improved readability [skip ci]	2023-12-22 16:39:13 -05:00
Andrew Kane	569c69580a	Improved InsertTuple code - #384 Co-authored-by: Heikki Linnakangas <heikki.linnakangas@iki.fi>	2023-12-22 15:08:28 -05:00
Andrew Kane	59509c3a17	Added extra 5% to memory estimate	2023-12-22 14:04:05 -05:00
Andrew Kane	61738846af	Updated comment [skip ci]	2023-12-22 14:03:33 -05:00
Andrew Kane	e8c3bf0cef	Improved memory tracking for HNSW index builds - #384	2023-12-22 13:35:43 -05:00
Andrew Kane	50d1aed3d8	Improved memory usage logging [skip ci]	2023-12-22 13:09:11 -05:00
Andrew Kane	66e14d2434	Updated indentation [skip ci]	2023-12-22 12:59:50 -05:00
Andrew Kane	42cd4c6833	Fixed call to GenerationContextCreate for Postgres < 15	2023-12-22 12:49:07 -05:00
Andrew Kane	dcbe0b6f0d	Reduced memory usage for HNSW index builds - #384 Co-authored-by: Heikki Linnakangas <heikki.linnakangas@iki.fi>	2023-12-22 12:41:47 -05:00
Andrew Kane	f61d4087b5	Slightly improved memory estimation [skip ci]	2023-12-21 10:31:36 -05:00
Andrew Kane	57554e5b46	Added todo [skip ci]	2023-12-20 17:52:31 -05:00
Andrew Kane	6738fa0bd7	Added HNSW_MEMORY flag to show memory usage - #384 [skip ci]	2023-12-20 16:49:16 -05:00
Andrew Kane	9ab10aa674	Fixed CI	2023-12-20 16:29:13 -05:00
Andrew Kane	ec41dfa1d7	Mark meta buffer contents as dirty when not logging	2023-12-20 16:20:15 -05:00
Andrew Kane	43e0b3d9d4	Mark buffer contents as dirty when not logging	2023-12-20 16:16:25 -05:00

1 2 3 4 5 ...

440 Commits