leveled

Author	SHA1	Message	Date
Russell Brown	ef9ac672e5	Stop snapshots when the bookie stops During EQC testing it was found that snapshots are still usable even if the bookie process crashes. This change has snapshots monitor the bookie and close when the bookie process dies.	2018-09-06 11:47:52 +01:00
Martin Sumner	c4e376ece5	Don't link snapshots If a snapshot breaks a penciller clone, this shouldn't crash the main process.	2018-07-10 10:25:20 +01:00
Martin Sumner	082eabb65b	Switch to start_link Start all processes linked - to collapse the whole tree if one process fails	2018-06-28 12:16:43 +01:00
Martin Sumner	aedeb0c934	Add support for with_lookup head_only head_only mode cna be run with_lookup - but there is no L0 index created in this case. So the L0 index wasn't returning a potition list and the L0 cache wasn't being checked. Code now checks every position in the L0 cache, when a lookup is attempted in head_only mode.	2018-06-23 15:15:49 +01:00
Martin Sumner	990e857ebe	Add to log	2018-06-23 13:25:10 +01:00
Martin Sumner	ac14bbdf41	Add log to penciller	2018-06-23 13:18:32 +01:00
Martin Sumner	319c6b4ca7	Undefined typo Interetsingly setting max_pencillercachesize to a non-integer merely had the impact of making the penciller cache size infinite. So a guard added to make sure it is an integer going forward.	2018-06-07 14:53:34 +01:00
Martin Sumner	039b135f5f	Ease timeout pressure in unit test	2018-05-18 14:36:47 +01:00
Martin Sumner	6a20b2ce66	Use leveled_codec types ... and exporting them. Previously types wer enot exported, and it appears dialyzer treated tham as any() when they were unexported types ??!!??	2018-05-04 15:24:08 +01:00
Martin Sumner	dd7b753688	Add spec and comments to penciller	2018-05-01 21:28:40 +01:00
Martin Sumner	11ba7029aa	de-terminate penciller	2018-04-10 09:51:21 +01:00
Martin Sumner	5312806592	Stop Iterator re-use The IMM iterator should not be reused, as it has already been filtered for a query. so if reused for a different query incorrect and unexpected results may occur. This reuse had been stopped by a previous commit, and this cleans up subsequently unused code.	2018-03-02 08:16:34 +00:00
Martin Sumner	861aa5a7db	Support multi-query fold Allow a single snapshot to run query over multiple ranges. Used initially to fold over multiple buckets.	2018-03-01 23:19:52 +00:00
Martin Sumner	2b6281b2b5	Initial head_only features Initial commit to add head_only mode to leveled. This allows leveled to receive batches of object changes, but where those objects exist only in the Penciller's Ledger (once they have been persisted within the Ledger). The aim is to reduce significantly the cost of compaction. Also, the objects ar enot directly accessible (they can only be accessed through folds). Again this makes life easier during merging in the LSM trees (as no bloom filters have to be created).	2018-02-15 16:14:46 +00:00
Martin Sumner	834704a3ff	Merge branch 'mas-i117-factor4scale' of https://github.com/martinsumner/leveled into mas-i117-factor4scale	2018-02-10 08:10:32 +00:00
Martin Sumner	f748fc8611	Narrower still Make the LSM tree more bottle shaped. Experiment to judge performance impact	2018-02-10 08:10:24 +00:00
Martin Sumner	5673d8b558	Expand test to ensure coverage catch	2018-02-10 08:09:33 +00:00
Martin Sumner	8113aebdcf	Add timings for Level 3 Level 3 readings now relatively common - so time the separately	2018-02-09 08:59:21 +00:00
Martin Sumner	7e4c3db915	Alternate scale factor Also had failed unit test - there was an issue with bit-flipping the position not being safely caught	2018-02-08 10:29:27 +00:00
Martin Sumner	5342e3a94f	Improve testing of bloom feature In particular will blooms re-appear following startup	2017-11-28 11:43:46 +00:00
Martin Sumner	c2f19d8825	Switch to using bloom at penciller Previouslythe tinybloom was used within the SST file as an extra check to remove false fetches. However the SST already has a low FPR check in the slot_index. If the newebloom was used (which is no longer per slot, but per sst), this can be shared with the penciller and then the penciller could use it and avoid the message pass. the message pass may be blocked by a 2i query or a slot fetch request for a merge. So this should make performance within the Penciller snappier. This is as a result of taking sst_timings within a volume test - where there was an average of + 100microsecs for each level that was dropped down. Given the bloom/slot checks were < 20 microsecs - there seems to be some further delay. The bloom is a binary of > 64 bytes - so passing it around should not require a copy.	2017-11-28 01:19:30 +00:00
Martin Sumner	f436cfd03e	Add consistent timing points Now all timing points should be made in a consistent fashion	2017-11-21 23:13:24 +00:00
Martin Sumner	3ef550d9f8	Refactor timing point management For Penciller and timing head requests.	2017-11-21 19:58:36 +00:00
Martin Sumner	51f504fec5	Add extra slow_fetch test sometimes ct tests don’t hit this - surprisingly	2017-11-20 17:29:57 +00:00
Martin Sumner	f55cbbeac3	OTP 19 requires defaults in dialyzer	2017-11-13 14:02:39 +00:00
Martin Sumner	8f27b3b628	Merge branch 'master' into mas-aae-segementfoldplus	2017-11-07 11:22:56 +00:00
Martin Sumner	61b7be5039	Make compression algorithm an option Compression can be switched between LZ4 and zlib (native). The setting to determine if compression should happen on receipt is now a macro definition in leveled_codec.	2017-11-06 15:54:58 +00:00
Martin Sumner	ee7f9ee4e0	Test coverage ... and column width formatting	2017-11-01 15:11:14 +00:00
Martin Sumner	b141dd199c	Allow for segment-acceleration of folds Initially with basic tests. If the SlotIndex has been cached, we can now use the slot index as it is based on the Segment hash algortihm. This looks like it should lead to an order of magnitude improvement in querying for keys/clocks by segment ID. This also required a slight tweak to the penciller keyfolder. It now caches the next answer from the SSTiter, rather than restart the iterator. When the IMMiter has many more entries than the SSTiter (as the sSTiter is being filtered but not the IMMiter) this could lead to lots of repeated folding.	2017-10-31 23:28:35 +00:00
Martin Sumner	36264eb416	Search range failure Discovered a bug with search ranges in leveled_tree - this was uncovered by an intermittently fialing 19.3 test. Test case added and bug fixed. It was due to a fialure to use end_key passed causing issues with particular manifests and full bucket ranges.	2017-10-24 13:19:30 +01:00
Martin Sumner	a128dcdadf	Change hash algorithm for penciller Switch from magic hash to md5 - to hopefully remove the need for some of the artificial jumps required to get expected fall positive ratios. Also split the hash into two 16-bit integers. We assume that SegmentID (from the perspective of AAE merkle/tictac trees) will always be at least 16 bits. the idea is that hashes should be used in blooms and indexes such that some advantage can be gained from just knowing the segmentID - in particular when folding over all the keys in a bucket. Performance testing has been difficult so far - I think due to “cloud” mysteries.	2017-10-20 23:04:29 +01:00
Martin Sumner	bfaed921e6	Split code for folders - introduce runner actor Introduce a dedicated module for all the different fold types. Also simplify the list of folders by deprecating those folds that should eb achieveable by fold_heads/fold_objects type folds but with smarter functions. Makes sure that the fold functiosn also have better spec coverage, and are dialyzer checked.	2017-10-17 20:39:11 +01:00
Martin Sumner	0f5911ab70	Add unit test of archive files	2017-09-28 10:50:54 +01:00
Martin Sumner	3950942da3	Roll in fix for intermittently failing test As descibed in https://github.com/martinsumner/leveled/issues/92 Only the first fix was made. Just to eb safe - archiving means renaming to another file with a different extension. Assumption is that renamed files cna be manually reaped if necessary.	2017-09-27 23:52:49 +01:00
Martin Sumner	eba21f49fa	Make tests compatible with OTP 16 this required a switch to change the sync strategy based on rebar parameter. However tests could be slow on macbook with OTP16 and sync - so timeouts added in unit tests, and ct tests sync_startegy changed to not sync for OTP16.	2017-09-15 15:10:04 +01:00
Heinz N. Gies	25389893cf	Add compatibility for old and new random / rand functions	2017-08-01 11:24:12 +02:00
Heinz N. Gies	369bdece5f	Cleanup dialyzer errrors in leveled_penciller	2017-07-31 19:39:40 +02:00
martinsumner	8da8722b9e	Add temporary aae index Pending ct tests. The aae index should expire after limit_minutes and be on an index which is rounded to unit_minutes.	2017-06-30 10:03:36 +01:00
martinsumner	32612dfe4a	Yet another array type OTP16 issue	2017-06-01 21:39:01 +01:00
martinsumner	2738ee951c	Add specs and docs for Penciller Help the dialyzer	2017-05-22 18:09:12 +01:00
martinsumner	4d12dfe0ab	Returning snapshots If the clerk updates the manifest - it might not recognise changes to the manifest made since the clerk took the manifest. So the penciller must merge its view of the snapshots back in to the updated manifest	2017-04-19 22:46:37 +01:00
Martin Sumner	fa9daf8696	Correct async fold fold objects which snaps in the fold was implemented incorrectly - it took information from the LedgeCache at the point of the request, not at the point of the fold. So the LedgerCache SQN may have been surpassed in the Penciller by the time the fold was called.	2017-04-17 23:01:55 +01:00
Martin Sumner	e01efe02f6	Long snaphsot timeout increase Increase this to 90 minutes. The first time all the snapshots are rebuilt it may take a long time, but they all get scheduled together - and queued until concurrency limits allow it to be completed. currently the snapshot is made on initialisation, and only released when completed (which may be after the queue). so the last couple of snapshots were over-shooting the 1 hour.	2017-04-13 22:43:29 +01:00
Martin Sumner	4e9fa2a206	Timeout long-running snapshots Add logic to timeout long-running snapshots.	2017-04-05 09:16:01 +01:00
martinsumner	878ec41ffa	Merge remote-tracking branch 'refs/remotes/origin/mas-sstblockv2-i42' into mas-pushmem-i46 # Conflicts: # src/leveled_penciller.erl	2017-03-14 15:43:35 +00:00
martinsumner	c5bb150f97	Drop some logs Not found to be interesting so far	2017-03-13 20:30:33 +00:00
martinsumner	f3e962c43a	Add level to SST slow fetch log	2017-03-13 12:16:36 +00:00
martinsumner	62c3ba8b6f	Passing ETS reference not tree ETS reference gets converted to tree by Penciller	2017-03-13 11:54:46 +00:00
martinsumner	39a005a8d0	Try and be consistent in flilename format i.e. ./filename.sst	2017-03-09 21:52:29 +00:00
martinsumner	4c59342600	Change SST reference to split filename The manifest and the logs are bloated by having the full file path for every filename in there - given the root path is constant. Could also cause issues if the mount point is ever changed.	2017-03-09 21:23:09 +00:00

1 2 3 4

178 commits