leveled

Author	SHA1	Message	Date
Martin Sumner	c294570bce	Mas d31 nhskv16sst (#428 ) * Add performance/profiling test Add test to perf_SUITE to do performance tests and also profile different activities in leveled. This can then be used to highlight functions with unexpectedly high execution times, and prove the impact of changes. Switch between riak_ctperf and riak_fullperf to change from standard test (with profile option) to full-scale performance test * Change shape of default perfTest * Refactor SST Compare and contrast profile for guess, before and after refactor: pre ``` lists:map_1/2 313370 2.33 32379 [ 0.10] lists:foldl_1/3 956590 4.81 66992 [ 0.07] leveled_sst:'-expand_list_by_pointer/5-fun-0-'/4 925020 6.13 85318 [ 0.09] erlang:binary_to_term/1 3881 8.55 119012 [ 30.67] erlang:'++'/2 974322 11.55 160724 [ 0.16] lists:member/2 4000180 15.00 208697 [ 0.05] leveled_sst:find_pos/4 4029220 21.01 292347 [ 0.07] leveled_sst:member_check/2 4000000 21.17 294601 [ 0.07] -------------------------------------------------- -------- ------- ------- [----------] Total: 16894665 100.00% 1391759 [ 0.08] ``` post ``` lists:map_1/2 63800 0.79 6795 [ 0.11] erlang:term_to_binary/1 15726 0.81 6950 [ 0.44] lists:keyfind/3 180967 0.92 7884 [ 0.04] erlang:spawn_link/3 15717 1.08 9327 [ 0.59] leveled_sst:'-read_slots/5-fun-1-'/8 31270 1.15 9895 [ 0.32] gen:do_call/4 7881 1.31 11243 [ 1.43] leveled_penciller:find_nextkey/8 180936 2.01 17293 [ 0.10] prim_file:pread_nif/3 15717 3.89 33437 [ 2.13] leveled_sst:find_pos/4 4028940 17.85 153554 [ 0.04] erlang:binary_to_term/1 15717 51.97 447048 [ 28.44] -------------------------------------------------- ------- ------- ------ [----------] Total: 6704100 100.00% 860233 [ 0.13] ``` * Update leveled_penciller.erl * Mas d31 nhskv16sstpcl (#426) Performance updates to leveled: - Refactoring of pointer expansion when fetching from leveled_sst files to avoid expensive list concatenation. - Refactoring of leveled_ebloom to make more flexible, reduce code, and improve check time. - Refactoring of querying within leveled_sst to reduce the number of blocks that need to be de-serialised per query. - Refactoring of the leveled_penciller's query key comparator, to make use of maps and simplify the filtering. - General speed-up of frequently called functions.	2024-01-22 21:22:54 +00:00
Martin Sumner	6223b801f3	Mas d31 i410looptoclose (#421 ) * Mas i410 looptoclose (#420) * Stop waiting full SHUTDOWN_PAUSE If there is a snapshot outstanding at shutdown time, there was a wait of SHUTDOWN_PAUSE to give the snapshot time to close down. This causes an issue in kv_index_tictactree when rebuilds complete, when an exchange was in flight at the point the rebuild completed - the aae_controller will become blocked for the full shutdown pause, whilst it waits for the replaced key store to be closed. This change is to loop within the shutdown pause, so that if the snapshot supporting the exchange is closed, the paused bookie can close more quickly (unblocking the controller). Without this fix, there are intermittent issues in kv_index_tictactree's mockvnode_SUITE tests. * Address test reliability Be a bit clearer with waiting round seconds, Was intermittently failing on QR4 previously (but QR5 1s later was always OK). * Update iterator_SUITE.erl * Refine test assertion At Stage C there might be 0 files left, in which case equality with Stage D result is ok.	2023-11-10 15:04:47 +00:00
Martin Sumner	9e804924a8	Mas d31 i416 (#418 ) * Add compression controls (#417) * Add compression controls Add configuration options to allow for a compression algorithm of `none` to disable compression altogether. Also an option to change the point in the LSM tree when compression is applied. * Handle configurable defaults consistently Move them into leveled.hrl. This forces double-definitions to be resolved. There are some other constants in leveled_bookie that are relevant outside of leveled_bookie. These are all now in the non-configurable startup defaults section. * Clarify referred-to default is OTP not leveled * Update leveled_bookie.erl Handle xref issue with eunit include	2023-11-07 14:58:43 +00:00
Martin Sumner	6677f2e5c6	Push log update through to cdb/sst Using the cdb_options and sst_options records	2018-12-11 20:42:00 +00:00
Martin Sumner	510994233e	Add check that index disappears Check I0 count goes down when that index is removed	2018-12-05 15:42:21 +00:00
Martin Sumner	cf1fcaeef2	Add test of index expiry To show how this works, and prove that it does work thta way. Test may require adjusting if tested on a slow node (e.g. reduce KeyCount or increase TTL)	2018-12-05 15:18:20 +00:00
Martin Sumner	6d2d0694e3	Reverse necessary on bucket list The function should see the buckets in order, so it accumulates in such a way to reverse the order - it makes sense that the outcome should be in reverse.	2018-11-23 19:03:24 +00:00
Martin Sumner	a9aa23bc9c	Bucket list update the docs to advertise throw capability. Test it for bucket list (and fix ordering of bucket lists)	2018-11-23 18:56:30 +00:00
Martin Sumner	ef2a8c62af	Add capability to exit a head or object fold with a throw This allows for all fold functions to throw an exception to exit out of a fold with all dependencies still closed down as expected. This was previously available for key folds, which was necessary for the folds to work in Riak (as max_results in index queries depends one xiting the fold with an exception). This change now adds a ct test, and adds support for head folds, object folds (key order) and object folds (sqn order)	2018-11-23 16:00:11 +00:00
Martin Sumner	f0208e9b12	Fix issues with deprecated folders They were deprecated for a reason	2018-10-31 11:04:23 +00:00
Martin Sumner	0fb35e658f	Add support for buckets that are tuples Only {binary(), binary()} tuples	2018-09-27 09:34:40 +01:00
Martin Sumner	0772317247	Test mistake If random integer was low, total could be below threshold - so calculate total correctly. Should make value re-generate random uniform, but test is still valid without this	2018-09-25 18:32:48 +01:00
Russell Brown	3a2d4aa496	Actually run the new test DERP!	2018-09-06 16:38:49 +01:00
Russell Brown	b7bd65d11f	Provide a top level API for folds As the fold functions have been added to get_runner in an ad hoc way, naturally, given the ongoing development of levelEd to support Riak, it was difficult for a new user (in this case Quviq) to see what folds are supported, and with what arguments, and expectations. This PR is for discussion. It is one of many ways to group, spec, and document the fold functions. A test is also added for coverage of range queries.	2018-09-06 15:01:54 +01:00
Martin Sumner	50967438d3	Switch from binary_bucketlist Allow for bucket listing of non-binary buckets (integer buckets, buckets with ascii strings)	2018-09-01 10:39:23 +01:00
Martin Sumner	4bf6d3e73d	Fiddle with naming in query API Was easier in the calling applictaion to switch between using and not using a list of the Query format was consistent between those two cases.	2018-03-02 10:20:43 +00:00
Martin Sumner	861aa5a7db	Support multi-query fold Allow a single snapshot to run query over multiple ranges. Used initially to fold over multiple buckets.	2018-03-01 23:19:52 +00:00
Martin Sumner	bfaed921e6	Split code for folders - introduce runner actor Introduce a dedicated module for all the different fold types. Also simplify the list of folders by deprecating those folds that should eb achieveable by fold_heads/fold_objects type folds but with smarter functions. Makes sure that the fold functiosn also have better spec coverage, and are dialyzer checked.	2017-10-17 20:39:11 +01:00
Martin Sumner	96a548e17a	Change tests - binary keys the new code requires bucket listing to be on binary keys not just binary buckets. As this is only intended for use within Riak (where all keys are buckets are binaries), this constraint seems OK. A test needed changing to ensure it had a binary key in the bucket.	2017-05-23 15:54:11 +01:00
martinsumner	3417baa3b8	Simple test To try and pinpoint any issue with _int index (as seen in Riak integrtaion testing)	2016-12-02 17:39:28 +00:00
martinsumner	e8c1d39df9	Switch to binary format Riak object Initial change to try and test assuming that leveled received the binary format of Riak objects (and parses that for metadata).	2016-11-28 22:26:09 +00:00
martinsumner	196c807b5e	Pass through sync_strategy Allow to switch for Riak to use o_sync as the sync flag rather than sync	2016-11-25 17:41:08 +00:00
martinsumner	51dbad95c0	Change FoldBucketsFun to take just bucket FoldBucketsFun does not take keys should be a 2-arity function (Bucket, Acc).	2016-11-21 14:12:17 +00:00
martinsumner	386d40928b	Fast List Buckets Copied the technique from HanoiDB to speed up list buckets.	2016-11-20 21:21:31 +00:00
martinsumner	ec18f9ab4c	Uncomment test	2016-11-18 16:34:16 +00:00
martinsumner	6684e8e1d3	Refine query to accept fold functions Need to be able to pass external fold functions into different queries, to work as a Riak backend	2016-11-18 15:53:22 +00:00
martinsumner	ac223ced68	Add FoldKeysFun Add the capability to pass FoldKeysFun into the index_query to allow for compatability with riak backend requirements.	2016-11-18 11:53:14 +00:00
martinsumner	37c23a5b38	Shift pause out of leveled Leveled will now signal the need for a pause due to back-pressure, but not actually pause itself. The hope is that in a riak implementation this pause can be managed by the put_fsm, and so not lock the store.	2016-11-07 10:27:38 +00:00
martinsumner	4583460328	Clean API of Riak-specific Methods Clena the API of Riak specific methods, and also resolve timing issue in simple_server unit test. Previously this would end up with missing data (and a lower sequence number after start) because of the penciller_clerk timeout being relatively large in the context of this test. Now the timeout has bene reduced the L0 slot is cleared by the time of the close. To make sure an extra sleep has been added as a precaution to avoid any intermittent issues.	2016-11-07 10:11:57 +00:00
martinsumner	a251f3eab0	Speed up query count test Less individual querys to make count will speed up this taste, without changing the nature of it	2016-11-04 18:20:00 +00:00
martinsumner	171baefc0c	SFT Background Failure Let it crash approach - stop trying to catch and propgate failure of write	2016-11-04 14:31:19 +00:00
martinsumner	eeeee07081	Fold Objects - Check values test Test that summed values in fold objects before and after restart	2016-11-04 14:23:37 +00:00
martinsumner	68b17c71b3	Expand fold objects support Fold over bucket and fold over index added	2016-11-04 11:01:37 +00:00
martinsumner	e8a7888397	Experiment with new cache size algorithm Remove the jitter probability and make it a smooth function heading towards the max ache size	2016-11-03 09:19:02 +00:00
martinsumner	e7506c3c1f	Startup work - baffled Changes the stratup otpions to a prolist to make it easier to get environment variables as default. Tried application:start - and completely baffled as to how to get this to work.	2016-11-02 12:58:27 +00:00
martinsumner	a00a123817	Recovery strategy testing Test added for the "retain" recovery strategy. This strategy makes sure a full history of index changes is made so that if the Ledger is wiped out, the Ledger cna be fully rebuilt from the Journal. This exposed two journal compaction problems - The BestRun selected did not have the source files correctly sorted in order before compaction - The compaction process incorrectly dealt with the KeyDelta object left after a compaction - i.e. compacting twice the same key caused that key history to be lost. These issues have now been corrected.	2016-10-27 00:57:19 +01:00
martinsumner	4cdc6211a0	Handling 'returned' in penciller unit tests The unit tests for the Penciller couldn't cope with the returned status - and so would intermittently fail (after tightening the timeout on sft check_ready.	2016-10-26 21:03:50 +01:00
martinsumner	e9c568a8b3	Test fix-up There was a test that failed to close down a bookie and that caused some issues. The issues are double-reoslved, the close down was tidied as well as the forgotten close being added back in. There is some generla tidy around in anticipation of TTL support.	2016-10-21 21:26:28 +01:00
martinsumner	0a2053b557	Improved unit test of CRC chekcing in bloom filter Confirm the impact of bit-flipping in the bloom filter	2016-10-21 16:08:41 +01:00
martinsumner	0324edd6f6	Rotating object tests Recent fixes have been made to problems associated with rapidly changing objexts especially on re-opening of the bookie. Test of rotating objects from both an index query and a fetch perspective added to better detect such issues in the future.	2016-10-20 12:16:17 +01:00
martinsumner	7319b8f415	Redundant clauses Remove some redundant clauses, and fix up some logging	2016-10-19 20:51:30 +01:00
martinsumner	12fe1d01bd	Penciller Manifest and Locking The penciller had the concept of a manifest_lock - but it wasn't clear what the purpose of it was. The updating of the manifest has now been updated to reduce the code and make the process cleaner and more obvious. Now the committed manifest only covers non-L0 levels. A clerk can work concurrently on a manifest change whilst the Penciller is accepting a new L0 file. On startup the manifets is opened as well as any L0 file. There is a possible race condition with killing process where there may be a L0 file which is merged but undeleted - and this is believed to be inert. There is some outstanding work still. Currently the whole store is paused if a push_mem is received by the Penciller, and the writing of a L0 sft file has not been completed. The creation of a L0 file appears to take about 300ms, so if the ledger_cache fills in this period a pause will occurr (perhaps due to objects with lots of index entries). It would be preferable to pause more elegantly in this situation. Perhaps there should be a harsh timeout on the call to check the SFT complete, and catching it should cause a refused response. The next PUT will then wait, but a any queued GETs can progress.	2016-10-19 17:34:58 +01:00
martinsumner	8f29a6c40f	Complete 2i work - some refactoring The 2i work now has tests for removals as well as regex etc. Some initial refactoring work has also been tried - to try and take some tasks of the critical path of push_mem. The primary change has been to avoid putting index keys into the gb_tree, and building the KeyChanges list in parallel to the gb_tree (now known as ObjectTree) within the Ledger Cache. Some initial experiments done as to changing the ETS table in the Penciller now that it will now be used for iterating - but that has been reverted for now.	2016-10-18 19:41:33 +01:00
martinsumner	905b712764	2i query test The 2i query test added in the previous commit didn't correctly test regex queries. This has now been improved.	2016-10-18 09:42:33 +01:00
martinsumner	3e475f46e8	Support for 2i query part1 Added basic support for 2i query. This involved some refactoring of the test code to share functions between suites. There is sill a need for a Part 2 as no tests currently cover removal of index entries.	2016-10-18 01:59:18 +01:00

45 commits