leveled

Author	SHA1	Message	Date
martinsumner	0a2053b557	Improved unit test of CRC chekcing in bloom filter Confirm the impact of bit-flipping in the bloom filter	2016-10-21 16:08:41 +01:00
martinsumner	b2089baa1e	Correct tombstone handling Prepare SFT files for handling tombstones correctly (without expiry dates). Also some work as it can be seen from tests that some SFT files ar enot be cleared out correctly. Pausing before trying t clear out the fles to experiment and trial the possibility that there is a timing issue.	2016-10-21 15:21:37 +01:00
martinsumner	c431bf3b0a	Broken snapshot test The test confirming that deleting sft files wer eheld open whilst snapshots were registered was actually broken. This test has now been fixed, as well as the logic in registring snapshots which had used ledger_sqn mistakenly rather than manifest_sqn.	2016-10-21 11:38:30 +01:00
martinsumner	0324edd6f6	Rotating object tests Recent fixes have been made to problems associated with rapidly changing objexts especially on re-opening of the bookie. Test of rotating objects from both an index query and a fetch perspective added to better detect such issues in the future.	2016-10-20 12:16:17 +01:00
martinsumner	7319b8f415	Redundant clauses Remove some redundant clauses, and fix up some logging	2016-10-19 20:51:30 +01:00
martinsumner	12fe1d01bd	Penciller Manifest and Locking The penciller had the concept of a manifest_lock - but it wasn't clear what the purpose of it was. The updating of the manifest has now been updated to reduce the code and make the process cleaner and more obvious. Now the committed manifest only covers non-L0 levels. A clerk can work concurrently on a manifest change whilst the Penciller is accepting a new L0 file. On startup the manifets is opened as well as any L0 file. There is a possible race condition with killing process where there may be a L0 file which is merged but undeleted - and this is believed to be inert. There is some outstanding work still. Currently the whole store is paused if a push_mem is received by the Penciller, and the writing of a L0 sft file has not been completed. The creation of a L0 file appears to take about 300ms, so if the ledger_cache fills in this period a pause will occurr (perhaps due to objects with lots of index entries). It would be preferable to pause more elegantly in this situation. Perhaps there should be a harsh timeout on the call to check the SFT complete, and catching it should cause a refused response. The next PUT will then wait, but a any queued GETs can progress.	2016-10-19 17:34:58 +01:00
martinsumner	f16f71ae81	Revert ominshambles performance refactoring To try and improve performance index entries had been removed from the Ledger Cache, and a shadow list of the LedgerCache (in SQN order) was kept to avoid gb_trees:to_list on push_mem. This did not go well. The issue was that ets does not deal with duplicate keys in the list when inserting (it will only insert one, but it is not clear which one). This has been reverted back out. The ETS parameters have been changed to [set, private]. It is not used as an iterator, and is no longer passed out of the process (the memtable_copy is sent instead). This also avoids the tab2list function being called.	2016-10-19 00:10:48 +01:00
martinsumner	8f29a6c40f	Complete 2i work - some refactoring The 2i work now has tests for removals as well as regex etc. Some initial refactoring work has also been tried - to try and take some tasks of the critical path of push_mem. The primary change has been to avoid putting index keys into the gb_tree, and building the KeyChanges list in parallel to the gb_tree (now known as ObjectTree) within the Ledger Cache. Some initial experiments done as to changing the ETS table in the Penciller now that it will now be used for iterating - but that has been reverted for now.	2016-10-18 19:41:33 +01:00
martinsumner	905b712764	2i query test The 2i query test added in the previous commit didn't correctly test regex queries. This has now been improved.	2016-10-18 09:42:33 +01:00
martinsumner	3e475f46e8	Support for 2i query part1 Added basic support for 2i query. This involved some refactoring of the test code to share functions between suites. There is sill a need for a Part 2 as no tests currently cover removal of index entries.	2016-10-18 01:59:18 +01:00
Russell Brown	59ea46120e	Fix include target	2016-10-17 14:24:32 +01:00
martinsumner	e3ce372f31	Delete Add functionality to delete keys. No tombstone reaping yet.	2016-10-16 15:41:09 +01:00
martinsumner	ed17e44f52	Improve test coverage Some additional tests following previous refactoring for abstraction, primarily to make manifest print safer an dprove co-existence of Riak and non-Riak objects.	2016-10-14 22:58:01 +01:00
martinsumner	7eb5a16899	Supporting Tags - Improving abstraction between Riak and non-Riak workloads The object tag "o" which was taken from eleveldb has been an extended to allow for specific functions to be triggered for different object types, in particular when extracting metadata for stroing in the Ledger. There is now a riak tag (o_rkv@v1), and in theory other tags can be added and used, as long as their is an appropriate set of functions in the leveled_codec.	2016-10-14 18:43:16 +01:00
martinsumner	de54a28328	Load and Count test This test exposed two bugs: - Yet another set of off-by-one errors (really stupidly scanning the Manifest from Level 1 not Level 0) - The return of an old issue related to scanning the journal on load whereby we fail to go back to the previous file before the current SQN	2016-10-13 17:51:47 +01:00
martinsumner	938cc0fc16	Re-add tests Oops - committed with tests commented out	2016-10-12 17:35:32 +01:00
martinsumner	0a08867280	Iterator support Add iterator support, used initially only for retrieving bucket statistics. The iterator is supported by exporting a function, and when the function is claled it will take a snapshot of the ledger, run the iterator and hten close the snapshot. This required a numbe rof underlying changes, in particular to get key comparison to work as "expected". The code had previously misunderstood how comparison worked between Erlang terms, and in particular did not account for tuple length being compared first by size of the tuple (and not just by each element in order).	2016-10-12 17:12:49 +01:00
martinsumner	4a8a2c1555	Code reduction refactor An attempt to refactor out more complex code. The Penciller clerk and Penciller have been re-shaped so that there relationship is much simpler, and also to make sure that they shut down much more neatly when the clerk is busy to avoid crashdumps in ct tests. The CDB now has a binary_mode - so that we don't do binary_to_term twice ... although this may have made things slower ??!!? Perhaps the is_binary check now required on read is an overhead. Perhaps it is some other mystery. There is now a more effiicient fetching of the size on pcl_load now as well.	2016-10-08 22:15:48 +01:00
martinsumner	2055f8ed3f	Add more complex snapshot test This exposed another off-by-one error on startup. This commit also includes an unsafe change to reply early from a rolling CDB file (with lots of objects writing the hash table can take too long). This is bad, but will be resolved through a refactor of the manifest writing: essentially we deferred writing of the manifest update which was an unnecessary performance optimisation. If instead we wait on this, the process is made substantially simpler, and it is safer to perform the roll of the complete CDB journal asynchronously. If the manifest update takes too long, an append-only log may be used instead.	2016-10-07 10:04:48 +01:00
martinsumner	ad5aebe93e	Further work on system tests Another issue exposed with laziness in the using an incomplete ledger when checking for presence during compaction.	2016-10-05 18:28:31 +01:00
martinsumner	d903f184fd	Add initial end-to-end common tests These tests highlighted some logical issues when scanning over databases on startup, so fixes are wrapped in here.	2016-10-05 09:54:53 +01:00
martinsumner	507428bd0b	Add initial system test Add some initial system tests. This highlighted issues: - That files deleted by compaction would be left orphaned and not close, and would not in fact delete (now deleted by closure only) - There was an issue on stratup that the first few keys in each journal would not be re-loaded into the ledger	2016-10-03 23:34:28 +01:00
martinsumner	a95d77607e	Initial work on sft files Working on the delta-encoded segment filter, plus some initial performance testing.	2016-05-31 17:21:14 +01:00
Martin Sumner	c5f50c613d	Ongoing improvements - in particular CDB now supports general erlang terms not just lists	2015-06-04 21:15:31 +01:00
Martin Sumner	647a7f44dc	Tidy-up initial files and add testing to optimise bst bloom filters	2015-05-31 23:31:31 +01:00
Martin Sumner	e2099d0c14	Initial files proving concepts WIP - nothing currently workable	2015-05-25 22:45:45 +01:00

... 2 3 4 5 6

276 commits