leveled

Author	SHA1	Message	Date
martinsumner	4e46c9735d	Log improvements Continuation of log review and conversion to using central log function. Fixup of convoluted shutdown process between Bookie, Inker and Inker's Clerk	2016-11-03 16:05:43 +00:00
martinsumner	7147ec0470	Logging - Phase 1 Abstract out logging and introduce a logbase	2016-11-02 18:14:46 +00:00
martinsumner	4cffecf2ca	Handle gen_server:cast slowness There was some unpredictable performance in tests, that was related to the amount of time it took the sft gen_server to accept a cast whihc passed the levelzero_cache. The response time looked to be broadly proportional to the size of the cache - so it appeared to be an issue with passing the large object to the process queue. To avoid this, the penciller now instructs the SFT gen_server to callback to the server for each tree in the cache in turn as it is building the list from the cache. Each of these requests should be reltaively short, and the processing in-between should space out the requests so the Pencille ris not blocked from answering queries when pompting a L0 write.	2016-10-31 01:33:33 +00:00
martinsumner	95609702bd	Penciller Memory Refactor Plugged the ne wpencille rmemory into the Penciller, and took advantage of the increased speed to simplify the callbacks involved. The outcome is much simpler code	2016-10-30 18:25:30 +00:00
martinsumner	cdb01cd24f	Quality Review Looked through test coverage and dialyzer output and attempted to fill test gaps and strip out untestable code (to let it crash).	2016-10-29 00:52:49 +01:00
martinsumner	c6ca973517	Penciller shutdown when empty Stop the penciller from writing an empty file, when shutting down and the L0 Cache is empty. Also parameter fiddle to see impact of the Penciller changes.	2016-10-27 21:40:43 +01:00
martinsumner	20cc17f916	Penciller Refactor Removed o(100) lines of code by refactoring the Penciller to no longer use ETS tables. The code is less confusing, and probably not an awful lot slower.	2016-10-27 20:56:18 +01:00
martinsumner	30f4f2edf6	Comment change on stall behaviour	2016-10-27 09:45:05 +01:00
martinsumner	4cdc6211a0	Handling 'returned' in penciller unit tests The unit tests for the Penciller couldn't cope with the returned status - and so would intermittently fail (after tightening the timeout on sft check_ready.	2016-10-26 21:03:50 +01:00
martinsumner	e9c568a8b3	Test fix-up There was a test that failed to close down a bookie and that caused some issues. The issues are double-reoslved, the close down was tidied as well as the forgotten close being added back in. There is some generla tidy around in anticipation of TTL support.	2016-10-21 21:26:28 +01:00
martinsumner	3ad9e42b61	Changed SFT shutdown to cast-based The SFT shutdown process ahs become a series of casts to-and-from between Penciller and SFT to stop the two processes syncronously making requests on each other	2016-10-21 12:18:06 +01:00
martinsumner	c431bf3b0a	Broken snapshot test The test confirming that deleting sft files wer eheld open whilst snapshots were registered was actually broken. This test has now been fixed, as well as the logic in registring snapshots which had used ledger_sqn mistakenly rather than manifest_sqn.	2016-10-21 11:38:30 +01:00
martinsumner	5c2029668d	Tombstone preperation Some initial code changes preparing for the test and implementation of tombstones and tombstone reaping	2016-10-20 16:00:08 +01:00
martinsumner	cf66431c8e	Smoother handling of back-pressure The Penciller had two problems in previous commits: - If it had a push_mem soon after a L0 file had been created, the push_mem would stall waiting for the L0 file to complete - and this count take 100-200ms - The penciller's clerk favoured L0 work, but was lazy about asking for other work in-between, so often the L1 layer was bursting over capacity and the clerk was doing nothing but merging more L0 files in (with those merges getting more and more expensive as they had to cover more and more files) There are some partial resolutions to this. There is now an aggressive timeout when checking whther the L0 file is ready on a push_mem, and if the timeout is breached the error is caught and a 'returned' message goes back to the Bookie. the Bookie doesn't now empty its cache, it carrie son filling it, but on some probability it will keep trying to push_mem on future pushes. This increases Jitter around the expensive operation and split out the L0 delay into defined chunks. The penciller's clerk is now more aggressive in asking for work. There is also some simplification of the relationship between clerk timeouts and penciller back-pressure. Also resolved is an issue of inconcistency between the loader and the on startup (replaying the transaction log) and the standard push_mem process. The loader was not correctly de-duplicating by adding first (in order) to a tree before outputting the list from the tree. Some thought will be given later as to whether non-L0 work can be safely prioritised if the merge process still keeps getting behind.	2016-10-20 02:23:45 +01:00
martinsumner	7319b8f415	Redundant clauses Remove some redundant clauses, and fix up some logging	2016-10-19 20:51:30 +01:00
martinsumner	12fe1d01bd	Penciller Manifest and Locking The penciller had the concept of a manifest_lock - but it wasn't clear what the purpose of it was. The updating of the manifest has now been updated to reduce the code and make the process cleaner and more obvious. Now the committed manifest only covers non-L0 levels. A clerk can work concurrently on a manifest change whilst the Penciller is accepting a new L0 file. On startup the manifets is opened as well as any L0 file. There is a possible race condition with killing process where there may be a L0 file which is merged but undeleted - and this is believed to be inert. There is some outstanding work still. Currently the whole store is paused if a push_mem is received by the Penciller, and the writing of a L0 sft file has not been completed. The creation of a L0 file appears to take about 300ms, so if the ledger_cache fills in this period a pause will occurr (perhaps due to objects with lots of index entries). It would be preferable to pause more elegantly in this situation. Perhaps there should be a harsh timeout on the call to check the SFT complete, and catching it should cause a refused response. The next PUT will then wait, but a any queued GETs can progress.	2016-10-19 17:34:58 +01:00
martinsumner	f16f71ae81	Revert ominshambles performance refactoring To try and improve performance index entries had been removed from the Ledger Cache, and a shadow list of the LedgerCache (in SQN order) was kept to avoid gb_trees:to_list on push_mem. This did not go well. The issue was that ets does not deal with duplicate keys in the list when inserting (it will only insert one, but it is not clear which one). This has been reverted back out. The ETS parameters have been changed to [set, private]. It is not used as an iterator, and is no longer passed out of the process (the memtable_copy is sent instead). This also avoids the tab2list function being called.	2016-10-19 00:10:48 +01:00
martinsumner	8f29a6c40f	Complete 2i work - some refactoring The 2i work now has tests for removals as well as regex etc. Some initial refactoring work has also been tried - to try and take some tasks of the critical path of push_mem. The primary change has been to avoid putting index keys into the gb_tree, and building the KeyChanges list in parallel to the gb_tree (now known as ObjectTree) within the Ledger Cache. Some initial experiments done as to changing the ETS table in the Penciller now that it will now be used for iterating - but that has been reverted for now.	2016-10-18 19:41:33 +01:00
martinsumner	3e475f46e8	Support for 2i query part1 Added basic support for 2i query. This involved some refactoring of the test code to share functions between suites. There is sill a need for a Part 2 as no tests currently cover removal of index entries.	2016-10-18 01:59:18 +01:00
martinsumner	ed17e44f52	Improve test coverage Some additional tests following previous refactoring for abstraction, primarily to make manifest print safer an dprove co-existence of Riak and non-Riak objects.	2016-10-14 22:58:01 +01:00
martinsumner	bbdac65f8d	Split out key codec Aplit out key codec, and also saner approach to key comparison (although still awkward).	2016-10-13 21:02:15 +01:00
martinsumner	de54a28328	Load and Count test This test exposed two bugs: - Yet another set of off-by-one errors (really stupidly scanning the Manifest from Level 1 not Level 0) - The return of an old issue related to scanning the journal on load whereby we fail to go back to the previous file before the current SQN	2016-10-13 17:51:47 +01:00
martinsumner	0a08867280	Iterator support Add iterator support, used initially only for retrieving bucket statistics. The iterator is supported by exporting a function, and when the function is claled it will take a snapshot of the ledger, run the iterator and hten close the snapshot. This required a numbe rof underlying changes, in particular to get key comparison to work as "expected". The code had previously misunderstood how comparison worked between Erlang terms, and in particular did not account for tuple length being compared first by size of the tuple (and not just by each element in order).	2016-10-12 17:12:49 +01:00
martinsumner	d2cc07a9eb	Doc update and clerk<->penciller changes Reviewing code to update comments revealed a weakness in the sequence of events between penciller and clerk committing a manifest change wherby an ill-timed crash could lead to files being deleted without the manifest changing. A different, and safer pattern now used between theses two actors.	2016-10-09 22:33:45 +01:00
martinsumner	4a8a2c1555	Code reduction refactor An attempt to refactor out more complex code. The Penciller clerk and Penciller have been re-shaped so that there relationship is much simpler, and also to make sure that they shut down much more neatly when the clerk is busy to avoid crashdumps in ct tests. The CDB now has a binary_mode - so that we don't do binary_to_term twice ... although this may have made things slower ??!!? Perhaps the is_binary check now required on read is an overhead. Perhaps it is some other mystery. There is now a more effiicient fetching of the size on pcl_load now as well.	2016-10-08 22:15:48 +01:00
martinsumner	2055f8ed3f	Add more complex snapshot test This exposed another off-by-one error on startup. This commit also includes an unsafe change to reply early from a rolling CDB file (with lots of objects writing the hash table can take too long). This is bad, but will be resolved through a refactor of the manifest writing: essentially we deferred writing of the manifest update which was an unnecessary performance optimisation. If instead we wait on this, the process is made substantially simpler, and it is safer to perform the roll of the complete CDB journal asynchronously. If the manifest update takes too long, an append-only log may be used instead.	2016-10-07 10:04:48 +01:00
martinsumner	f58f4d0ea5	Mini Refactor Thought about the mess, thought about swithcing to a FSM, throught about just sorting a bit of the mess instead.	2016-10-06 13:23:20 +01:00
martinsumner	ad5aebe93e	Further work on system tests Another issue exposed with laziness in the using an incomplete ledger when checking for presence during compaction.	2016-10-05 18:28:31 +01:00
martinsumner	d903f184fd	Add initial end-to-end common tests These tests highlighted some logical issues when scanning over databases on startup, so fixes are wrapped in here.	2016-10-05 09:54:53 +01:00
martinsumner	507428bd0b	Add initial system test Add some initial system tests. This highlighted issues: - That files deleted by compaction would be left orphaned and not close, and would not in fact delete (now deleted by closure only) - There was an issue on stratup that the first few keys in each journal would not be re-loaded into the ledger	2016-10-03 23:34:28 +01:00
martinsumner	e2bb09b873	Snapshot testing Work to test the checking of sequence numbers in snapshots as required by the inkers clerk to calculate the percentage of a file which is compactable	2016-09-26 10:55:08 +01:00
martinsumner	c64d67d9fb	Snapshot Work - Interim Commit Some initial work to get snapshots going. Changes required, as need to snapshot through the Bookie to ensure that there is no race between extracting the Bookie's in-memory view and the Penciller's view if a push_to_mem has occurred inbetween. A lot still outstanding, especially around Inker snapshots, and handling timeouts	2016-09-23 18:50:29 +01:00
martinsumner	d3e985ed80	Refactor Penciller Push Two aspects of pushing to the penciller have been refactored: 1 - Allow the penciller to respond before the ETS table has been updated to unlock the Bookie sooner. 2 - Change the way the copy of the memtable is stored to work more effectively with snapshots wihtout locking the Penciller any further on a snapshot or push request	2016-09-21 18:31:42 +01:00
martinsumner	aa7d235c4d	Rename clerk and CDB Speed-Up CDB did many "bitty" reads/writes when scanning or writing hash tables - change these to bult reads and writes to speed up. CDB also added capabilities to fetch positions and get keys by position to help with iclerk role.	2016-09-20 16:13:36 +01:00
martinsumner	c10eaa75cb	Dialyzer changes Some chnages to improve dialyzer pass rate	2016-09-20 10:17:24 +01:00
martinsumner	b452fbe27c	End-to-end test Changes to ensure working of first end-to-end test (with a single Key and Value)	2016-09-15 18:38:23 +01:00
martinsumner	e73a5bbf31	WIP - First draft of Bookie code First draft of untested bookie code	2016-09-15 10:53:24 +01:00
martinsumner	86666b1cb6	Scan over CDB file Make scanning over a CDB file generic rather than specific to read-in of active nursery log - open to be called as an external function to support other scanning behaviour.	2016-09-09 15:58:19 +01:00
martinsumner	edfe9e3bed	Improved testing Improve testing of Penciller to show startup and shutdown with push, merging and fetch	2016-09-08 14:21:30 +01:00
martinsumner	2a76eb364e	Inker - Initial Code An attempt to get a first inker that can build a ledger from a manifest as well as support simple get and put operations. Basic tests surround the building of manifests only at this stage - more work required for get and put.	2016-09-05 15:01:23 +01:00
martinsumner	ce0c55a2ec	Resolve issue of Remainders Two issues looked at - There shouldn't be a remainder after writing the L0 file, as this could have overlapping sequence numbers which will be missed on restart - There should be a safety-check to stop the Clerk from doing a fake push too soon after a background L0 file ahs been written (as the fake push would lock the ledger waiting for the L0 file write to finish)	2016-08-16 12:45:48 +01:00
martinsumner	4586e2340a	Startup and Shutdown Support Added support for startup and shutdown of a Ledger. As aprt of this will now start tracking the highest sequence number. This also adds a safety check on pcl_pushmem to make sure that only keys with a higher sequenc enumber are being pushed in - and hence we can happily insert into the in-memory view without checking the sequence number.	2016-08-15 16:43:39 +01:00
martinsumner	6e56b569b8	Auto-merge Allow for the clerk to merge continuously is no activity for the penciller to prompt. The penciller now must also correctly lock the manifest - to stop races between the creation of ne wL0 files and the completion of work by the clerk	2016-08-12 01:05:59 +01:00
martinsumner	c269eb3c52	Initial level merging Some basic merging in the lsm tree.	2016-08-10 13:02:08 +01:00
martinsumner	718425633a	Penciller accepting push Standardise on record definitions between modules to make easier - then add functionality to pushing to penciller as bookie would do. Some initial manual testing of this seems OK.	2016-08-09 16:09:29 +01:00
martinsumner	33f1efd576	Work on descriptions Add further descriptions of roles following name changes. Attempt to simplify manifest management in the Penciller by assuming there is only one Penciller's Clerk active - and so only one piece of work can be ongoing	2016-08-02 17:51:43 +01:00
martinsumner	2bdb5fba6c	Re-naming Naming things is hard. This change renames things based on the Bookie/Inker/Penciller terminology	2016-08-02 13:44:48 +01:00

1 2 3 4 5

247 commits