leveled

Author	SHA1	Message	Date
martinsumner	baf4ca252f	Revert "Experiment with temporary us eof ETS table" This reverts commit `2a106d0dc5`.	2016-12-13 20:24:29 +00:00
martinsumner	2a106d0dc5	Experiment with temporary us eof ETS table Rather than expensive lists:ukeymerge, try use a temporary ETS table.	2016-12-13 19:38:14 +00:00
martinsumner	c8be3bfa46	Slot hash corrected When building the hashtree the incorrect IndexLength was being used to calculate the slot - causing many queries to loop all the way round the Index	2016-12-13 17:02:45 +00:00
martinsumner	8f775a88fd	Investigate performance regression Performance has regressed following the hashtable change. Speculation that the hashtable format might not be right, and so there is more cycling around the hashtree. Logging added.	2016-12-13 14:06:19 +00:00
martinsumner	52499170c0	Tidy logging following changes Include detailed timings in a permanent log	2016-12-13 12:41:44 +00:00
Martin Sumner	cfc6a67638	Switch to ordered_set Improved performance by a combination of switching to an ordered_set (so a list can be extracted in a sane way), and building the binary from an ordered list.	2016-12-13 12:35:30 +00:00
martinsumner	aa2d19df1d	Revert back to handling list of binaries (but differently) Performance from last commit got worse not better :-( Perhaps better handling all as lists, and then building a binary at the end.	2016-12-13 03:22:40 +00:00
martinsumner	972a0ee0b9	Refactor hash table write Less looping and re-looping over list. Uses ordering to build more naturally.	2016-12-13 02:15:13 +00:00
martinsumner	52e21de298	Initial switch to using ETS No real refactor of building hashtables at this stage - just using ETS not an arrary of skiplists	2016-12-12 21:47:09 +00:00
martinsumner	8ccd02e893	Merge Tree issue The attempt to refcator the writer meant that files were never reaching the max slots - and so we were only ever stopping when the lists were exhausted. This meant that the merge tree just had a C0 and a C1 file!	2016-12-12 18:30:12 +00:00
martinsumner	ff0bf15c8f	Fix the fix	2016-12-12 18:18:37 +00:00
martinsumner	1537334fbd	Ensure fetch still works when delete_pending	2016-12-12 18:17:53 +00:00
martinsumner	cf6a1eb513	Add extra bloom check Add extra bloom check - but get the SFT process to perform not the chekc not the Penciller. This avoids complexity of negotiating the transfer of the bloom to the Penciller - but doesn't avoid the potentially unecessary message pass between processes.	2016-12-12 18:01:37 +00:00
martinsumner	1f56501499	Refactor writing SFT Files Previously the code had involved veyr high arity functions which were hard to follow. This has been simplified somewhat with the addition of a writer record to make things easier to track, as well as a general refactoring to better logically seperate the building of things.	2016-12-12 16:12:31 +00:00
martinsumner	f28c7e02bf	Remove unnecessary clause As the intention is to change the way the tiny bloom is called, the unnecessary clause of handling an undefined bloom can be removed.	2016-12-11 21:24:04 +00:00
martinsumner	86bdfdeaf0	Reverted back out the additional bloom check This is desirable to add back in going forward, but wasn't implemented in a safe or clear way. The way the bloom was or was not on the LoopState was clumsy, and it got persisted in multiple places without a CRC check. Intention to implement back in wherby it is requested on-demand by the Penciller, and then the SFT worker lifts it off disk and CRC checks it. So it is never on the SFT LoopState. Also it will be easier to control the logic over which levels have the bloom in the Penciller.	2016-12-11 21:01:10 +00:00
martinsumner	4b48ed14c6	Correct Mistyped 2 ^ 32	2016-12-11 20:38:20 +00:00
martinsumner	f96d148073	Make the merge_test a more sensible size On the verge of a timeout. Rather than keep battling with the timeout, make it do less work	2016-12-11 20:17:05 +00:00
martinsumner	5cfe9a71e1	Wrap test with non-default timeout	2016-12-11 15:25:14 +00:00
martinsumner	24a5347bec	Revert	2016-12-11 15:19:34 +00:00
martinsumner	a86686d621	Remove unnecessary reverse	2016-12-11 15:17:58 +00:00
martinsumner	1b63845050	Bring compression back to SFT It is expensive on the CPU - but it leads to a 4 x increase in the cache coverage. Try and make some small micro gains in list handling in create_block	2016-12-11 15:02:33 +00:00
martinsumner	44cee5a6e8	Experiemnt with no compression Does compression hurt CPU more than the benefit gaine din some cases?	2016-12-11 12:33:09 +00:00
martinsumner	71cf7a3a51	Setting change led to idle CPU	2016-12-11 08:37:03 +00:00
martinsumner	fb069666dc	Export module	2016-12-11 08:16:00 +00:00
martinsumner	16c704551b	Revert to original SFT build settings Leveled is always CPU bound during tests, and it is the merge in the ledger that drains the CPU hardest,	2016-12-11 07:35:23 +00:00
martinsumner	6f06c6fdeb	ETS delete Delete the objects rather than starting a new table each time	2016-12-11 07:07:30 +00:00
martinsumner	2758498fad	More Jitter! Having reduced the size of the ledger cache (again) we can now tolerate more jitter here	2016-12-11 06:54:41 +00:00
martinsumner	32ac305c67	Compaction test error Compaction tests now throwing up different corruption points	2016-12-11 06:53:25 +00:00
martinsumner	8bcb49479d	Re-introduce ETS Index Add ETS Index back in to avoid having to check each skip list in turn. Also this helps keep a lower skip list size.	2016-12-11 05:23:24 +00:00
martinsumner	f848500eff	Tinker, tinker, tinker, tinker	2016-12-11 04:53:36 +00:00
martinsumner	523716e8f2	Add tiny bloom to Penciller Manifest This is an attempt to save on unnecessary message transfers, and slightly more expensive GCS checks in the SFT file itself.	2016-12-11 04:48:50 +00:00
martinsumner	ea8f3c07a7	oops	2016-12-11 02:00:19 +00:00
martinsumner	2c7fdc74d4	Setting fiddling Try to find a happy medium	2016-12-11 01:58:25 +00:00
martinsumner	5d11bc051f	Allow for more fluctuation in L0 write time Try to alleviate existing co-ordination issue when all vnodes tend to try and write L0 files concurrently	2016-12-11 01:49:03 +00:00
martinsumner	1f38bcb328	Magic Hash vs phash2 Magic Hash broke Skip List organisation	2016-12-11 01:32:32 +00:00
martinsumner	ccc993383d	Stop second hash on fetch_head The bookie should magic_hash for fetch_head, and now passes the hash to the Penciller so second hash not required.	2016-12-11 01:21:53 +00:00
martinsumner	2d3a40e6f1	Magic Hash - and no L0 Index Move to using the DJ Bernstein Magic Hash consistently, and trying to make sure we only hash once for each operation (as the hash is more expensive than phash2). The improved lookup time for missing keys should allow for the L0 index to be removed, and hence speed up the completion time for push_mem operations. It is expected there will be a second stage of creating a tinybloom as part of the SFT creation process, and then adding that tinybloom to the manifest. This will then reduce the message passing required for a GET not in the cache or higher levels	2016-12-11 01:02:56 +00:00
martinsumner	95d5e12ce7	Switch to using ets set as index of L0 cache Hope is that this will cause less garbage collection, and also will be slightly faster. Note that snapshots don't now get an index - they get the special index 'snap'. However, the SkipLists have bloom protection, and most snapshots are iterators not fetchers.	2016-12-10 14:15:35 +00:00
martinsumner	06c58bf84b	Split out hashtree implementation Split out hashtree implementation functions in leveled_cdb to make it easier to swap this out. Currently using an array of skiplists - may be better with an ets ordered_set	2016-12-10 13:03:38 +00:00
martinsumner	c4e4cf67fe	Add bloom to loaded skiplist	2016-12-10 11:39:00 +00:00
martinsumner	626a8e63f9	Experiment converting CDB to use skiplist not gb_tree Might insertion time be faster?	2016-12-10 10:55:35 +00:00
martinsumner	a3f60e3609	OTP version shenanigans	2016-12-09 18:55:13 +00:00
martinsumner	d2bd01eaf1	Add fast fail to skiplist Add a bloom filter to the skiplist, to make it faster at returning not found. The SkipList is now encapsulated within a dict().	2016-12-09 18:30:40 +00:00
martinsumner	f0db730f07	Adjust jitter settings	2016-12-09 16:34:15 +00:00
martinsumner	82cb49638a	Attempt at performance improvement Try to add some extra jitter in to the process of L0 writes, and also make L0 writes delayed to help with bufferring	2016-12-09 14:36:03 +00:00
martinsumner	349d194a7c	Increase jitter slightly	2016-12-09 09:52:31 +00:00
martinsumner	5bdb7fd7fa	Alter Riak HEAD Change the extract of Riak metadata. In Riak-based volume tests hte writing of SFT files is tanking. Could this be the "extra" metadata. i.e. There are only current plans to look at the vclock. Sibling count is free to fetch, what if we just get these two items, will it be less CPU to extract the metadata, but also will the reduced weight reduce the downstream impact?	2016-12-08 23:38:50 +00:00
martinsumner	2f4013f430	Set Jitter correctly this time	2016-12-08 21:02:39 +00:00
martinsumner	b03ef664c8	Experiment with SFT settings	2016-12-08 20:52:17 +00:00

... 16 17 18 19 20 ...

1115 commits