leveled

Author	SHA1	Message	Date
Martin Sumner	61724cfedb	Merge branch 'master' into mas-riakaae-impl-2	2017-09-28 13:23:29 +01:00
Martin Sumner	3950942da3	Roll in fix for intermittently failing test As descibed in https://github.com/martinsumner/leveled/issues/92 Only the first fix was made. Just to eb safe - archiving means renaming to another file with a different extension. Assumption is that renamed files cna be manually reaped if necessary.	2017-09-27 23:52:49 +01:00
Martin Sumner	389694b11b	Add exportable option to tictac Idea being that sometimes you may wish to compare a tictac tree between leveled and something that doesn't understand erlang:phash or term_to_binary. So allow the magic_hash to be used instead - and perhaps an extract function that does base64 encoding or something similar.	2017-09-26 22:49:40 +01:00
Martin Sumner	dfab33e8da	Add smaller trees The "small" tree will serialise to 1.5MB - which seems large. Much smaller trees seem to be more suitable for things like recently modified aae indexes.	2017-09-25 13:07:08 +01:00
Martin Sumner	eba21f49fa	Make tests compatible with OTP 16 this required a switch to change the sync strategy based on rebar parameter. However tests could be slow on macbook with OTP16 and sync - so timeouts added in unit tests, and ct tests sync_startegy changed to not sync for OTP16.	2017-09-15 15:10:04 +01:00
Martin Sumner	869e799b41	Fix tests Obviously got totally messed up and confused when testing previous commits. Multiple tests were failing for a change which got merged in as the tests were not reflecting the required API.	2017-09-15 10:33:16 +01:00
Martin Sumner	53ddc8950b	Add tests using fold_heads Comparing the inbuilt tictac_tree fold, to using "proper" abstraction and achieving the same thing through fold_heads. The fold_heads method is slower (a lot more manipulation required in the fold) - expect it to require > 2 x CPU. However, this does give the flexibility to change the hash algorithm. This would allow for a fold over a database of AAE trees (where the hash has been pre-computed using sha) to be compared with a fold over a database of leveled backends. Also can vary whether the fold_heads checks for presence of the object in the Inker. So normally we can get the speed advantage of not checking the Journal for presence, but periodically we can.	2017-08-07 10:45:41 +01:00
Martin Sumner	dd20132892	Add test with fold_heads Build the AAE tree equally using fold_heads. This is a pre-cursor to running this within Riak. In part this leans on some of the work done to improve standard Riak AAE with leveled. When rebuilding the standard AAE store only the head is required, and so this process was switched in riak_kv_sweeper to make a fold_heads request if supported by the backend. The head response is a proxy object, which when loaded into a riak_object will allow for access to object metadata, but will use the passed function if access to object contents is requested.	2017-08-05 16:43:03 +01:00
Heinz N. Gies	38e9b0e80a	Add missing uniform/0 function	2017-08-01 11:24:12 +02:00
Heinz N. Gies	25389893cf	Add compatibility for old and new random / rand functions	2017-08-01 11:24:12 +02:00
Martin Sumner	8748fef28c	Add extra second to sleep Sleep for just one more second to resolve intermittent failure	2017-08-01 00:14:31 +01:00
Martin Sumner	65fd029ca6	typo - backlist/blacklist	2017-07-11 12:25:06 +01:00
martinsumner	80fd2615f6	Implement blacklist/whitelist Change from the all/whitelist ebhavior to the blacklist/whitelist behaviour documented in the write-up	2017-07-11 11:44:01 +01:00
martinsumner	3105656d2e	Add test descriptions and further documentation	2017-07-06 15:40:30 +01:00
martinsumner	0d72b353fe	Add test of expiry of nrt aae terms	2017-07-04 13:29:40 +01:00
martinsumner	439bf8c3b8	Add bucket whitelist test	2017-07-04 10:55:53 +01:00
Martin Sumner	1af9ac56dc	Revert passing Bucket Bad edit. Reverted	2017-07-03 19:06:41 +01:00
martinsumner	97fdd36d53	Returning bucket when bucket is all Need to know {Bucket, Key} not just Key if all buckets are being covered by nrt aae. So shoehorning this in - will also allow for proper use of FilterFun when filtering by partition.	2017-07-03 18:03:13 +01:00
martinsumner	d0a825a145	Extend test to detect keys When comparing recent changes demonstration the detection of the keys which have changed with a follow-up query	2017-07-03 10:33:34 +01:00
Martin Sumner	fd84e4f608	Test timeouts So that coverage testing will run.	2017-07-02 22:23:02 +01:00
martinsumner	52ca0e4b6c	Test expansion Detect a recent difference	2017-07-02 19:33:18 +01:00
martinsumner	da53808e2e	Extend test beyond restart Prove that recency check still works after a restart	2017-07-01 08:24:58 +01:00
martinsumner	a15c046887	Re-introduce commented tests	2017-06-30 16:31:48 +01:00
martinsumner	954995e23f	Support for recent AAE index With basic ct test. Doesn't currently prove expiry of index. Doesn't prove ability to find segments. Assumes that either "all" buckets or a special list of buckets require indexing this way. Will lead to unexpected results if the same bucket name is used across different Tags. The format of the index has been chosen so that hopeully standard index features can be used (e.g. return_terms).	2017-06-30 16:31:22 +01:00
martinsumner	8da8722b9e	Add temporary aae index Pending ct tests. The aae index should expire after limit_minutes and be on an index which is rounded to unit_minutes.	2017-06-30 10:03:36 +01:00
martinsumner	8e7aaf0ee7	Correct testutil to understand riak_extract_metadata Change, but change not reflected in tets code	2017-06-27 17:11:13 +01:00
martinsumner	f81a4bca0d	Revert "WIP - Recent Modifications" This reverts commit bc19a05d83a02d7ec03771657df85b33acc6cfee.	2017-06-27 16:25:18 +01:00
martinsumner	9fca17d56a	WIP - Recent Modifications Just some initial WIP code for this. Will revisit this again after exploring some ideas as to how to reduce the cost of the get_keys_by_segment. The overlal idea is that there are trees of recent modifications, with recent being some rolling time window made up of hourly blocks, and recency being dtermined by the last-modified date on the object metadata - which should be conistent across a cluster. So if we were at 15:30 we would get the tree for 14:00 - 15:00 and the tree for 15:00-16:00 from two different queries which cover the same partitions and then compare. Comparison may find differences, and we know what segment the difference is in - but how to then find all keys in that segment which have been modified in the period? Three ways: Do it inefficeintly and infrequently using a fold_keys and a filter (perhaps with SST files having a highest LMD in the metadata so that they can be skipped). Add a special index, where verye entry has a TTL, and the Key is {$segment, Segment, Bucket, Key} so that a normal 2i query cna be used. Align hashing for segments with hashing for penciller lookup so that a query over the actual keys cna be optimised skipping chunks of the in-memory part, and chunks of the SST file	2017-06-27 16:25:18 +01:00
Martin Sumner	e938eaa153	Add close to test	2017-06-23 16:51:28 +01:00
Martin Sumner	99131320c5	Broken test log	2017-06-23 15:20:24 +01:00
martinsumner	25a5065edd	Re-introduce test (again)	2017-06-23 14:56:32 +01:00
martinsumner	5e9e1347c7	Add test to find {term, key} that represents difference Not just detect existence of difference, but clarify what that difference that is.	2017-06-23 14:55:49 +01:00
martinsumner	2be4422e47	Re-add test	2017-06-23 12:44:52 +01:00
martinsumner	4e5c3e2f64	Fix merge Fix typo in merge, and extra validation step to unit tests to prevent it returning.	2017-06-23 12:32:37 +01:00
martinsumner	47655dc9c7	Uncomment previous test	2017-06-22 14:30:14 +01:00
martinsumner	5a012ff8a6	Add test of index comparison Compare two indexes for consistency	2017-06-22 13:54:51 +01:00
martinsumner	7cfa392b6e	Flexible TicTacTree sizes Allow tictac tree sizes to be flexible. Tested lots of different sizes. Having both level 1 and level 2 the same size seemed to be consistently quicker than trying to make either of the levels relatively wider. There's an 8% performance improvement if the SegmentCount is reduced by a quarter.	2017-06-20 10:58:13 +01:00
martinsumner	d5b4cb844f	Finding keys Progresses from a segment list to scanning for the keys in that segment	2017-06-19 18:38:55 +01:00
martinsumner	8203487a11	Expanded test ct testing of tictac trees now compares between differently partitioned stores.	2017-06-19 15:43:19 +01:00
Martin Sumner	833c7a80cb	corrected test differing object was in wrong bucket	2017-06-19 13:11:43 +01:00
martinsumner	c586b78f45	Initial code with busted ct test Initiat comparison made betwene trees externally - but ct test is bust.	2017-06-19 11:36:57 +01:00
martinsumner	f5dd154cee	Rename hashtree query Naming is now confusing now we have TicTac Trees. This query builds a list of keys and hashes not a tree - so it was misleading anyaway. Now renamed hashlist_query.	2017-06-16 12:38:59 +01:00
Martin Sumner	7642aac2cc	Change Riak object hash approach Change the riak object hash being kept in the metadata, to being a hash of the vector clock	2017-06-16 10:14:24 +01:00
martinsumner	94f3e036ea	Add journal compaction testing	2017-06-06 16:30:02 +01:00
Martin Sumner	15c52ae118	Change default compaction settings Need to allow specific settings to be passed into unit tests. Also, too much journal compaction may lead to intermittent failures on the basic_SUITE space_clear_on_delete test. think this is because there are less “deletes” to reload in on startup to trigger the cascade down and clear up?	2017-06-02 08:37:57 +01:00
Martin Sumner	96a548e17a	Change tests - binary keys the new code requires bucket listing to be on binary keys not just binary buckets. As this is only intended for use within Riak (where all keys are buckets are binaries), this constraint seems OK. A test needed changing to ensure it had a binary key in the bucket.	2017-05-23 15:54:11 +01:00
martinsumner	0d8ab0899e	Add test for is_empty Bucket listing didn't care if keys were active - now does.	2017-05-23 11:59:44 +01:00
martinsumner	a052edaea0	Add volume test information Volume testing with AAE rebuilds	2017-04-24 11:24:14 +01:00
martinsumner	fbb4879d81	Change fold_heads to do basic Journal presence check This at least checks the file is present, and the Key exists in the index of that file. If the value is corrupt it will be removed by compation, and then this will fail (unless the file is never compacted). TODO: resolve issus of files which are corrupt - but never compacted - a job for backup?	2017-04-21 15:55:03 +01:00
Martin Sumner	fa9daf8696	Correct async fold fold objects which snaps in the fold was implemented incorrectly - it took information from the LedgeCache at the point of the request, not at the point of the fold. So the LedgerCache SQN may have been surpassed in the Penciller by the time the fold was called.	2017-04-17 23:01:55 +01:00

... 2 3 4 5 6 ...

308 commits