leveled

martinsumner 9fca17d56a WIP - Recent Modifications Just some initial WIP code for this. Will revisit this again after exploring some ideas as to how to reduce the cost of the get_keys_by_segment. The overlal idea is that there are trees of recent modifications, with recent being some rolling time window made up of hourly blocks, and recency being dtermined by the last-modified date on the object metadata - which should be conistent across a cluster. So if we were at 15:30 we would get the tree for 14:00 - 15:00 and the tree for 15:00-16:00 from two different queries which cover the same partitions and then compare. Comparison may find differences, and we know what segment the difference is in - but how to then find all keys in that segment which have been modified in the period? Three ways: Do it inefficeintly and infrequently using a fold_keys and a filter (perhaps with SST files having a highest LMD in the metadata so that they can be skipped). Add a special index, where verye entry has a TTL, and the Key is {$segment, Segment, Bucket, Key} so that a normal 2i query cna be used. Align hashing for segments with hashing for penciller lookup so that a query over the actual keys cna be optimised skipping chunks of the in-memory part, and chunks of the SST file		2017-06-27 16:25:18 +01:00
..
end_to_end	WIP - Recent Modifications	2017-06-27 16:25:18 +01:00
volume	Add journal compaction testing	2017-06-06 16:30:02 +01:00
lookup_test.erl	Initial work on sft files	2016-05-31 17:21:14 +01:00
rice_test.erl	Tidy-up initial files and add testing to optimise bst bloom filters	2015-05-31 23:31:31 +01:00