leveled

Author	SHA1	Message	Date
Martin Sumner	70ebb62a61	Search the loader's mock cache .. (#354 ) ... in the correct direction - otherwise frequently updated objects may not be indexed correctly on reload.	2021-10-04 13:34:29 +01:00
Martin Sumner	ed0301e2cf	Mas i335 otp24 (#336 ) * Address OTP24 warnings, ct and eunit paths * Reorg to add OTP 24 support * Update VOLUME.md * Correct broken refs * Update README.md * CI on all main branches Co-authored-by: Ulf Wiger <ulf@wiger.net>	2021-05-25 13:41:20 +01:00
Martin Sumner	9157de680e	Change refernces to loop state records Resolve issue with OTP 22 performance https://github.com/martinsumner/leveled/issues/326 - by changing refernces to loop state. The test perf_SUITE proves the issue. OTP 22, without fixes: Fold pre-close 41209 ms post-close 688 ms OTP 22, with fixes: Fold pre-close 401 ms post-close 317 ms	2021-01-11 10:39:34 +00:00
Martin Sumner	f3f574de02	Switch to checking on get_kvrange In production scale testing, placing te check_modified call on get_kvrange not get_slots made the performance difference. It should help in get_lots as well, but unable to reliably get coverage in tests with this. So for now, will leave off until a proper test can be constructed which demonstrates any benefits.	2020-12-03 13:37:22 +00:00
Martin Sumner	a210aa6846	Promote cache when scanning When scanning over a leveled store with a helper (e.g. segment filter and last modified date range), applying the filter will speed up the query when the block index cache is available to get_slots. If it is not available, previously the leveled_sst did not then promote the cache after it had accessed the underlying blocks. Now the code does this, and also when the cache has all been added, it extracts the largest last modified date so that sst files older than the passed in date can be immediately dismissed	2020-12-02 13:29:50 +00:00
Martin Sumner	b4c79caf7a	Allow for caching of compaction scores Potentially reduce the overheads of scoring each file on every run. The change also alters the default thresholds for compaction to favour longer runs (which will tend towards greater storage efficiency).	2020-11-27 02:35:27 +00:00
Martin Sumner	312fc52832	Extend test to make it highly likely a "garbage" merge file choice is made	2020-03-31 09:33:50 +01:00
Martin Sumner	9e56bfa947	Merge branch 'master' into mas-i311-mergeselector	2020-03-30 20:07:05 +01:00
Martin Sumner	42eb5f56bc	Merge branch 'master' into mas-i311-mergeselector	2020-03-27 17:11:18 +00:00
Martin Sumner	aca945a171	Add counting of tombstones to new SST files .. and that old-style SST files cna still be created, and opened, with a return of 'not_counted'	2020-03-27 10:20:10 +00:00
Martin Sumner	50cb98ecdd	Resolve intermittent test failure the previous regex filter still allowed files with cdb in the body of the name (which can be true as filenames are guid based)	2020-03-17 17:29:59 +00:00
Martin Sumner	808a858d09	Don't score a rolling file In giving an empty file a score of 0, a race condition was exposed. A file might not be active, but might still be rolling - and then cna get scored as 0, and immediately compacted. It will then be removed from the journal manifest. Check each file is not rolling before making it a candidate for rolling.	2020-03-16 21:41:47 +00:00
Martin Sumner	dbceda876c	Issue with tag order https://github.com/martinsumner/leveled/issues/309 Resolve issue, and remove test log entries used when discovering issue.	2020-03-16 16:35:06 +00:00
Martin Sumner	6350302ea8	Uncomment test	2020-03-16 13:32:52 +00:00
Martin Sumner	9d92ca0773	Add tests for appDefined functions	2020-03-16 12:51:14 +00:00
Martin Sumner	706ba8a674	Resolve issues with passing specs around	2020-03-15 23:15:09 +00:00
Martin Sumner	694d2c39f8	Support for recalc Initial test included for running with recallc, and also transition from retain to recalc. Moves all logic for startup fold into leveled_bookie - avoid the Inker requiring any direct knowledge about implementation of the Penciller.	2020-03-15 22:14:42 +00:00
Martin Sumner	156e7b064d	Compaction, retain and recovery Change the penciller check so that it returns current/replaced/missing not just true/false. Reduce unnecessary penciller checks for non-standard keys that will always be retained - and remove redunandt code. Expand tests of retain and recover to make sure that compaction on delete is well covered. Also move the SQN number laong during initial loads - to stop aggressive loop to find starting SQN every file.	2020-03-09 15:12:48 +00:00
Martin Sumner	0966ce9929	Test improvements Improve the speed of leveled_cdb tests by disabling sync on write. Improve the strength of check of the correct behaviour when compacting with a reduced journal size.	2019-08-29 10:32:07 +01:00
Martin Sumner	8587686783	Add testing to ensure keydeltas are compacted in test	2019-07-26 21:43:00 +01:00
Martin Sumner	dab9652f6c	Add ability to control journal size by object count This helps when there are files wiht large numbers of key deltas (and hence small values), where otherwise the object count may get out of control.	2019-07-25 09:45:23 +01:00
Martin Sumner	22e732841c	Compaction of already compacted journals Ensure that journals with a large volume of key deltas do not erroneously get repeatedly compacted.	2019-07-24 18:03:22 +01:00
Martin Sumner	f8b3101a3a	Two memory management helpers Two helpers for memory management: 1 - a scan over the cdb file may lead to a lot of binary references being made. So force a GC fater the scan. 2 - the penciller files contain slots that will be frequently read - so advice the page cache to pre-load them on startup. This is in response to unexpected memory mangement issues in a potentially non-conventional setup - where the erlang VM held a lot of memory (that could be GC'd , in preference to the page cache - and consequently disk I/O and request latency were higher than expected.	2019-07-15 13:44:39 +01:00
Martin Sumner	952f088873	Memory management Extracting binary from within a binary leaves a reference to the whole of the original binary. If there are a lot of very large objects received abck toback - this can explode the amount of memory the penciller appears to hold (and gc cannot resolve this). To dereference from the larger binary, need to do a binary copy	2019-06-15 17:23:06 +01:00
Martin Sumner	876a023db1	Add database_id to options So that this can be recorded in logs	2019-06-13 14:58:32 +01:00
Martin Sumner	e360b97cfb	GC manifest files when numbers skipped Otherwise list of old files perpetually grows	2019-05-23 10:16:15 +01:00
Martin Sumner	14e1f577c9	Test default tag	2019-03-14 00:08:01 +00:00
Martin Sumner	01f0dadbb3	Add access to SQN Use book_sqn/3 or book_sqn/4 to get the SQN of an object in the store.	2019-03-13 16:21:03 +00:00
Martin Sumner	be6e23f7de	Change cache_size in sst tests Makes results more predictable (with coin toss variations)	2019-01-29 13:40:55 +00:00
Martin Sumner	e3bd83179a	Uncomment tests!	2019-01-27 23:31:44 +00:00
Martin Sumner	8f6862a10b	Test sst slot configuration change Confirm it results in many more files, if the slot count reduced. Has to handle the fact that Level 0 file has unlimited slots regardless of number of slots configured	2019-01-27 22:03:55 +00:00
Martin Sumner	e349774167	Allow clerk to be stopped during compaction scoring This will stop needless compaction work from being completed when the iclerk is sent a close at this stage.	2019-01-25 12:11:42 +00:00
Martin Sumner	0333604fd9	Change to cast in inker/iclerk interaction This allows for leveled_iclerk:clerk_stop to be a sync call, so that files will only be closed once the iclerk has stopped. This is designed ot prevent iclerk crashes during shutdowns when files it is depnding on are closed mid shutdown.	2019-01-24 21:32:54 +00:00
Martin Sumner	28d0aef5fe	Make check that compaction not ongoing before accepting new compaction Respond 'busy' if compaction is ongoing	2019-01-24 15:46:17 +00:00
Martin Sumner	c060c0e41d	Handle L0 cache being full A test thta will cause leveled to crash due to a low cache size being set - but protect against this (as well as the general scenario of the cache being full). There could be a potential case where a L0 file present (post pending) without work backlog being set. In this case we want to roll the level zero to memory, but don't accept the cache update if the L0 cache is already full.	2019-01-14 12:27:51 +00:00
Martin Sumner	672cfd4fcd	Allow for run-time changes to log_level and forced_logs Will not lead to immediate run time changes in SST or CDB logs. These log settings will only change once the new files are re-written. To completely change the log level - a restart of the store is necessary with new startup options.	2018-12-11 21:59:57 +00:00
Martin Sumner	6677f2e5c6	Push log update through to cdb/sst Using the cdb_options and sst_options records	2018-12-11 20:42:00 +00:00
Martin Sumner	f274d2a63a	Tighten acceptable duration even with cover, passes in 30s.	2018-12-10 13:23:39 +00:00
Martin Sumner	e73f48a18b	Add failing test Test fails as fetching repeated object is too slow. ```Head check took 124301 microseconds checking list of length 5000 Head check took 112286 microseconds checking list of length 5000 Head check took 1336512 microseconds checking list of length 5 2018-12-10T11:54:41.342 B0013 <0.2459.0> Long running task took 260788 microseconds with task of type pcl_head 2018-12-10T11:54:41.618 B0013 <0.2459.0> Long running task took 276508 microseconds with task of type pcl_head 2018-12-10T11:54:41.894 B0013 <0.2459.0> Long running task took 275225 microseconds with task of type pcl_head 2018-12-10T11:54:42.173 B0013 <0.2459.0> Long running task took 278836 microseconds with task of type pcl_head 2018-12-10T11:54:42.477 B0013 <0.2459.0> Long running task took 304524 microseconds with task of type pcl_head``` It taks twice as long to check for one repeated object as it does to check for 5K non-repeated objects	2018-12-10 11:58:21 +00:00
Martin Sumner	8e687ee7c8	Add user-defined functions To allow for extraction of metadata, and building of head responses - it should eb possible to dynamically and user-defined tags, and functions to treat them. If no function is defined, revert to the behaviour of the ?STD tag.	2018-12-06 21:00:59 +00:00
Martin Sumner	881b93229b	Isolate better changes needed to support changes to metadata extraction More obvious how to extend the code as it is all in one module. Also add a new field to the standard object metadata tuple that may hold in the future other object metadata base don user-defined functions.	2018-12-06 15:31:11 +00:00
Martin Sumner	510994233e	Add check that index disappears Check I0 count goes down when that index is removed	2018-12-05 15:42:21 +00:00
Martin Sumner	cf1fcaeef2	Add test of index expiry To show how this works, and prove that it does work thta way. Test may require adjusting if tested on a slow node (e.g. reduce KeyCount or increase TTL)	2018-12-05 15:18:20 +00:00
Martin Sumner	578a9f88e0	Support for log settings at startup Both log level and forced_logs. Allows for log_level to be changed at startup ad runtime. Also allow for a list of forced logs, so if log_level is set > info, individual info logs can be forced to be seen (such as to see stats logs).	2018-12-05 00:17:39 +00:00
Martin Sumner	6d2d0694e3	Reverse necessary on bucket list The function should see the buckets in order, so it accumulates in such a way to reverse the order - it makes sense that the outcome should be in reverse.	2018-11-23 19:03:24 +00:00
Martin Sumner	a9aa23bc9c	Bucket list update the docs to advertise throw capability. Test it for bucket list (and fix ordering of bucket lists)	2018-11-23 18:56:30 +00:00
Martin Sumner	ef2a8c62af	Add capability to exit a head or object fold with a throw This allows for all fold functions to throw an exception to exit out of a fold with all dependencies still closed down as expected. This was previously available for key folds, which was necessary for the folds to work in Riak (as max_results in index queries depends one xiting the fold with an exception). This change now adds a ct test, and adds support for head folds, object folds (key order) and object folds (sqn order)	2018-11-23 16:00:11 +00:00
Martin Sumner	2afb160a12	Add test - large seglist	2018-11-07 21:35:21 +00:00
Martin Sumner	e9fb893ea0	Check segment is as expected with tuplebuckets In head_only mode	2018-11-05 10:31:15 +00:00
Martin Sumner	e72a946f43	TupleBuckets in Riak objects Adds support with test for tuplebuckets in Riak keys. This exposed that there was no filter using the seglist on the in-mmemory keys. This means that if there is no filter applied in the fold_function, many false positives may emerge. This is probably not a big performance benefit (and indeed for performance it may be better to apply during the leveled_pmem:merge_trees). Some thought still required as to what is more likely to contribute to future bugs: an extra location using the hash matching found in leveled_sst, or the extra results in the query.	2018-11-05 01:21:08 +00:00

1 2 3 4 5 ...

258 commits