restic

mirror of https://github.com/restic/restic.git synced 2025-03-04 18:48:39 +00:00

Author	SHA1	Message	Date
Gilbert Gilb's	536ebefff4	feat(backends/s3): add warmup support before repacks and restores (#5173 ) * feat(backends/s3): add warmup support before repacks and restores This commit introduces basic support for transitioning pack files stored in cold storage to hot storage on S3 and S3-compatible providers. To prevent unexpected behavior for existing users, the feature is gated behind new flags: - `s3.enable-restore`: opt-in flag (defaults to false) - `s3.restore-days`: number of days for the restored objects to remain in hot storage (defaults to `7`) - `s3.restore-timeout`: maximum time to wait for a single restoration (default to `1 day`) - `s3.restore-tier`: retrieval tier at which the restore will be processed. (default to `Standard`) As restoration times can be lengthy, this implementation preemptively restores selected packs to prevent incessant restore-delays during downloads. This is slightly sub-optimal as we could process packs out-of-order (as soon as they're transitioned), but this would really add too much complexity for a marginal gain in speed. To maintain simplicity and prevent resources exhautions with lots of packs, no new concurrency mechanisms or goroutines were added. This just hooks gracefully into the existing routines. Limitations: - Tests against the backend were not written due to the lack of cold storage class support in MinIO. Testing was done manually on Scaleway's S3-compatible object storage. If necessary, we could explore testing with LocalStack or mocks, though this requires further discussion. - Currently, this feature only warms up before restores and repacks (prune/copy), as those are the two main use-cases I came across. Support for other commands may be added in future iterations, as long as affected packs can be calculated in advance. - The feature is gated behind a new alpha `s3-restore` feature flag to make it explicit that the feature is still wet behind the ears. - There is no explicit user notification for ongoing pack restorations. While I think it is not necessary because of the opt-in flag, showing some notice may improve usability (but would probably require major refactoring in the progress bar which I didn't want to start). Another possibility would be to add a flag to send restores requests and fail early. See https://github.com/restic/restic/issues/3202 * ui: warn user when files are warming up from cold storage * refactor: remove the PacksWarmer struct It's easier to handle multiple handles in the backend directly, and it may open the door to reducing the number of requests made to the backend in the future.	2025-02-01 18:26:27 +00:00
Michael Eischer	58dc4a6892	backend/retry: hide final log for `stat()` method stat is only used to check the config file's existence. We don't want log output in this case.	2024-11-01 15:17:54 +01:00
Michael Eischer	e24dd5a162	backend/retry: don't trip circuit breaker if context is canceled When the context used for a load operation is canceled, then the result is always an error independent of whether the file could be retrieved from the backend. Do not false positively trip the circuit breaker in this case. The old behavior was problematic when trying to lock a repository. When `Lock.checkForOtherLocks` listed multiple lock files in parallel and one of them fails to load, then all other loads were canceled. This cancelation was remembered by the circuit breaker, such that locking retries would fail.	2024-08-26 16:22:21 +02:00
Michael Eischer	38654a3bd7	backend/retry: do not log final error if context was canceled Calls to `List(ctx, ...)` are usually stopped by canceling the context once no further entries are required by the caller. Thus, don't log the final error if the used context was canceled.	2024-05-30 18:48:52 +02:00
Michael Eischer	e4a48085ae	backend/retry: feature flag new retry behavior	2024-05-24 20:24:02 +02:00
Michael Eischer	98709a4372	retry: reduce total number of retries Retries in restic try to solve two main problems: - retry a temporarily failed operation - tolerate temporary network interruptions The first problem only requires a few retries, whereas the last one benefits primarily from spreading the requests over a longer duration. Increasing the default multiplier and the initial interval works for both cases. The first few retries only take a few seconds, while later retries quickly reach the maximum interval of one minute. This ensures that the total number of retries issued by restic will remain at around 21 retries for a 15 minute period. As the concurrency in restic is bounded, retries drastically reduce the number of requests sent to a backend. This helps to prevent overloading the backend.	2024-05-24 20:24:02 +02:00
Michael Eischer	512cd6ef07	retry: ensure that there's always at least one retry Previously, if an operation failed after 15 minutes, then it would never be retried. This means that large backend requests are more unreliable than smaller ones.	2024-05-24 20:24:02 +02:00
Michael Eischer	a60ee9b764	retry: limit retries based on elapsed time not count Depending on how long an operation takes to fail, the total retry duration can currently vary between 1.5 and 15 minutes. In particular for temporarily interrupted network connections, the former timeout is too short. Thus always use a limit of 15 minutes.	2024-05-24 20:24:02 +02:00
Michael Eischer	a3633cad9e	retry: explicitly log failed requests This simplifies finding the request in the log output that cause an operation to fail.	2024-05-24 20:24:02 +02:00
Michael Eischer	53d15bcd1b	retry: add circuit breaker to load method If a file exhausts its retry attempts, then it is likely not accessible the next time. Thus, immediately fail all load calls for that file to avoid useless retries.	2024-05-18 19:59:26 +02:00
Michael Eischer	aeb7eb245c	retry: do not retry permanent errors This is currently gated behind a feature flag as some unexpected interactions might show up in the wild.	2024-05-18 19:59:26 +02:00
Michael Eischer	1b8a67fe76	move Backend interface to backend package	2023-10-25 23:00:18 +02:00
Arash Farr	d15ffd9c92	retry: Do not retry Load() if file does not exist	2023-10-22 13:25:32 -05:00
Michael Eischer	d1a5ec7839	Rename unused testing parameter to _ The parameter is an additional marker that the test helper must only be used for tests.	2023-05-18 21:17:53 +02:00
Michael Eischer	6042df075f	migrations: Fix S3 backend detection	2023-04-14 22:32:16 +02:00
Michael Eischer	648edeca40	retry: Do not retry Stat() if file does not exist In non test/debug code, Stat() is used exclusively to check whether a file exists. Thus, do not retry if a file is reported as not existing.	2022-12-03 11:42:48 +01:00
Michael Eischer	40ac678252	backend: remove Test method The Test method was only used in exactly one place, namely when trying to create a new repository it was used to check whether a config file already exists. Use a combination of Stat() and IsNotExist() instead.	2022-12-03 11:28:10 +01:00
Michael Eischer	ff7ef5007e	Replace most usages of ioutil with the underlying function The ioutil functions are deprecated since Go 1.17 and only wrap another library function. Thus directly call the underlying function. This commit only mechanically replaces the function calls.	2022-12-02 19:36:43 +01:00
Michael Eischer	5c7a9a739a	backend: Split RetryBackend into own package The RetryBackend tests depend on the mock backend. When the Backend interface is eventually split from the restic package, this will lead to a dependency cycle between backend and backend/mock. Thus split the RetryBackend into a separate package to avoid this problem.	2022-10-21 21:38:17 +02:00

19 commits