restic

Commit Graph

Author	SHA1	Message	Date
Michael Eischer	5756c96c9f	archiver: Fix race condition resulting in files containing null IDs In some rare cases files could be created which contain null IDs (all zero) in their content list. This was caused by a race condition between growing the `Content` slice and inserting the blob IDs into it. In some cases the blob ID was written to the old slice, which a short time afterwards was replaced with a larger copy, that did not yet contain the blob ID.	2022-11-10 20:19:37 +01:00
Michael Eischer	b1d1202b1d	archiver: Check that saved file does not have null IDs in content Null IDs in the file content indicate that something went wrong. Thus fails before saving the affected file.	2022-11-08 22:57:41 +01:00
Michael Eischer	c0f34af9db	backup: hide files from status which are read completely but not saved As the FileSaver is asynchronously waiting for all blobs of a file to be stored, the number of active files is higher than the number of files from which restic is reading concurrently. Thus to not confuse users, only display files in the status from which restic is currently reading.	2022-10-30 10:29:12 +01:00
Michael Eischer	b4de902596	archiver: Asynchronously complete FutureFile After reading and chunking all data in a file, the FutureFile still has to wait until the FutureBlobs are completed. This was done synchronously which results in blocking the file saver and prevents the next file from being read. By replacing the FutureBlob with a callback, it becomes possible to complete the FutureFile asynchronously.	2022-10-30 10:29:11 +01:00
Michael Eischer	b361284f28	Merge pull request #3979 from MichaelEischer/backup-less-time-now backup: reduce calls to time.Now	2022-10-21 21:33:34 +02:00
Michael Eischer	c8c8391b21	Merge pull request #3974 from greatroar/cleanup More cleanups and a micro-optimization	2022-10-21 21:11:37 +02:00
Michael Eischer	ee7c28f5e6	backup: reduce calls to time.Now Archiver.Save queries the current time multiple times. This commit removes one of these calls as they showed up while profiling a backup of a nearly unchanged dataset containing 3 million files.	2022-10-21 20:55:01 +02:00
greatroar	22147e1e02	all: Minor cleanups if x { return true } return false => return x fmt.Sprintf("%v", x) => fmt.Sprint(x) or x.String() The fmt.Sprintf idiom is still used in the SecretString tests, where it serves security hardening.	2022-10-16 10:50:39 +02:00
Michael Eischer	964977677f	backup: Remove unused filename parameter from CompleteBlob callback	2022-10-15 15:21:17 +02:00
Michael Eischer	a3113c6097	restic: Change FindSnapshot functions to return the snapshot	2022-10-15 13:34:04 +02:00
greatroar	16849d5361	internal/archiver: Missing argument to errors.Errorf	2022-10-14 14:18:52 +02:00
Michael Eischer	1a6160d152	Merge pull request #3880 from MichaelEischer/archiver-savedir-cleanup archiver: Improve handling of "file xxx already present" error	2022-10-08 21:48:14 +02:00
Michael Eischer	5278ab51c8	archiver: Check that duplicates are only ignored if identical	2022-10-08 21:38:36 +02:00
Michael Eischer	403b01b788	backup: Only return a warning for duplicate directory entries The backup command failed if a directory contains duplicate entries. Downgrade the severity of this problem from fatal error to a warning. This allows users to still create a backup.	2022-10-08 21:38:21 +02:00
Michael Eischer	d7d7b4ab27	archiver: refactor TreeSaverTest	2022-10-08 21:29:32 +02:00
Michael Eischer	8e38c43c27	archiver: let FutureNode.Take return an error if no data is available This ensures that we cannot accidentally store an invalid node.	2022-10-08 21:28:39 +02:00
Michael Eischer	2b88cd6eab	archiver: Restructure SaveTree to work like SaveDir SaveTree did not use the TreeSaver but rather managed the tree collection and upload itself. This prevents using the parallelism offered by the TreeSaver and duplicates all related code. Using the TreeSaver can provide some speed-ups as all steps within the backup tree now rely on FutureNodes. This can be especially relevant for backups with large amounts of explicitly specified files. The main difference between SaveTree and SaveDir is, that only the former can save tree blobs in which nodes have a different name than the actual file on disk. This is the result of resolving name conflicts between multiple files with the same name. The filename that must be used within the snapshot is now passed directly to restic.NodeFromFileInfo. This ensures that a FutureNode already contains the correct filename.	2022-10-08 21:28:39 +02:00
Michael Eischer	2e3f1c08c5	repository: split index into a separate package	2022-10-08 21:15:34 +02:00
Michael Eischer	2e606ca70b	backup: rework read concurrency	2022-10-02 22:55:14 +02:00
Michael Eischer	4a10ebed15	archiver: reduce memory usage for large files FutureBlob now uses a Take() method as a more memory-efficient way to retrieve the futures result. In addition, futures are now collected while saving the file. As only a limited number of blobs can be queued for uploading, for a large file nearly all FutureBlobs already have their result ready, such that the FutureBlob object just consumes memory.	2022-07-23 14:45:07 +02:00
Michael Eischer	b817681a11	archiver: Incrementally serialize tree nodes That way it is not necessary to keep both the Nodes forming a Tree and the serialized JSON version in memory.	2022-07-23 14:45:07 +02:00
Michael Eischer	c206a101a3	archiver: unify FutureTree/File into futureNode There is no real difference between the FutureTree and FutureFile structs. However, differentiating both increases the size of the FutureNode struct. The FutureNode struct is now only 16 bytes large on 64bit platforms. That way is has a very low overhead if the corresponding file/directory was not processed yet. There is a special case for nodes that were reused from the parent snapshot, as a go channel seems to have 96 bytes overhead which would result in a memory usage regression.	2022-07-23 14:45:07 +02:00
Michael Eischer	32f4997733	archiver: remove unused fileInfo from progress callback	2022-07-23 14:16:23 +02:00
Michael Eischer	dcb00fd2d1	archiver: cleanup Saver interface	2022-07-23 14:16:23 +02:00
Michael Eischer	79321a195c	archiver: remove dead attribute from FutureNode	2022-07-23 14:16:23 +02:00
MichaelEischer	443cc49afd	Merge pull request #3830 from MichaelEischer/cleanup-repo Extract Load/SaveTree/JSONUnpacked from repository	2022-07-23 10:46:13 +02:00
Michael Eischer	8c11fc3ec9	crypto: move crypto buffer helpers	2022-07-17 13:42:23 +02:00
Michael Eischer	89d3ce852b	repository: extract Load/StoreJSONUnpacked A Load/Store method for each data type is much clearer. As a result the repository no longer needs a method to load / store json.	2022-07-17 13:22:00 +02:00
Michael Eischer	fbcbd5318c	repository: extract LoadTree/SaveTree The repository has no real idea what a Tree is. So these methods never belonged there.	2022-07-17 13:11:28 +02:00
Michael Eischer	ce89018902	Fix data race in blob_saver After the `BlobSaver` job is submitted, the buffer can be released and reused by another `FileSaver` even before `BlobSaver.Save` returns. That FileSaver will the change `buf.Data` leading to wrong backup statistics. Found by `go test -race ./...`: WARNING: DATA RACE Write at 0x00c0000784a0 by goroutine 41: github.com/restic/restic/internal/archiver.(FileSaver).saveFile() /home/michael/Projekte/restic/restic/internal/archiver/file_saver.go:176 +0x789 github.com/restic/restic/internal/archiver.(FileSaver).worker() /home/michael/Projekte/restic/restic/internal/archiver/file_saver.go:242 +0x2af github.com/restic/restic/internal/archiver.NewFileSaver.func2() /home/michael/Projekte/restic/restic/internal/archiver/file_saver.go:88 +0x5d golang.org/x/sync/errgroup.(Group).Go.func1() /home/michael/go/pkg/mod/golang.org/x/sync@v0.0.0-20210220032951-036812b2e83c/errgroup/errgroup.go:57 +0x91 Previous read at 0x00c0000784a0 by goroutine 29: github.com/restic/restic/internal/archiver.(BlobSaver).Save() /home/michael/Projekte/restic/restic/internal/archiver/blob_saver.go:57 +0x1dd github.com/restic/restic/internal/archiver.(BlobSaver).Save-fm() <autogenerated>:1 +0xac github.com/restic/restic/internal/archiver.(FileSaver).saveFile() /home/michael/Projekte/restic/restic/internal/archiver/file_saver.go:191 +0x855 github.com/restic/restic/internal/archiver.(FileSaver).worker() /home/michael/Projekte/restic/restic/internal/archiver/file_saver.go:242 +0x2af github.com/restic/restic/internal/archiver.NewFileSaver.func2() /home/michael/Projekte/restic/restic/internal/archiver/file_saver.go:88 +0x5d golang.org/x/sync/errgroup.(Group).Go.func1() /home/michael/go/pkg/mod/golang.org/x/sync@v0.0.0-20210220032951-036812b2e83c/errgroup/errgroup.go:57 +0x91	2022-07-03 14:47:53 +02:00
Michael Eischer	fa25d6118e	archiver: Reduce tree saver concurrency Large amount of tree savers have no obvious benefit, however they can increase the amount of (potentially large) trees kept in memory.	2022-07-02 22:42:34 +02:00
Michael Eischer	bba1e81719	archiver: Limit blob saver count to GOMAXPROCS Now with the asynchronous uploaders there's no more benefit from using more blob savers than we have CPUs. Thus use just one blob saver for each CPU we are allowed to use.	2022-07-02 22:42:34 +02:00
Michael Eischer	120ccc8754	repository: Rework blob saving to use an async pack uploader Previously, SaveAndEncrypt would assemble blobs into packs and either return immediately if the pack is not yet full or upload the pack file otherwise. The upload will block the current goroutine until it finishes. Now, the upload is done using separate goroutines. This requires changes to the error handling. As uploads are no longer tied to a SaveAndEncrypt call, failed uploads are signaled using an errgroup. To count the uploaded amount of data, the pack header overhead is no longer returned by `packer.Finalize` but rather by `packer.HeaderOverhead`. This helper method is necessary to continue returning the pack header overhead directly to the responsible call to `repository.SaveBlob`. Without the method this would not be possible, as packs are finalized asynchronously.	2022-07-02 22:42:34 +02:00
Alexander Neumann	6c4ceaf1e7	Print number of bytes added to the repo This includes optional compression and crypto overhead.	2022-07-02 18:55:12 +02:00
Alexander Neumann	99634c0936	Return real size from SaveBlob	2022-07-02 18:55:12 +02:00
MichaelEischer	bc96879d41	Merge pull request #3785 from MichaelEischer/replace-tomb-usage Remove usage of tomb package	2022-06-19 14:42:48 +02:00
greatroar	f92ecf13c9	all: Move away from pkg/errors, easy cases github.com/pkg/errors is no longer getting updates, because Go 1.13 went with the more flexible errors.{As,Is} function. Use those instead: errors from pkg/errors already support the Unwrap interface used by 1.13 error handling. Also: * check for io.EOF with a straight ==. That value should not be wrapped, and the chunker (whose error is checked in the cases changed) does not wrap it. * Give custom Error methods pointer receivers, so there's no ambiguity when type-switching since the value type will no longer implement error. * Make restic.ErrAlreadyLocked private, and rename it to alreadyLockedError to match the stdlib convention that error type names end in Error. * Same with rest.ErrIsNotExist => rest.notExistError. * Make s3.Backend.IsAccessDenied a private function.	2022-06-14 08:36:38 +02:00
Michael Eischer	e002b09d57	archiver: free workers once finished	2022-06-05 15:48:10 +02:00
Michael Eischer	408ac1a0c2	archiver: remove tomb usage	2022-06-05 15:47:52 +02:00
greatroar	0db1d11b2e	archiver: Remove cleanup goroutine from BufferPool This isn't doing anything. Channels should get cleaned up by the GC when the last reference to them disappears, just like all other data structures. Also inlined BufferPool.Put in Buffer.Release, its only caller.	2022-05-29 17:09:16 +02:00
Michael Eischer	9ffb8920f1	repository: run blackbox tests using old and new repo version	2022-04-30 11:34:10 +02:00
Alexander Neumann	db8a958991	Merge pull request #3683 from MichaelEischer/fix-golangci-lint-warnings Fix golangci lint warnings	2022-03-29 11:45:10 +02:00
Michael Eischer	c60540b196	add go:build headers everywhere	2022-03-28 22:23:47 +02:00
Michael Eischer	d6db5a1fc2	archiver: Fix test The test relied on an undeocumented sideeffect of the LoadBlob implementation	2022-03-28 22:09:49 +02:00
Alexander Neumann	fb5d9345a7	Merge pull request #3510 from MichaelEischer/fix-archiver-early-on-abort archiver: Fix TestArchiverAbortEarlyOnError test	2021-10-16 15:37:41 +02:00
greatroar	c892c0bab9	internal/restic: Don't allocate in Tree.Insert name old time/op new time/op delta BuildTree-8 34.6µs ± 4% 7.0µs ± 3% -79.68% (p=0.000 n=18+19) name old alloc/op new alloc/op delta BuildTree-8 34.0kB ± 0% 0.9kB ± 0% -97.37% (p=0.000 n=20+20) name old allocs/op new allocs/op delta BuildTree-8 108 ± 0% 1 ± 0% -99.07% (p=0.000 n=20+20)	2021-09-26 18:08:48 +02:00
Michael Eischer	e0d615c264	archiver: Fix TestArchiverAbortEarlyOnError test This can be caused when the test has uploaded four blobs, then queues two blobs for upload which are delayed. Then a seventh file can be opened which lead to a test failure.	2021-09-12 22:17:17 +02:00
Alexander Neumann	0e5f2fff71	Merge pull request #3243 from restic/fix-scanner-overlap backup: Fix total size for overlapping targets	2021-01-30 21:17:21 +01:00
Alexander Neumann	200f09522d	Add more error checks	2021-01-30 20:02:37 +01:00
Alexander Neumann	5c617859ab	backup/scanner: Fix total size for overlapping targets Before, the scanner would could files twice if they were included in the list of backup targets twice, e.g. `restic backup foo foo/bar` would could the file `foo/bar` twice. This commit uses the tree structure from the archiver to run the scanner, so both parts see the same files.	2021-01-29 11:31:36 +01:00

1 2 3 4

174 Commits