1
0
Fork 0
mirror of https://github.com/borgbackup/borg.git synced 2025-01-03 05:35:58 +00:00
Commit graph

8693 commits

Author SHA1 Message Date
Thomas Waldmann
d30fa6f28e
implement --match-archives tags:TAG1,TAG2,...
also: reduce code duplication
2024-09-26 23:09:23 +02:00
TW
2a01d29cda
Merge pull request #8428 from ThomasWaldmann/tags-fixes
fixes for tags infrastructure PR
2024-09-26 23:02:23 +02:00
Thomas Waldmann
d5b4a696bb
fixes for tags infrastructure PR 2024-09-26 22:41:21 +02:00
TW
70977a323b
Merge pull request #8426 from ThomasWaldmann/archive-tags-metadata
tags: low-level infrastructure for archive tags
2024-09-26 21:24:06 +02:00
TW
9c9a75bdf9
Merge pull request #8421 from ThomasWaldmann/rel200b11
Release 2.0.0 beta11
2024-09-26 20:06:30 +02:00
Thomas Waldmann
89c346b935
tags: low-level infrastructure for archive tags
Read or modify this set, only add validated str to it:

Archive.tags: Optional[set[str]]

borg info [--json] <archive> displays a list of comma-separated archive tags (currently always empty).
2024-09-26 19:16:43 +02:00
TW
3c1f173738
Merge pull request #8422 from ThomasWaldmann/list-more-archive-metadata
List more archive metadata
2024-09-26 17:07:16 +02:00
Thomas Waldmann
9b3a0023c3
repo-list: add hostname/username/comment to default format, reorder, adjust
borg 1.x encouraged users to put everything into the archive name:
- name of the dataset
- timestamp (usually used to make the archive name unique)
- maybe also hostname (when backing up to same repo from multiple hosts)
- maybe also username (when backing up to same repo from multiple users)

borg2 now discourages users from putting the timestamp into the name,
because we rather want same name within a series of archives - thus,
the field width for the name can be narrower.

the ID of the archive is now the only unique identifier, thus it is
moved to the leftmost place.
256bits (64 hex digits) was a bit much and as borg can also deal with
abbreviated IDs, we only show 32bits (8 hex digits) by default.

the ID is followed by the timestamp (also quite "interesting", because
it usually differs for different archives).

then following are: archive name, user name, host name - these might be
always the same if there is only one series of archives in a repo.

use 2 blanks separating the fields for better readability.
2024-09-26 15:53:36 +02:00
Thomas Waldmann
1b680531ee
repo-list --short: only print archive IDs
Needed to change this because listing just the
archive names is pretty useless if names are not
unique.

The short list is likely mostly used by scripts to
iterate over all archives, so outputting IDs is
better.
2024-09-26 09:10:52 +02:00
Thomas Waldmann
fb496bf1b7
build_usage build_man 2024-09-26 07:57:01 +02:00
Thomas Waldmann
439024cba6
update CHANGES 2024-09-26 07:54:38 +02:00
TW
84615385ce
Merge pull request #8420 from ThomasWaldmann/catch-be-error
catch BackendError raised by Store(), see #8373
2024-09-26 07:06:12 +02:00
Thomas Waldmann
628990d32c
fix markup error in change log 2024-09-25 23:53:15 +02:00
Thomas Waldmann
c3038e129d
catch BackendError raised by Store(), see #8373 2024-09-25 23:45:27 +02:00
TW
5be2485627
Merge pull request #8412 from ThomasWaldmann/update-changes
update CHANGES
2024-09-25 23:31:21 +02:00
TW
fa7a935baa
Merge pull request #8419 from ThomasWaldmann/upgrade-pyinstaller
upgrade pyinstaller to 6.10.0, fixes #8366
2024-09-25 23:31:04 +02:00
Thomas Waldmann
328cd22a13
update CHANGES 2024-09-25 23:25:34 +02:00
Thomas Waldmann
ae85f64536
upgrade pyinstaller to 6.10.0 2024-09-25 23:21:32 +02:00
TW
656124e001
Merge pull request #8418 from ThomasWaldmann/update-requirements-locked
update development.lock.txt
2024-09-25 22:48:35 +02:00
TW
fd1a7ddb76
Merge pull request #8417 from ThomasWaldmann/ids-per-chunk
increase IDS_PER_CHUNK, fixes #6945
2024-09-25 22:48:16 +02:00
Thomas Waldmann
1f866ce6c6
update development.lock.txt 2024-09-25 21:14:42 +02:00
Thomas Waldmann
015e3a43aa
increase IDS_PER_CHUNK, fixes #6945 2024-09-25 20:57:28 +02:00
TW
67b62b5989
Merge pull request #8411 from ThomasWaldmann/optimize-repo-list-usage
bugfix: remove superfluous repository.list() call
2024-09-25 11:06:09 +02:00
Thomas Waldmann
1436bbba1a
bugfix: remove superfluous repository.list() call
Because it ended the loop only when .list() returned an
empty result, this always needed one call more than
necessary.

We can also detect that we are finished, if .list()
returns less than the limit we gave to it.

Also: reduce code duplication by using repo_lister func.
2024-09-24 23:43:08 +02:00
TW
7d02fe2b8f
Merge pull request #8403 from ThomasWaldmann/cache-chunkindex
chunks index caching
2024-09-24 23:37:29 +02:00
Thomas Waldmann
36e3d63474
chunks index caching, fixes #8397
borg compact now uses ChunkIndex (a specialized, memory-efficient data structure),
so it needs less memory now. Also, it saves that chunks index to cache/chunks in
the repository.

When the chunks index is needed, it is first tried to get it from cache/chunks.
If that fails, fall back to building the chunks index via repository.list(),
which can be rather slow and immediately cache the resulting ChunkIndex in the
repo.

borg check --repair currently just deletes the chunks cache, because it might
have deleted some invalid chunks in the repo.

cache.close now saves the chunks index to cache/chunks in repo if it
was modified.
thus, borg create will update the cached chunks index with new chunks.

cache/chunks_hash can be used to validate cache/chunks (and also to validate /
invalidate locally cached copies of that).
2024-09-24 22:25:00 +02:00
TW
1e6f71f2f5
Merge pull request #8408 from helmutg/faq-slow-fat
FAQ: Why is backing up an unmodified FAT filesystem slow on Linux?
2024-09-23 13:46:46 +02:00
Helmut Grohne
70f173caa7 FAQ: Why is backing up an unmodified FAT filesystem slow on Linux? 2024-09-23 10:36:56 +02:00
TW
527454840b
Merge pull request #8405 from ThomasWaldmann/support-rclone-borgstore
add support for rclone:// repositories (via borgstore)
2024-09-22 23:27:27 +02:00
Thomas Waldmann
bd6caf835d
add support for rclone:// repositories (via borgstore) 2024-09-22 22:26:07 +02:00
TW
4d8954ecbb
Merge pull request #8404 from ThomasWaldmann/fix-build-files-cache
cache: fix crash in _build_files_cache
2024-09-22 01:34:21 +02:00
Thomas Waldmann
e5e685fd1f
cache: fix crash in _build_files_cache 2024-09-22 00:36:30 +02:00
TW
b862f2b95f
Merge pull request #8389 from ThomasWaldmann/files-cache-from-archive
files cache improvements
2024-09-21 15:17:29 +02:00
Thomas Waldmann
ec9d412756
fix race condition with data loss potential, fixes #3536
we discard all files cache entries referring to files
with timestamps AFTER we started the backup.

so, even in case we would back up an inconsistent file
that has been changed while we backed it up, we would
not have a files cache entry for it and would fully
read/chunk/hash it again in next backup.
2024-09-21 11:34:34 +02:00
Thomas Waldmann
b60378cf0e
fix race condition with data loss potential, fixes #3536
if we detect the conditions for this (rare) race,
abort reading the file and retry.

The caller (_process_any) will do up to MAX_RETRIES
before giving up. If it gives up, a warning is logged
and the file is not written to the archive and won't
be memorized in the files cache either.

Thus, the file will be read/chunked/hashed again at
the next borg create run.
2024-09-21 11:34:31 +02:00
TW
275e5e136c
Merge pull request #8399 from ThomasWaldmann/storelocking-updates
storelocking: fixes / cleanups
2024-09-20 14:28:01 +02:00
Thomas Waldmann
31e5318e66
storelocking: fixes / cleanups
- on explicit request, update .last_refresh_dt inside _create_lock / _delete_lock
- reset .last_refresh_dt if we kill our own lock
- be more precise, have exactly the datetime of the lock in .last_refresh_dt
- cosmetic: do refresh/stale time comparisons always in the same way
2024-09-20 11:49:40 +02:00
Thomas Waldmann
c100e7b1f5
files cache: update ctime, mtime of known and "unchanged" files, fixes #4915 2024-09-20 00:44:55 +02:00
Thomas Waldmann
a891559578
files cache improvements, fixes #8385, fixes #5658
- changes to locally stored files cache:

  - store as files.<H(archive_name)>
  - user can manually control suffix via env var
  - if local files cache is not found, build from previous archive.
- enable rebuilding the files cache via loading the previous
  archive's metadata from the repo (better than starting with
  empty files cache and needing to read/chunk/hash all files).
  previous archive == same archive name, latest timestamp in repo.
- remove AdHocCache (not needed any more, slow)
- remove BORG_CACHE_IMPL, we only have one
- remove cache lock (this was blocking parallel backups to same
  repo from same machine/user).

Cache entries now have ctime AND mtime.

Note: TTL and age still needed for discarding removed files.
      But due to the separate files caches per series, the TTL
      was lowered to 2 (from 20).
2024-09-20 00:40:49 +02:00
TW
385eeeb4d5
Merge pull request #8398 from ThomasWaldmann/repo-compress-using-chunkindex
repo-compress: use chunkindex rather than repository.list()
2024-09-19 19:42:41 +02:00
Thomas Waldmann
f1a39a059e
repo-compress: use chunkindex rather than repository.list()
repository.list is slow, so rather use the chunkindex,
which might be cached in future. currently, it also uses
repository.list, but at least we can solve the problem
at one place then.
2024-09-19 18:59:03 +02:00
TW
3fd1587dd7
Merge pull request #8396 from ThomasWaldmann/storelocking-debug-logging
storelocking: add debug logging
2024-09-19 16:38:26 +02:00
Thomas Waldmann
d322889972
storelocking: avoid raising a NotLocked exception while releasing the lock while handling an exception 2024-09-19 15:15:22 +02:00
Thomas Waldmann
6a283200f2
storelocking: add debug logging 2024-09-19 15:15:20 +02:00
TW
8d37c00f7b
Merge pull request #8395 from ThomasWaldmann/msys-updates
msys2: disable SETUPTOOLS_USE_DISTUTILS=stdlib hack
2024-09-19 13:29:21 +02:00
Thomas Waldmann
4c1a0b1ca0
msys2: disable SETUPTOOLS_USE_DISTUTILS=stdlib hack
The msys2 changelog says it is only needed for setuptools < 70.2.0:

https://www.msys2.org/docs/python/#known-issues

https://setuptools.pypa.io/en/stable/history.html#v70-2-0
2024-09-19 12:36:18 +02:00
TW
11b72efffe
Merge pull request #8394 from ThomasWaldmann/list-refresh-lock-frequently
repository.list: refresh lock more frequently
2024-09-19 12:19:29 +02:00
Thomas Waldmann
2a20ebeec7
repository.list: refresh lock more frequently
under all circumstances, we must avoid that the lock
gets stale due to not being refreshed in time.

there is some internal rate limiting in _lock_refresh,
so calling it often should be no problem.
2024-09-19 11:38:49 +02:00
TW
97d1e18626
Merge pull request #8393 from ThomasWaldmann/fix-compact-nonunique-names
fixes for non-unique archive names
2024-09-19 01:52:08 +02:00
Thomas Waldmann
03b139ee53
ArchiveFormatter: fix for non-unique archive names
For Archive(), always use the archive id, not the archive name!
2024-09-19 00:58:45 +02:00