[Borgbackup] Consistent unexpectedly large deduplicated size

Thomas Waldmann tw at waldmann-edv.de
Wed May 1 19:03:26 EDT 2024


> One of them has a consistent unexpectedly large deduplicated size, much larger than the sum size of the new files being added.

Besides file content data, borg also backs up file metadata.

Usually this isn't much, but in some cases it could contribute 
significantly to the overall size:

- large xattrs or ACLS changing

- files which you think are still "the same" (because that have same 
content data) have different metadata now and produce a different borg 
metadata stream (not well-deduplicating against the metadata streams of 
past backups). the usual reasons for this are: archiving atime, 
different mount point / different base dir, chown/chmod -R, different 
mtime ("touch").

- file discovery: dir tree traversal order, unstable inode numbers

- file size vs. disk usage: sparse files can be very large size and use 
little disk space.

you have excluded atimes from the backup archive, so it is not that in 
your specific case.

> Are there any ideas about how to find out?

You can check "borg debug ..." whether it has enough to analyze this.

-- 

GPG Fingerprint: 6D5B EF9A DD20 7580 5747  B70F 9F88 FB52 FAF7 B393


More information about the Borgbackup mailing list