Time-Machine-style backup with rsync

73 pointsby plessthanpt05over 11 years ago

13 comments

queseraover 11 years ago

The key feature of Time Machine is hard links to directories -- which is only possible on modern HFS+ (and rsync doesn't even try). Some people like the UI too, of course.Without hard linked directories, a full --link-dest backup of a decent sized disk, with zero file changes from the previous pass, can easily consume 100MB (and take 45 minutes to perform).This disk consumption might seem insignificant, today, but that's 2.4GB per day if you run a standard Time Machine equivalent backup schedule. Of course you might not choose to do that, because the previous hour's backup would only finish 15 mins before the next one started, which is insane.These numbers are from direct experience on a 2TB, approximately 60% utilized source drive.That said, I use rsync, not Time Machine, for my OSX backups. You'll want a few additional switches for HFS+, and if your target drive is HFS+ also, make sure you turn OFF "ignore ownership on this volume" in the Finder... but the script posted here has the right general idea. Somewhere on my project list is adding directory hardlinking to rsync.

评论 #6624877 未加载

评论 #6625186 未加载

评论 #6624713 未加载

ta_tmachineover 11 years ago

When will this trend of "like time machine" backup software going to stop ?Time Machine, as can be seen here <a href="http://www.apple.com/support/timemachine/" rel="nofollow">http://www.apple.com/support/timemachine/</a> time machine is tightly integrated in the os and provides a self-defining interface and user experience.This github page is for a wrapper shell script around rsync, which is not like time machine.This has been going on for a while now, see Timevault (<a href="https://wiki.ubuntu.com/TimeVault" rel="nofollow">https://wiki.ubuntu.com/TimeVault</a> ), back in time (<a href="http://backintime.le-web.org/" rel="nofollow">http://backintime.le-web.org/</a> ) or flyback (<a href="http://www.flyback-project.org/" rel="nofollow">http://www.flyback-project.org/</a> ).Please telling us your backup solution is like time machine when it lacks the kind of UX time machine offers, thanks !

评论 #6624690 未加载

评论 #6625025 未加载

评论 #6624043 未加载

moleculeover 11 years ago

This sounds a lot like rdiff-backup, which uses rsync and hard links to provide incremental backups:<a href="http://rdiff-backup.nongnu.org/features.html" rel="nofollow">http://rdiff-backup.nongnu.org/features.html</a>

评论 #6624606 未加载

评论 #6626086 未加载

评论 #6624746 未加载

whalesaladover 11 years ago

This is really cool. For me, most of my data is "in the cloud" now. Important and frequently-accessed project files are in Dropbox, code is on Github, and most of my music is either streamed from the web in Spotify or my actual library is stored/streamed via iTunes Match. Because of this, I actually don't have a considerable amount of data to keep backed-up, and a lightweight non-time-machine solution like this looks perfect. I tried rsync before but never got into a solid routine.I've become obsessed with the "12factor" app approach everywhere in my digital life, so that if a device ever disappeared, was stolen, or died, I could get a replacement fully operational without any problems. Like an app-server dying, just launch a new one and it will bootstrap itself.I'm kinda crazy with my new "homelab" and started it off with an old 2U Poweredge I got on eBay for about $200. It's cheaper than a Synology/Drobo, has room for 6 drives, and the dual quad-core processors + 16GB of RAM is pretty cool too. It's running FreeNAS right now in a VM with a 3TB ZFS pool. I've created an AFP share that appears to my mac as Time Machine and over my gigabit-network it does a pretty fast backup. I really like the FreeNAS software. It's open-source, runs on FreeBSD, and the UI/admin tool is built in Django.

fbristowover 11 years ago

This looks great, doesn't seem to need anything other than rsync installed.You should also check out rsnapshot: <a href="http://www.rsnapshot.org/" rel="nofollow">http://www.rsnapshot.org/</a>, it does a great job and has many of the features that this script does.

评论 #6623760 未加载

评论 #6624272 未加载

评论 #6624158 未加载

评论 #6624022 未加载

andmariosover 11 years ago

I have written a similar script that I use for many years now. These sort of backups are very convenient.I started too with hard links but nowadays I prefer to format my backup disk with BTRFS and use btrfs snapshots instead, though I still support hard links.I prefer btrfs snapshots due to their support for COW, so if I decide to play with a backup I won't mess all the other versions of this backup. With hard links you should never write to your existing backups.Lately I added a helper script to mount remote filesystems and lock mysql databases but it could be easier to use.Most of my code is checks to make sure I won't write somewhere I shouldn't to.<a href="https://github.com/andmarios/mrbStudio" rel="nofollow">https://github.com/andmarios/mrbStudio</a>

beagle3over 11 years ago

As quesera noted below, on a not-so-big modern disk with 500,000 files, the metadata can easily be in the 50-100MB range, which adds up to >1GB for metadata (even when nothing has changed) if you back up every hour.You should, however, consider bup (<a href="https://github.com/bup/bup" rel="nofollow">https://github.com/bup/bup</a>) - it takes less than a minute to figure out nothing is done, it deduplicates parts of files, (that is, if you have a 20GB virtual machine image, and you've changed one byte in the middle of it, then the next snapshot is going to take ~10KB, not 20GB). The older release don't keep ownership/modification time, but there's a new version pending release soon that does.It also works well remotely (through ssh), can do an integrity check (bup fsck), redundancy (using par2; important after deduplication). And it has a fuse frontend that makes it all accessible as a file system, as well as an ftp frontend.bup is teh awesome.

ilikejamover 11 years ago

Missing some options for extended attributes (SELinux, ACLs) and (on OSX) resource forks.Also, if you're doing multiple backups of the same data to a filesystem over time, it's worth doing 'cp -al' from the previous backup to the current backup destination, then rsync over the top of that - that way multiple copies of files which haven't changed don't take up any extra space.

评论 #6624116 未加载

davidcollantesover 11 years ago

Although useful, this is nothing like Time Machine.

scottlu2over 11 years ago

Link-backup does this as well, plus it knows how to build hard links to old backups even when directory structure or filenames change, effectively de-dup support. It does this by building a content addressable index on the destination filesystem that backup trees hard-link against.<a href="http://www.scottlu.com/Content/Link-Backup.html" rel="nofollow">http://www.scottlu.com/Content/Link-Backup.html</a>

belloover 11 years ago

If you're looking for file snapshots and versioning, I've found Back In Time (<a href="http://backintime.le-web.org/" rel="nofollow">http://backintime.le-web.org/</a>) to be awesome.

评论 #6623934 未加载

frank_boydover 11 years ago

Are there any advantages over Déjà Dup?

mikeboover 11 years ago

When you restore a backup using this method is the drive bootable?