|
Softpanorama |
May the source be with you, but remember the KISS principle ;-)
Softpanorama Search
|
| See Also | Recommended Links | Directory Syncronization | Cygwin | NetDrive | ||
| LFTP -- Scriptable FTP client | FileZilla | NcFTP | wget | Curl | Humor | Etc |
To site Everything Solaris (which provides one of the best explanation of how to use it on Solaris) Rsync is a wonderful little utility that's amazingly easy to set up on your machines.
Rsync can serve as an alternative to rcp, scp, scripted FTP sessions. Tools like NFS mounting and FTP NetDrive convert the problem in local Directory Syncronization problem and this allow regular directory synchronization tools to be used. It can be used as local utility but typical is used in client server mode: one instance of rsync is running as a server (with --daemon option) and the other as a client.
Rsync make sense mostly on large files and large directory trees -- in this case advantage of rsync is that it copies only the diffs of files that have actually changed, compressed and through ssh if you want to for security. Otherwise the tool is an overkill
Features:
Rsync is often used it to synchronize Website trees from staging to production servers and to backup key areas of the filesystems both automatically through cron and by a CGI script. Here are some other key features of rsync:
In most primitive way rsync can be used as an analog of xcopy in Linux world: the tool for copying one tree of directories to another location with the ability to "flatten" the target.
Tip: the -z option on rsync doesn’t make sense for local file transfers, it slows things down.
Joe 'Zonker' Brockmeier explained the basic functionality of rsync in his Linux.com paper Back up like an expert with rsync in the following way:
Getting to know rsync
The basic syntax for rsync is simple enough -- just run
rsync [options] source destinationto copy the file or files provided as thesourceargument to thedestination.So, for example, if you want to copy some files under your home directory to a USB storage device, you might use
rsync -a /home/user/dir/ /media/disk/dir/. By the way, "/home/user/dir/" and "/home/usr/dir" are not the same thing to rsync. Without the final slash, rsync will copy the directory in its entirety. With the trailing slash, it will copy the contents of the directory but won't recreate the directory. If you're trying to replicate a directory structure with rsync, you should omit the trailing slash -- for instance, if you're mirroring /var/www on another machine or something like that.In this example, I included the archive option (
-a), which actually combines several rsync options. It combines the recursive and copy symlinks options, preserves group and owner, and generally makes rsync suitable for making archive copies. Note that it doesn't preserve hardlinks; if you want to preserve them, you will need to add the hardlinks option (-H).Another option you'll probably want to use most of the time is verbose (
-v), which tells rsync to report lots of information about what it's doing. You can double and triple up on this option -- so using-vwill give you some information, using-vvwill give more, and using-vvvwill tell you everything that rsync is doing.rsync will move hidden files (files whose names begin with a .) without any special options. If you want to exclude hidden files, you can use the option
--exclude=".*/". You can also use the--excludeoption to prevent copying things like Vim's swap files (.swp) and automatic backups (.bak) created by some programs.Be very careful with the delete option
with it, you can whack a bunch of files without meaning to. In fact, while you're getting used to rsync, it's probably a good idea to use the --dry-run option with your commands to run through the transfer first, without actually copying or synching files. If you do start an rsync transfer and realize that you've botched the command in some way that might result in the destruction of data, press
Ctrl-cimmediately to terminate the transfer. Some files may be gone, but you may be able to save the rest.
Initiating transfere
Using Rsync Itself
http://www.humbug.org.au/talks/rsync/index.htmlNow on to actually using, or initiating an rsync transfer with rsync itself. It's the same binary as the daemon, just without the "--daemon" flag. It's simplicity is a virtue. I'll start with a commandline that I use in a script to synchronize a Web tree below.
rsync --verbose --progress --stats --compress --rsh=/usr/local/bin/ssh \ --recursive --times --perms --links --delete \ --exclude "*bak" --exclude "*~" \ /www/* webserver:simple_path_name
Example Usage
Copy files on the local host
rsync /home/henry/Kernel/linux-2.2.0.tar.gz /usr/src/linux.tgz
linux-2.2.0.tar.gzwill be copied from the user's directory to/usr/src, and renamed tolinux.tgz. In effect, this behaves exactly likecp, except the destination file would not be mindlessly overwritten; only portions of the source file that needed to be copied would be "transmitted".Copy files from a remote host to the local host
rsync -a linus@transmeta.com:/home/linus/linux-3.0.0/ /pub/ftp/linux-kernel/v3.0.0/In this example, a remote shell would be started on transmeta.com, under the username "linus". Once the connection had been established (depending on the transport agent used, there may be a prompt for a password or passphrase), the [CHECK THIS] contents of linus's
linux-3.0.0directory would be synchronised with the contents of the public FTP server directory on the local host.The
-aoption means the same thing as it does tocp---rsyncwill run in "archive" mode, which preserves permissions, file ownership, and recurses directories.Mirror a module from a remote host
rsync mirror.aarnet.edu.au:: rsync mirror.aarnet.edu.au::redhat /home/ftp/pub/redhat rsync mirror.aarnet.edu.au::debian/dists/potato /home/ftp/debian/potatoThe first of these three commands lists the modules available for mirroring on
mirror.aarnet.edu.au. After viewing the list, and deciding which modules to retrieve, the next two commands perform some transfers. The first will retrieve all of the Red Hat archive, whereas the second will only retrieve the "potato" part of the Debian archive.
rsynccommand-line switches and optionsFor a serious discussion of
rsync's options, you should turn to theREADMEfile that came with the distribution (found under/usr/doc/rsyncon Debian and Red Hat) or (preferably) to the manual page.
rsyncsupports both long and short options for most flags. Short argument forms are given first, if they exist, with long GNU-style arguments following.
By far the most useful option is
-a (--archive). This acts like the corresponding option tocp;rsyncwill:
recurse subdirectories (
-r);recreate soft links (
-l);preserve permissions (
-p);preserve timestamps (
-t);attempt to copy devices (if invoked as root) (
-D);preserve group information (
-g) (and userid, if invoked as root) (-o).The
-H (--hard-links)option is not implied by-a.
-b (--backup)will force backups to be made; the default extension is a tilde, "~". Overridden with--suffix=SUFFIX.
-S (--sparse)asksrsyncto copy sparse files efficiently.
-x (--one-file-system): Don't cross filesystems. Useful if you've got-aturned on!
-e (-rsh=COMMAND)use COMMAND instead ofrsh. Usingrsyncwithout this option to specifysshcan result in very confusing error messages.
--daemonrun in "daemon" mode, where clients may connect to receive (parts of) pre-defined "modules".
--config=FILEFILE is used as the config file for a daemon mode instead of/etc/rsyncd.conf.
rsyncd.confuid = root gid = root exclude = tmp/ exclude = temp/ [home] path = /export/home comment = All home directories read only = true hosts allow = trustedhost.plugged.org.auThis configuration file specifies an rsync daemon that runs as root, and offers /export/home to any client from
trustedhost.plugged.org.au.The other options ensure that directories named "tmp" or "temp" are never transferred, and that clients have read-only access to the server.
|
|||||||
developerWorksMuch like
Listing 1. Copy the contents of a directory verbatimcp,rsynccopies files from a source to a destination. Unlikecp, the source and destination of anrsyncoperation can be local or remote. For instance, the command in Listing 1 copies the directory /tmp/photos and its entire contents verbatim to a home directory.
$ rsync -n -av /tmp/photos ~ building file list ... done photos/ photos/Photo 2.jpg photos/Photo 3.jpg photos/Photo 6.jpg photos/Photo 9.jpg sent 218 bytes received 56 bytes 548.00 bytes/sec total size is 375409 speedup is 1370.11
The-voption enables verbose messages. The-aoption (where a stands for archive), is a shorthand for-rlptgoD(recurse, copy symbolic links as symbolic links, preserve permissions, preserve file times, preserve group, preserve owner, and preserve devices and special files, respectively). Typically,-amirrors files; exceptions occur when the destination cannot or does not support the same attributes. For example, copying a directory from UNIX to Windows® does not map perfectly. Some suggestions for unusual cases appear below.
rsynchas a lot of options. If you worry that your options or source or destination specifications are incorrect, use-nto perform a dry run. A dry run previews what will happen to each file but does not move a single byte. When you are confident of all the settings, drop the-nand proceed.Listing 2 provides an example where
-nis invaluable. The command in Listing 1 and the following command yield different results.
Listing 2. Copy the contents of a named directory
$ rsync -av /tmp/photos/ ~ ./ Photo 2.jpg Photo 3.jpg Photo 6.jpg Photo 9.jpg sent 210 bytes received 56 bytes 532.00 bytes/sec total size is 375409 speedup is 1411.31
What is the difference? The difference is the trailing slash on the source argument. If the source has a trailing slash, the contents of the named directory but not the directory itself are copied. A slash on the end of the destination is immaterial.
And Listing 3 provides an example of moving the same directory to another system.
Listing 3. Move a directory to a
$ rsync -av /tmp/photos example.com:album created directory album Photo 2.jpg Photo 3.jpg Photo 6.jpg Photo 9.jpg sent 210 bytes received 56 bytes 21.28 bytes/sec total size is 375409 speedup is 1411.31
Assuming that you have the same login name on the remote machine,
rsyncprompts you with a password and, given the proper credential, creates the directory album and copies the images to that directory. By default,rsyncuses Secure Shell (SSH) as its transport mechanism; you can reuse your machine aliases and public keys withrsync.The examples in Listing 2 and Listing 3 demonstrate two of
rsync's four modes. The first example was shell mode, also dubbed local mode. The second sample was remote shell mode and is so named because SSH powers the underlying connection and transfers.rsynchas two additional modes. List mode acts likels: It lists the contents of source, as shown in Listing 4.
Listing 4. List the contents of a source
$ drwxr-xr-x 238 2009/08/22 18:49:50 photos -rw-r--r-- 6148 2008/07/03 01:36:18 photos/.DS_Store -rw-r--r-- 71202 2008/06/18 04:51:36 photos/Photo 2.jpg -rw-r--r-- 69632 2008/06/18 04:51:45 photos/Photo 3.jpg -rw-r--r-- 61046 2008/07/14 00:31:17 photos/Photo 6.jpg -rw-r--r-- 167381 2008/07/14 00:31:56 photos/Photo 9.jpg
The fourth mode is server mode. Here, thersyncdaemon runs perennially on a machine, accepting requests to transfer files. A transfer can send files to the daemon or request files from it. Server mode is ideal for creating a central backup server or project repository.To differentiate between remote shell mode and server mode, the latter employs two colons (
:) in the source and destination names. Assuming that whatever.example.com exists, the next command copies files from the source to a local destination:
$ rsync -av whatever.example.com::src /tmp
And what exactly issrc? It's anrsyncmodule that you define and configure on the daemon's host. A module has a name, a path that contains its files, and some other parameters, such asread only, which protects the contents from modification.To run an
rsyncdaemon, type:
$ sudo rsync --daemon
Running the
rsyncdaemon as the superuser, root, is not strictly necessary, but the practice protects other files on your machine. Running as root,rsyncrestricts itself to the module's directory hierarchy (its path) usingchroot. After achroot, all other files and directories seem to vanish. If you choose to run thersyncdaemon with your own privileges, choose an unused socket and make sure its modules have sufficient permissions to allow download and/or upload. Listing 5 shows a minimal configuration to share some files in your home directory without the need forsudo. The configuration is stored in file rsyncd.conf.
Listing 5. Simple configuration for sharing files
motd file = /home/strike/rsyncd/rsync.motd_file pid file = /home/strike/rsyncd/rsyncd.pid port = 7777 use chroot = no [demo] path = /home/strike comment = Martin home directory list = no [dropbox] path = /home/strike/public/dropbox comment = A place to leave things for Martin read only = no [pickup] path = /home/strike/public/pickup comment = Get your files here!
The file has two segments. The first segment—here, the first four lines—configures the operation of thersyncdaemon. (Other options are available, too.) The first line points to a file with a friendly message to identify your server. The second line points to another file to record the process ID of the server. This is a convenience in the event you must manually kill thersyncdaemon:
kill -INT `cat /home/strike/rsyncd/rsyncd.pid`
The two files are in a home directory, because this example does not use superuser privileges to run the software. Similarly, the port chosen for the daemon is above 1000, which users can claim for any application. The fourth line turns off
chroot.The remaining segment is subdivided into small sections, one section per module. Each section, in turn, has a header line and a list of (key-value) pairs to set options for each module. By default, all modules are read only; set
read only = noto allow Write operations. Also by default, all modules are listed in the module catalog; setlist = noto hide the module.To start the daemon, run:
$ rsync --daemon --config=rsyncd.conf
Now, connect to the daemon from another machine, and omit a module name. You should see this:
rsync --port=7777 mymachine.example.com:: Hello! Welcome to Martin's rsync server. dropbox A place to leave things for Martin pickup Get your files here!
If you do not name a module after the colons (
::), the daemon responds with a list of available modules. If you name a module but do not name a specific file or directory within the module, the daemon provides a catalog of the module's contents, as shown in Listing 6.
Listing 6. Catalog output of a module's contents
rsync --port=7777 mymachine.example.com::pickup Hello! Welcome to Martin's rsync server. drwxr-xr-x 4096 2009/08/23 08:56:19 . -rw-r--r-- 0 2009/08/23 08:56:19 article21.html -rw-r--r-- 0 2009/08/23 08:56:19 design.txt -rw-r--r-- 0 2009/08/23 08:56:19 figure1.png
And naming a module and a file copies the file locally, as shown in Listing 7.
Listing 7. Name a module to copy files locally
rsync --port=7777 mymachine.example.com::pickup/ Hello! Welcome to Martin's rsync server. drwxr-xr-x 4096 2009/08/23 08:56:19 . -rw-r--r-- 0 2009/08/23 08:56:19 article21.html -rw-r--r-- 0 2009/08/23 08:56:19 design.txt -rw-r--r-- 0 2009/08/23 08:56:19 figure1.png
You can also perform an upload by reversing the source and destination, then pointing to the module for writes, as shown in Listing 8.
Listing 8. Reverse source and destination directories
$ rsync -v --port=7777 application.js mymachine.example.com::dropbox Hello! Welcome to Martin's rsync server. application.js sent 245 bytes received 38 bytes 113.20 bytes/sec total size is 164 speedup is 0.58
That's a quick but thorough review. Next, let's see how you can applyrsyncto daily tasks.rsyncis especially useful for backups. And because it can synchronize a local file with its remote counterpart—and can do that for an entire file system, too—it's ideal for managing large clusters of machines that must be (at least partially) identical.Performing backups on a frequent basis is a critical but typically ignored chore. Perhaps it's the demands of running a lengthy backup each day or the need to have large external media to store files; never mind the excuse, copying data somewhere for safekeeping should be an everyday practice.
To make the task painless, use
rsyncand point to a remote server—perhaps one that your service provider hosts and backs up. Each of your UNIX machines can use the same technique, and it's ideal for keeping the data on your laptop safe.Establish SSH keys and an
rsyncdaemon on the remote machine, and create a backup module to permit writes. Once established, runrsyncto create a daily backup that takes hardly any space, as shown in Listing 9.
Listing 9. Create daily backups
#!/bin/sh # This script based on work by Michael Jakl (jakl.michael AT gmail DOTCOM) and used # with express permission. HOST=mymachine.example.com SOURCE=$HOME PATHTOBACKUP=home-backup date=`date "+%Y-%m-%dT%H:%M:%S"` rsync -az --link-dest=$PATHTOBACKUP/current $SOURCE $HOST:PATHTOBACKUP/back-$date ssh $HOST "rm $PATHTOBACKUP/current && ln -s back-$date $PATHTOBACKUP/current"
ReplaceHOSTwith the name of your backup host andSOURCEwith the directory you want to save. ChangePATHTOBACKUPto the name of your module. (You can also embed the three final lines of the script in a loop, dynamically changeSOURCE, and back up a series of separate directories on the same system.) Here's how the backup works:
- To begin,
dateis set to the current date and time and yields a string like2009-08-23T12:32:18, which identifies the backup uniquely.- The
rsynccommand performs the heavy lifting.-azpreserves all file information and compresses the transfers. The magic lies in--link-dest=$PATHTOBACKUP/current, which specifies that if a file has not changed, do not copy it to the new backup. Instead, create a hard link from the new backup to the same file in the existing backup. In other words, the new backup only contains files that have changed; the rest are links.More specifically (and expanding all variables),
mymachine.example.com::home-backup/currentis the current archive. The new archive for /home/strike is targeted tomymachine.example.com::home-backup/back-2009-08-23T12:32:18. If a file in /home/strike has not changed, the file is represented in the new backup by a hard link to the current archive. Otherwise, the new file is copied to the new archive.If you touch but a few files or perhaps a handful of directories each day, the additional space required for what is effectively a full backup is paltry. Moreover, because each daily backup (except the very first) is so small, you can keep a long history of the files on hand.
- The last step is to alter the organization of the backups on the remote machine to promote the newly created archive to be the current archive, thereby minimizing the differences to record the next time this script runs. The last command removes the current archive (which is merely a symbolic link) and recreates the same symbolic link pointing to the new archive.
Keep in mind that a hard link to a hard link points to the same file. Hard links are very cheap to create and maintain, so a full backup is simulated using only an incremental scheme.
Sys Admin v16, i02
Distributing software packages to all of our servers is a tedious task. Currently, a release manager makes a connection to each server and transfers files using ftp. This involves entering passwords multiple times, waiting for transfers to complete, changing directories, and keeping files organized. We developed a shell script, distribute_release ( Listing 1 ), that makes the job easier.
Our script has some advantages over the ftp process:
- Directory trees can be used to organize release modules.
- A distribution network defines how files are transferred from server to server.
- When a release module is ready to be distributed, it is replicated to all of the servers in the network using rsync, which helps minimize network traffic.
- Various authentication methods can be used to avoid entering passwords for each server.
We’ll describe the directory structures including creating the distribution network. Then we’ll talk about the scripts. Finally, we’ll discuss an example.
Directory Structures
Each release module is stored in the directory /var/spool/pkg/release/[module]/. A module directory can be flat, or it can contain subdirectories. Hidden directory trees under the ./release/ directory define the distribution network. Therefore, the names of these directories cannot be used as module names.
Transport protocols supported by distribute_release include nfs, rsh, and ssh. If a release module is distributed using nfs, then the directory /var/spool/pkg/release/.nfs/[module]/ contains symbolic links corresponding to the hosts in the server’s distribution network:
/var/spool/pkg/release/.nfs/[module]/[host] -> \ /net/[host]/var/spool/pkg/release/When using nfs, rsync functions like an improved copy command, transferring files between the directories /var/spool/pkg/release/[module]/ and /var/spool/pkg/release/.nfs/[module]/[host]/[module].When using rsh or ssh, the directory structures are similar. With rsh, for example, empty files of the form /var/spool/pkg/release/.rsh/[module]/[host] define the hosts in the distribution network.
The Scripts
Before distribute_release can be called, the directory structures and the distribution network must be created. The script create_distribution ( Listing 2 ) facilitates these tasks.
One argument, the name of a release module, must be passed to create_distribution. When no options are used, the local host functions as a terminal node in the distribution network. In other words, the system may receive updates from another host, but it will not propagate those updates to downstream hosts. Downstream hosts and transport protocols may be specified with the -h and -t options respectively.
When using distribute_release, the name of a release module must be passed to the script. The -q and -v options may be used to control the amount of information displayed to the user. Hosts to be included or excluded from the distribution may be specified using the -i and -e options. The -r option may be used to determine how many times the program will recursively call itself to distribute the module to successive levels in a distribution hierarchy. When using nfs, the recursive calls are made locally. With rsh and ssh, the program calls itself on a remote server.
Distribute_release first gets the argument and any command-line options. Then, for each transport protocol, the script builds a distribution list and executes the appropriate rsync command for each host in the list. If a recursion depth is specified, then another instance of distribute_release is executed in a detached screen session, allowing the parent instance to continue running while the child processes propagate the module to other hosts.
An Example
Our example network (see Figure 1 ) contains five servers -- bambi, pongo, pluto, nemo, and goofy. One of the release modules is named TS1 (located on bambi) and the module is named TS2 (located on pluto). By executing the create_distributions script ( Listing 3 ) on each server, the complete distribution network for both modules is built using the proper create_distribution calls.
Consider the TS1 release module; after the module has been distributed to all of the systems in the network, the directory /var/spool/pkg/release/TS1/ contains the following files and subdirectories:
./README ./v1/TS1-v1.pkg ./v2/TS1-v2.pkg ./beta/TS1-v3.pkgOn bambi, the directory /var/spool/pkg/release/.ssh/TS1/ contains a file named pongo. So, executing "distribute_release TS1” on bambi synchronizes the TS1 module with pongo using ssh as the transport protocol. The TS1 module can be distributed from pongo to all servers in the network using the -r option:distribute_release -r 2 TS1When using ssh, passwords can be avoided by using public/private key pairs with empty passphrases. When using rsh, you can update /etc/hosts.equiv or the appropriate .rhosts file. Obviously, passwords are not an issue with nfs. Deciding which protocol to use depends on security concerns, potential performance issues, and configuration complexity.John Spurgeon is a software developer and systems administrator for Intel’s Factory Information Control Systems, IFICS, in Hillsboro, Oregon. He is currently preparing to ride a single-speed bicycle in Race Across America in 2007.
Ed Schaefer is a frequent contributor to Sys Admin. He is a software developer and DBA for Intel’s Factory Information Control Systems, IFICS, in Hillsboro, Oregon. Ed also hosts the monthly Shell Corner column on UnixReview.com. He can be reached at: shellcorner@comcast.net.
Apr 14, 2009 | developerworks
If you work with both a laptop and a desktop computer, you know you have to synchronize the machines to keep them up to date. In addition, you probably want to run the synchronization not only at your home but also from a remote site; in my case, whenever I travel with my laptop, I make sure that whatever I do on it gets backed up to my desktop computer. (Losing your laptop and thereby losing all your work isn't nice at all!) Many solutions to this problem exist: This article introduces one such tool—
rsync—and mentions several related tools, all of which provide easy synchronization procedures.
Rsync::Config is a module who can be used to create rsync configuration files. A configuration file (from Rsync::Config point of view) is made by atoms and modules with atoms. A atom is the smallest piece from the configuration file. This module inherits from Rsync::Config::Module .
What is Rsync?Rsync is a very useful alternative to rcp written by Andrew Tridgell and Paul Mackerras. This tool lets you copy files and directories between a local host and a remote host (source and destination can also be local if you need.) The main advantage of using Rsync instead of rcp, is that rsync can use SSH as a secure channel, send/receive only the bytes inside files that changed since the last replication, and remove files on the destination host if those files were deleted on the source host to keep both hosts in sync. In addition to using rcp/ssh for transport, you can also use Rsync itself, in which case you will connect to TCP port 873.
Whether you rely on SSH or use Rsync explicitely, Rsync still needs to be installed on both hosts. A Win32 port is available if you need, so you can have either one of the host or both be NT hosts. Rsync's web site has some good infos and links. There is also an HOWTO.
Configuring /etc/rsyncd.conf
Being co-written by Andrew Tridgell, author of Samba, it's no surprise that Rsync's configuration file looks just like Samba (and Windows' :-), and that Rsync lets you create projects that look like shared directories under Samba. Accessing remote resources through this indirect channel offers more independence, as it lets you move files on the source Rsync server without changing anything on the destination host.
Any parameters listed before any [module] section are global, default parameters.
Each module is a symbolic name for a directory on the local host. Here's an example:
#/etc/rsyncd.conf
secrets file = /etc/rsyncd.secrets
motd file = /etc/rsyncd.motd #Below are actually defaults, but to be on the safe side...
read only = yes
list = yes
uid = nobody
gid = nobody
[out]
comment = Great stuff from remote.acme.com
path = /home/rsync/out
[confidential]
comment = For your eyes only
path = /home/rsync/secret-out
auth users = joe,jane
hosts allow = *.acme.com
hosts deny = *
list = false
Note: Rsync will not grant access to a protected share if the password file (/etc/rsyncd.secrets, here) is world-readable.
Running RSYNCd
Per the manual page:
The rsync daemon is launched by specifying the --daemon option to rsync. You can launch it either via inetd or as a stand-alone daemon. When run via inetd you should add a line like this to /etc/services:
rsync 873/tcp
... and a single line something like this to /etc/inetd.conf:
rsync stream tcp nowait root /usr/bin/rsync rsyncd --daemon
You will then need to send inetd a HUP signal to tell it to reread its config file. Note that you should not send the rsync server a HUP signal to force it to reread the /etc/rsyncd.conf. The file is re-read on each client connection.
Per the HOWTO:
The rsync daemon is robust, so it is safe to launch it as a stand-alone server. The code that loops waiting for requests is only a few lines long then it forks a new copy. If the forked process dies then it doesn't harm the main daemon.
The big advantage of running as a daemon will come when the planned directory cache system is implemented. The caching system will probably only be enable when running as a daemon. For this reason, busy sites is recommended to run rsync as a daemon. Also, the daemon mode makes it easy to limit the number of concurrent connections.Since it's not included in the 2.4.3 RPM package, here's the init script to be copied as /etc/rc.d/init.d/rsyncd with symlinks to /etc/rc.d/rc3.d:
#!/bin/sh
# Rsyncd This shell script takes care of starting and stopping the rsync daemon
# description: Rsync is an awesome replication tool.
# Source function library.
. /etc/rc.d/init.d/functions
[ -f /usr/bin/rsync ] || exit 0
case "$1" in
start)
action "Starting rsyncd: " /usr/bin/rsync --daemon
;;
stop)
action "Stopping rsyncd: " killall rsync
;;
*)
echo "Usage: rsyncd {start|stop}"
exit 1
esac
exit 0
Here's an example under Linux on how to set up a replication through SSH:
rsync -avz -e ssh rsync@remote.acme.com:/home/rsync/out/ /home/rsync/from_remote
An important thing here, is that the presence or absence of a trailing "/" in the source directory determines whether the directory itself is copied, or simply the contents of this source directory.
In other words, the above means that the local host must have a directory available (here, /home/rsync/from_remote to receive the contents of /home/rsync/out sitting on the remote host, otherwise Rsync will happily download all files into the path given as destination without asking for confirmation, and you could end up with a big mess.
On the other hand, rsync -avz -e ssh rsync@remote.acme.com:/home/rsync/out /home/rsync/from_remote means that the an "out" sub-directory is first created under /home/rsync/from_remote on the destination host, and will be populated with the contents of the remote directory ./out. In this case, files will be save on the local host in /home/rsync/from_remote/out, so the former commands looks like a better choice.
Here's how to replicate an Rsync share from a remote host:
rsync -avz rsync@remote.acme.com::out /home/rsync/in
Notice that we do not use a path to give the source resource, but instead just a name ("out"), and that we use :: to separate the server's name and the resource it offers. In the Rsync configuration that we'll see just below, this is shown as a [out] section. This way, admins on remote.acme.com can move files on their server; As long as they remember to update the actual path in the [out] section (eg. PATH=/home/rsync/out to PATH=/home/outgoing), remote Rsync users are not affected.
An Rsync server displays the list of available anonymous shares through rsync remote.acme.com::. Note the ::. For added security, it is possible to prompt for a password when listing private shares, so that only authorized remote users know about the Rsync shares available from your server.
Any NT version?
The NT port only requires the latest and greatest RSYNCx.EXE and Cygnus' CYGWIN1.DLL. The easiest is to keep both in the same directory, but the DLL can be located in any directory found in your PATH environment variable.
Robert Scholte's excellent tutorial on using the NT port of Rsync can be found here.
Instructions on how to install rsync as an NT service are here.
Here's an example based on the sample above:
C:\Rsync>rsync243 -avz joe@linux.acme.com::confidential ./confidential
Password:
receiving file list ... done
./
./
wrote 109 bytes read 123 bytes 66.29 bytes/sec
total size is 0 speedup is 0.00Useful command-line switches
-v, --verbose increase verbosity
-q, --quiet decrease verbosity
-c, --checksum always checksum
-a, --archive archive mode. It is a quick way of saying you want recursion and want to preserve everything.
-r, --recursive recurse into directories
-R, --relative use relative path names
-u, --update update only (don't overwrite newer files)
-t, --times preserve times
-n, --dry-run show what would have been transferred
-W, --whole-file copy whole files, no incremental checks
-I, --ignore-times Normally rsync will skip any files that are already the same length and have the same time-stamp. This option turns off this behavior.
--existing only update files that already exist
--delete delete files that don't exist on the sending side
--delete-after delete after transferring, not before
--force force deletion of directories even if not empty
-c, --checksum always checksum
--size-only only use file size when determining if a file should be transferred
--progress show progress during transfer
-z, --compress compress file data
--exclude=PATTERN exclude files matching PATTERN
--daemon run as a rsync daemon
--password-file=FILE get password from FILEResources
- http://members.ozemail.com.au/~msteveb/rsync/
- http://www.ccp14.ac.uk/ccp14admin/rsync/
- http://optics.ph.unimelb.edu.au/help/rsync/
- http://sunsite.dk/SunSITE/guides/rsync/
- http://sunsite.dk/SunSITE/guides/rsync/rsync-mirroring02.html
- ftp://ftp.minn.net/usr/mmchen/
- http://www.uwsg.indiana.edu/security/rsync.html
- http://filewatcher.org/sec/rsync.html
- http://freeos.com/printer.php?entryID=4042
- A Tutorial on Using rsync
- Rsnapshot (higher-level backup utility based on rsync, improved ease-of-use, allows you to keep multiple snapshots in time of your data, local or remote)
In the last two months I've been traveling a lot. During the same period my main desktop computer went belly up. I would have been in trouble without rsync at my disposal -- but thanks to my regular use of this utility, my data (or most of it, anyway) was already copied offsite just waiting to be used. It takes a little time to become familiar with rsync, but once you are, you should be able to handle most of your backup needs with just a short script.What's so great about rsync? First, it's designed to speed up file transfer by copying the differences between two files rather than copying an entire file every time. For example, when I'm writing this article, I can make a copy via rsync now and then another copy later. The second (and third, fourth, fifth, etc.) time I copy the file, rsync copies the differences only. That takes far less time, which is especially important when you're doing something like copying a whole directory offsite for daily backup. The first time may take a long time, but the next will only take a few minutes (assuming you don't change that much in the directory on a daily basis).
Another benefit is that rsync can preserve permissions and ownership information, copy symbolic links, and generally is designed to intelligently handle your files.
You shouldn't need to do anything to get rsync installed -- it should be available on almost any Linux distribution by default. If it's not, you should be able to install it from your distribution's package repositories. You will need rsync on both machines if you're copying data to a remote system, of course.
When you're using it to copy files to another host, the rsync utility typically works over a remote shell, such as Secure Shell (SSH) or Remote Shell (RSH). We'll work with SSH in the following examples, because RSH is not secure and you probably don't want to be copying your data using it. It's also possible to connect to a remote host using an rsync daemon, but since SSH is practically ubiquitous these days, there's no need to bother.
2006-09-20I hate making backups by hand. It costs a lot of time and usually I have far better things to do. Long ago (in the Windows 98 era) I made backups to CD only before I needed to reïnstall the OS, which was about once every 18 months, and my code projects maybe twice as often. A lot has changed since those dark times though. My single PC expanded into a network with multiple desktops and a server, I installed a mix of Debian an Ubuntu and ditched Windows, and I have a nice broadband link - just as my friends do. Finally a lazy git like me can set up a decent backup system that takes care of itself, leaving me time to do the "better" things (such as writing about it :-)
There are already quite a few tutorials on the internet explaining various ways to backup your Linux system using built-in commands and a script of some sorts, but I could not find one that suited me so I decided to write another one - one that takes care of backing up my entire network.
rsnapshot is a filesystem snapshot utility based on rsync. It makes it easy to make periodic snapshots of local machines, and remote machines over ssh. It uses hard links whenever possible, to greatly reduce the disk space required.
Warsync (Wrapper Around Rsync) is a server replication system mainly used to sync servers in LVS clusters. It is based on rsync over ssh and has native support for Debian package synchronization.
Advanced applications of rsync by Martin Streicher
Rsync algorithm technical report
Linux.com :: Back up like an expert with rsyncLast modified: October 19, 2009