rsync compare directories?
up vote
53
down vote
favorite
Is it possible to compare two directories with rsync and only print the differences? There's a dry-run option, but when I increase verbosity to a certain level, every file compared is shown.
ls -alR
and diff
is no option here, since there are hardlinks in the source making every line different. (Of course, I could delete this column with perl.)
directory rsync file-comparison
add a comment |
up vote
53
down vote
favorite
Is it possible to compare two directories with rsync and only print the differences? There's a dry-run option, but when I increase verbosity to a certain level, every file compared is shown.
ls -alR
and diff
is no option here, since there are hardlinks in the source making every line different. (Of course, I could delete this column with perl.)
directory rsync file-comparison
Similar: serverfault.com/questions/62364/…
– reinierpost
Oct 29 '15 at 12:41
add a comment |
up vote
53
down vote
favorite
up vote
53
down vote
favorite
Is it possible to compare two directories with rsync and only print the differences? There's a dry-run option, but when I increase verbosity to a certain level, every file compared is shown.
ls -alR
and diff
is no option here, since there are hardlinks in the source making every line different. (Of course, I could delete this column with perl.)
directory rsync file-comparison
Is it possible to compare two directories with rsync and only print the differences? There's a dry-run option, but when I increase verbosity to a certain level, every file compared is shown.
ls -alR
and diff
is no option here, since there are hardlinks in the source making every line different. (Of course, I could delete this column with perl.)
directory rsync file-comparison
directory rsync file-comparison
asked Dec 1 '12 at 11:18
chris
52511017
52511017
Similar: serverfault.com/questions/62364/…
– reinierpost
Oct 29 '15 at 12:41
add a comment |
Similar: serverfault.com/questions/62364/…
– reinierpost
Oct 29 '15 at 12:41
Similar: serverfault.com/questions/62364/…
– reinierpost
Oct 29 '15 at 12:41
Similar: serverfault.com/questions/62364/…
– reinierpost
Oct 29 '15 at 12:41
add a comment |
7 Answers
7
active
oldest
votes
up vote
41
down vote
accepted
You will propably have to run something like
rsync -avun --delete
in both directions.
But what are you actually trying to accomplish?
Update:
rsync -avun --delete $TARGET $SOURCE |grep "^deleting "
will give you a list of files that do not exist in the target-directory.
"grep delet" because each line prints : deleting ..file..
rsync -avun $SOURCE $TARGET
will give you a list of "different" files (including new files).
add a comment |
up vote
42
down vote
To add to Nils's answer (for anyone coming across this via Google), by default rsync
only compares the file sizes and modification times to tell if there are any differences. (If those are different it does more, but if they're the same, it stops there.)
If you want to compare actual file contents, even for files which have the same size and last modification time, add the flag -c
to tell rsync
to compare the files using a checksum.
rsync -avnc $SOURCE $TARGET
(The -u
option tells rsync to ignore files which are newer in $TARGET
than on $SOURCE
, which you probably don't want if you're comparing contents.)
2
If you only care that the data is the same you might want to add--no-group --no-owner --no-perms --no-times
or some combination of these based on your needs.
– flungo
Sep 30 '16 at 17:31
1
@flungo, or just use a subset of the options implied by-a
instead of-a
, e.g.rsync -rlDcnv --delete $SOURCE $TARGET
– maxschlepzig
Nov 26 '16 at 22:29
Please add--delete
to list files only existing in$TARGET
– Tom Hale
Oct 31 '17 at 5:42
add a comment |
up vote
16
down vote
Just for those less familiar with rsync
:
rsync -rvnc --delete ${SOURCE}/ ${DEST}
-n
: most important bit -- do not change anything ;
-rc
: compare only the contents (otherwise use-ac
) ;
-v
: list the files )
--delete
: look for a symmetrical, not a uni-directional difference.- Finally,
/
means "look inside the directory, and compare its contents to the destination".
It will print a usual
rsync
output,
- with one <filename> on a line for every "new" file in
${SOURCE}
and one "deleting <filename>" line for each "new" file in
${DEST}
.
It may also print a few warnings, like "skipping non-regular file <filename>" for symlinks.
PS. I know it's a terrible PS -- but it was indeed added in a rush. Nevertheless, I bet one may find this useful.
PPS. Alternatively, one could also do
find $SOURCE -type f -exec md5sum {} ; | tee source.md5
find $DEST -type f -exec md5sum {} ; | tee dest.md5
If the filenames do not contain newlines, we can then sort both *.md5
files, and diff
them. ( This will work only for files, though; that is, an empty directory on either side won't be detected. )
add a comment |
up vote
2
down vote
I understand from your question that you do not want to use diff on ls, but you can also use diff recursively on directories:
diff -rq DIR1 DIR2
add a comment |
up vote
1
down vote
It took me a few tries to get this to work. Nils' answer requires that $TARGET
ends in a trailing /
, as explained by ジョージ.
Here is a version that explicitly adds the trailing /
:
rsync -avun --delete ${TARGET}/ ${SOURCE} | sed -ne 's/^deleting *//p'
This gives the list of files that exist below the ${SOURCE}
directory but not below the ${TARGET}
directory.
I use sed
here to remove the leading deleting
from the output lines, and to print only those lines.
I do not use the rsync
option -c
because comparing file contents would be much slower for my use cases, and comparing only file sizes and modification times also seems sufficient in these cases. I have no reason to suspect that my computers suffer from clock skew problems or that something maliciously changed time stamps. Also, the outcome of -c
cannot change the decision to delete a file, only the decision to update or keep a file.
I also use -u
and -a
(rather than -r
), so that I can later re-use the command line and change it to copy selected directories and files from ${SOURCE}
to ${TARGET}
, like this:
rsync -avu ${SOURCE}/{dirA,dirB,fileX} ${TARGET} # copy some files
add a comment |
up vote
1
down vote
Surprisingly no answer in 6 years uses the -i
option or gives nice output so here I'll go:
TLDR - Just show me the commands
rsync -rin --ignore-existing "$LEFT_DIR"/ "$RIGHT_DIR"/|sed -e 's/^[^ ]* /L /'
rsync -rin --ignore-existing "$RIGHT_DIR"/ "$LEFT_DIR"/|sed -e 's/^[^ ]* /R /'
rsync -rin --existing "$LEFT_DIR"/ "$RIGHT_DIR"/|sed -e 's/^/X /'
Understanding the output
Here's an example of the output:
L file-only-in-Left-dir
R file-only-in-right-dir
X >f.st...... file-with-dif-size-and-time
X .f...p..... file-with-dif-perms
Note the first character of every line:
L
/R
mean that the file/dir appears only at theL
eft orR
ight dir.
X
means that the file appears on both sides but is not the same (in which case the next 11 characters give you more info.s
,t
andp
depict differences in size, time and permissions respectively -- for more info tryman rsync
and search for--itemize-changes
).
Extra options you may wish to use
If you want to also compare the owner/group/permissions of the files add the options -o
/-g
/-p
respectively. Finally note that by default rsync considers two files the same if they have the same name, time and size. This is extremely fast and most of the times more than enough but if you want to be 100% sure add -c
to also compare the contents of files with the same name, time & size.
TLDR - Just give me a script to call
Here it is. Call it like this
diff-dirs Left_Dir Right_Dir [options]
All options mentioned above in section "Extra options you may wish to use" also apply here.
#!/bin/bash
# Compare two directories using rsync and print the differences
# CAUTION: options MUST appear after the directories
#
# SYNTAX
#---------
# diff-dirs Left_Dir Right_Dir [options]
#
# EXAMPLE OF OUTPUT
#------------------
# L file-only-in-Left-dir
# R file-only-in-right-dir
# X >f.st...... file-with-dif-size-and-time
# X .f...p..... file-with-dif-perms
#
# L / R mean that the file/dir appears only at the `L`eft or `R`ight dir.
#
# X means that a file appears on both sides but is not the same (in which
# case the next 11 characters give you more info. In most cases knowing
# that s,t,T and p depict differences in Size, Time and Permissions
# is enough but `man rsync` has more info
# (look at the --itemize-changes option)
#
# OPTIONS
#---------
# All options are passed to rsync. Here are the most useful for the purpose
# of directory comparisons:
#
# -c will force comparison of file contents (otherwise only
# time & size is compared which is much faster)
#
# -p/-o/-g will force comparison of permissions/owner/group
LEFT_DIR=$1; shift
RIGHT_DIR=$1; shift
OPTIONS="$*"
# Files that don't exist in Right_Dir
rsync $OPTIONS -rin --ignore-existing "$LEFT_DIR"/ "$RIGHT_DIR"/|sed -e 's/^[^ ]* /L /'
# Files that don't exist in Left_Dir
rsync $OPTIONS -rin --ignore-existing "$RIGHT_DIR"/ "$LEFT_DIR"/|sed -e 's/^[^ ]* /R /'
# Files that exist in both dirs but have differences
rsync $OPTIONS -rin --existing "$LEFT_DIR"/ "$RIGHT_DIR"/|sed -e 's/^/X /'
How does it work?
The core of the job is performed by calling rsync like this:
rsync -rin --ignore-existing $DIR_A/ $DIR_B/
We use: -i
to tell rsync to print one line of output for every file formated in a special way*, -n
to suppress normal behavior of rsync (to try to copy/delete/sync the two dirs), -r
to work recursively for all files/sub-dirs.
We call rsync three times:
1st call: print files that don't exist in Dir_B. We need to use --ignore-existing
to ignore files that exist on both sides.
rsync -rin --ignore-existing $DIR_A/ $DIR_B/
2nd call: Exactly as before but we swap the order of DIR_A/DIR_B.
3rd call: Finally we use --existing
to only check files that appear in both dirs.
rsync -rin --existing $DIR_A/ $DIR_B/
Notes
*: Use man rsync
and look for --itemize-changes
add a comment |
up vote
0
down vote
I have another idea of doing this:
rsync -rn --out-format=FILEDETAIL::%n $TARGET $SOURCE | grep "^FILEDETAIL"
You can match "FILEDETAIL::" with the output of the command. Also, you can change the string "FILEDETAIL::". The "%n" is the file name.
-r This tells rsync to copy directories recursively.
-n This make rsync perform a trial run that doesn't make any changes.
add a comment |
7 Answers
7
active
oldest
votes
7 Answers
7
active
oldest
votes
active
oldest
votes
active
oldest
votes
up vote
41
down vote
accepted
You will propably have to run something like
rsync -avun --delete
in both directions.
But what are you actually trying to accomplish?
Update:
rsync -avun --delete $TARGET $SOURCE |grep "^deleting "
will give you a list of files that do not exist in the target-directory.
"grep delet" because each line prints : deleting ..file..
rsync -avun $SOURCE $TARGET
will give you a list of "different" files (including new files).
add a comment |
up vote
41
down vote
accepted
You will propably have to run something like
rsync -avun --delete
in both directions.
But what are you actually trying to accomplish?
Update:
rsync -avun --delete $TARGET $SOURCE |grep "^deleting "
will give you a list of files that do not exist in the target-directory.
"grep delet" because each line prints : deleting ..file..
rsync -avun $SOURCE $TARGET
will give you a list of "different" files (including new files).
add a comment |
up vote
41
down vote
accepted
up vote
41
down vote
accepted
You will propably have to run something like
rsync -avun --delete
in both directions.
But what are you actually trying to accomplish?
Update:
rsync -avun --delete $TARGET $SOURCE |grep "^deleting "
will give you a list of files that do not exist in the target-directory.
"grep delet" because each line prints : deleting ..file..
rsync -avun $SOURCE $TARGET
will give you a list of "different" files (including new files).
You will propably have to run something like
rsync -avun --delete
in both directions.
But what are you actually trying to accomplish?
Update:
rsync -avun --delete $TARGET $SOURCE |grep "^deleting "
will give you a list of files that do not exist in the target-directory.
"grep delet" because each line prints : deleting ..file..
rsync -avun $SOURCE $TARGET
will give you a list of "different" files (including new files).
edited Feb 17 '17 at 15:31
ctx
1,587314
1,587314
answered Dec 2 '12 at 22:14
Nils
12.4k63669
12.4k63669
add a comment |
add a comment |
up vote
42
down vote
To add to Nils's answer (for anyone coming across this via Google), by default rsync
only compares the file sizes and modification times to tell if there are any differences. (If those are different it does more, but if they're the same, it stops there.)
If you want to compare actual file contents, even for files which have the same size and last modification time, add the flag -c
to tell rsync
to compare the files using a checksum.
rsync -avnc $SOURCE $TARGET
(The -u
option tells rsync to ignore files which are newer in $TARGET
than on $SOURCE
, which you probably don't want if you're comparing contents.)
2
If you only care that the data is the same you might want to add--no-group --no-owner --no-perms --no-times
or some combination of these based on your needs.
– flungo
Sep 30 '16 at 17:31
1
@flungo, or just use a subset of the options implied by-a
instead of-a
, e.g.rsync -rlDcnv --delete $SOURCE $TARGET
– maxschlepzig
Nov 26 '16 at 22:29
Please add--delete
to list files only existing in$TARGET
– Tom Hale
Oct 31 '17 at 5:42
add a comment |
up vote
42
down vote
To add to Nils's answer (for anyone coming across this via Google), by default rsync
only compares the file sizes and modification times to tell if there are any differences. (If those are different it does more, but if they're the same, it stops there.)
If you want to compare actual file contents, even for files which have the same size and last modification time, add the flag -c
to tell rsync
to compare the files using a checksum.
rsync -avnc $SOURCE $TARGET
(The -u
option tells rsync to ignore files which are newer in $TARGET
than on $SOURCE
, which you probably don't want if you're comparing contents.)
2
If you only care that the data is the same you might want to add--no-group --no-owner --no-perms --no-times
or some combination of these based on your needs.
– flungo
Sep 30 '16 at 17:31
1
@flungo, or just use a subset of the options implied by-a
instead of-a
, e.g.rsync -rlDcnv --delete $SOURCE $TARGET
– maxschlepzig
Nov 26 '16 at 22:29
Please add--delete
to list files only existing in$TARGET
– Tom Hale
Oct 31 '17 at 5:42
add a comment |
up vote
42
down vote
up vote
42
down vote
To add to Nils's answer (for anyone coming across this via Google), by default rsync
only compares the file sizes and modification times to tell if there are any differences. (If those are different it does more, but if they're the same, it stops there.)
If you want to compare actual file contents, even for files which have the same size and last modification time, add the flag -c
to tell rsync
to compare the files using a checksum.
rsync -avnc $SOURCE $TARGET
(The -u
option tells rsync to ignore files which are newer in $TARGET
than on $SOURCE
, which you probably don't want if you're comparing contents.)
To add to Nils's answer (for anyone coming across this via Google), by default rsync
only compares the file sizes and modification times to tell if there are any differences. (If those are different it does more, but if they're the same, it stops there.)
If you want to compare actual file contents, even for files which have the same size and last modification time, add the flag -c
to tell rsync
to compare the files using a checksum.
rsync -avnc $SOURCE $TARGET
(The -u
option tells rsync to ignore files which are newer in $TARGET
than on $SOURCE
, which you probably don't want if you're comparing contents.)
edited Jan 7 '15 at 14:58
data:image/s3,"s3://crabby-images/624dc/624dc03dee15f9ea8c0b008c56b2b7f48126f59b" alt=""
data:image/s3,"s3://crabby-images/624dc/624dc03dee15f9ea8c0b008c56b2b7f48126f59b" alt=""
HalosGhost
3,65592135
3,65592135
answered Jan 7 '15 at 14:35
user98393
42142
42142
2
If you only care that the data is the same you might want to add--no-group --no-owner --no-perms --no-times
or some combination of these based on your needs.
– flungo
Sep 30 '16 at 17:31
1
@flungo, or just use a subset of the options implied by-a
instead of-a
, e.g.rsync -rlDcnv --delete $SOURCE $TARGET
– maxschlepzig
Nov 26 '16 at 22:29
Please add--delete
to list files only existing in$TARGET
– Tom Hale
Oct 31 '17 at 5:42
add a comment |
2
If you only care that the data is the same you might want to add--no-group --no-owner --no-perms --no-times
or some combination of these based on your needs.
– flungo
Sep 30 '16 at 17:31
1
@flungo, or just use a subset of the options implied by-a
instead of-a
, e.g.rsync -rlDcnv --delete $SOURCE $TARGET
– maxschlepzig
Nov 26 '16 at 22:29
Please add--delete
to list files only existing in$TARGET
– Tom Hale
Oct 31 '17 at 5:42
2
2
If you only care that the data is the same you might want to add
--no-group --no-owner --no-perms --no-times
or some combination of these based on your needs.– flungo
Sep 30 '16 at 17:31
If you only care that the data is the same you might want to add
--no-group --no-owner --no-perms --no-times
or some combination of these based on your needs.– flungo
Sep 30 '16 at 17:31
1
1
@flungo, or just use a subset of the options implied by
-a
instead of -a
, e.g. rsync -rlDcnv --delete $SOURCE $TARGET
– maxschlepzig
Nov 26 '16 at 22:29
@flungo, or just use a subset of the options implied by
-a
instead of -a
, e.g. rsync -rlDcnv --delete $SOURCE $TARGET
– maxschlepzig
Nov 26 '16 at 22:29
Please add
--delete
to list files only existing in $TARGET
– Tom Hale
Oct 31 '17 at 5:42
Please add
--delete
to list files only existing in $TARGET
– Tom Hale
Oct 31 '17 at 5:42
add a comment |
up vote
16
down vote
Just for those less familiar with rsync
:
rsync -rvnc --delete ${SOURCE}/ ${DEST}
-n
: most important bit -- do not change anything ;
-rc
: compare only the contents (otherwise use-ac
) ;
-v
: list the files )
--delete
: look for a symmetrical, not a uni-directional difference.- Finally,
/
means "look inside the directory, and compare its contents to the destination".
It will print a usual
rsync
output,
- with one <filename> on a line for every "new" file in
${SOURCE}
and one "deleting <filename>" line for each "new" file in
${DEST}
.
It may also print a few warnings, like "skipping non-regular file <filename>" for symlinks.
PS. I know it's a terrible PS -- but it was indeed added in a rush. Nevertheless, I bet one may find this useful.
PPS. Alternatively, one could also do
find $SOURCE -type f -exec md5sum {} ; | tee source.md5
find $DEST -type f -exec md5sum {} ; | tee dest.md5
If the filenames do not contain newlines, we can then sort both *.md5
files, and diff
them. ( This will work only for files, though; that is, an empty directory on either side won't be detected. )
add a comment |
up vote
16
down vote
Just for those less familiar with rsync
:
rsync -rvnc --delete ${SOURCE}/ ${DEST}
-n
: most important bit -- do not change anything ;
-rc
: compare only the contents (otherwise use-ac
) ;
-v
: list the files )
--delete
: look for a symmetrical, not a uni-directional difference.- Finally,
/
means "look inside the directory, and compare its contents to the destination".
It will print a usual
rsync
output,
- with one <filename> on a line for every "new" file in
${SOURCE}
and one "deleting <filename>" line for each "new" file in
${DEST}
.
It may also print a few warnings, like "skipping non-regular file <filename>" for symlinks.
PS. I know it's a terrible PS -- but it was indeed added in a rush. Nevertheless, I bet one may find this useful.
PPS. Alternatively, one could also do
find $SOURCE -type f -exec md5sum {} ; | tee source.md5
find $DEST -type f -exec md5sum {} ; | tee dest.md5
If the filenames do not contain newlines, we can then sort both *.md5
files, and diff
them. ( This will work only for files, though; that is, an empty directory on either side won't be detected. )
add a comment |
up vote
16
down vote
up vote
16
down vote
Just for those less familiar with rsync
:
rsync -rvnc --delete ${SOURCE}/ ${DEST}
-n
: most important bit -- do not change anything ;
-rc
: compare only the contents (otherwise use-ac
) ;
-v
: list the files )
--delete
: look for a symmetrical, not a uni-directional difference.- Finally,
/
means "look inside the directory, and compare its contents to the destination".
It will print a usual
rsync
output,
- with one <filename> on a line for every "new" file in
${SOURCE}
and one "deleting <filename>" line for each "new" file in
${DEST}
.
It may also print a few warnings, like "skipping non-regular file <filename>" for symlinks.
PS. I know it's a terrible PS -- but it was indeed added in a rush. Nevertheless, I bet one may find this useful.
PPS. Alternatively, one could also do
find $SOURCE -type f -exec md5sum {} ; | tee source.md5
find $DEST -type f -exec md5sum {} ; | tee dest.md5
If the filenames do not contain newlines, we can then sort both *.md5
files, and diff
them. ( This will work only for files, though; that is, an empty directory on either side won't be detected. )
Just for those less familiar with rsync
:
rsync -rvnc --delete ${SOURCE}/ ${DEST}
-n
: most important bit -- do not change anything ;
-rc
: compare only the contents (otherwise use-ac
) ;
-v
: list the files )
--delete
: look for a symmetrical, not a uni-directional difference.- Finally,
/
means "look inside the directory, and compare its contents to the destination".
It will print a usual
rsync
output,
- with one <filename> on a line for every "new" file in
${SOURCE}
and one "deleting <filename>" line for each "new" file in
${DEST}
.
It may also print a few warnings, like "skipping non-regular file <filename>" for symlinks.
PS. I know it's a terrible PS -- but it was indeed added in a rush. Nevertheless, I bet one may find this useful.
PPS. Alternatively, one could also do
find $SOURCE -type f -exec md5sum {} ; | tee source.md5
find $DEST -type f -exec md5sum {} ; | tee dest.md5
If the filenames do not contain newlines, we can then sort both *.md5
files, and diff
them. ( This will work only for files, though; that is, an empty directory on either side won't be detected. )
edited Jun 23 at 13:04
answered Mar 13 '17 at 12:43
data:image/s3,"s3://crabby-images/37381/373819a5328bd5afe0b9d18b5bdfcef8b75a7720" alt=""
data:image/s3,"s3://crabby-images/37381/373819a5328bd5afe0b9d18b5bdfcef8b75a7720" alt=""
ジョージ
41358
41358
add a comment |
add a comment |
up vote
2
down vote
I understand from your question that you do not want to use diff on ls, but you can also use diff recursively on directories:
diff -rq DIR1 DIR2
add a comment |
up vote
2
down vote
I understand from your question that you do not want to use diff on ls, but you can also use diff recursively on directories:
diff -rq DIR1 DIR2
add a comment |
up vote
2
down vote
up vote
2
down vote
I understand from your question that you do not want to use diff on ls, but you can also use diff recursively on directories:
diff -rq DIR1 DIR2
I understand from your question that you do not want to use diff on ls, but you can also use diff recursively on directories:
diff -rq DIR1 DIR2
answered Jul 3 '17 at 16:58
Camion
211
211
add a comment |
add a comment |
up vote
1
down vote
It took me a few tries to get this to work. Nils' answer requires that $TARGET
ends in a trailing /
, as explained by ジョージ.
Here is a version that explicitly adds the trailing /
:
rsync -avun --delete ${TARGET}/ ${SOURCE} | sed -ne 's/^deleting *//p'
This gives the list of files that exist below the ${SOURCE}
directory but not below the ${TARGET}
directory.
I use sed
here to remove the leading deleting
from the output lines, and to print only those lines.
I do not use the rsync
option -c
because comparing file contents would be much slower for my use cases, and comparing only file sizes and modification times also seems sufficient in these cases. I have no reason to suspect that my computers suffer from clock skew problems or that something maliciously changed time stamps. Also, the outcome of -c
cannot change the decision to delete a file, only the decision to update or keep a file.
I also use -u
and -a
(rather than -r
), so that I can later re-use the command line and change it to copy selected directories and files from ${SOURCE}
to ${TARGET}
, like this:
rsync -avu ${SOURCE}/{dirA,dirB,fileX} ${TARGET} # copy some files
add a comment |
up vote
1
down vote
It took me a few tries to get this to work. Nils' answer requires that $TARGET
ends in a trailing /
, as explained by ジョージ.
Here is a version that explicitly adds the trailing /
:
rsync -avun --delete ${TARGET}/ ${SOURCE} | sed -ne 's/^deleting *//p'
This gives the list of files that exist below the ${SOURCE}
directory but not below the ${TARGET}
directory.
I use sed
here to remove the leading deleting
from the output lines, and to print only those lines.
I do not use the rsync
option -c
because comparing file contents would be much slower for my use cases, and comparing only file sizes and modification times also seems sufficient in these cases. I have no reason to suspect that my computers suffer from clock skew problems or that something maliciously changed time stamps. Also, the outcome of -c
cannot change the decision to delete a file, only the decision to update or keep a file.
I also use -u
and -a
(rather than -r
), so that I can later re-use the command line and change it to copy selected directories and files from ${SOURCE}
to ${TARGET}
, like this:
rsync -avu ${SOURCE}/{dirA,dirB,fileX} ${TARGET} # copy some files
add a comment |
up vote
1
down vote
up vote
1
down vote
It took me a few tries to get this to work. Nils' answer requires that $TARGET
ends in a trailing /
, as explained by ジョージ.
Here is a version that explicitly adds the trailing /
:
rsync -avun --delete ${TARGET}/ ${SOURCE} | sed -ne 's/^deleting *//p'
This gives the list of files that exist below the ${SOURCE}
directory but not below the ${TARGET}
directory.
I use sed
here to remove the leading deleting
from the output lines, and to print only those lines.
I do not use the rsync
option -c
because comparing file contents would be much slower for my use cases, and comparing only file sizes and modification times also seems sufficient in these cases. I have no reason to suspect that my computers suffer from clock skew problems or that something maliciously changed time stamps. Also, the outcome of -c
cannot change the decision to delete a file, only the decision to update or keep a file.
I also use -u
and -a
(rather than -r
), so that I can later re-use the command line and change it to copy selected directories and files from ${SOURCE}
to ${TARGET}
, like this:
rsync -avu ${SOURCE}/{dirA,dirB,fileX} ${TARGET} # copy some files
It took me a few tries to get this to work. Nils' answer requires that $TARGET
ends in a trailing /
, as explained by ジョージ.
Here is a version that explicitly adds the trailing /
:
rsync -avun --delete ${TARGET}/ ${SOURCE} | sed -ne 's/^deleting *//p'
This gives the list of files that exist below the ${SOURCE}
directory but not below the ${TARGET}
directory.
I use sed
here to remove the leading deleting
from the output lines, and to print only those lines.
I do not use the rsync
option -c
because comparing file contents would be much slower for my use cases, and comparing only file sizes and modification times also seems sufficient in these cases. I have no reason to suspect that my computers suffer from clock skew problems or that something maliciously changed time stamps. Also, the outcome of -c
cannot change the decision to delete a file, only the decision to update or keep a file.
I also use -u
and -a
(rather than -r
), so that I can later re-use the command line and change it to copy selected directories and files from ${SOURCE}
to ${TARGET}
, like this:
rsync -avu ${SOURCE}/{dirA,dirB,fileX} ${TARGET} # copy some files
edited May 15 at 6:23
answered May 15 at 6:14
Orafu
113
113
add a comment |
add a comment |
up vote
1
down vote
Surprisingly no answer in 6 years uses the -i
option or gives nice output so here I'll go:
TLDR - Just show me the commands
rsync -rin --ignore-existing "$LEFT_DIR"/ "$RIGHT_DIR"/|sed -e 's/^[^ ]* /L /'
rsync -rin --ignore-existing "$RIGHT_DIR"/ "$LEFT_DIR"/|sed -e 's/^[^ ]* /R /'
rsync -rin --existing "$LEFT_DIR"/ "$RIGHT_DIR"/|sed -e 's/^/X /'
Understanding the output
Here's an example of the output:
L file-only-in-Left-dir
R file-only-in-right-dir
X >f.st...... file-with-dif-size-and-time
X .f...p..... file-with-dif-perms
Note the first character of every line:
L
/R
mean that the file/dir appears only at theL
eft orR
ight dir.
X
means that the file appears on both sides but is not the same (in which case the next 11 characters give you more info.s
,t
andp
depict differences in size, time and permissions respectively -- for more info tryman rsync
and search for--itemize-changes
).
Extra options you may wish to use
If you want to also compare the owner/group/permissions of the files add the options -o
/-g
/-p
respectively. Finally note that by default rsync considers two files the same if they have the same name, time and size. This is extremely fast and most of the times more than enough but if you want to be 100% sure add -c
to also compare the contents of files with the same name, time & size.
TLDR - Just give me a script to call
Here it is. Call it like this
diff-dirs Left_Dir Right_Dir [options]
All options mentioned above in section "Extra options you may wish to use" also apply here.
#!/bin/bash
# Compare two directories using rsync and print the differences
# CAUTION: options MUST appear after the directories
#
# SYNTAX
#---------
# diff-dirs Left_Dir Right_Dir [options]
#
# EXAMPLE OF OUTPUT
#------------------
# L file-only-in-Left-dir
# R file-only-in-right-dir
# X >f.st...... file-with-dif-size-and-time
# X .f...p..... file-with-dif-perms
#
# L / R mean that the file/dir appears only at the `L`eft or `R`ight dir.
#
# X means that a file appears on both sides but is not the same (in which
# case the next 11 characters give you more info. In most cases knowing
# that s,t,T and p depict differences in Size, Time and Permissions
# is enough but `man rsync` has more info
# (look at the --itemize-changes option)
#
# OPTIONS
#---------
# All options are passed to rsync. Here are the most useful for the purpose
# of directory comparisons:
#
# -c will force comparison of file contents (otherwise only
# time & size is compared which is much faster)
#
# -p/-o/-g will force comparison of permissions/owner/group
LEFT_DIR=$1; shift
RIGHT_DIR=$1; shift
OPTIONS="$*"
# Files that don't exist in Right_Dir
rsync $OPTIONS -rin --ignore-existing "$LEFT_DIR"/ "$RIGHT_DIR"/|sed -e 's/^[^ ]* /L /'
# Files that don't exist in Left_Dir
rsync $OPTIONS -rin --ignore-existing "$RIGHT_DIR"/ "$LEFT_DIR"/|sed -e 's/^[^ ]* /R /'
# Files that exist in both dirs but have differences
rsync $OPTIONS -rin --existing "$LEFT_DIR"/ "$RIGHT_DIR"/|sed -e 's/^/X /'
How does it work?
The core of the job is performed by calling rsync like this:
rsync -rin --ignore-existing $DIR_A/ $DIR_B/
We use: -i
to tell rsync to print one line of output for every file formated in a special way*, -n
to suppress normal behavior of rsync (to try to copy/delete/sync the two dirs), -r
to work recursively for all files/sub-dirs.
We call rsync three times:
1st call: print files that don't exist in Dir_B. We need to use --ignore-existing
to ignore files that exist on both sides.
rsync -rin --ignore-existing $DIR_A/ $DIR_B/
2nd call: Exactly as before but we swap the order of DIR_A/DIR_B.
3rd call: Finally we use --existing
to only check files that appear in both dirs.
rsync -rin --existing $DIR_A/ $DIR_B/
Notes
*: Use man rsync
and look for --itemize-changes
add a comment |
up vote
1
down vote
Surprisingly no answer in 6 years uses the -i
option or gives nice output so here I'll go:
TLDR - Just show me the commands
rsync -rin --ignore-existing "$LEFT_DIR"/ "$RIGHT_DIR"/|sed -e 's/^[^ ]* /L /'
rsync -rin --ignore-existing "$RIGHT_DIR"/ "$LEFT_DIR"/|sed -e 's/^[^ ]* /R /'
rsync -rin --existing "$LEFT_DIR"/ "$RIGHT_DIR"/|sed -e 's/^/X /'
Understanding the output
Here's an example of the output:
L file-only-in-Left-dir
R file-only-in-right-dir
X >f.st...... file-with-dif-size-and-time
X .f...p..... file-with-dif-perms
Note the first character of every line:
L
/R
mean that the file/dir appears only at theL
eft orR
ight dir.
X
means that the file appears on both sides but is not the same (in which case the next 11 characters give you more info.s
,t
andp
depict differences in size, time and permissions respectively -- for more info tryman rsync
and search for--itemize-changes
).
Extra options you may wish to use
If you want to also compare the owner/group/permissions of the files add the options -o
/-g
/-p
respectively. Finally note that by default rsync considers two files the same if they have the same name, time and size. This is extremely fast and most of the times more than enough but if you want to be 100% sure add -c
to also compare the contents of files with the same name, time & size.
TLDR - Just give me a script to call
Here it is. Call it like this
diff-dirs Left_Dir Right_Dir [options]
All options mentioned above in section "Extra options you may wish to use" also apply here.
#!/bin/bash
# Compare two directories using rsync and print the differences
# CAUTION: options MUST appear after the directories
#
# SYNTAX
#---------
# diff-dirs Left_Dir Right_Dir [options]
#
# EXAMPLE OF OUTPUT
#------------------
# L file-only-in-Left-dir
# R file-only-in-right-dir
# X >f.st...... file-with-dif-size-and-time
# X .f...p..... file-with-dif-perms
#
# L / R mean that the file/dir appears only at the `L`eft or `R`ight dir.
#
# X means that a file appears on both sides but is not the same (in which
# case the next 11 characters give you more info. In most cases knowing
# that s,t,T and p depict differences in Size, Time and Permissions
# is enough but `man rsync` has more info
# (look at the --itemize-changes option)
#
# OPTIONS
#---------
# All options are passed to rsync. Here are the most useful for the purpose
# of directory comparisons:
#
# -c will force comparison of file contents (otherwise only
# time & size is compared which is much faster)
#
# -p/-o/-g will force comparison of permissions/owner/group
LEFT_DIR=$1; shift
RIGHT_DIR=$1; shift
OPTIONS="$*"
# Files that don't exist in Right_Dir
rsync $OPTIONS -rin --ignore-existing "$LEFT_DIR"/ "$RIGHT_DIR"/|sed -e 's/^[^ ]* /L /'
# Files that don't exist in Left_Dir
rsync $OPTIONS -rin --ignore-existing "$RIGHT_DIR"/ "$LEFT_DIR"/|sed -e 's/^[^ ]* /R /'
# Files that exist in both dirs but have differences
rsync $OPTIONS -rin --existing "$LEFT_DIR"/ "$RIGHT_DIR"/|sed -e 's/^/X /'
How does it work?
The core of the job is performed by calling rsync like this:
rsync -rin --ignore-existing $DIR_A/ $DIR_B/
We use: -i
to tell rsync to print one line of output for every file formated in a special way*, -n
to suppress normal behavior of rsync (to try to copy/delete/sync the two dirs), -r
to work recursively for all files/sub-dirs.
We call rsync three times:
1st call: print files that don't exist in Dir_B. We need to use --ignore-existing
to ignore files that exist on both sides.
rsync -rin --ignore-existing $DIR_A/ $DIR_B/
2nd call: Exactly as before but we swap the order of DIR_A/DIR_B.
3rd call: Finally we use --existing
to only check files that appear in both dirs.
rsync -rin --existing $DIR_A/ $DIR_B/
Notes
*: Use man rsync
and look for --itemize-changes
add a comment |
up vote
1
down vote
up vote
1
down vote
Surprisingly no answer in 6 years uses the -i
option or gives nice output so here I'll go:
TLDR - Just show me the commands
rsync -rin --ignore-existing "$LEFT_DIR"/ "$RIGHT_DIR"/|sed -e 's/^[^ ]* /L /'
rsync -rin --ignore-existing "$RIGHT_DIR"/ "$LEFT_DIR"/|sed -e 's/^[^ ]* /R /'
rsync -rin --existing "$LEFT_DIR"/ "$RIGHT_DIR"/|sed -e 's/^/X /'
Understanding the output
Here's an example of the output:
L file-only-in-Left-dir
R file-only-in-right-dir
X >f.st...... file-with-dif-size-and-time
X .f...p..... file-with-dif-perms
Note the first character of every line:
L
/R
mean that the file/dir appears only at theL
eft orR
ight dir.
X
means that the file appears on both sides but is not the same (in which case the next 11 characters give you more info.s
,t
andp
depict differences in size, time and permissions respectively -- for more info tryman rsync
and search for--itemize-changes
).
Extra options you may wish to use
If you want to also compare the owner/group/permissions of the files add the options -o
/-g
/-p
respectively. Finally note that by default rsync considers two files the same if they have the same name, time and size. This is extremely fast and most of the times more than enough but if you want to be 100% sure add -c
to also compare the contents of files with the same name, time & size.
TLDR - Just give me a script to call
Here it is. Call it like this
diff-dirs Left_Dir Right_Dir [options]
All options mentioned above in section "Extra options you may wish to use" also apply here.
#!/bin/bash
# Compare two directories using rsync and print the differences
# CAUTION: options MUST appear after the directories
#
# SYNTAX
#---------
# diff-dirs Left_Dir Right_Dir [options]
#
# EXAMPLE OF OUTPUT
#------------------
# L file-only-in-Left-dir
# R file-only-in-right-dir
# X >f.st...... file-with-dif-size-and-time
# X .f...p..... file-with-dif-perms
#
# L / R mean that the file/dir appears only at the `L`eft or `R`ight dir.
#
# X means that a file appears on both sides but is not the same (in which
# case the next 11 characters give you more info. In most cases knowing
# that s,t,T and p depict differences in Size, Time and Permissions
# is enough but `man rsync` has more info
# (look at the --itemize-changes option)
#
# OPTIONS
#---------
# All options are passed to rsync. Here are the most useful for the purpose
# of directory comparisons:
#
# -c will force comparison of file contents (otherwise only
# time & size is compared which is much faster)
#
# -p/-o/-g will force comparison of permissions/owner/group
LEFT_DIR=$1; shift
RIGHT_DIR=$1; shift
OPTIONS="$*"
# Files that don't exist in Right_Dir
rsync $OPTIONS -rin --ignore-existing "$LEFT_DIR"/ "$RIGHT_DIR"/|sed -e 's/^[^ ]* /L /'
# Files that don't exist in Left_Dir
rsync $OPTIONS -rin --ignore-existing "$RIGHT_DIR"/ "$LEFT_DIR"/|sed -e 's/^[^ ]* /R /'
# Files that exist in both dirs but have differences
rsync $OPTIONS -rin --existing "$LEFT_DIR"/ "$RIGHT_DIR"/|sed -e 's/^/X /'
How does it work?
The core of the job is performed by calling rsync like this:
rsync -rin --ignore-existing $DIR_A/ $DIR_B/
We use: -i
to tell rsync to print one line of output for every file formated in a special way*, -n
to suppress normal behavior of rsync (to try to copy/delete/sync the two dirs), -r
to work recursively for all files/sub-dirs.
We call rsync three times:
1st call: print files that don't exist in Dir_B. We need to use --ignore-existing
to ignore files that exist on both sides.
rsync -rin --ignore-existing $DIR_A/ $DIR_B/
2nd call: Exactly as before but we swap the order of DIR_A/DIR_B.
3rd call: Finally we use --existing
to only check files that appear in both dirs.
rsync -rin --existing $DIR_A/ $DIR_B/
Notes
*: Use man rsync
and look for --itemize-changes
Surprisingly no answer in 6 years uses the -i
option or gives nice output so here I'll go:
TLDR - Just show me the commands
rsync -rin --ignore-existing "$LEFT_DIR"/ "$RIGHT_DIR"/|sed -e 's/^[^ ]* /L /'
rsync -rin --ignore-existing "$RIGHT_DIR"/ "$LEFT_DIR"/|sed -e 's/^[^ ]* /R /'
rsync -rin --existing "$LEFT_DIR"/ "$RIGHT_DIR"/|sed -e 's/^/X /'
Understanding the output
Here's an example of the output:
L file-only-in-Left-dir
R file-only-in-right-dir
X >f.st...... file-with-dif-size-and-time
X .f...p..... file-with-dif-perms
Note the first character of every line:
L
/R
mean that the file/dir appears only at theL
eft orR
ight dir.
X
means that the file appears on both sides but is not the same (in which case the next 11 characters give you more info.s
,t
andp
depict differences in size, time and permissions respectively -- for more info tryman rsync
and search for--itemize-changes
).
Extra options you may wish to use
If you want to also compare the owner/group/permissions of the files add the options -o
/-g
/-p
respectively. Finally note that by default rsync considers two files the same if they have the same name, time and size. This is extremely fast and most of the times more than enough but if you want to be 100% sure add -c
to also compare the contents of files with the same name, time & size.
TLDR - Just give me a script to call
Here it is. Call it like this
diff-dirs Left_Dir Right_Dir [options]
All options mentioned above in section "Extra options you may wish to use" also apply here.
#!/bin/bash
# Compare two directories using rsync and print the differences
# CAUTION: options MUST appear after the directories
#
# SYNTAX
#---------
# diff-dirs Left_Dir Right_Dir [options]
#
# EXAMPLE OF OUTPUT
#------------------
# L file-only-in-Left-dir
# R file-only-in-right-dir
# X >f.st...... file-with-dif-size-and-time
# X .f...p..... file-with-dif-perms
#
# L / R mean that the file/dir appears only at the `L`eft or `R`ight dir.
#
# X means that a file appears on both sides but is not the same (in which
# case the next 11 characters give you more info. In most cases knowing
# that s,t,T and p depict differences in Size, Time and Permissions
# is enough but `man rsync` has more info
# (look at the --itemize-changes option)
#
# OPTIONS
#---------
# All options are passed to rsync. Here are the most useful for the purpose
# of directory comparisons:
#
# -c will force comparison of file contents (otherwise only
# time & size is compared which is much faster)
#
# -p/-o/-g will force comparison of permissions/owner/group
LEFT_DIR=$1; shift
RIGHT_DIR=$1; shift
OPTIONS="$*"
# Files that don't exist in Right_Dir
rsync $OPTIONS -rin --ignore-existing "$LEFT_DIR"/ "$RIGHT_DIR"/|sed -e 's/^[^ ]* /L /'
# Files that don't exist in Left_Dir
rsync $OPTIONS -rin --ignore-existing "$RIGHT_DIR"/ "$LEFT_DIR"/|sed -e 's/^[^ ]* /R /'
# Files that exist in both dirs but have differences
rsync $OPTIONS -rin --existing "$LEFT_DIR"/ "$RIGHT_DIR"/|sed -e 's/^/X /'
How does it work?
The core of the job is performed by calling rsync like this:
rsync -rin --ignore-existing $DIR_A/ $DIR_B/
We use: -i
to tell rsync to print one line of output for every file formated in a special way*, -n
to suppress normal behavior of rsync (to try to copy/delete/sync the two dirs), -r
to work recursively for all files/sub-dirs.
We call rsync three times:
1st call: print files that don't exist in Dir_B. We need to use --ignore-existing
to ignore files that exist on both sides.
rsync -rin --ignore-existing $DIR_A/ $DIR_B/
2nd call: Exactly as before but we swap the order of DIR_A/DIR_B.
3rd call: Finally we use --existing
to only check files that appear in both dirs.
rsync -rin --existing $DIR_A/ $DIR_B/
Notes
*: Use man rsync
and look for --itemize-changes
edited Nov 23 at 18:22
answered Aug 17 at 15:51
data:image/s3,"s3://crabby-images/2679e/2679e9991e4403b0ec674696d232004457f8b2c2" alt=""
data:image/s3,"s3://crabby-images/2679e/2679e9991e4403b0ec674696d232004457f8b2c2" alt=""
ndemou
507414
507414
add a comment |
add a comment |
up vote
0
down vote
I have another idea of doing this:
rsync -rn --out-format=FILEDETAIL::%n $TARGET $SOURCE | grep "^FILEDETAIL"
You can match "FILEDETAIL::" with the output of the command. Also, you can change the string "FILEDETAIL::". The "%n" is the file name.
-r This tells rsync to copy directories recursively.
-n This make rsync perform a trial run that doesn't make any changes.
add a comment |
up vote
0
down vote
I have another idea of doing this:
rsync -rn --out-format=FILEDETAIL::%n $TARGET $SOURCE | grep "^FILEDETAIL"
You can match "FILEDETAIL::" with the output of the command. Also, you can change the string "FILEDETAIL::". The "%n" is the file name.
-r This tells rsync to copy directories recursively.
-n This make rsync perform a trial run that doesn't make any changes.
add a comment |
up vote
0
down vote
up vote
0
down vote
I have another idea of doing this:
rsync -rn --out-format=FILEDETAIL::%n $TARGET $SOURCE | grep "^FILEDETAIL"
You can match "FILEDETAIL::" with the output of the command. Also, you can change the string "FILEDETAIL::". The "%n" is the file name.
-r This tells rsync to copy directories recursively.
-n This make rsync perform a trial run that doesn't make any changes.
I have another idea of doing this:
rsync -rn --out-format=FILEDETAIL::%n $TARGET $SOURCE | grep "^FILEDETAIL"
You can match "FILEDETAIL::" with the output of the command. Also, you can change the string "FILEDETAIL::". The "%n" is the file name.
-r This tells rsync to copy directories recursively.
-n This make rsync perform a trial run that doesn't make any changes.
edited Sep 19 at 11:44
user88036
answered Sep 19 at 11:37
data:image/s3,"s3://crabby-images/89e87/89e876870282ef5867aa7dc9cf6d38b0e2721e13" alt=""
data:image/s3,"s3://crabby-images/89e87/89e876870282ef5867aa7dc9cf6d38b0e2721e13" alt=""
zhao Tony
1
1
add a comment |
add a comment |
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f57305%2frsync-compare-directories%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Similar: serverfault.com/questions/62364/…
– reinierpost
Oct 29 '15 at 12:41