User Tools

Site Tools


aoe:rsync

rsync of SL

https://www.scientificlinux.org/download/mirroring/

mkdir -p /export/pub/linux/scientific/52/  (create the target directory first, one time)
rsync -avzlH --delete --exclude=sites/Fermi --exclude=archive/debuginfo --exclude=archive/obsolete rsync://rsync.scientificlinux.org/scientific/53/ /export/pub/linux/scientific/53/
rsync -avzlH --delete --exclude=sites/Fermi --exclude=archive/debuginfo --exclude=archive/obsolete rsync://rsync.scientificlinux.org/scientific/52/ /export/pub/linux/scientific/52/
rsync -avzlH --delete --exclude=sites/Fermi --exclude=archive/debuginfo --exclude=archive/obsolete rsync://rsync.scientificlinux.org/scientific/5x/ /export/pub/linux/scientific/5x/
rsync -avzlH --delete --exclude=sites/Fermi --exclude=archive/debuginfo --exclude=archive/obsolete rsync://rsync.scientificlinux.org/scientific/5rolling/ /export/pub/linux/scientific/5rolling/
rsync -avzlH --delete --exclude=sites/Fermi --exclude=archive/debuginfo --exclude=archive/obsolete rsync://rsync.scientificlinux.org/scientific/livecd/52/ /export/pub/linux/scientific/livecd/52/

lftp alternative

wget to mirror a web site

wget -rkp -l6 -np -nH -N http://example.com/

http://blogs.techrepublic.com.com/opensource/?p=883

The -rkp option tells wget to download recursively, to convert downloaded links in HTML pages to point to local files, and to obtain all images and other files to properly render the page.

The -l6 option tells wget to recurse to a maximum of six nested levels, while -np tells it not to recurse to the parent directory. The -nH option tells wget not to create host directories; this means that the files will be downloaded to the current directory rather than a directory named after the hostname of the site being mirrored.

Finally, -N tells wget to use time-stamping, which is its way of trying to prevent downloading the same unchanged file more than once. Unfortunately, with dynamic sites being the norm, this may not work very well, but it’s worth adding, regardless.

Wget is capable of mirroring HTTP, HTTPS, and FTP sites. It can do so anonymously or with authentication for all of these protocols.

aoe/rsync.txt · Last modified: 1970/01/18 07:09 by 127.0.0.1