Get Entire Websites On Your Hard Drive With Wget
September 9th, 2008 | 2 Comments
You know that wget is on my list of essential of command-line apps, and Linux Journal just put out an article on how to download an entire website to your hardrive for offline viewing, backup, or whatever.
Check it out:
$ wget \
–recursive \
–no-clobber \
–page-requisites \
–html-extension \
–convert-links \
–restrict-file-names=windows \
–domains website.org \
–no-parent \
www.website.org/tutorials/html/
This would download the file at www.website.org/tutorials/html/. For a full explanation of all the options used, make sure to check the full story.
Isn’t Wget cool? Do you have any awesome tips or tricks for Wget?
Full Story: Downloading an Entire Web Site with wget












At work, I use the following to mirror a page or so onto my hard drive, when I need to back it up before I pull it down (I only have access to the site via a CMS–I can’t touch the code; and I can’t hide pages–I can only delete them):
wget -pk http://thepage.com/Iwant.htmThat gets all the stuff I need w/o grabbing the whole site or following any sidebar links.
The other day in the podcast you said something about resuming downloads with wget…how’s that work?
wget -c /url/to/download/
-c = continue
Nice, huh?