This site is currently on indefinite hiatus. For fresh, new Linux & productivity content, head on over to nathanrhale.com/linux/ and subscribe to the Linux-only feed.

CLI Fun

Get Entire Websites On Your Hard Drive With Wget

September 9th, 2008 | 2 Comments

You know that wget is on my list of essential of command-line apps, and Linux Journal just put out an article on how to download an entire website to your hardrive for offline viewing, backup, or whatever.

Check it out:

$ wget \
–recursive \
–no-clobber \
–page-requisites \
–html-extension \
–convert-links \
–restrict-file-names=windows \
–domains website.org \
–no-parent \
www.website.org/tutorials/html/

This would download the file at www.website.org/tutorials/html/. For a full explanation of all the options used, make sure to check the full story.

Isn’t Wget cool? Do you have any awesome tips or tricks for Wget?

Full Story: Downloading an Entire Web Site with wget

Share and Enjoy:
  • Digg
  • del.icio.us
  • Furl
  • NewsVine
  • Slashdot
  • StumbleUpon
  • Technorati
  • Facebook
  • TwitThis
  • YahooMyWeb

2 Responses to “Get Entire Websites On Your Hard Drive With Wget”

  1. At work, I use the following to mirror a page or so onto my hard drive, when I need to back it up before I pull it down (I only have access to the site via a CMS–I can’t touch the code; and I can’t hide pages–I can only delete them):

    wget -pk http://thepage.com/Iwant.htm

    That gets all the stuff I need w/o grabbing the whole site or following any sidebar links.

    The other day in the podcast you said something about resuming downloads with wget…how’s that work?

  2. wget -c /url/to/download/

    -c = continue

    Nice, huh?