Using the Wayback Machine to Archive (and Backup) WordPress
Sometimes, WordPress backups fail and restoring your many blog posts and pages can become a brutal challenge. Fortunately, archiving them using the Wayback (time) Machine can make them easily recoverable.
Sure, you can’t actually go back in time, but using the Wayback Machine isn’t far off. It archives public web documents to preserve human culture for future generations.
In this post, I’ll go into detail about the Wayback Machine, what it is, how you can use it to automatically or manually archive your blog posts and pages, and also how you can retrieve archived content. I’ll also show you a few plugins you can use for easy archiving.
What is the Way Back Machine?
The Wayback Machine is a three-dimensional index that archives publicly accessible web pages by crawling them, similar to search engines. It was created in 1996 as a non-profit project by The Internet Archive.
The Wayback Machine was named to reference Mr. Peabody’s WABAC machine from the popular cartoon Rocky and Bullwinkle. In the show, the machine was pronounced as “way back,” which is where the index got its name.
Archiving your blog posts and pages using the Wayback Machine can be useful if your site breaks and your backups fail. While you can’t archive all dynamic content, the text on your posts and pages are saved, which means you can copy and paste it into a new post.
You can recover the posts and content you’re missing while also contributing to a non-profit project. By archiving your site, you’re preserving information and artifacts from the cultures and heritages of humanity for future generations and civilizations. Future human beings can take a look at the Wayback Machine and everything that was archived so they can learn from us just as archaeologists uncover ancient artifacts so we can learn from the past to create a better future.
When and What’s Archived?
The Wayback Machine only crawls public web pages and can’t access content that’s password-protected or on a secure, private server. It also doesn’t crawl sites that discourage search engines from crawling them.
Popular sites that get a lot of traffic are automatically crawled, but you can manually archive pages in a few seconds.
The only prerequisite is that you need to make sure your WordPress website is set up to let crawlers go through your pages and posts. To ensure your site can be archived:
- Go to your WordPress admin dashboard and click on Settings > Reading.
- Under Search Engine Visibility, uncheck the box for the setting Discourage search engines from indexing this site, then click Save Changes.
If you have any plugins installed and activated that have a similar setting, be sure to change it to let crawlers through.
Once that’s done, you’re ready to archive your posts and pages.
Archiving Your Blog Posts and Pages
There are two main ways to archive your site using the Wayback Machine.
The first method is by typing
web.archive.org/save/ in front of the URL in your browser’s address bar. You also don’t need to omit the http:// or https:// at the beginning of the web address.
You can also go to the Wayback Machine Web Archive page and enter the URL of the page or post you want to archive in the field under Save Page Now. Then, Click the Save Page button.
In either case, the process takes a few seconds but can take a bit longer depending on the size of the page. Once the archiving has been completed, you should see a direct URL you can copy and save to directly access the archived post or page later.
Accessing Your Archived Content
Once you have archived your posts and pages, you can access them by visiting the Wayback Machine. Keep in mind that it can take several days for a page to get fully archived so you may not be able to access the content you archived right away, but it should be there later on.
You can search for archived pages and posts by clicking on the web icon. Then, enter a URL into the field that dynamically appears toward the top of the page and press Enter on your keyboard.
If you don’t remember the exact URL of the post or page you’re trying to recover, you can enter only your main web address or the link to your blog. The Wayback Machine should pull up all the results related to the address you entered, including URL strings.
The search results return a calendar with colored circles to highlight the days where content was archived. You can hover over one of these circles to view a list of pages that were indexed on that day.
You can click on one of the hyperlinked times that are listed to view the archived page.
From there, you can copy and paste the text into your post or page editor and save a new copy of your content to recover your site.
Voilà! Your site is fully recovered.
Plugins for Archiving with Speed
If you would like different ways to archive your posts and pages, check out these plugins. Not all of them archive to the Wayback Machine, but some of them offer other complimentary capabilities to archiving.
The Archiver plugin takes the Way Back Machine and places it right in your admin dashboard. You can archive posts and pages through your admin toolbar and send them to the Way Back Machine. It’s an easy way to archive pages that doesn’t involve remembering any URLs.
The Archive plugin doesn’t connect to the Way Back Machine, but if you want a way to archive your posts internally, you can use this plugin. Instead of deleting old posts you don’t need, but still want to keep, you can archive them and keep them in your database with this plugin.
This plugin doesn’t archive anything, but it can help you figure out what pages or posts are missing on your site since it searches for broken links. Once you know what’s been lost, you search for it in the Way Back Machine. Then, you can copy and paste your text content into a new page or post and replace the old links with the new ones.
It’s not a real time machine, but if you’re having troubles restoring your posts or page content after your site breaks, searching the Wayback Machine for your previously archived content can help you get it back. Archiving your site can act as a backup to your backups in case disaster strikes and your site isn’t able to be fully restored.
Obviously, archiving your site with the Wayback Machine is not a solid solution for backing up your websites. If you’re looking for a more reliable solution, check out our managed backups and all-new storage plans that back backing up your sites not only a no-brainer but simple and affordable.
WIN a Share of $5K
Subscribe to our blog this #hostingmonth for a chance to win one of 5 prizes of $1,000 WPMU Dev credit! Learn More.