Tips & Tricks download from the wayback machine

Get the HTML without the wayback machine interface. just add id_ after the timestamp:
web.archive.org/web/20150901185758id_/http://www.example.com/
Does not work sometimes

Download all filles from a site, even specifying other options like to certain timestamp:

Install this tool:

https://github.com/hartator/wayback-machine-downloader

execute this command:
wayback_machine_downloader http://example.com -d /home/user/Desktop/webs/example_archive –to 20150901185758

Download only certain files.

Download all the zip/rar files from a site or even a directory
wayback_machine_downloader http://elhacker.org -d /home/user/Desktop/webs/example --only "/\.(zip|rar)$/i"

From this article you can get more information
https://exposureninja.com/blog/extract-urls-archive-org/

Dump your json or text file
For JSON file:
http://web.archive.org/cdx/search/cdx?url=example.com*&output=json

For TXT format:
http://web.archive.org/cdx/search/cdx?url=example.com*&output=txt

0 thoughts on “Tips & Tricks download from the wayback machine

Leave a Reply