BetaArchive Logo
Navigation Home Screenshots Image Uploader Server Info FTP Servers Wiki Forum RSS Feed Rules Please Donate
UP: 41d, 3h, 26m | CPU: 13% | MEM: 5462MB of 12279MB used
{The community for beta collectors}

Post new topic Reply to topic  [ 14 posts ] 
Author Message
 PostPost subject: Download a website off the wayback machine?        Posted: Thu Nov 26, 2009 8:48 am 
FTP Access
User avatar
Offline

Joined
Sun Aug 16, 2009 2:47 am

Posts
563

Location
Illinois

Favourite OS
Android 4.1 Jelly Bean
i'm trying to download some archived websites from 1998 to transfer to my Windows '98 machine (I.E This one), but i can't figure out how to get Visual Wget to download all the subpages/subdirectories. how do i make it do that?

_________________
I REALLY SHOULD POST HERE MORE OFTEN!
---
My Av: Unused icon from the Mac 128k's rom


Top  Profile
 PostPost subject: Re: Download a website off the wayback machine?        Posted: Thu Nov 26, 2009 10:17 am 
Donator
User avatar
Offline

Joined
Sun Jan 11, 2009 3:29 am

Posts
2314

Favourite OS
Maemo 5 PR1.3
Recusive download? (on *nix systems)

_________________
Program run condition: collect keys. Deadline: 2 days.


Top  Profile
 PostPost subject: Re: Download a website off the wayback machine?        Posted: Thu Nov 26, 2009 11:27 am 
Administrator
User avatar
Offline

Joined
Fri Aug 18, 2006 11:47 am

Posts
11977

Location
Merseyside, United Kingdom

Favourite OS
Microsoft Windows 7 Ultimate x64
I've tried downloading from the way back machine before, it doesn't allow you to do it because of their directory structure. I don't know how but it just didn't seem to work.

_________________
Image


Top  Profile  WWW
 PostPost subject: Re: Download a website off the wayback machine?        Posted: Thu Nov 26, 2009 1:35 pm 
Donator
User avatar
Offline

Joined
Sun Jan 11, 2009 3:29 am

Posts
2314

Favourite OS
Maemo 5 PR1.3
Recursive download on wget and curl (tested on Ubuntu and SUA) does not work on Internet Archive.
Weird dir structure, I guess (404, even if that /is/ the URL of the file.)

_________________
Program run condition: collect keys. Deadline: 2 days.


Top  Profile
 PostPost subject: Re: Download a website off the wayback machine?        Posted: Thu Nov 26, 2009 2:16 pm 
Donator
Offline

Joined
Sat Aug 22, 2009 4:28 pm

Posts
4403
Offline Explorer.

But it's *cough* not free *cough* Image

_________________
Longhorn Packet 1.21 - Solves most of the problems with Longhorn Setup
[GUIDE] How to dump clean/untouched images from CD discs
Longhorn Music Album (FLAC) | 523.31 MB | 17 tracks | Donators Discussion Forum


Top  Profile
 PostPost subject: Re: Download a website off the wayback machine?        Posted: Thu Nov 26, 2009 5:59 pm 
Donator
User avatar
Offline

Joined
Fri Oct 02, 2009 9:41 pm

Posts
1350

Favourite OS
Windows 7\Slackware
I've had some problems with offline explorer, it isn't worth paying for IMO, but might be worth a shot.

_________________
Previously known as effy11


Top  Profile  WWW
 PostPost subject: Re: Download a website off the wayback machine?        Posted: Thu Nov 26, 2009 6:20 pm 
FTP Access
User avatar
Offline

Joined
Sun Aug 16, 2009 2:47 am

Posts
563

Location
Illinois

Favourite OS
Android 4.1 Jelly Bean
i suppose i could get all the pages manually with Visual WGet, but that would take forever.

_________________
I REALLY SHOULD POST HERE MORE OFTEN!
---
My Av: Unused icon from the Mac 128k's rom


Top  Profile
 PostPost subject: Re: Download a website off the wayback machine?        Posted: Wed Dec 02, 2009 11:41 pm 
Donator
Offline

Joined
Sun Aug 24, 2008 6:36 pm

Posts
84

Location
UK

Favourite OS
Mountain Lion
Take a look at http://www.httrack.com/

All you do is put in the web address you want to download, enter a bit more info (such as file types to download) and it does it :)


Top  Profile  WWW
 PostPost subject: Re: Download a website off the wayback machine?        Posted: Thu Dec 03, 2009 12:34 am 
Administrator
User avatar
Offline

Joined
Fri Aug 18, 2006 11:47 am

Posts
11977

Location
Merseyside, United Kingdom

Favourite OS
Microsoft Windows 7 Ultimate x64
Craig wrote:
Take a look at http://www.httrack.com/

All you do is put in the web address you want to download, enter a bit more info (such as file types to download) and it does it :)


Same problem Craig, it doesn't work. Tried it myself a few times only to find it fails.

_________________
Image


Top  Profile  WWW
 PostPost subject: Re: Download a website off the wayback machine?        Posted: Thu Dec 03, 2009 1:03 am 
Donator
User avatar
Offline

Joined
Sun Sep 27, 2009 7:55 pm

Posts
1209
THAT program, ugg, It never worked for me. Wouldn't copy worth a damn. If I really need to copy a website, I use a program called Blue Crab, (sorry MacOS only.)

for OSX users:

http://www.apple.com/downloads/macosx/i ... ecrab.html

Man, there was this one program for windows, if only I could remember the name of it, It would be prob. (O) am before I could find it.

_________________
Laugh, monkeys. Laugh. / RoL IRC


Top  Profile
 PostPost subject: Re: Download a website off the wayback machine?        Posted: Sat Dec 05, 2009 1:57 am 
Sorry, but no that's not possible. It would be encrypted. I wish I could though. Could you imagine downloading the entire internet and storing it on your computer? :-D


Top
 PostPost subject: Re: Download a website off the wayback machine?        Posted: Sat Dec 05, 2009 3:09 am 
Donator
User avatar
Offline

Joined
Sun Sep 27, 2009 7:55 pm

Posts
1209
What would be encrypted?

_________________
Laugh, monkeys. Laugh. / RoL IRC


Top  Profile
 PostPost subject: Re: Download a website off the wayback machine?        Posted: Sat Dec 05, 2009 3:44 am 
Donator
Offline

Joined
Wed Mar 21, 2007 2:42 am

Posts
762

Location
Guelph, ON, Canada
its odd rthat it wouldn't let you download stuff from a website through the wayback machine, it wouldn't be the robots.txt that sort of thing blocks all access, but I guess one has to do it the old fashioned way of going through and downloading every page and image...


Top  Profile
 PostPost subject: Re: Download a website off the wayback machine?        Posted: Mon Dec 07, 2009 9:03 am 
This site can be loaded correctly with Offline Explorer. The only suggestion to make the download correct is to use the Project Properties dialog - URL Filters - Directory section and add the server name to the Included keywords list, like:
www.server.com
This will limit the download to the site you want and not follow outside links. All scripts and special codes on the site will be correctly downloaded and processed.


Top
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 14 posts ] 




Who is online

Users browsing this forum: No registered users and 5 guests


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum

Search for:
Jump to:  

All views expressed in these forums are those of the author and do not necessarily represent the views of the BetaArchive site owner.

Powered by phpBB® Forum Software © phpBB Group

Copyright © 2006-2014

 

Sitemap | XML | RSS