Jump to content

Need a way downloading file from Wayback Machine with preserving original timestamp


Recommended Posts

I bet there's some easy way which I'm missing. From what I've checked so far...

-Firefox overwrites the date modified property of the file when downloading with the current one.
-There was some add-on to preserve that, but, obviously, it's XUL and does not work with modern FF.
-I tried IE for that... no luck, same effect.
-I successfully used Powershell's Start-BitsTransfer for that, but for some reason it refuses to download this way from Wayback Machine.
-I tried wget for windows, but it returned access denied.
-And obviously, DownloadThemAll! for WebExtensions will never happen.
-And I do not use Chrome, just to be as Google-less as possible.

Any quick hints? BW, I have Basilisk installed next to Firefox, so maybe that's the key...

 

Link to comment
Share on other sites

  • 1 month later...

Hmmm.

If you could post a link to the file, that would surely help.

Free Download Manager used to have such an option, but I believe it is not there anymore in 5.x version, you might need an old version, try using the "lite" 3.9.7:

https://www.freedownloadmanager.org/download.htm

.Of course it may also depend on server side.

jaclaz

Link to comment
Share on other sites

Well, I tested Free Download Manager, and despite I checked the "preserve timestamp form the server" tickbox, it didn't work, and the date is recent :/ I cross-checked it with other, non-wayback machine file, and with those ones it worked well.

Might it be that Internet Archive is changing those timestamps?

oh, and bunch of links for testing:

https://web.archive.org/web/20190726124453/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-nld_bb32e4ab293d32208635cc0bb9dffdb3b7a8560a.exe
https://web.archive.org/web/20190726124514/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-ptg_c4580d1a146f572e7857018e4298a9990cdb9740.exe
https://web.archive.org/web/20190726124532/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-ita_d5cbff29878fc957eae4b256f167918417a88e2d.exe
https://web.archive.org/web/20190726124547/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-jpn_4675cf4de607daa7e33473efb6e0fc8ccf95cffe.exe
https://web.archive.org/web/20190726124605/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-chs_34cde03ee16057673b89196585ab13f155742b63.exe
https://web.archive.org/web/20190726124620/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-fra_09e7dd707ad91d01deb9daf9eeeda0285035f57f.exe
https://web.archive.org/web/20190726124633/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-esn_2292207334a7f27e02edbe334b8411b54fe460dd.exe
https://web.archive.org/web/20190726124648/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-enu_33be765b1a232875d327a79834a1c24e46b54745.exe
https://web.archive.org/web/20190726124705/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-rus_64afafbce906d91f6590dff3d86bedb8627e1f55.exe
https://web.archive.org/web/20190726124720/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-trk_472c95d7e28cef7a0704724c0deda9ba29e1ff0a.exe
https://web.archive.org/web/20190726124733/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-cht_93673cea321e3995260b1867881141638ac197d4.exe
https://web.archive.org/web/20190726124748/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-sve_45eb4801a83f5ac3727c71f86a08bfa4e4ec2715.exe
https://web.archive.org/web/20190726124803/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-deu_c7f71b42fe0153e88f433c44fb82021eb7fbd2d3.exe
https://web.archive.org/web/20190726124826/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-kor_e1b4361e12a92992d3886309ac3648831e5b6888.exe
https://web.archive.org/web/20190726124841/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-hun_8586de439f104796026a4a80b0d02f268f60c239.exe
https://web.archive.org/web/20190726124859/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-ptb_cc9f4110cd6a5e96013c0d1f443a32346dcb8bba.exe
https://web.archive.org/web/20190726124915/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-csy_1f25a4497caa29e82b7f93d90a8ba17b6a0f233f.exe
https://web.archive.org/web/20190726124930/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-plk_32b54437cc6b0de87f4dc4533ac3b9f9401d1c31.exe

 

Link to comment
Share on other sites

Maybe it is the way the Wayback Machine sends the original file/date/time.

I tried the:

https://web.archive.org/web/20190726124532/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-ita_d5cbff29878fc957eae4b256f167918417a88e2d.exe

Actually  I tried the http link :

http://web.archive.org/web/20190726124532/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-ita_d5cbff29878fc957eae4b256f167918417a88e2d.exe

With good/ol' Net Vampire 3.3:

https://www.fileplanet.com/archive/p-43170/Net-Vampire-3-3

and it couldn't fetch the original date :( BUT looking at its log ;):

Quote

I 14/08/2019 15.52.32   ------------------------- Attempt 1 ------------------------- 
P 14/08/2019 15.52.32  Connecting to web.archive.org ...
P 14/08/2019 15.52.32  Connected to web.archive.org [207.241.226.190]
S 14/08/2019 15.52.32  GET /web/20190726124532/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-ita_d5cbff29878fc957eae4b256f167918417a88e2d.exe HTTP/1.1
S 14/08/2019 15.52.32  Connection: close
S 14/08/2019 15.52.32  Host: web.archive.org
S 14/08/2019 15.52.32  Accept: */*
S 14/08/2019 15.52.32  Pragma: no-cache
S 14/08/2019 15.52.32  Cache-Control: no-cache
S 14/08/2019 15.52.32  Referer: http://web.archive.org/web/20190726124532/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/
S 14/08/2019 15.52.32  User-Agent: Mozilla/4.04 [en] (Win95; I ;Nav)
S 14/08/2019 15.52.32  
R 14/08/2019 15.52.44  HTTP/1.1 200 OK
R 14/08/2019 15.52.44  Server: nginx/1.15.8
R 14/08/2019 15.52.44  Date: Wed, 14 Aug 2019 13:52:57 GMT
R 14/08/2019 15.52.44  Content-Type: application/octet-stream
R 14/08/2019 15.52.44  Transfer-Encoding: chunked
R 14/08/2019 15.52.44  Connection: close
R 14/08/2019 15.52.44  X-Archive-Orig-Cache-Control: public,max-age=172800
R 14/08/2019 15.52.44  X-Archive-Orig-Content-Length: 3469528
R 14/08/2019 15.52.44  X-Archive-Orig-Last-Modified: Tue, 02 Sep 2014 22:33:03 GMT
R 14/08/2019 15.52.44  X-Archive-Orig-Accept-Ranges: bytes
R 14/08/2019 15.52.44  X-Archive-Orig-ETag: "807994dbfdc6cf1:0"
R 14/08/2019 15.52.44  X-Archive-Orig-X-Cache: TCP_MISS
R 14/08/2019 15.52.44  X-Archive-Orig-Server: Microsoft-IIS/10.0
R 14/08/2019 15.52.44  X-Archive-Orig-X-Powered-By: ASP.NET
R 14/08/2019 15.52.44  X-Archive-Orig-X-CID: 7
R 14/08/2019 15.52.44  X-Archive-Orig-X-CCC: US
R 14/08/2019 15.52.44  X-Archive-Orig-X-MSEdge-Ref: Ref A: F92FC3436A024076A0AB5678875C166C Ref B: PAOEDGE0308 Ref C: 2019-07-26T12:45:32Z
R 14/08/2019 15.52.44  X-Archive-Orig-Date: Fri, 26 Jul 2019 12:45:32 GMT
R 14/08/2019 15.52.44  X-Archive-Orig-Connection: close
R 14/08/2019 15.52.44  Cache-Control: max-age=1800
R 14/08/2019 15.52.44  X-Archive-Guessed-Content-Type: application/x-dosexec
R 14/08/2019 15.52.44  Memento-Datetime: Fri, 26 Jul 2019 12:45:32 GMT
R 14/08/2019 15.52.44  Link: <http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-ita_d5cbff29878fc957eae4b256f167918417a88e2d.exe>; rel="original", <http://web.archive.org/web/timemap/link/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-ita_d5cbff29878fc957eae4b256f167918417a88e2d.exe>; rel="timemap"; type="application/link-format", <http://web.archive.org/web/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-ita_d5cbff29878fc957eae4b256f167918417a88e2d.exe>; rel="timegate", <http://web.archive.org/web/20190726124532/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-ita_d5cbff29878fc957eae4b256f167918417a88e2d.exe>; rel="first memento"; datetime="Fri, 26 Jul 2019 12:45:32 GMT", <http://web.archive.org/web/20190726124532/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-ita_d5cbff29878fc957eae4b256f167918417a88e2d.exe>; rel="memento"; datetime="Fri, 26 Jul 2019 12:45:32 GMT", <http://web.archive.org/web/20190726124532/http://download.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-kb2972207-x86-ita_d5cbff29878fc957eae4b256f167918417a88e2d.exe>; rel="last memento"; datetime="Fri, 26 Jul 2019 12:45:32 GMT"
R 14/08/2019 15.52.44  Content-Security-Policy: default-src 'self' 'unsafe-eval' 'unsafe-inline' data: blob: archive.org web.archive.org analytics.archive.org pragma.archivelab.org

the needed info seems to be actually transferred.

Now all you need is finding a tool that actually parses the info in "X-Archive-Orig-Last-Modified", but now we have a good search term, thus:

https://gitlab.com/gnuwget/wget2/issues/205

So, the gnu wget seemingly can parse it, why woudn't a Windows wget? :dubbio:

 

As a matter of fact I just tried this:

https://eternallybored.org/misc/wget/

Version 1.9.4 as I am on XP:

https://eternallybored.org/misc/wget/releases/old/wget-1.19.4-win32.zip

And I got this:

Quote

C:\appoggio\wget>wget https://web.archive.org/web/20190726124532/http://download
.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver2003-k
b2972207-x86-ita_d5cbff29878fc957eae4b256f167918417a88e2d.exe
--2019-08-14 16:54:02-- https://web.archive.org/web/20190726124532/http://downl
oad.windowsupdate.com/c/msdownload/update/software/secu/2014/09/windowsserver200
3-kb2972207-x86-ita_d5cbff29878fc957eae4b256f167918417a88e2d.exe
Resolving web.archive.org (web.archive.org)... 207.241.226.190
Connecting to web.archive.org (web.archive.org)|207.241.226.190|:443... connecte
d.
HTTP request sent, awaiting response... 200 OK
Length: unspecified [application/octet-stream]
Saving to: 'windowsserver2003-kb2972207-x86-ita_d5cbff29878fc957eae4b256f1679184
17a88e2d.exe'

windowsserver2003-k     [              <=>   ]   3,31M   305KB/s    in 11s

2019-08-14 16:54:17 (299 KB/s) - 'windowsserver2003-kb2972207-x86-ita_d5cbff2987
8fc957eae4b256f167918417a88e2d.exe' saved [3469528]

C:\appoggio\wget>dir
 Il volume nell'unità C è Disco locale
 Numero di serie del volume: C08C-CFD9

 Directory di C:\appoggio\wget

14/08/2019  16.54    <DIR>          .
14/08/2019  16.54    <DIR>          ..
14/08/2019  16.51            13.113 help.txt
23/01/2018  20.34         3.895.184 wget.exe
03/09/2014  00.33         3.469.528 windowsserver2003-kb2972207-x86-ita_d5cbff29
878fc957eae4b256f167918417a88e2d.exe
               3 File      7.377.825 byte
               2 Directory  129.531.092.992 byte disponibili

Hep!

0.jpg

 

Now, the attentive reader will have noticed that the timestamp is not actually correct, the Wayback Machine sent:

Tue, 02 Sep 2014 22:33:03 GMT

Whilst the file timestamp I got is:

03/09/2014  00.33

But this is inherent to the way filesystem timestamp change depending on local time settings, the actual NTFS timestamp is UTC, i.e. roughly GMT, but it appears according to my local settings.

jaclaz

 

Edited by jaclaz
Link to comment
Share on other sites

As I initially wrote, I tried wget initially, and it was throwing out error... and sigh, I'm quite ashamed what it was. it turns out that "access denied" I initially received cam from the fact that I ran wget from user account, and wget was located in c: program files (x86) ... i did it on windows 7, GNU Wget 1.11.4.
So yeah, actually I downloaded some file today, but still with wrong timestamp... as I'm not fresh enough to play with parsing content headers myself, I juts think I gotta try 1.9.4 ran from XP...

Link to comment
Share on other sites

Get the wget from the page I posted (and NOT from any other place):

https://eternallybored.org/misc/wget/

The XP version I used is 1.19.4, (I posted with a typo :blushing: before, it is not 1.9.4)

Current Windows Vista+ is 1.20.4.

If you actually ran 1.11.4, it is WAY OLDER and very likely has not the feature that was added to it in 2015.

jaclaz

Edited by jaclaz
Link to comment
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
  • Recently Browsing   0 members

    • No registered users viewing this page.
×
×
  • Create New...