Skip navigation

Converting a webarchive to html

6052 Views 8 Replies Latest reply: Nov 22, 2012 1:17 AM by Fabrizio Bartolomuci RSS
Fabrizio Bartolomuci Level 1 Level 1 (0 points)
Currently Being Moderated
Nov 20, 2012 1:24 AM

I managed to collect the behavior of a complex web site into a webarchive. Thereafter I would like to turn that webarchive into an html set of nested directory. Yet, when I did it both with Waf and with a commercial software bought on the the Apple store, what I got was just the nested directory with the html page at the bottom and no images, nor css nor working links. If you are interested the webarchive document is at:

http://www.miafoto.it/it/GiroMilano.webarchive

while the weak product of the extraction is at:

http://www.miafoto.it/it/Giromilano/Pagine/default.aspx

and the empty directories above.

In addition to the different look, the webarchive displays the same behavior as the official web site - when a listbox vales is selected and then the button pushed - while the extracted version just produces a page with no contents, by reloading itself rather than the official page. As you may see the webarchive is over 1MB while the product of the extraction just little over 1 KB.

What is wrong with it and how may I perform such an apparently trivial business with usable results?

Thanks, Fabrizio

MacBook Pro, Mac OS X (10.7.1)
  • andyBall_uk Level 6 Level 6 (17,595 points)
    Currently Being Moderated
    Nov 20, 2012 8:03 AM (in response to Fabrizio Bartolomuci)

    http://sourceforge.net/projects/webarchivext/files/ seemingly works better than waf or the later https://github.com/robrohan/WebArchiveExtractor/downloads

    on the example file you gave, although you lose the perfect map image, although scrolling in that doesn't work offline even in the webarchive format

     

    if a current site, Camino's save-as webpage complete is easier

  • andyBall_uk Level 6 Level 6 (17,595 points)
    Currently Being Moderated
    Nov 20, 2012 9:13 AM (in response to Fabrizio Bartolomuci)

    I think I follow what you want, & no, although the application produces most all of the referenced files, it doesn't create a page that will call up a new    ...atm.it/it/Giromilanoone one like the webarchive does.

  • andyBall_uk Level 6 Level 6 (17,595 points)
    Currently Being Moderated
    Nov 21, 2012 6:20 AM (in response to Fabrizio Bartolomuci)

    no magic, but as you've seen, the converter/folderizer apps aren't consistent.

     

    you can surely edit the html using any raw editor, or even a plist editor.

     

    since you rely on the site still existing/being accesssible - can you use an iframe to display the actual site, instead ?

  • andyBall_uk Level 6 Level 6 (17,595 points)
    Currently Being Moderated
    Nov 21, 2012 11:24 AM (in response to Fabrizio Bartolomuci)

    http://www.suavetech.com/0xed/0xed.html is quite nice on a mac, & free; textwrangler should be more useful, certainly easier to read.

     

    http://www.nightproductions.net/prefsetter.html will open .webarchives & show how they're arranged, just have to add.plist to the name for it to recognise them. I expect that other plist editors will do the same

     

    if your saved webarchive 'remembers' date & other settings ( seems to?), is it possible to save one for each required value?

Actions

More Like This

  • Retrieving data ...

Bookmarked By (0)

Legend

  • This solved my question - 10 points
  • This helped me - 5 points
This site contains user submitted content, comments and opinions and is for informational purposes only. Apple disclaims any and all liability for the acts, omissions and conduct of any third parties in connection with or related to your use of the site. All postings and use of the content on this site are subject to the Apple Support Communities Terms of Use.