Author Topic: Printing multiple web pages ... aka save me from this boredom!  (Read 1774 times)

ABlipInContinuity

Printing multiple web pages ... aka save me from this boredom!
« on: 31 December, 2008, 01:52:30 pm »
I have a lit of about 2000 URLs. They are all similar, but the URL contains a parameter with a numeric identifier passed in.

This is perfect for scripting.

I thought a macro would just about do it. Any pointers?

Re: Printing multiple web pages ... aka save me from this boredom!
« Reply #1 on: 02 January, 2009, 12:29:53 am »
I can't think of an elegant solution, but here's a slightly clumsy one:

Write a script that
- converts each HTML file to PDF (e.g. using htmltopdf)
- merges the PDF files into one file using pdftk
- deletes the intermediate PDFs

Then simply open your (2000 page) PDF document, and hit print.

Re: Printing multiple web pages ... aka save me from this boredom!
« Reply #2 on: 02 January, 2009, 07:06:49 pm »
For simple text-only web pages you can use "lynx -dump" to format the html, but for most web pages you need something more complicated. How about the firefox command line print add-on torisugari - Command Line Print. With that you can use
Code: [Select]
firefox -print www.example.com/page123.htmlto print to a PDF or direct to a printer.

Woofage

  • Tofu-eating Wokerati
  • Ain't no hooves on my bike.
Re: Printing multiple web pages ... aka save me from this boredom!
« Reply #3 on: 02 January, 2009, 07:14:08 pm »
do you just need the text?
if so, wget the html files and concatenate.
Pen Pusher