Processed World PDF Torture Test, Part Three: iPad And HP TouchPad

And if you want to play along at home:

Processed World (Nov-82) [choose PDF (5.9 M) or right-click Save As here]

This is a follow-up to: The PDF Torture Test The iPad 2 And iPhone 4s Failed and iPad PDF Torture Test: GoodReader Vs. Processed World.



And we have a bonus too, the PDF being tried on an HP TouchPad!

So this post is also now a follow-up to:
Fondle: HP TouchPad
HP TouchPad Vs. iPad 2 In PDF Smackdown Test

This video was shot by HP TouchPad maven Jonathan Ezor, aka @webOSquire:

That’s very exciting. It shows minimal delays for displaying it and is acceptable for reading it on the HP TouchPad!

It’s clear that for best results, some PDFs need to be opened up and tweaked to make them faster for a tablet. This is not that much work and is worth it to be able to read things that are no longer in print and highly likely to remain that way!

Of course, as tablets continue to become more powerful, and PDF software continues to evolve, this kind of conversion will no longer be necessary.

If you want to play along at home with this new version:

Google Docs: PWPDF3bHigh.pdf – app 14 MBs

Some background on how this was all done.

I searched and wound up downloading a free program called PDF-XChange Viewer.

I Exported all the pages:

Let’s pause to look at the size of each page’s image scan:

They’re big!

And I saved them as JPEG (they’re originally in JPEG2000):

Note that they’re 300 dpi. I didn’t change the size or DPI, just the file format.

And told the software to give each on a unique page number in the filename:

And off it went:

I have a crap slow PC, so this all took about forty minutes. On modern machines, I’m sure it’d take a few minutes.

Thumbnail view of exported page scans:

And a view by file size list:

Here’s one of them opened in another free program I already had, IrfanView:

Given the dimensions it’s reporting in the lower left corner, it seems exporting to JPEG made them even larger. However, I’m no expert at images and using a grab-bag of programs brings a lot of uncertainties into the process. Anyway, look at how sharp that image is even when zoomed-in so much!

Unfortunately, this left me with a set of thumbnails that was over a whopping fifty megabytes! And a PDF created from them was about the same size.

I tried doing an Export, cutting the DPI down to 100:

But something went awry in the process. It got hung up on one page and then every page after that one turned out blank!

Instead of going through that again, I decided to cheat by just using a dedicated program to bulk resize (and rename) the already-exported JPEGs. I used FastStone Photo Resizer. And what I did was specify a width of 768 pixels, the same width as an iPad screen.

And here’s a list of the renamed page scans listed by file size:

Zooming in a lot shows the reduced resolution:

But that doesn’t really matter. What does is being able to read them conventionally without page rendering taking up to twelve seconds!

Then I used a demo version of Nitro PDF to create the PDF file (which I also used for the bloated fifty-meg PDF).

It went from about 6MBs to over 14MBs, but still: Mission accomplished. Now it can be read without all those irritating rendering delays.

For those interested, here’s a peek at the metadata of the original PDF from the Internet Archive:

Thanks again to Laura Fullton and to Jonathan Ezor the testing and videos!


4 responses to “Processed World PDF Torture Test, Part Three: iPad And HP TouchPad

  1. MacArthur

    I’m not sure if I understand this pdf test. I downloaded that Processed World pdf and tried it in Google Play Books on both MediaTek and Intel hardware. Having seen your struggles with this pdf I thought it could be a good test for all the new devices I have with me at the moment. But, cheapest possible Chinese Intel Z3735F tab, the Teclast X80HD has no problem with it at all. Not after the first loading in Google Books. It’s actually very snappy with only short page loading delay. This is a 35k AnTuTu score tab. Also Adobe Acrobat Reader has no problems with it, but a little bit more delay than Google Books. I made a quick video of it and posted to YouTube:
    Also the MediaTek device I tried had no problems with it after initial loading.
    What am I missing?
    On the other hand I see that some very popular reader apps like the Aldiko Book Reader can’t open this pdf at all on any of my devices.

    • Which version did you use? The original has JPEG2000 images that would kill your tablets. I think you grabbed the version I did where I converted the JPEG2000 images to the easier JPEG. What is the file size?

      Looking at your YT video, it has the filename of the one with the JPEG2000 images. I have to admit I’ve never tried the Google Play Books app on it! Thanks for the video.

      • MacArthur

        Aah, that explains it. I downloaded the 6 MB pdf file because I thought it was really a test with a pdf reader reading a pdf you did. The 100 MB original file named processedworld06proc_orig_jp2 contains 76 files (pages) in the graphical format jpg2000. As that is not pdf format it can’t be opened in the Google Books app. Not without joining and converting to pdf. I misunderstood you. When you talk about jpg2000 I thought you meant that the 6 MB pdf was just that. Same resolution (72 dpi) but compressed to pdf. The original jpg2000 files are set to 72 dpi when opened in Photoshop, but on the other hand they are 198 x 132 cm big = 5616 x 3744 pixels. I just saw the 72 dpi setting, not the total amount of pixels. In pixel dimensions that equals 60 MB per page decompressed. A bit heavy for any app to open instantly.

      • Here is the Internet Archive page for the PDF I mean:

        Their PDF is just ~6MBs and has JPEG2000 images.

