We have been working on a secret project since early 2010. We already have a great Java library for printing, viewing and rasterizing PDF files. How difficult would it be to create a new version which can convert PDF to HTML?
The advent of Mobile devices and the maturity of the HTML specification has made it much more feasible and advantageous. HTML 5 offers a canvas, Javascript and decent featureset to handle PDF content. PDF is a great file format for display but the viewers are not great on mobile platforms.
Java does not run on Apple’s IPad/IPhone, and Android does not support several critical Java features. So porting our Java PDF viewer to either platform would be a huge amount of work. It is also not clear which platform will win and whether there is a commercial market.
PDF to HTML conversion makes a great deal of sense for our customers who face the same issue – they can make content available on multiple platforms easily and do not have to pick which they think will win. So we have taken our Java parser and built a new product to generate HTML.
The full details and access to the software can be found here.
We are publishing lots of blog articles on things we have learnt, the specifics of conversion, and even some of the legal issues involved. If any topic is of particular interest, please let us know. What would you like to know?
Click here to see all the article in the PDF to HTML5 conversion series.
Latest posts by Mark Stephens (see all)
- Saving your settings in our online PDF to HTML5 and SVG converter - May 20, 2013
- PDF teasers – how would you handle this stack problem? - May 15, 2013
- Where do your PDF objects start in a PDF file? - May 8, 2013
- Version 5 release – Swing and javaFX - April 26, 2013
- Which languages should have examples when documenting a web service? - April 24, 2013

I publish books online in PDF and I would be very interested in testing the beta version of your PDF to HTML5 conversion software.
It is now live and you can download the trial version from http://www.jpedal.org/html_download.php
There will be another new release next week
Regards,
MArk