PDF to HTML5 conversion

We have been working on a secret project since early 2010. We already have a great Java library for printing, viewing and rasterizing PDF files. How difficult would it be to create a new version which can convert PDF to HTML?

The advent of Mobile devices and the maturity of the HTML specification has made it much more feasible and advantageous. HTML 5 offers a canvas, Javascript and  decent featureset to handle PDF content. PDF is a great file format for display but the viewers are not great on mobile platforms.

Java does not run on Apple’s IPad/IPhone, and Android does not support several critical Java features. So porting our Java PDF viewer to either platform would be a huge amount of work. It is also not clear which platform will win and whether there is a commercial market.

PDF to HTML conversion makes a great deal of sense for our customers who face the same issue – they can make content available on multiple platforms easily and do not have to pick which they think will win. So we have taken our Java parser and built a new product to generate HTML.

The full details and access to the software can be found here.

We are publishing lots of blog articles on things we have learnt, the specifics of conversion, and even some of the legal issues involved. If any topic is of particular interest, please let us know. What would you like to know?

Click here to see all the article in the PDF to HTML5 conversion series.

This post is part of our “HTML5 Article index” in these articles, we aim to help you understand the world of HTML5.

 

Ebook Page Link

The following two tabs change content below.

Mark Stephens

System Architect and Lead Developer at IDRSolutions
Mark Stephens has been working with Java and PDF since 1999 and has diversified into HTML5, SVG and JavaFX. He also enjoys speaking at conferences and has been a Speaker at user groups, Business of Software, Seybold and JavaOne conferences. He has a very dry sense of humor and an MA in Medieval History for which he has not yet found a practical use.

Related Posts:

Markee174

About Mark Stephens

Mark Stephens has been working with Java and PDF since 1999 and has diversified into HTML5, SVG and JavaFX. He also enjoys speaking at conferences and has been a Speaker at user groups, Business of Software, Seybold and JavaOne conferences. He has a very dry sense of humor and an MA in Medieval History for which he has not yet found a practical use.

2 thoughts on “PDF to HTML5 conversion

  1. I publish books online in PDF and I would be very interested in testing the beta version of your PDF to HTML5 conversion software.

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes:

<a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>