Sometimes you do not want to generate a full HTML page when you convert from PDF to HTML5. For example, if the content is being fed into a Content Management system which then adds its own page layout, you would want just the content (no HTML, HEAD or BODY tags). So we have added a flag in the latest release to allow you to do this. Here is the new code in our ExtractPagesAsHTML example (the code is commented out by default which will include the HTML structure).
/** * exclude HTML and body tags * (you will need to call draw method yourself) **///HTMLoutput.setBooleanValue(HTMLDisplay.ExcludeMetadata, true);
IDRsolutions develop a Java PDF library, a PDF forms to HTML5 converter, a PDF to HTML5 or SVG converter and a Java Image Library that doubles as an ImageIO replacement. On the blog our team post about anything interesting they learn about.