Sometimes you do not want to generate a full HTML page when you convert from PDF to HTML5. For example, if the content is being fed into a Content Management system which then adds its own page layout, you would want just the content (no HTML, HEAD or BODY tags). So we have added a flag in the latest release to allow you to do this. Here is the new code in our ExtractPagesAsHTML example (the code is commented out by default which will include the HTML structure).
/** * exclude HTML and body tags * (you will need to call draw method yourself) **///HTMLoutput.setBooleanValue(HTMLDisplay.ExcludeMetadata, true);