Mark Stephens Mark has been working with Java and PDF since 1999 and is a big NetBeans fan. He enjoys speaking at conferences. He has an MA in Medieval History and a passion for reading.

PDF to HTML5 conversion – duplicate text in PDF files for bold effects

24 sec read

A popular trick in PDF files is to print some text twice (with the second character moved slightly) to create a bold effect.

pdf text

You cannot do this in HTML5 so all you get is double text overlapping. How ugly!

html text

So we add some ‘intelligence’ into the conversion to ignore these characters (it needs to be smart enough to work correctly when we get genuine double characters like following or moon so we look at the position and gap between the letters).

This gives a much better representation of the text 🙂

html text

The PDF file format uses lots of tricks which work very well for PDF but need care in being translated in HTML5.

 

IDRsolutions develop a Java PDF Viewer and SDK, an Adobe forms to HTML5 forms converter, a PDF to HTML5 converter and a Java ImageIO replacement. On the blog our team post anything interesting they learn about.

Mark Stephens Mark has been working with Java and PDF since 1999 and is a big NetBeans fan. He enjoys speaking at conferences. He has an MA in Medieval History and a passion for reading.

Converting your PDF files to HTML5 with BuildVu 

Recently we announced our updated product range for 2018 and are rebranding some existing products, like JPDF2HTML5 which has been renamed to BuildVu. It...
Georgia Ingham
2 min read

Favourite resources from our HTML development team

As the web progresses and grows, so do the technologies that come along with it. Trying to keep on top of everything you need...
Ovidijus Okinskas
1 min read

Leave a Reply

Your email address will not be published. Required fields are marked *

IDRsolutions Ltd 2019. All rights reserved.