All Stories

What does the ActualText dictionary tag do?

Text is defined in the PDF file format as a display value (normally what you see onscreen) and an extraction value. It is useful...
Mark Stephens
29 sec read

PDF to HTML conversion – matching PDF page size

A PDF file are designed to be resolution independent – they are defined using resolution independent units so that the page will always appear...
Mark Stephens
52 sec read

Debugging Java – what is the difference between a…

This blog article was posted in response to a question on our forum about the difference between our debug and non-debug jars. We you...
Mark Stephens
1 min read

Java CMYK to RGB conversion – speed comparison of…

Some optimisations are best left to Java. For example a System.arraycopy is the fastest way to create a new version because it is optimised...
Mark Stephens
1 min read

PDF to HTML conversion – relative positioning of content

One of the most interesting things about developing the PDF to HTML convertor is the number of ideas and enhancements which arise from actual...
Mark Stephens
50 sec read