Portable Document Format (PDF) is a file format used to present documents in a manner independent of application software, hardware, and operating systems.

PDF file

Understanding the PDF file Format – ActualText

Text is defined in the PDF file format as a display value (normally what you see onscreen) and an extraction value. It is useful...
Mark Stephens
40 sec read

PDF to HTML conversion – matching PDF page size

A PDF file are designed to be resolution independent – they are defined using resolution independent units so that the page will always appear...
Mark Stephens
57 sec read

There are several versions of each image inside your…

When you look at a PDF file you see images displayed. In fact there are ‘several’ versions of each image… Firstly there is the...
Mark Stephens
1 min read

What happens if the CropBox is smaller than the…

Many of the issues you see in PDF file come from the interaction of different parts. Here is a really good example I came...
Mark Stephens
51 sec read

Convert PDF to HTML – a question of scale

When we convert a PDF to HTML we take the PDF page size. So for a standard PDF file, this might well be around...
Mark Stephens
48 sec read