Portable Document Format (PDF) is a file format used to present documents in a manner independent of application software, hardware, and operating systems.

PDF file

Where do your PDF objects start in a PDF…

In theory this is a really easy question to answer for non-Compressed PDF files. There is a reference table to all the PDF objects...
Mark Stephens
1 min read

No Startxref found in last 1024 bytes opening file…

A PDF file contains a couple of key pointers. One of them is a pointer to the start of the table which describes the...
Mark Stephens
44 sec read

Multiple trailers in a PDF file

A PDF file consists of a ‘dump’ of PDF objects and a reference table defining where they are located in the PDF file and...
Mark Stephens
1 min read

Customer question – Extracting overlapping clipped images from a…

We had an interesting inquiry about extracting clipped images from a PDF file this week. We already have an example to do this, but...
Mark Stephens
3 min read

How are Embedded CMAP tables defined in a PDF…

Every glyf inside a PDF file can have a display value and a different extraction value. This is useful because often you need to...
Mark Stephens
2 min read