Mark Stephens

  https://www.idrsolutions.com Mark founded the company and has worked with Java and PDF since 1997. The original creator of the core code, he is also a NetBeans enthusiast who enjoys speaking at conferences and reading. He holds an Athletics Blue and an MA in Mediaeval History from St. Andrews University.

   



193 Stories by Mark Stephens

Why convert PDF magazines to HTML5? – Part 1. Gain control of your content

Over the last few months, we have been working with several magazines on our PDF to HTML5 converter (stay tuned for lots of interesting...
1 min read

No Startxref found in last 1024 bytes opening file – what does this error message mean with a PDF file?

A PDF file contains a couple of key pointers. One of them is a pointer to the start of the table which describes the...
44 sec read

How to identify a PDF file

The best way to identify a PDF file is to scan the first line of the file. In theory the first line of a...
49 sec read

Multiple trailers in a PDF file

A PDF file consists of a ‘dump’ of PDF objects and a reference table defining where they are located in the PDF file and...
1 min read

Table order in OTF fonts

As part of our TrueType to OpenType font conversion (we need this for PDF to HTML5 conversion to ensure fonts display on all browsers),...
49 sec read

How to extract Structured text from PDF files in Java (Tutorial)

Developers hoping to extract content from PDF documents whilst maintaining the structure of the text should follow this tutorial. Some (but not all) PDF...
1 min read