193 Stories by Mark Stephens
Over the last few months, we have been working with several magazines on our PDF to HTML5 converter (stay tuned for lots of interesting... A PDF file contains a couple of key pointers. One of them is a pointer to the start of the table which describes the... The best way to identify a PDF file is to scan the first line of the file. In theory the first line of a... A PDF file consists of a ‘dump’ of PDF objects and a reference table defining where they are located in the PDF file and... As part of our TrueType to OpenType font conversion (we need this for PDF to HTML5 conversion to ensure fonts display on all browsers),... Developers hoping to extract content from PDF documents whilst maintaining the structure of the text should follow this tutorial. Some (but not all) PDF...