Oracle and Java

We always felt that things needed to change in the Java world so have been keenly waiting to see what Oracle would do with Java… Well we got our first clues, last week with a letter from Oracle. Sent by express Fedex from California, the good news is that Oracle clearly has money to spend…

Read More

Don’t blame the PDF file format

I see a lot of complaints about the PDF file format on various forums, moaning about it. They tend to focus mainly on 2 issues:- 1. The PDF file format is complicated. 2. Extraction, especially of text, is not always straight-forward. Both of these, I think, are essentially unfair. PDF arose out of Postscript and…

Read More

Why we need to see your PDF files…

What makes writing a PDF parser especially interesting (ie complex) is that the specification is often ambiguous and that PDF is a very complex structure. To Display a PDF file requires the parser to correctly scan the PDF object data structure, to correctly decode and assemble all the data, and then parse the stream of…

Read More