PDF extraction

Don’t blame the PDF file format

I see a lot of complaints about the PDF file format on various forums, moaning about it. They tend to focus mainly on 2...
Brendan
1 min read

Understanding the PDF file format – Text, shapes and…

I have been looking at an issue for a potential client recently which required the generation of different views of the page. This is...
Mark Stephens
1 min read

PDF mystery – what is the correct value for…

I came across an interesting issue with PDF Text fields while debugging a file this week. We were sent a 2 page document created...
Chris Wade
1 min read

PDF format and style information

Because PDF is very much an output and display format it does not contain much format information such as paragraph breaks and spaces unless...
Mark Stephens
1 min read