The main body of a book or article.

Text

Search PDF Files With Regular Expressions – Generating Teasers

Recently I have had some questions on how it would be possible to display search results including two words from either side of the...
Kieran France
55 sec read

PDF to HTML5 conversion – 2 ways to display…

One of our key considerations with PDF to HTML conversion was the quality of the text. We have seen too many attempts which just...
Mark Stephens
1 min read

Problems with using non standard characters from unicode 3.0

Recently I have been looking at an issue for one of our potential clients. The text extraction was not working correctly due to an...
Kieran France
1 min read

Punctuation ?

So what is punctuation? This may seem like a simple question yet I find myself asking it more and more often whilst working on...
Kieran France
1 min read