I started IDRsolutions while working for the Times Newspaper group in the 1990s. So I know that the complex page layout on Newspaper pages tends to raise a whole load of special issues. But also that it provides some really good case studies to hone our technology. Here is an example I would like to share.
Here is an example with one line highlighted. You will notice there are big spaces between the words on the highlighted line. It comes from a live Newspaper page (reproduced with permission) to show the issue.
If we split out the individual words, we get this which does not look too good.
So let us be more fussy on what breaks we allow and try to keep the text as a single block.
It needs some more work and tuning but definitely a step in the right direction. What do you think?
IDRsolutions develop a Java PDF library, a PDF forms to HTML5 converter, a PDF to HTML5 or SVG converter and a Java Image Library that doubles as an ImageIO replacement. On the blog our team post about anything interesting they learn about.