Text Archives - Java PDF Blog

The main body of a book or article.

Text

Apache Tika PDF support in JPedal

JPedal now contains an Apache Tika Parser which can parse and extract unstructured text from PDF files. How to use an Apache Tika PDF...

Jacob Collins
Jan 24, 2023 29 sec read

Understanding the PDF File Format

We have been working with PDF files since 1999 and developed complex software to display PDF files. We have learnt a lot about the...

Leon Atherton
Mar 17, 2022 3 min read

Tutorial : How To Copy Text in JavaFX and…

At IDRSolutions we have a PDF Viewer that has the ability to highlight and copy text, because we are developing a JavaFX implementation of our PDF Viewer we required...

Nathan Howard
Jun 26, 2014 1 min read

PDF to HTML5 conversion – Extracting PDF text and…

Text works differently in PDFs and in HTML files, which can make it a surprisingly complex problem to get great output during PDF to...

Sam Howard
Nov 14, 2013 1 min read

Why are fonts so complicated?

I do a lot of work with fonts, and when I tell people this they often seem surprised – surely there can’t be that...

Sam Howard
Oct 8, 2013 2 min read