The main body of a book or article.

Text

Improvements to Our PDF Inspector

In the last two(1)(2) releases of JPedal, we added some exciting new features to our PDF Inspector tool which can help you debug those...
Jacob Collins
1 min read

How to extract text from a PDF as Markdown

Some PDF files can be “tagged” which means they contain information about the structure of the file. This structure is embedded as metadata within...
Jacob Collins
1 min read

How to translate PDF files in Java (Tutorial)

Today I will demonstrate a worked example to show how you can create a PDF translator using our PDF toolkit JPedal and Translator. This...
Jacob Collins
1 min read

How to extract text from a PDF as JSON

Some PDF files can be “tagged” which means they contain information about the structure of the file. This structure is embedded as metadata within...
Jacob Collins
1 min read

How to process PDFs for use with AI (Tutorial)

As Artificial Intelligence becomes more popular for processing large bodies of text, it becomes apparent that PDF files pose a challenge. PDF is a...
Jacob Collins
1 min read

How to search for text in a PDF file…

Can you determine if a PDF is searchable for text without opening it? Well you will need some special software. This might be useful...
Jacob Collins
48 sec read