Jacob Collins

  Jacob is a Java developer and the product manager of JPedal

 



25 Stories by Jacob Collins

How to convert PDF files to ePUB

Not all PDFs are created equally. Some go beyond simple visual layouts and include internal tags that describe the document’s structure. These are known...
1 min read

How to extract text from a PDF as JSON

Some PDF files can be “tagged” which means they contain information about the structure of the file. This structure is embedded as metadata within...
1 min read

How to process PDFs for use with AI (Tutorial)

As Artificial Intelligence becomes more popular for processing large bodies of text, it becomes apparent that PDF files pose a challenge. PDF is a...
1 min read

What is inside a PDF file?

A PDF document structure consists of several components that determines how text, images and other elements are stored and displayed. It is a binary...
1 min read

How to sign PDF files in Java (Tutorial)

PDF files are not natively supported in Java, so to digitally sign them, you will need some special software. This tutorial demonstrates how to...
27 sec read

PDFBox alternative

Working with PDF documents in Java, developers often encounter the choice between open-source libraries like Apache PDFBox and commercial solutions like JPedal. While PDFBox...
1 min read