jpedal

How to Read PDF files in Java (Step-by-Step Guide)

TL;DR Java has no native PDF support, so you need a library. Apache PDFBox is free and adequate for basic extraction, and iText adds...
Jacob Collins
3 min read

Working with PDF Files in Java: A Complete Guide…

Portable Document Format (PDF) files are the standard for sharing and preserving documents across the internet and other platforms, but working with them programmatically...
Jacob Collins
3 min read

How to remove text from a PDF in Java…

TL;DR True PDF redaction in Java requires two things: hiding the text visually and removing it from the content stream. This tutorial shows how...
Jacob Collins
2 min read

How to remove unused objects from PDF file (Tutorial)

PDF files sometimes contain objects which are never used. This increases the file size and is wasteful. This tutorial shows you how to reduce...
Jacob Collins
44 sec read

How to extract text from a PDF as JSON

Some PDF files can be “tagged” which means they contain information about the structure of the file. This structure is embedded as metadata within...
Jacob Collins
1 min read

How to convert PDF files to ePUB

Not all PDFs are created equally. Some go beyond simple visual layouts and include internal tags that describe the document’s structure. These are known...
Jacob Collins
1 min read