57 Stories by Jacob Collins
We’ve been making progress on improving our PDF merge tool JPedal (JPedal is the best Java PDF library for developers), and I was curious... This blog post shows you how to convert a PDF to YAML, by extracting marked content with structure tags. What is structured PDF? Some... Not all PDFs are created equally. Some go beyond simple visual layouts and include internal tags that describe the document’s structure. These are known... Some PDF files can be “tagged” which means they contain information about the structure of the file. This structure is embedded as metadata within... As Artificial Intelligence becomes more popular for processing large bodies of text, it becomes apparent that PDF files pose a challenge. PDF is a... A PDF document structure consists of several components that determines how text, images and other elements are stored and displayed. It is a binary...