Portable Document Format (PDF) is a file format used to present documents in a manner independent of application software, hardware, and operating systems.

PDF

How to extract text from PDF files as YAML…

This blog post shows you how to convert a PDF to YAML, by extracting marked content with structure tags. What is structured PDF? Some...
Jacob Collins
47 sec read

Choosing the Best Document Viewer for Your Web App

Choosing the right document viewer for display is a crucial step in your journey to developing a web application. The document viewer should be...
Leon Atherton
2 min read

How to process PDFs for use with AI (Tutorial)

As Artificial Intelligence becomes more popular for processing large bodies of text, it becomes apparent that PDF files pose a challenge. PDF is a...
Jacob Collins
1 min read

What is inside a PDF file?

A PDF document structure consists of several components that determines how text, images and other elements are stored and displayed. It is a binary...
Jacob Collins
1 min read

PDF.js Alternative for PDF Documents

Display Documents on Web Apps PDF.js is maintained by Mozilla for Firefox, but it works reasonably well with other browsers. It allows you to...
Leon Atherton
1 min read

How to sign PDF files in Java (Tutorial)

PDF files are not natively supported in Java, so to digitally sign them, you will need some special software. This tutorial demonstrates how to...
Jacob Collins
29 sec read