Extraction relates to the extraction of fonts, images, etc from PDF, HTML5, SVG, etc.

Extraction

How to Compare PDF files

Asking how to compare PDF files is a frequent question on the PDF forums. It is important to understand what you are trying to...
Mark Stephens
1 min read

How to find out if a PDF file has…

Because it turned out that people wanted to make PDF files accessible and extract content from PDF documents (and not just view them), Adobe...
Mark Stephens
49 sec read

Why PDF to HTML conversion does not work very…

When people convert PDF files into HTML files, they tend to be disappointed with the results. The main reason for this tends to be...
Mark Stephens
1 min read

Understanding the PDF file format – Text, shapes and…

I have been looking at an issue for a potential client recently which required the generation of different views of the page. This is...
Mark Stephens
1 min read

What text format and style information is in a…

Because PDF is very much an output and display format it does not contain much text formatting information such as paragraph breaks and spaces...
Mark Stephens
39 sec read