Extraction relates to the extraction of fonts, images, etc from PDF, HTML5, SVG, etc.

Extraction

How are images displayed in a PDF file?

Images are not stored inside a PDF file as Tiff or PNG or JPG images. They are stored as the binary pixel data along...
Mark Stephens
1 min read

How to Compare PDF files

Asking how to compare PDF files is a frequent question on the PDF forums. It is important to understand what you are trying to...
Mark Stephens
1 min read

How to find out if a PDF file has…

Because it turned out that people wanted to make PDF files accessible and extract content from PDF documents (and not just view them), Adobe...
Mark Stephens
49 sec read

Why PDF to HTML conversion does not work very…

When people convert PDF files into HTML files, they tend to be disappointed with the results. The main reason for this tends to be...
Mark Stephens
1 min read

Understanding the PDF file format – Text, shapes and…

I have been looking at an issue for a potential client recently which required the generation of different views of the page. This is...
Mark Stephens
1 min read