structured

How to extract text from a PDF as Markdown

Some PDF files can be “tagged” which means they contain information about the structure of the file. This structure is embedded as metadata within...
Jacob Collins
1 min read

How to process PDFs for use with AI (Tutorial)

As Artificial Intelligence becomes more popular for processing large bodies of text, it becomes apparent that PDF files pose a challenge. PDF is a...
Jacob Collins
1 min read

How to search for text in a PDF file…

Can you determine if a PDF is searchable for text without opening it? Well you will need some special software. This might be useful...
Jacob Collins
48 sec read