PDF extraction
JPedal now contains an Apache Tika Parser which can parse and extract unstructured text from PDF files. How to use an Apache Tika PDF... I have been looking at an issue for a potential client recently which required the generation of different views of the page. This is... I came across an interesting issue with PDF Text fields while debugging a file this week. We were sent a 2 page document created... Because PDF is very much an output and display format it does not contain much text formatting information such as paragraph breaks and spaces...