Mark Stephens Mark founded the company and has worked with Java and PDF since 1997. The original creator of the core code, he is also a NetBeans enthusiast who enjoys speaking at conferences and reading. He holds an Athletics Blue and an MA in Mediaeval History from St. Andrews University.

What does the ActualText dictionary tag do?

Updated: May 28, 2026 29 sec read

Text is defined in the PDF file format as a display value (normally what you see onscreen) and an extraction value. It is useful to have 2 options because some characters are displayed differently to what you extract (for example fl is one glyf onscreen but two in extracted text).

But did you know you can have an additional value set to show the actual Text enclosed in a Tj command? If you are using Marked Content, the Dictionary can contain a Dictionary value /ActualText which will be used in place of whatever is shown in the Tj command, which is ignored for extraction purposes. So if you want to add something different (or just ensure that the value comes out exactly as you want, it is a useful feature. What would you use it for?

Our software libraries allow you to

Convert PDF files to HTML

Use PDF Forms in a web browser

Convert PDF Documents to an image

Work with PDF Documents in Java

Read and write HEIC and other Image formats in Java

Mark Stephens Mark founded the company and has worked with Java and PDF since 1997. The original creator of the core code, he is also a NetBeans enthusiast who enjoys speaking at conferences and reading. He holds an Athletics Blue and an MA in Mediaeval History from St. Andrews University.

« PDF to HTML conversion – matching PDF page size

How does a decodeArray work? »

What is PDF/A?

Mark Stephens
Jul 24, 2026 3 min read

The Best PDF Inspector Tools for Developers

Jacob Collins
Jun 19, 2026 6 min read

How to make a PDF file manually

chika
May 14, 2026 15 min read