Unicode is an international encoding standard for use with different languages and scripts, by which each letter, digit, or symbol is assigned a unique numeric value that applies across different platforms and programs.

Unicode

How is text stored in a PDF file?

Text is defined in PDF files by a Font object and a set of TJ commands. So you will see something like this in...
Mark Stephens
55 sec read