pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Tilman Hausherr <THaush...@t-online.de>
Subject Re: Determining why a PDF is large
Date Fri, 17 May 2019 04:04:32 GMT

> same "CID" and the same font name ("AAAROV+ArialUnicodeMS" -- no more
> "AAA???+ArialUnicodeMS" coming up multiple times with slightly
> different names). 

Some more explanation:

The "AAAROV+" or similar (always with "+") indicates that it is a font 
subset. But if you use it several times, it is likely to contain the 
same glyphs. That is why reusing it helps. The subsetting is done at 
saving time, i.e. PDFBox then "knows" what glyphs you used and then 
creates a new, smaller font.

The built-in type 1 "standard 14" fonts are without the actual font, 
because one assumes that the viewer has them in the OS. So they don't 
use much space.

Reusing resources in a document is also good with images, e.g. a logo 
that appears on each page.


To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: users-help@pdfbox.apache.org

View raw message