Question: Find/identify pdfs with corrupted text layers?
Does anyone know a good way to check pdf files, and find out which ones have corrupted text layers?
For example, when perfectly good text has been replaced with a mess like this:
.1918., « » .!
, " # , " # $ :«% - "» # # "«& ' »( ,
# " ( .&..).
Sometimes the originals have bad text layers, and require ocr. Sometimes other applications such as Preview or Ghostscript can corrupt the text layers, but it's hard to tell when they have corrupted the text layers until I need to search or need to copy and paste into translation software.
MacBook Air (11-inch Mid 2013), macOS Sierra (10.12.6)