Read text content from image in PDF

Questions and postings pertaining to the development of ImageMagick, feature enhancements, and ImageMagick internals. ImageMagick source code and algorithms are discussed here. Usage questions which are too arcane for the normal user list should also be posted here.
Posts: 1
Joined: 2017-07-17T06:37:56-07:00
Authentication code: 1151

Read text content from image in PDF

Post by Birju »


I would like to read content from image in PDF.
I try to convert PDF to image and read content but if in PDF file text content in IMAGE then image generates with complete black (related portion).

Posts: 13034
Joined: 2010-01-23T23:01:33-07:00
Authentication code: 1151
Location: England, UK

Re: Read text content from image in PDF

Post by snibgo »

It would help if you showed your PDF file, told us what version IM you are running, and what command you used.

I guess you have black text on a transparent background, and save it to a format that removes transparency. Then the cure is to flatten against white, or whatever colour you want.
snibgo's IM pages: