pdfjpegpdf-conversion

Why is there more content found when converting a pdf to jpeg?


When I convert a pdf file to JPG format, there are extra contents at the top of the image but this content is not found in the pdf file. enter image description here

The above screenshot if for the pdf file.enter image description here

The above image is of the jpg file ( converted from pdf - the first image).

Any idea why there is some extra content coming for this file ? This happens only for this file. For all other files I convert using the pdf2image python library (or any method), the jpg is similiar to the pdf. Please help ?


Solution

  • The extra region that is shown when converting to an image format is called the non printable region. In the pdf file, only the printable region is visible. The non printable region will not be visible in the pdf file. When converted to another format (eg: jpeg/png), the non printable region is also converted and is shown in the image file. You will need to crop the image using the markings provided above the printable region (+).