问题说明了一切。是否有包含不同 dpi(每英寸点数)图像的 PDF 文档?
还是假设如果我知道一张图像的 dpi,我就知道整个文档?
最佳答案
我赞成@ypnos 的回答,这是完全正确的。
但我想通过展示 pdfimages
的一个最近的新功能来补充它。公用事业。pdfimages
以前已知能够从 PDF 文件中提取图像(这是它唯一有用的目的)。 但是,现在您还可以使用它来调查有关所用图像的更多详细信息,而无需提取它们。
使用下一个命令,我使用新的 -list
查询某个 PDF 文件第 7 页和第 8 页上所有图像的数据。范围:
pdfimages -list -f 7 -l 8 ct-magazin-14-2012.pdf page num type width height color comp bpc enc interp object ID --------------------------------------------------------------------- 7 0 image 581 838 rgb 3 8 jpeg no 39 0 7 1 image 4 4 rgb 3 8 image no 40 0 7 2 image 314 332 rgb 3 8 jpx no 44 0 7 3 image 358 430 rgb 3 8 jpx no 45 0 7 4 image 4 4 rgb 3 8 image no 46 0 7 5 image 4 4 rgb 3 8 image no 47 0 7 6 image 4 6 rgb 3 8 image no 48 0 7 7 image 596 462 rgb 3 8 jpx no 49 0 7 8 image 4 6 rgb 3 8 image no 50 0 7 9 image 4 4 rgb 3 8 image no 51 0 7 10 image 8 10 rgb 3 8 image no 41 0 7 11 image 6 6 rgb 3 8 image no 42 0 7 12 image 113 27 rgb 3 8 jpx no 43 0 8 13 image 582 839 gray 1 8 jpeg no 2080 0 8 14 image 344 364 gray 1 8 jpx no 2079 0
Note, however: this version of pdfimages
is the one from Poppler (the one from XPDF does not (yet?) support this new feature):
pdfimages -version pdfimages version 0.20.2 Copyright 2005-2012 The Poppler Developers - http://poppler.freedesktop.org Copyright 1996-2011 Glyph & Cog, LLC
The -list
option appeared for the first time in Poppler v0.19.0, released on March 1st, 2012.
Now, the above list does not directly tell you the resolution ("dpi") of the image. That value is dependent on: at which size is this image rendered on the PDF page?
A PDF can easily have the same image used at different spots of a PDF file, using a different rendering size for each occasion. The image needs to be embedded into the PDF only once but can be used/rendered 'by reference' multiple times (inefficiently constructed PDFs may still contain the same image multiple times, but that's a different topic...)
Now let's clear up the questions which may arise from looking at the respective column headings. What do they mean?
page
- The page number in the PDF containing the image.
num
- The image number of the current listing.
type
- The image type. Possible values are:
image
(an opaque image),mask
(a monochrome image mask),smask
(a soft-mask image) andstencil
(a monochrome mask image used for painting a color or a pattern). Note: Transparency in PDF for images is created by using two separate PDF objects: one for the image and one for the mask or smask. The mask/smask belonging to a transparent image always directly follows image in the listing.
width
- The image width in pixels.
height
- The image height in pixels.
color
- The image color space. Possible values are
gray
,rgb
,cmyk
,lab
(L*a*b),icc
(ICC based),index
(indexed colors),sep
(separation) anddevn
(DeviceN).
comp
- The number of color components used by the image.
bpc
- The bits per color component used by the image.
enc
- The encoding (compression) used by the image. Possible values are:
image
(a raster image -- may internally use the generic/Flate
or/LZW
compression, but not a special image encoding),jpeg
(JPEG compression),jpx
(JPEG2000 compression),jbig2
(JBIG2 compression) andccitt
(Fax compression).
interp
- Is
yes
if interpolation was requested when scaling up the image.
object ID
- The image's PDF object ID (with "generation number") inside the file.
Update (March 2016)
As of Poppler v0.25.0 (released December 11, 2013) and later versions, the command pdfimages -list
now includes new columns which indicate the automatically calculated x-ppi
(horizontal) and y-ppi
(vertical) resolutions for each embedded image as displayed within the PDF page by the PDF renderer.
In addition, the size (in Bytes/kBytes) used by each image (when uncompressed) as well as its size compression ratio (as embedded in PDF) are indicated.
To show the result (using Poppler v0.42.0) for the same file as above:
page num type width height color comp bpc enc interp objectID x-ppi y-ppi size ratio
------------------------------------------------------------------------------------
7 0 image 581 838 rgb 3 8 jpeg no 39 0 73 73 2107B 0.1%
7 1 image 4 4 rgb 3 8 image no 40 0 150 150 54B 112%
7 2 image 314 332 rgb 3 8 jpx no 44 0 150 150 19.0K 6.2%
7 3 image 358 430 rgb 3 8 jpx no 45 0 150 150 15.7K 3.5%
7 4 image 4 4 rgb 3 8 image no 46 0 150 150 62B 129%
7 5 image 4 4 rgb 3 8 image no 47 0 150 150 51B 106%
7 6 image 4 6 rgb 3 8 image no 48 0 150 150 62B 86%
7 7 image 596 462 rgb 3 8 jpx no 49 0 150 150 40.7K 5.0%
7 8 image 4 6 rgb 3 8 image no 50 0 150 150 86B 119%
7 9 image 4 4 rgb 3 8 image no 51 0 150 150 62B 129%
7 10 image 8 10 rgb 3 8 image no 41 0 150 150 157B 65%
7 11 image 6 6 rgb 3 8 image no 42 0 150 150 82B 76%
7 12 image 113 27 rgb 3 8 jpx no 43 0 151 152 1090B 12%
8 13 image 582 839 gray 1 8 jpeg no 2080 0 72 72 319B 0.1%
8 14 image 344 364 gray 1 8 jpx no 2079 0 150 150 4325B 3.5%
x-ppi
y-ppi
size
ratio
关于pdf - PDF 文档可以包含具有不同 DPI 的图像吗?,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/11696747/