pdf - PDF 文档可以包含具有不同 DPI 的图像吗?

标签 pdf dpi poppler

问题说明了一切。是否有包含不同 dpi(每英寸点数)图像的 PDF 文档?

还是假设如果我知道一张图像的 dpi,我就知道整个文档?

最佳答案

我赞成@ypnos 的回答,这是完全正确的。

但我想通过展示 pdfimages 的一个最近的新功能来补充它。公用事业。
pdfimages以前已知能够从 PDF 文件中提取图像(这是它唯一有用的目的)。 但是,现在您还可以使用它来调查有关所用图像的更多详细信息,而无需提取它们。

使用下一个命令,我使用新的 -list 查询某个 PDF 文件第 7 页和第 8 页上所有图像的数据。范围:

pdfimages -list -f 7 -l 8  ct-magazin-14-2012.pdf

  page   num  type   width height color comp bpc  enc interp  object ID
  ---------------------------------------------------------------------
     7     0 image     581   838  rgb     3   8  jpeg   no        39  0
     7     1 image       4     4  rgb     3   8  image  no        40  0
     7     2 image     314   332  rgb     3   8  jpx    no        44  0
     7     3 image     358   430  rgb     3   8  jpx    no        45  0
     7     4 image       4     4  rgb     3   8  image  no        46  0
     7     5 image       4     4  rgb     3   8  image  no        47  0
     7     6 image       4     6  rgb     3   8  image  no        48  0
     7     7 image     596   462  rgb     3   8  jpx    no        49  0
     7     8 image       4     6  rgb     3   8  image  no        50  0
     7     9 image       4     4  rgb     3   8  image  no        51  0
     7    10 image       8    10  rgb     3   8  image  no        41  0
     7    11 image       6     6  rgb     3   8  image  no        42  0
     7    12 image     113    27  rgb     3   8  jpx    no        43  0
     8    13 image     582   839  gray    1   8  jpeg   no      2080  0
     8    14 image     344   364  gray    1   8  jpx    no      2079  0

Note, however: this version of pdfimages is the one from Poppler (the one from XPDF does not (yet?) support this new feature):

pdfimages -version

  pdfimages version 0.20.2
  Copyright 2005-2012 The Poppler Developers - http://poppler.freedesktop.org
  Copyright 1996-2011 Glyph & Cog, LLC

The -list option appeared for the first time in Poppler v0.19.0, released on March 1st, 2012.

Now, the above list does not directly tell you the resolution ("dpi") of the image. That value is dependent on: at which size is this image rendered on the PDF page?

A PDF can easily have the same image used at different spots of a PDF file, using a different rendering size for each occasion. The image needs to be embedded into the PDF only once but can be used/rendered 'by reference' multiple times (inefficiently constructed PDFs may still contain the same image multiple times, but that's a different topic...)

Now let's clear up the questions which may arise from looking at the respective column headings. What do they mean?

page

  • The page number in the PDF containing the image.

num

  • The image number of the current listing.

type

  • The image type. Possible values are: image (an opaque image), mask (a monochrome image mask), smask (a soft-mask image) and stencil (a monochrome mask image used for painting a color or a pattern). Note: Transparency in PDF for images is created by using two separate PDF objects: one for the image and one for the mask or smask. The mask/smask belonging to a transparent image always directly follows image in the listing.

width

  • The image width in pixels.

height

  • The image height in pixels.

color

  • The image color space. Possible values are gray, rgb, cmyk, lab (L*a*b), icc (ICC based), index (indexed colors), sep (separation) and devn (DeviceN).

comp

  • The number of color components used by the image.

bpc

  • The bits per color component used by the image.

enc

  • The encoding (compression) used by the image. Possible values are: image (a raster image -- may internally use the generic /Flate or /LZW compression, but not a special image encoding), jpeg (JPEG compression), jpx (JPEG2000 compression), jbig2 (JBIG2 compression) and ccitt (Fax compression).

interp

  • Is yes if interpolation was requested when scaling up the image.

object ID

  • The image's PDF object ID (with "generation number") inside the file.

Update (March 2016)

As of Poppler v0.25.0 (released December 11, 2013) and later versions, the command pdfimages -list now includes new columns which indicate the automatically calculated x-ppi (horizontal) and y-ppi (vertical) resolutions for each embedded image as displayed within the PDF page by the PDF renderer.

In addition, the size (in Bytes/kBytes) used by each image (when uncompressed) as well as its size compression ratio (as embedded in PDF) are indicated.

To show the result (using Poppler v0.42.0) for the same file as above:

page num type width height color comp bpc enc interp objectID x-ppi y-ppi size ratio
------------------------------------------------------------------------------------
   7  0 image   581   838  rgb     3   8 jpeg   no       39 0    73    73 2107B 0.1%
   7  1 image     4     4  rgb     3   8 image  no       40 0   150   150   54B 112%
   7  2 image   314   332  rgb     3   8 jpx    no       44 0   150   150 19.0K 6.2%
   7  3 image   358   430  rgb     3   8 jpx    no       45 0   150   150 15.7K 3.5%
   7  4 image     4     4  rgb     3   8 image  no       46 0   150   150   62B 129%
   7  5 image     4     4  rgb     3   8 image  no       47 0   150   150   51B 106%
   7  6 image     4     6  rgb     3   8 image  no       48 0   150   150   62B  86%
   7  7 image   596   462  rgb     3   8 jpx    no       49 0   150   150 40.7K 5.0%
   7  8 image     4     6  rgb     3   8 image  no       50 0   150   150   86B 119%
   7  9 image     4     4  rgb     3   8 image  no       51 0   150   150   62B 129%
   7 10 image     8    10  rgb     3   8 image  no       41 0   150   150  157B  65%
   7 11 image     6     6  rgb     3   8 image  no       42 0   150   150   82B  76%
   7 12 image   113    27  rgb     3   8 jpx    no       43 0   151   152 1090B  12%
   8 13 image   582   839  gray    1   8 jpeg   no     2080 0    72    72  319B 0.1%
   8 14 image   344   364  gray    1   8 jpx    no     2079 0   150   150 4325B 3.5%

x-ppi
  • 图像在 PDF 页面上呈现时的水平分辨率(以每英寸像素为单位)。

  • y-ppi
  • 图像在 PDF 页面上呈现时的垂直分辨率(以每英寸像素为单位)。

  • size
  • PDF 文件中嵌入图像的大小。正在使用以下后缀:“B”字节、“K”千字节、“M”兆字节和“G”千兆字节。

  • ratio
  • 嵌入图像的压缩率。
  • 关于pdf - PDF 文档可以包含具有不同 DPI 的图像吗?,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/11696747/

    相关文章:

    android - MonoDroid 应用程序不使用正确的密度绘图

    Android:如何从 native 代码获取设备 DPI

    python - 如何使用 Poppler 以真实比例显示 PDF?

    javascript - 从pdf文件中读取注释

    HTML5 到 PDF 服务器端

    c# - GUI 故障 : Checkbox text shown incomplete in Windows 8

    c++ - 在 Qt Creator 中利用 poppler - 帮助

    ghostscript - PDF 到 PNG 与 PDF 到 JPG 之间的巨大速度差异

    html - 在网页上嵌入 PDF 并防止下载

    ios - 使用 Apple 的 PDFKit 框架更改 PDF 中的文本