pdf:images
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revisionNext revisionBoth sides next revision | ||
pdf:images [2016/02/22 10:29] – [Implementation] christian | pdf:images [2016/03/02 15:15] – [Implementation] christian | ||
---|---|---|---|
Line 2: | Line 2: | ||
Bitmapped images are described in chapter 8.9 //Images// on page 203 of the {{pdf32000_2008.pdf# | Bitmapped images are described in chapter 8.9 //Images// on page 203 of the {{pdf32000_2008.pdf# | ||
+ | |||
+ | The implementation is in package '' | ||
===== Usage ===== | ===== Usage ===== | ||
Line 23: | Line 25: | ||
renderer paintXObject: | renderer paintXObject: | ||
</ | </ | ||
+ | |||
+ | {{demo20_ImagesUsage.pdf}} shows the result with an example image. | ||
+ | |||
+ | {{demo21_Images.pdf}} shows some PDF features of images (masking, rotation, interpolation, | ||
===== Object Models ===== | ===== Object Models ===== | ||
Line 95: | Line 101: | ||
The conversion methods are implemented in the '' | The conversion methods are implemented in the '' | ||
+ | |||
+ | {{ : | ||
The default behavior is to transfer the pixels one by one. For each pixel, the bits are read from the specified location in the source image bytes and interpreted as color ('' | The default behavior is to transfer the pixels one by one. For each pixel, the bits are read from the specified location in the source image bytes and interpreted as color ('' | ||
Line 100: | Line 108: | ||
This default implementation ('' | This default implementation ('' | ||
- | Some conversions can be greatly sped up by exploiting the internal byte organization of the image bits and transfering them directly. While this is possible for many useful forms, it is not possible in general (think of a Smalltalk image with a big palette of more than 255 colors). The following conversions are currently optimized: | + | Some conversions can be greatly sped up (one or two orders of magnitude) |
+ | |||
+ | The following conversions are currently optimized: | ||
* Depth1Image for Black and white images and masks | * Depth1Image for Black and white images and masks | ||
* Depth24Image with 8 bit RGB | * Depth24Image with 8 bit RGB | ||
* Depth32Image for 8 bit RGB and BGR images taken from the '' | * Depth32Image for 8 bit RGB and BGR images taken from the '' | ||
* Depth32Image for 8 bit ARGB and ABGR | * Depth32Image for 8 bit ARGB and ABGR | ||
- | ===== Disclaimer ===== | + | * Depth{2 4 8)Image with a MappedPalette. |
- | Not covered | + | The direct conversion of an image with a mapped palette is special. Since RGB color components |
+ | |||
+ | When converting such image optimized by converting the palette and using the same indexes for the pixels allowing direct reuse of the image bytes, the /Indexed colorspace may contain several entries for the same color. Converting such an ImageXObject back to Smalltalk will not recreate the least significant 5 bits leading to slightly different colors as in the original. But for 8 bit RGB usage, it will not make any difference. Although this does not feel proper, it will not make much difference in practice. But the speed up of the optimization is worth it. | ||
+ | ===== To be done ===== | ||
+ | |||
+ | ==== Filter ==== | ||
+ | |||
+ | Although all Smalltalk images can be used for PDF, not all PDF images can be transformed to Smalltalk images. For one, several | ||
* **RunLengthDecode** 8 bit monochrome images | * **RunLengthDecode** 8 bit monochrome images | ||
* **CCITTFaxDecode** CCITT encoded 1 bit monochrome images | * **CCITTFaxDecode** CCITT encoded 1 bit monochrome images | ||
* **JBIG2Decode** JBIG2 encoded 1 bit monochrome images | * **JBIG2Decode** JBIG2 encoded 1 bit monochrome images | ||
* **DCTDecode** JPEG encoded 8 bit grayscale or color images | * **DCTDecode** JPEG encoded 8 bit grayscale or color images | ||
- | * **JPXDecode** JPEG2000 encoded grayscale or color images | + | * **JPXDecode** JPEG2000 encoded grayscale or color images. |
+ | |||
+ | This means that it is not possible to extract such images from PDF. Nor is it possible to store images in the most efficient way in a PDF. This feature is valuable and I hope to implement some of the filters in the not too distant future. | ||
- | These are not implemented (yet), so that it is not possible to extract | + | Secondly, PDF can have images in other colorspaces than RGB or Grayscale; most notable |
+ | ==== Inlined Images ==== | ||
+ | Images in PDF can be inlined in the /Contents stream instead of storing them in the /Resources as /XObject. Only a subset of legal PDF images can be inlined and it is discouraged for large images. Even though, I have not seen such image in a real-world PDF, this feature should be implemented for completeness. |
/var/www/virtual/code4hl/html/dokuwiki/data/pages/pdf/images.txt · Last modified: 2016/03/02 15:19 by christian