DUMPPDF(1) | PDFMiner Manual | DUMPPDF(1) |
dumppdf - dumps internal contents of a PDF files
dumppdf [option...] file...
dumppdf dumps the internal contents of a PDF file in pseudo-XML format. This program is primarily for debugging purposes, but it's also possible to extract some meaningful contents
-a
-i objno[,objno,...]
-p pageno[,pageno,...]
-r, -b, -t
With -r option, the “raw” stream contents are dumped without decompression. With -b option, the decompressed contents are dumped as a binary blob. With -t option, the decompressed contents are dumped in a text format, similar to repr() manner. When -r or -b option is given, no stream header is displayed for the ease of saving it to a file.
-T
-P password
-d
Dump all the headers and contents, except stream objects:
$ dumppdf -a test.pdf
Dump the table of contents:
$ dumppdf -T test.pdf
Extract a JPEG image:
$ dumppdf -r -i6 test.pdf > image.jpeg
Jakub Wilk <jwilk@debian.org>
Yusuke Shinyama <yusuke@cs.nyu.edu>
08/09/2020 | dumppdf |