Generate PDF Add-On 
Generate a PDF from a single TWiki page
This is a substantial re-write of the TWiki:Codev.PrintUsingPDF pdf script.
It essentially accomplishes the same goal but with more integration with TWiki
(e.g. use of TWiki variables for configuration and the TWiki rendering methods
instead of using a hacked version of the
 pdf script.
It essentially accomplishes the same goal but with more integration with TWiki
(e.g. use of TWiki variables for configuration and the TWiki rendering methods
instead of using a hacked version of the TWiki::view for generating HTML)
as well as more modular code structure (i.e. most of the logic is moved to a perl package).
We began using the TWiki:Codev.PrintUsingPDF pdf script to publish pages but
found it somewhat TWiki unfriendly (no offense to the authors as it's a great idea).
This version allows most of the script configuration (such as page size, table of contents, etc.)
via preference variables in TWiki. It also allows header, footer, and title page configuration via TWiki topics.
 pdf script to publish pages but
found it somewhat TWiki unfriendly (no offense to the authors as it's a great idea).
This version allows most of the script configuration (such as page size, table of contents, etc.)
via preference variables in TWiki. It also allows header, footer, and title page configuration via TWiki topics.
 Usage 
 
-  The simplest way to activate PDF printing, is to add genpdfto front of theSKINsetting. This will replace the PatternSkin 'Print Version' action with 'Generate PDF'
-  eg Set SKIN=genpdf,pattern
 
-  Like the TWiki:Codev.PrintUsingPDF pdf script, a page can be published by substituting pdf script, a page can be published by substitutinggenpdfforviewin the topic URL.
-  To make it even easier for novice TWiki users to use, we added a link (like edit and attach) to our view.tmplto publish the current page (using the current topic as the document title). The line we added to our template is:
-  <a href="%SCRIPTURLPATH%/genpdf%SCRIPTSUFFIX%/%WEB%/%TOPIC%?pdftitle=%TOPIC%">PDF</a>
 
-  You may also choose to replace the Printable (?skin=print) targets in your favourite skin with
[[%SCRIPTURL{genpdf}%/%WEB%/%TOPIC%][Printable]]
 
-   You can now generate a PDF document containing all of the descendents of the base topic as separate chapters. For example, if you create a ParentTopic, then create FirstChild and SecondChild with ParentTopic as their parent topic, then create GrandChildOne with FirstChild as its parent (and so on), you get a tree like this: You can now generate a PDF document containing all of the descendents of the base topic as separate chapters. For example, if you create a ParentTopic, then create FirstChild and SecondChild with ParentTopic as their parent topic, then create GrandChildOne with FirstChild as its parent (and so on), you get a tree like this:
ParentTopic
   - FirstChild
      - GrandChildOne
      - GrandChildTwo
   - SecondChild
      - GrandChildThree
   If you add 
'?pdfrecursive=on' to the URL parameters, all of the topics will be rolled into the PDF.
 Configuring the Script 
The script can be configured either via URL parameters or web preference variables. URL parameters have precedence over web preference variables. If neither of these are present, the script will use hard-coded default variables. The general script configuration variables are explained in the next section while header/footer and title page configuration are explained in subsequent sections.
 General Configuration 
The following table shows the various configuration variables and their meaning. The first column gives the variable to use if passed in the URL. The second column shows the variable to use if using a TWiki preference variable (i.e. Set VARIABLE = ). The third column gives the default value if neither the URL nor TWiki preference variable is used. Note that URL variables have precedence over TWiki preference variables. For a more detailed description of the HTMLDOC related variables, see the HTMLDOC documentation at 
http://www.htmldoc.org
.
| URL Variable | Twiki Preference Variable | Default Value | Example | Explanation | 
| pdfbanner | GENPDFADDON_BANNER |  | Foobar Documentation System | Used to override the banner of a title page. | 
| pdftitle | GENPDFADDON_TITLE |  | Writing Foobars |  | 
| pdfsubtitle | GENPDFADDON_SUBTITLE |  | A short guide to creating foobar documents |  | 
| pdfheadertopic | GENPDFADDON_HEADERTOPIC |  | GenPDFHeaderFooterTopic | The name of a topic that defines headers and footers using <-- HEADER LEFT "foobar"  syntax | 
| pdftitletopic | GENPDFADDON_TITLETOPIC |  | GenPDFTitleTopic | The name of a topic that defines the layout of the title page | 
| skin | GENPDFADDON_SKIN | pattern | nat | Default skin to use for PDF generation | 
| cover | GENPDFADDON_COVER | print | print.nat | Default cover to use for PDF generation | 
| pdfrecursive | GENPDFADDON_RECURSIVE | undef | on | Include children of the base topic in the PDF | 
| pdfformat | GENPDFADDON_FORMAT | pdf14 | pdf12 | HTMLDOC output format | 
| pdftoclevels | GENPDFADDON_TOCLEVELS | 5 | 3 | Number of levels to include in the PDF table of contents (use 0 to disable the generation of a table of contents). Note that HTMLDOC generates a table of contents based on HTML headers in the page. | 
| pdfpagesize | GENPDFADDON_PAGESIZE | a4 | letter | The page size for PDF output | 
| pdforientation | GENPDFADDON_ORIENTATION | portrait | landscape | The page orientation (e.g. landscape or portrait) | 
| pdfwidth | GENPDFADDON_WIDTH | 860 | 1060 | The pixel width of the browser (used to scale images--images wider than this will be truncated) | 
| pdfheadershift | GENPDFADDON_HEADERSHIFT | 0 | +3 | Shift all headers up or down (for negative values) by this amount (e.g. H1 would become H3 for a value of 2). | 
| pdfkeywords | GENPDFADDON_KEYWORDS | %FORMFIELD{"KeyWords"}% | 'foo, bar, baz, zip' | Used for PDF Keywords META info to help search engines | 
| pdfsubject | GENPDFADDON_SUBJECT | %FORMFIELD{"TopicHeadline"}% | 'Foobar document creation' | Used for PDF Subject META info to help search engines | 
| pdftocheader | GENPDFADDON_TOCHEADER | ... | l.. | See http://www.htmldoc.org/   | 
| pdftocfooter | GENPDFADDON_TOCFOOTER | ..i | .i. | See http://www.htmldoc.org/   | 
| pdfheader | GENPDFADDON_HEADER |  | .1. | Specify content of header, see http://www.htmldoc.org   | 
| pdffooter | GENPDFADDON_FOOTER |  | .1. | Specify content of footer, see http://www.htmldoc.org   | 
| pdfheadfootfont | GENPDFADDON_HEADFOOTFONT |  | Helvetica-Bold | Font specification for headers and footers. | 
| pdfheadfootsize | GENPDFADDON_HEADFOOTSIZE |  | 12 | Sets the size of the header and footer text in points (1 point = 1/72nd inch) | 
| pdfbodyimage | GENPDFADDON_BODYIMAGE |  | http://my.server.com/path/to/background.jpeg | The image that will appear tiled in the background of every page | 
| pdflogoimage | GENPDFADDON_LOGOIMAGE |  | http://my.server.com/path/to/logo.jpeg | The logo that will appear in a header or footer if you specify 'l' in the string (see http://www.htmldoc.org  ) | 
| pdfnumberedtoc | GENPDFADDON_NUMBEREDTOC | undef | True | Option flag for getting numbered headings and Table of Contents. Set it to anything for true. | 
| pdfduplex | GENPDFADDON_DUPLEX | undef | True | Option flag to set up the document for duplex printing. Headers and footers will swap position on alternating pages. Set it to anything for true. | 
| pdfpermissions | GENPDFADDON_PERMISSIONS | undef | print,no-copy | PDF Security permissions to disable print/copy etc. By default the PDF is not protected. | 
| pdfmargins | GENPDFADDON_MARGINS | undef | top:0.5in,bottom:2.5cm,left:12pt,right:15mm | Specify the page margins (white space to edge of page) | 
| pdfbodycolor | GENPDFADDON_BODYCOLOR | undef | #CCff99 | Specify the background colour of all pages | 
| pdfstruct | GENPDFADDON_STRUCT | book | webpage | use bookfor structured topics, i.e. when rendering a bunch of topics recursively; usewebpagewhen printing a topic without a specific heading structure, i.e. if it is just a normal webpage or if it has got a special VIEW_TEMPLATE | 
If using TWiki preference variables, copy them to the appropriate web preferences page.
 
-  Settings for the GenPDFAddOn Plugin 
-  #Set GENPDFADDON_BANNER  = 
-  #Set GENPDFADDON_TITLE  = 
-  #Set GENPDFADDON_SUBTITLE  = 
-  #Set GENPDFADDON_HEADERTOPIC  = 
-  #Set GENPDFADDON_TITLETOPIC  = 
-  #Set GENPDFADDON_SKIN  = 
-  #Set GENPDFADDON_COVER  = 
-  #Set GENPDFADDON_RECURSIVE  = 
-  #Set GENPDFADDON_FORMAT  = 
-  #Set GENPDFADDON_TOCLEVELS  = 
-  #Set GENPDFADDON_PAGESIZE  = 
-  #Set GENPDFADDON_ORIENTATION  = 
-  #Set GENPDFADDON_WIDTH  = 
-  #Set GENPDFADDON_HEADERSHIFT  = 
-  #Set GENPDFADDON_KEYWORDS  = 
-  #Set GENPDFADDON_SUBJECT  = 
-  #Set GENPDFADDON_TOCHEADER  = 
-  #Set GENPDFADDON_TOCFOOTER  = 
-  #Set GENPDFADDON_HEADER  = 
-  #Set GENPDFADDON_FOOTER  = 
-  #Set GENPDFADDON_HEADFOOTFONT  = 
-  #Set GENPDFADDON_HEADFOOTSIZE  = 
-  #Set GENPDFADDON_BODYIMAGE  = 
-  #Set GENPDFADDON_LOGOIMAGE  = 
-  #Set GENPDFADDON_NUMBEREDTOC  = 
-  #Set GENPDFADDON_DUPLEX  = 
-  #Set GENPDFADDON_PERMISSIONS  = 
-  #Set GENPDFADDON_MARGINS  = 
-  #Set GENPDFADDON_BODYCOLOR  = 
-  #Set GENPDFADDON_STRUCT  = 
 
 Limiting the PDF Generation Region 
The add-on allows the user to define the region of the topic that should be included in the PDF generation (much like the TWiki %STARTINCLUDE% and %STOPINCLUDE% variables. In this case, HTML comments are used instead. Everything between these two comments will be included in the PDF generation. The rest of topic will be excluded.
 
-  Use <!-- PDFSTART -->to mark the starting point in the topic for PDF generation.
-  Use <!-- PDFSTOP -->to mark the stopping point in the topic for PDF generation.
Note that there can be multiple PDFSTART and PDFSTOP comment pairs in a single topic to selectively include/exclude multiple sections of the topic in the PDF document. (If you view this page in 
raw mode or edit it, you'll see an example of multiple PDFSTART/PDFSTOP sections to exclude the TWiki table of contents). If no PDFSTART/PDFSTOP comment pair appears in the topic, the entire topic text is used. In general, this should not be a problem except for title topics that include forms as the form meta-data will show up in a fairly illegible manner at the end of the document. Therefore, for topics that reference forms, a PDFSTART comment should be placed at the beginning of the topic and a PDFSTOP should be placed at the end.
 NOTE:
 NOTE: all %META: tags are removed from the base topic. If you want to display form data, you should add %FORMFIELD{"field"}% tags to the topic or title topic.
 Creating and Configuring a Title Page 
The add-on allows the user to use a topic as a title page for PDF generation. Earlier versions of the add-on required that the title page be expressed using pure HTML as the title page topic was not TWiki rendered. The latest version of add-on, however, does full TWiki rendering of the title topic page like any other TWiki topic. In addition, the following variables can be passed with the URL to override their settings elsewhere (e.g. in the web preferences or TWiki preferences pages).
Also note that the PDFSTART and PDFSTOP HTML comments should be placed at the beginning and end of title topic. An example title page can be found at 
GenPDFExampleTitleTopic.
 Creating and Configuring Headers and Footers 
The add-on also allows the user to configure header and footer formats for both the main section of the document and the table of contents. Configuring the main header and footer is much like configuring a title page. You can select a TWiki topic to use for the header and footer. Remember to wrap the HTML comments that HTMLDOC uses for the header and footer between <!-- PDFSTART 
 and <!-- PDFSTOP 
 tags. The add-on will perform TWiki common variable substition within the HTMLDOC header/footer HTML comments. This will allow TWiki variables (such as %REVINFO{web="%WEB%" topic="%BASETOPIC%"}%) to be embedded in the headers and footers.
See the HTMLDOC documentation at 
http://www.htmldoc.org
 for details of the format of the header and footers. In addition, the 
genpdf script will perform variable substition for the %GENPDFADDON _BANNER%, %GENPDFADDON _TITLE%, and %GENPDFADDON _SUBTITLE% variables as it does for the title page. Finally, the PDFSTART and PDFSTOP HTML comments should be placed at the beginning and end of header/footer topic. An example header/footer page can be found at 
GenPDFExampleHeaderFooterTopic.
 Frequently Asked Questions 
 
-  How do I stop the table of contents from being generated?  
-  For some topics, like User topics, it doesn't make any sense to have a table of contents generated so add 'pdftoclevels=0'as a URL parameter.
-  When I do a recursive PDF of WebHome it doesn't include all topics  
-  That's because some topics distributed with TWiki don't have a parent association. If you really want to include every topic in the web, you should reparent them all with WebHome as the parent.
 Add-On Installation Instructions 
Note: You do not need to install anything on the browser to use this add-on. The following instructions are for the administrator who installs the add-on on the server where TWiki is running. 
 
-  Install htmldoc from http://www.htmldoc.org/ (optionally use the patch in the Addon's zip file for headers on every page) (optionally use the patch in the Addon's zip file for headers on every page)
-  Download the ZIP file from the Add-on Home (see below)
-  Unzip GenPDFAddOn.zipin your twiki installation directory. Content:         | File: | Description: |     | data/TWiki/GenPDFAddOn.txt | Add-on topic |     | data/TWiki/GenPDFExampleTitleTopic.txt | Example title topic |     | data/TWiki/GenPDFExampleHeaderFooterTopic.txt | Example header and footer topic |     | bin/genpdf | Add-on script |     | lib/TWiki/Contrib/GenPDF.pm | Add-on perl package |     | lib/TWiki/Contrib/GenPDFAddOn/Config.spec |  |  
 
-  Adjust the script ownership to match your webserver configuration (e.g. chown nobody genpdf) if needed.
-  Make sure the script is executable (e.g chmod 755 genpdf).
-  Adjust the perl path in the genpdf script to match your perl installation location.
-   BEFORE TWiki-4 Add the variable BEFORE TWiki-4 Add the variable$htmldocCmd = "/path/to/htmldoc";to yourlib/TWiki.cfgfile so TWiki can find the location of the htmldoc executable. Look for $fgrepCmd.
-   TWiki-4 and later Configure the TWiki-4 and later Configure the$TWiki::cfg{Extensions}{GenPDFAddOn}{htmldocCmd} = "/path/to/htmldoc";usingconfigure(in theExtensionssection)
-   Copy the preferences from above and paste them into TWikiPreferences, or the WebPreferences topic for a single web. Copy the preferences from above and paste them into TWikiPreferences, or the WebPreferences topic for a single web.
-  Test if the installation was successful: 
-  Try loading this page  
-  If it doesn't work, check your webserver logs for any problems. The most common issue is probably an htmldoc installation problem.
 
 Known Bugs 
 
-  Verbatim text runs off the page. This is a limitation of HTMLDOC. Preformatted text may run off the edge of the page and be truncated.
-  HTMLDOC crashes with segmentation faults. Eg it fails to generate TWikiDocumentation. I managed to get it to work a few times, but it generally fails. The error returned is Conversion failed: 'Inappropriate ioctl for device' at /var/www/twiki/lib/TWiki/Contrib/GenPDF.pm line XXX
-  Some pages don't have a header. HTMLDOC breaks the page for every level 1 heading (Eg. <h1>) but it doesn't write a header for the new page, so topics with lots of level 1 headings and not much content don't seem to have any headers. Therefore I patched htmldoc-1.8.24 to force a header for every new page:
*** htmldoc-1.8.24/htmldoc/ps-pdf.cxx   Sat Oct 30 05:53:59 2004
--- htmldoc-1.8.24/htmldoc/ps-pdf_force_header.cxx      Tue Jun 13 02:12:28 2005
***************
*** 1465,1471 ****
      pspdf_prepare_heading(page, print_page, pages[page].header, top,
                            page_text, sizeof(page_text),
!                         page > chapter_starts[chapter] ||
                              OutputType != OUTPUT_BOOK);
      pspdf_prepare_heading(page, print_page, pages[page].footer, 0,
                            page_text, sizeof(page_text));
--- 1465,1472 ----
      pspdf_prepare_heading(page, print_page, pages[page].header, top,
                            page_text, sizeof(page_text),
! /*                      page > chapter_starts[chapter] || */
!                         1 || /* force heading onto chapter front page */
                              OutputType != OUTPUT_BOOK);
      pspdf_prepare_heading(page, print_page, pages[page].footer, 0,
                            page_text, sizeof(page_text));
 Add-On Info 
| Add-on Author: | TWiki:Main/BrianSpinar  , TWiki:Main/WadeTurland   | 
| Add-on Version: | 02 Jul 2008 | 
| Change History: |  | 
| 02 Jul 2008 | added support for VIEW_TEMPLATEandCOVER;                  fixed rendering of anchor and img tags;                  addedpdfstructparameter to print unstructured webpages as well;                  TWiki:Main.MichaelDaum  | 
| 25 Jun 2008 | added templateactivation and Configure script spec file TWiki:Main.SvenDowideit  | 
| 25 Jun 2008 | security and TWiki 4.2 fixes TWiki:Main.SvenDowideit   | 
| 2 Nov 2007 | Added new header and footer control (Bugs:Item4916) and fixed generation of wrong TWiki page (Bugs:Item4915) | 
| 23 Oct 2007 | Fixed Bugs:Item4452 & Bugs:Item4885, compatibility with Perl 5.6 and  missing images with SSL certificates | 
| 31 Aug 2007 | Fixed Bugs:Item4530, improper rendering of lists | 
| 13196 | Removed noptags before sending to htmldoc, fixed Bugs:Item3549 | 
| 11673 | TWiki:Main/RickMach  updated MIME type to pdf from x-pdf, fixed bug preventing disabling the TOC | 
| 9716 | TWiki:Main/CrawfordCurrie  added content-disposition header to files, so they download using a sensible file name | 
| 9683 | TWiki:Main/CrawfordCurrie  updated for TWiki-4 | 
| Version 0.6 (28 Jun 2005) | Less aggressive regex for removing twikiNewLink spans so it doesn't break when using the TWiki:Plugins.SpacedWikiWordPlugin Added 'recursive' preference to include chapters for all descendents of the base topicUse File::Spec->tmpdir to specify the default directory for temporary files
 | 
| Version 0.5 (16 Jun 2005) | Redirect to 'oops' URLs if permission denied or topic does not exist.Removed twikiNewLink spans from title page so they render as normal text (without the blue ? mark).Fully qualify image/href URLs on the title page.Changed temp files to use OO style 'new File::Temp;' for better code portability.
 | 
| Version 0.4 (13 Jun 2005) | Better security (now calls system($TWiki::htmldocCmd, @htmldocArgs))Checks return code of htmldoc and returns an error on failureValidation of preferences Preferences changed to comply with TWikiPlugins standardBetter HTML3.2 compatibility for htmldoc 1.8.24 (downgrades a few elements)Full integration of PDF META tags (optionally using 2 FORMFIELDs):   ==%FORMFIELD{"TopicHeadine"}%== for PDF Subject field==%FORMFIELD{"KeyWords"}%== for PDF Keywords fieldall other PDF fields use topic infoMore htmldoc options (margins, permissions, numbered TOC, logoimage, headfootfont) using preferencesRemoved %TOC%fields so it only uses HTMLDOC's TOCTitle topic and header/footer topics are now fully renderedFixed the heading shifting functionFully qualify links, making the document portableHTMLDOC output goes to a temp file instead of stdoutTemp files now use 'GenPDFAddOn' prefix. (Eg. /tmp/GenPDFAddOn1wr3C48ibd.html)
 | 
| Version 0.3 (12 Apr 2005) | Added full TWiki rendering to title topic pageAdded TWiki common variable expansion to header/footer topic page
 | 
| Version 0.2 (26 Mar 2005) | Fixed bug with table of contents generation (i.e. it was always generated even if pdftoclevelswas set to 0)Now allow multiple PDFSTART/PDFSTOPpairs within a single page to include/exclude multiple sections of the same pageAdded Brent Robert's fix to allow operation with latest version (1.8.24) of HTMLDOC
 | 
| Version 0.1 (30 Jan 2005) |  | 
| CPAN Dependencies: | File::Temp (if for some reason you don't already have it installed ) | 
| Other Dependencies: | HTMLDOC (http://www.htmldoc.org  ) | 
| Perl Version: | 5.005 or above | 
| License: | GPL | 
| Add-on Home: | http://TWiki.org/cgi-bin/view/Plugins/GenPDFAddOn   | 
| Feedback: | http://TWiki.org/cgi-bin/view/Plugins/GenPDFAddOnDev   | 
| Appraisal: | http://TWiki.org/cgi-bin/view/Plugins/GenPDFAddOnAppraisal   | 
Related Topic: TWikiAddOns
-- 
TWiki:Main/WadeTurland
 - 28 Jun 2005
 
-- 
TWiki:Main/BrianSpinar
 - 12 Apr 2005