RFC Beautification Working GroupIndependent Submission R. GiebenInternet-DraftRequest for Comments: 7328 GoogleIntended status:Category: InformationalOctober 15, 2013 Expires: April 18,August 2014 ISSN: 2070-1721 Writing I-Ds and RFCsusingUsing Pandoc and abitBit of XMLdraft-gieben-pandoc2rfc-03Abstract This document presents a technique for using a Markdown syntax variant, called Pandoc, and a(bit)bit of XML[RFC2629](as defined in RFC 2629) as a source format for documentsin thethat are Internet-Drafts (I-Ds)and Request for Comments (RFC) series.or RFCs. The goal of this technique (which is called Pandoc2rfc) is to let an author of an I-D focus on the main body of text without being distracted too much by XMLtags,tags; however, it doeshowevernot alleviate the need to typeset some files in XML. Status of This Memo ThisInternet-Draftdocument issubmitted in full conformance withnot an Internet Standards Track specification; it is published for informational purposes. This is a contribution to theprovisionsRFC Series, independently ofBCP 78any other RFC stream. The RFC Editor has chosen to publish this document at its discretion andBCP 79. Internet-Draftsmakes no statement about its value for implementation or deployment. Documents approved for publication by the RFC Editor areworking documentsnot a candidate for any level oftheInternetEngineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The listStandard; see Section 2 of RFC 5741. Information about the currentInternet- Drafts is at http://datatracker.ietf.org/drafts/current/. Internet-Drafts are draft documents valid for a maximumstatus ofsix monthsthis document, any errata, and how to provide feedback on it may beupdated, replaced, or obsoleted by other documentsobtained atany time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress." This Internet-Draft will expire on April 18, 2014.http://www.rfc-editor.org/info/rfc7328. Copyright Notice Copyright (c)20132014 IETF Trust and the persons identified as the document authors. All rights reserved. This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (http://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document.Code Components extracted from this document must include Simplified BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Simplified BSD License.Table of Contents 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . 2 2. Pandoc to RFC . . . . . . . . . . . . . . . . . . . . . . . .32 2.1. Dependencies . . . . . . . . . . . . . . . . . . . . . .45 3. Building an Internet-Draft . . . . . . . . . . . . . . . . .45 4. Supported Features . . . . . . . . . . . . . . . . . . . . . 5 5. Unsupported Features and Limitations . . . . . . . . . . . .67 6. Pandoc Style . . . . . . . . . . . . . . . . . . . . . . . . 7 6.1. Figures . . . . . . . . . . . . . . . . . . . . . . . . . 7 6.2. Tables . . . . . . . . . . . . . . . . . . . . . . . . . 7 6.3. References . . . . . . . . . . . . . . . . . . . . . . . 77. Acknowledgements . . . . . . . . . . . .6.4. Index . . . . . . . . . .8 8. Security Considerations . . .. . . . . . . . . . . . . . . . 89. IANA Considerations . . . . . . . . . . . . . . . . . . .7. Acknowledgements . .8 10. Normative References. . . . . . . . . . . . . . . . . . . . 8Appendix A. Changelog . .8. Security Considerations . . . . . . . . . . . . . . . . . . . 8A.1. -00 . .9. References . . . . . . . . . . . . . . . . . . . . . . . . . 8A.2. -01 . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 A.3. -02 . . . . . . . . . . . . . . . . . . .9.1. Normative References . . . . . . . .9 A.4. -03. . . . . . . . . . 8 9.2. Informative References . . . . . . . . . . . . . . . . . 9 AppendixB.A. Cheat Sheet . . . . . . . . . . . . . . . . . . . .9 Author's Address . . . . . . . . . . . . . . . . . . . . . . . .10 1. Introduction This document presents a technique for using a Markdown [Markdown] syntax variant, called Pandoc [Pandoc], and a bit of XML [RFC2629] as a source format for documentsin thethat are Internet-Drafts (I-Ds)and Request for Comments (RFC) series.or RFCs. The goal of this technique is to let an author of an I-D focus on the main body of text without being distracted too much by XMLtags,tags; however, it doeshowevernot alleviate the need to typeset some files in XML. Pandoc isana format that is almost plain textformatandtherefortherefore particularly well suited for editing RFC-like documents. The syntax itself is asuper setsuperset of the syntax championed by Markdown. 2. Pandoc to RFC Pandoc's syntax is easy to learn andwritewrite, and it can be translated to numerous output formats, including, but not limited to: HTML, EPUB, (plain)MarkdownMarkdown, and DocBook XML. Pandoc2rfc allows authors to write in Pandoc syntaxwhichthat is then transformed to XML and given to xml2rfc. The conversions are, in awayway, amusing, as we start off with (almost) plain text, use elaborateXMLXML, and end up with plain text again. +-------------------+ pandoc +---------+ | ALMOST PLAIN TEXT | ------> | DOCBOOK | +-------------------+ 1 +---------+ | | non-existent | 2 | xsltproc faster way | | v v +------------+ xml2rfc +---------+ | PLAIN TEXT | <-------- | XML | +------------+ 3 +---------+ Figure 1: Attempt to justifyPandoc2rfc.Pandoc2rfc The output of step 2 in Figure 1 is XMLwhichthat is suitable for inclusion in either the "middle" or "back" section of an RFC. Even though Pandoc2rfc abstracts away a lot of XML details, there are still places left where XML files needs to beedited. Mostedited -- most notably in the "front" section of an RFC. The simplest way to start using Pandoc2rfc is to create a template XML file and include the appropriate XML for the "front","middle""middle", and "back" section: <?xml version='1.0' ?> <!DOCTYPE rfc SYSTEM 'rfc2629.dtd' [ <!ENTITY pandocAbstract PUBLIC '' 'abstract.xml'> <!ENTITY pandocMiddle PUBLIC '' 'middle.xml'> <!ENTITY pandocBack PUBLIC '' 'back.xml'> <!ENTITY rfc.2629 PUBLIC '' 'reference.RFC.2629.xml'> ]> <rfc ipr='trust200902' docName='draft-gieben-pandoc2rfc'> <front> <title>Writing I-Ds and RFCs using Pandoc</title> <author> <organization/> <address><uri>http://www.example.com</uri></address> </author> <date/> <abstract> &pandocAbstract; </abstract> </front> <middle> &pandocMiddle; </middle> <back> <references title="Normative References"> &rfc.2629; </references> &pandocBack; </back> </rfc> Figure 2: A minimaltemplate.xml.template.xml In thiscasecase, you will need to edit four documents: 1. "abstract.mkd" - contains the abstract; 2. "middle.mkd" - contains the main body of text; 3. "back.mkd" - holds the appendices (if any); 4.Andand this "template.xml"--- probably a fairly staticfile,file; among other things, it holds the author(s) and the references.Up to dateUp-to-date source code for Pandoc2rfc can be found at[Pandoc2rfc],[Pandoc2rfc]; this includes the style sheet"transform.xsl""transform.xsl", which is used for the XML transformation (also see Section 3). 2.1. Dependencies Pandoc2rfc needs "xsltproc" [XSLT] and "pandoc" [Pandoc] to be installed. The conversion to xml2rfc XML is done with a style sheet based on XSLT version 1.0 [W3C.REC-xslt-19991116]. When using the template from Figure22, xml2rfc version 2 (or higher) must be used. 3. Building an Internet-Draft Assuming the setup from Section 2, we can build an I-D as follows (in a Unix-like environment): for i in abstract middle back; do pandoc -st docbook $i.mkd | xsltproc --nonet transform.xsl - > $i.xml done xml2rfc template.xml -f draft.txt --text # create text output xml2rfc template.xml -f draft.html --html # or create HTML output xml2rfc template.xml -f draft.xml --exp # or create XML output Figure 3: Building anI-D.I-D Note that the output file names (abstract.xml,middle.xmlmiddle.xml, and back.xml) must match the names used as the XML entities in"template.xml""template.xml". (See the "!ENTITY" lines in Figure2).2.) The Pandoc2rfc source repository includes a shell script that incorporates the above transformations. Creating a "draft.txt" or a "draft.xml" can be done with "pandoc2rfc *.mkd" and "pandoc2rfc -X*.mkd"*.mkd", respectively. 4. Supported Features The full description of Pandoc's syntax can be found in [PandocGuide]. The following features of xml2rfc are supported by Pandoc2rfc (also see Table 1 in AppendixBA for a "cheat sheet"): o Sections with an anchor and title attributes; o Severallistslist styles: * style="symbols", use "* " for each item; * style="numbers", use digits: "1. " for each item; * style="empty", use "#. " for each item; * style="format %i", useromanlowercase Roman numerals: "ii. "; * style="format (%d)", useromanuppercase Roman numerals "II. "; * style="format ...", use strike-through text at the start in the first element, "1. ~~REQ%d.~~ "; * style="letters", use lower- or uppercase letters: "a. " and "A. " (note: two spaces as mandated by Pandoc); * style="hanging", use the Pandoc definition list syntax: Term 1 : Definition 1Figure 4: Pandoc syntax used for a hanging paragraph.o Spanx style="verb",style="emph"style="emph", and style="strong",respectivelyrespectively, use: "`text`", "_text_" or "**text**"; o Blockquotequote, which is converted to a paragraph within a "<liststyle="empty">" paragraph;style="empty">"; o Figures with an anchor and title (Section 6.1); o Tables with an anchor and title (Section 6.2); o References (Section 6.3) * external ("<eref>"); *internalcross-reference ("<xref>"), to: + sections (handled by Pandoc); + figures (handled by XSLT); + tables (handled by XSLT). o Index, by using footnotes and superscript text (Section 6.4); o Citations, by usinginternal references;cross-references; o Processing Instructions("PI"s: "<?rfc?>"),(PIs), which appear as "<?rfc?>", may be used after a sectionheader, theyheader. They are carried over to the generatedXML;XML. o The"<vspace>"-tag"<vspace>" tag is supported and carried over to the generated XML. 5. Unsupported Features and Limitations WithPandoc2rfcPandoc2rfc, an author of an I-D can get a long way without needing to input XML, but it is not a 100% solution. The initial setup and the reference library stillforcesforce the author to edit XML files. Themeta datametadata feature (Pandoc's "Title Block" extension) is not used in Pandoc2rfc. This information (authors, date,keywordkeyword, and URLs) should be put in the "template.xml". Some other quirks: oAn index is not supported; oComments are supported via HTML comments in the Pandoc sourcefiles;files. o Citations are supported viainternal references,cross-references; the citation syntax of Pandoc is notused;used. o Authors still need to know how to deal with possible errors from xml2rfc. 6. Pandoc Style The following sections detail how to use the Pandoc syntax for figures,tablestables, and references to get the desired output. 6.1. Figures Indent the paragraph with 4 spaces as mandated by Pandoc. If you add an inline footnote _directly_ after the figure, the artwork gets a title attribute with the text of that footnote (and a possible anchor). 6.2. Tables A table can be entered by using Pandoc's table syntax. You can choose multiple styles as input, but they all are converted to the same style table (plain "<texttable>")tablein xml2rfc. If you add an inline footnote _directly_ after the table, it will get a title attribute with the text of that footnote (and a possible anchor). Thebuilt inbuilt-in syntax of Pandoc to create a caption with "Table:" should not be used. 6.3. References Pandoc provides a syntax that can be used for references. Its syntax is repeated in this paragraph. Any referencelike:like "[Clickhere](URI)",here](URI)" is an external reference. An internal(i.e. see Section X)reference (i.e., "see Section X") is typesetwith:with "[](#localid)". For referencing RFCs (and othercitations),documents), you will need to add the reference source in thetemplate,template as an external XMLentity,entity; Figure 2 provides an example. Afterthatthat, you can usean internal reference:the following syntax to create a citation: "[](#RFC2629)" toreferencecite RFC 2629. There is no direct support for referencing tables,figuresfigures, and artworks, butpandoc2rfcPandoc2rfc employs the following "hack". If an inline footnote is added after the figure or table, the text of the footnote is used as the title. The first word up until a double colon "::" will be used as the anchor. If a figure has ananchoranchor, it will be centered on the page. Figure22, for instance, is followed by this inline footnote: ^[fig:minimal::A minimal template.xml.] 6.4. Index An index can be generated by using the following syntax: ^[ ^item^ subitem ] where "subitem" is optional. 7. Acknowledgements The following people have helped shape Pandoc2rfc: Benno Overeinder, Erlend Hamnaberg, MatthijsMekkingMekking, and Trygve Laugstoel. 8. Security Considerations This document raises no security issues. 9.IANA Considerations This document has no actions for IANA. 10.References 9.1. Normative References[Markdown] Gruber, J., "Markdown", 2004, <http://daringfireball.net/projects/markdown/>. [Pandoc2rfc] Gieben, R., "Pandoc2rfc git repository", October 2012, <http://github.com/miekg/pandoc2rfc>. [PandocGuide] MacFarlane, J., "Pandoc User's Guide", 2006, <http://johnmacfarlane.net/pandoc/README.html>. [Pandoc] MacFarlane, J., "Pandoc, a universal document converter", 2006, <http://johnmacfarlane.net/pandoc/>.[RFC2629] Rose, M., "Writing I-Ds and RFCs using XML", RFC 2629, June 1999. [W3C.REC-xslt-19991116] Clark, J., "XSL Transformations (XSLT) Version 1.0", World Wide Web Consortium Recommendation REC-xslt-19991116, November 1999, <http://www.w3.org/TR/1999/REC-xslt-19991116>. [XSLT] Veillard, D., "The XSLT C library for GNOME", 2006, <http://xmlsoft.org/XSLT/xsltproc2.html>. 9.2. Informative References [Markdown] Gruber, J., "Markdown", 2004, <http://daringfireball.net/projects/markdown/>. [Pandoc] MacFarlane, J., "Pandoc, a universal document converter", 2006, <http://johnmacfarlane.net/pandoc/>. [Pandoc2rfc] Gieben, R., "Pandoc2rfc git repository", October 2012, <http://github.com/miekg/pandoc2rfc>. [PandocGuide] MacFarlane, J., "Pandoc User's Guide", 2006, <http://johnmacfarlane.net/pandoc/README.html>. Appendix A.Changelog [This section should be removed by the RFC editor before publishing] A.1. -00 1. Initial document. A.2. -01 1. Lots of updates; 2. Added the style sheet use in an appendix. A.3. -02 1. Make "template.xml" actually valid XML; 2. Removed the style sheet from the appendix; 3. Make more explicit that typesetting some XML files is still needed; 4. Fix blockquote text and conversion; 5. Overhauled the way references to figures and tables work; 6. Cleaned up and removed duplicate text. A.4. -03 1. Change affiliation for R. Gieben. Appendix B.Cheat Sheet +---------------------+-----------------+--------------+ | Textual construct | Pandoc syntax | Text output | +---------------------+-----------------+--------------+ | Section Header | "# Section" | 1. Section | | Unordered List | "* item" | o item | | Unordered List | "#. item" | item | | Ordered List | "1. item" | 1. item | | Ordered List | "a. item" | a. item | | Ordered List | "ii. item" | i. item | | Ordered List | "II. item" | (1) item | | Ordered List | "A. item" | A. item | | Ordered List | "1. ~~REQ%d.~~" | REQ1. | | Emphasis | "_text_" | _text_ | | Strong Emphasis | "**text**" | *text* | | Verbatim | "`text`" | "text" | | Block Quote | "> quote" | quote | | External Reference | "[Click](URI)" | Click [1] | | Internal Reference | "[](#id)" | Section 1 | | Figure Anchor | "^[fid::text]" | N/A | | Figure Reference | "[](#fid)" | Figure 1 | | Table Anchor | "^[tid::text]" | N/A | | Table Reference | "[](#tid)" | Table 1 | | Citations | "[](#RFC2119)" |[RFC2199][RFC2119] | | Table | Tables | * | | Figures | Code Blocks | * | | Definition List | Definition | * | | Index | ^[ ^item^ ] | * | +---------------------+-----------------+--------------+ * This construct creates output too voluminous to show in the table. Table 1: The most important textual constructs that can be used inPandoc2rfc. The bottom three create output to voluminous to show in this table.Pandoc2rfc Author's Address R. (Miek) Gieben GoogleEmail:EMail: miek@google.com