Word documents to XML via upCast
Following Rice's example, using upCast to convert Word documents to upCast (Windows machine).
- Once upCast is installed: Resources > templates > Word to DocBook > open template.
- Documentation for upCast: http://upcast.de/iloop/assets/content/products/upcast/765b1744/doc/manual/html/index.html#N2004D
- Documentation also in the Word to DocBook folder
- Change the Catalog (under Pipeline Settings) to
${pipeline:PipelineBase}/resources/schema/catalog
- Strip the title page, TOC from the Word documents; Large documents kept failing/timing-out so I had to break them in two.
- Choose the file (even though it says rtf to DocBook v 5.0 it seemed to convert .docx files just fine).
- I didn't select any options
- Table Model: CALS
- DocBook structure: book > chapter > section
DocBook Notes
- Fix hierarchy:
- Chapters are Intro, Standard Docs (overview), Maintenance Docs (overview), Appendix
- Section X.X would be each individual standard doc or maintenance doc or item in the appendix.
- Section X.X.X are sub-titles for each document: Getting Started, Process Overview, Document Layout, Business Rules and Routing (all are on the same level in the hierarchy).
- Removed TOC description tables under chapters
- Removed extra TOC anchors (TOC_xxxxx...) from Word. Check anchors
- Put red arrows in note element; remove the red arrow images
- Put tips into tip elements
- Table titles need to have emphasis as bold
- Changed column widths on tables to 1.0*
- Changed "phrase role="strong"" to "emphasis role="bold""
- Added the Kuali copyright comment to each book from Rice xml files (kept the copyright from 0.8 docs; need to determine which/what is needed)
CSS Stylesheet
From Rice, https://svn.kuali.org/repos/rice/trunk/src/site/docbook/, did some minor modifications to their css (mostly removing non-applicable/OLE-book-breaking elements).
OLEdocbooks.css is my work in progress.
Availability
Can we follow this format: DocBook Environment Setup? - Jeff Caddel has experience with checking in and working with DocBook xml in SVN. He is looking into hooking the DocBook output into OLE.