Upload
others
View
2
Download
0
Embed Size (px)
Citation preview
Copyright 2011 Inera Incorporated. All Rights Reserved
Challenges in eBook Production
Presented by
Bruce D. Rosenblum
CEO
Inera Incorporated
STM International, 1 December 2011
Copyright 2011 Inera Incorporated. All Rights Reserved
Copyright 2011 Inera Incorporated. All Rights Reserved
Copyright 2011 Inera Incorporated. All Rights Reserved
So You Want an e-Book Workflow…
Introduction
DTD selection
XML and Composition
File organization and naming
Metadata
Fronts and backs
Body content challenges
Copyright 2011 Inera Incorporated. All Rights Reserved
Journals Look… Well… Plain
Copyright 2011 Inera Incorporated. All Rights Reserved
Books Look… Well… "Designy"
Copyright 2011 Inera Incorporated. All Rights Reserved
Book XML: Why Now?
Market demand
Multi-platform delivery
Production costs
Delivery time
Copyright 2011 Inera Incorporated. All Rights Reserved
Book XML Rationale
Journal XML workflow focus
Online delivery
Metadata delivery
Book XML workflow
Production efficiency
ePub creation
Copyright 2011 Inera Incorporated. All Rights Reserved
Journal/Book Issues Comparison
Journals
Standard designs
3B2, XPP batch
pagination
Automation,
automation,
automation
Books
Designers rule
InDesign centric
Let's do this all by
hand
XML production requirements inherently contradict traditional
book production. We are starting to address this issue…
Copyright 2011 Inera Incorporated. All Rights Reserved
DTD Selection
Unlike journals, no single standard DTD for books
Some choices
TEI
DocBook
NLM/JATS
DITA
Roll your own proprietary DTD
How do I choose?
Copyright 2011 Inera Incorporated. All Rights Reserved
DTD Selection
Based on
Your content
Front list vs. back list vs. historical content
Discipline(s), e.g. Humanities vs. Life sciences
Your XML use-cases
Tools you may want to use
Copyright 2011 Inera Incorporated. All Rights Reserved
TEI DTD
Origins: Academic community (Brown University)
Widely used in humanities
Great for historical materials
E.g. preserving line break/pagination information
Poetry
Least-known by suppliers
Weakest commercial tool support
Copyright 2011 Inera Incorporated. All Rights Reserved
DocBook DTD
Origins: Technical publication (O’Reilly)
Great for technical and trade books
Lots of commercial tool support
FrameMaker, ArborText
Well-known by suppliers
OASIS standard
Copyright 2011 Inera Incorporated. All Rights Reserved
NLM/JATS DTD
Origins: Scholarly journal archiving & publication
Widely used by journal publishers
Great for
Science publications, multi-author works
Publishers doing books and journals
Content with structured references
Well-known by suppliers
NISO standard (journal tag suite)
Book model not as mature; BITS revision 2012
But 3.0 is useable
Copyright 2011 Inera Incorporated. All Rights Reserved
DTD Commonalities
Any of these DTDs work well for simple
monographs
All of these DTDs are designed for customization,
if necessary
Copyright 2011 Inera Incorporated. All Rights Reserved
XML Creation
Author – Really?
After PDF
Common, but less robust
Final XML is not proofed
Before copy-editing – requires XML editors
After copy-editing, before composition
Best compromise
Content edited in Word
Proofed PDF created from XML
Copyright 2011 Inera Incorporated. All Rights Reserved
Typesetting
Books are an InDesign world
But InDesign is a limited XML platform
Difficult to import/export richly tagged XML
Automate InDesign XML-driven page layout
Custom scripts
Customizable off-the-shelf commercial software
Allows automatic layout and manual tweaking
Copyright 2011 Inera Incorporated. All Rights Reserved
XML, Composition, & Corrections
Easy (relatively speaking): XML InDesign
Hard
Corrections in InDesign
InDesign richly tagged XML
Options
Correct in InDesign; export IDML; XSLT final XML
Correct in XML and re-flow content to InDesign
Works best when InDesign page layout is automated
Copyright 2011 Inera Incorporated. All Rights Reserved
File Organization
Is an XML book
One large file?
One chapter per file?
It depends…
Can individual chapters stand alone?
Will you sell individual chapters?
Will you re-package individual chapters?
The linking problem
Inter-chapter references "See chapter 9"
Back-of-the-book reference list
Copyright 2011 Inera Incorporated. All Rights Reserved
File Naming
Bad
No system; Random editor's choice
Author names Accented letters, disambiguation
Good
Book ID Internal ID, e.g. BK12345.xml
ISBN, e.g. 978-1-4094-1940-2.xml
Chapters E.g. 978-1-4094-1940-2_C001.xml, 978-1-4094-1940-2_intro.xml
No special characters or spaces
Copyright 2011 Inera Incorporated. All Rights Reserved
Artwork File Names
Design a system that is
Logical and consistent
Unambiguous
Robust
Minimizes manual work in file preparation Bad: "Insert BK12345-Fig2.5.jpg"
Good: "Figure 2.5: Figure title"
Transform auto-generates: <graphic xlink:href="BK12345_C002_FG005"/>
Avoids possible mis-typing of figure name
File extensions: XML is better without
E.g. TIFF for PDF, GIF for online delivery
Script adds appropriate extension when rendering
Copyright 2011 Inera Incorporated. All Rights Reserved
Example Image Names
Follow and extend book conventions
Figure: 978-1-4094-1940-2_C001_FG003
But what about other types
Plate: 978-1-4094-1940-2_C001_PL003
Map: 978-1-4094-1940-2_C001_MP003
Exhibit: 978-1-4094-1940-2_C001_EX003
Equation: 978-1-4094-1940-2_C001_EQ003
Consider all artwork types across publications
Copyright 2011 Inera Incorporated. All Rights Reserved
Special Artwork Naming
Unnumbered images
Figure: 978-1-4094-1940-2_C001_UN003
Special cases
When the "name" is better
Lots of unnumbered images
Field guide
<graphic xlink:href=
"Eusynthemis-tillyardi-male"/>
Copyright 2011 Inera Incorporated. All Rights Reserved
Equations
Format options
MathML
TeX
Images
Consider composition and delivery requirements
InDesign works best with images
eBook readers can't render MathML or TeX
Future proof
MathML + images
Copyright 2011 Inera Incorporated. All Rights Reserved
Metadata
Identifiers ISBN
Print, PDF, ePub… How many? Consult www.bisg.org
Book ID (internal publisher ID)
DOI Chapters, figures, tables, too?
Book "type" (publisher classification)
Publisher and imprint information
Copyright, publication date, edition
Authors, editors, translators Book-level
Chapter-level
More… Each publisher has unique requirements
Copyright 2011 Inera Incorporated. All Rights Reserved
Fronts
Title and half-title page
Copyright page
Tables of contents
Regular and Expanded
List of figures, tables, etc.
Prelims
Preface
Introduction
Dedication
Acknowledgements
Copyright 2011 Inera Incorporated. All Rights Reserved
Title and Half Title Page
<book-title>The Instrumental Music
of Schmeltzer, Biber, Muffat
and their Contemporaries</book-title>
<contrib contrib-
type="author"><name><surname>Brewer</surname>
<given-names>Charles E.</given-
names></name></contrib>
<aff>Associate Professor of Musicology, The Florida
State University, USA</aff>
<p content-type="cat-data-title">British Library
Cataloguing in Publication Data</p>
<p content-type="cat-data> Brewer, Charles E.
(Charles Everett) </p>
<p content-type="cat-data"> The
instrumental music of Schmeltzer, Biber, Muffat and
their contemporaries. </p>
<p content-type="cat-data"> 1.
Instrumental music – 17th century – History and
criticism. </p>
<p content-type="cat-data-indent">I. Title </p>
<p content-type="cat-data-indent>784'.09032-dc22
</p>
Copyright 2011 Inera Incorporated. All Rights Reserved
Tables of Contents
To XML or Not?
Included in XML
Data redundancy and corrections
Simple, but risk of error
Excluded from XML
Build automatically from chapters
Expanded TOC information in each chapter
Requires some script expertise, but more robust
Copyright 2011 Inera Incorporated. All Rights Reserved
Prelims
Introduction, Preface, etc.
"Mini-chapters"
Usually very simple
But…
Unnumbered artwork
End signature
One file or many?
Copyright 2011 Inera Incorporated. All Rights Reserved
Backs
Bibliography and References
Notes
Glossary
Index
Copyright 2011 Inera Incorporated. All Rights Reserved
Bibliography and References
Linking to CrossRef
Not required, but desirable
Linking from chapters
Consider
chapter-level reference lists
Back-of-the-book bibliography
Copyright 2011 Inera Incorporated. All Rights Reserved
Notes
Best to place in each chapter
Avoids linking problems
Script can collect all for back-of-book
Copyright 2011 Inera Incorporated. All Rights Reserved
Glossary
XML setup
Back-chapter
Inline definition
Allows marginalia presentation
Script can collect
for back-of-book
Copyright 2011 Inera Incorporated. All Rights Reserved
Glossary XML Example
<xref ref-type=“glossary” rid=“gl002”>Cognitions</xref> represent
any "knowledge…"
<book-part id=“glossary” book-part-type=“glossary”>
<back><glossary>
<def-list>
<def-item id=“gl002”>
<term>Cognitions</term>
<def>A person's knowledge, opinions, or beliefs.</def>
</def-item>
</def-list>
</glossary></back></book-part>
Copyright 2011 Inera Incorporated. All Rights Reserved
Indexes
The hand-curated index, not auto-generated index
Vestige of print or useful scholarly tool?
Print index
Used to find useful information
Used to evaluate book contents
Electronic index
Used in a "search" world?
Perhaps to evaluate book contents
The index isn't dead yet, is it?
Copyright 2011 Inera Incorporated. All Rights Reserved
The Index Workflow Problem
Integrated creation in Word
Authors not index specialists
Hard to integrate into editorial
Can't create until book fully paginated
Index specialists are not XML experts
How to markup richly linked index?
It's not easy
Perhaps unlinked text index is OK?
Copyright 2011 Inera Incorporated. All Rights Reserved
Body Content Challenges
Complex boxes
Table formatting
Discontinuous Lists
"See page…"
Copyright 2011 Inera Incorporated. All Rights Reserved
Box 1: Box Title
This is some text in a
sidebar box. Boxes
may also contain
figures, lists, equations,
or even sub-boxes
inside
Complex Boxes
Copyright 2011 Inera Incorporated. All Rights Reserved
Text in Boxes <boxed-text id=“exh1" content-type="exhibit“>
<label>Exhibit 1</label><title>Factors</title>
<list id="L1" list-type="bullet">
<list-item>conditions</list-item>
<list id="L2" list-type="bullet">
<list-item>high</list-item>
<list-item>significant</list-item>
</list>
</list>
</boxed-text>
Copyright 2011 Inera Incorporated. All Rights Reserved
Table Formatting
Shaded cells
CSS attributes in HTML model table
<styled-content style="1" style-type="shade">Cell
content</styled-content>
Special cell borders
E.g. double-underline in financial tables
CSS attributes in HTML model table
CALS requires custom setup
Copyright 2011 Inera Incorporated. All Rights Reserved
Discontinuous Lists
<list id="L1" list-type="order">
<label>1</label>list-item><p>Item
1</p></list-item>
<label>2</label>list-item><p>Item
2</p></list-item>
<label>3</label>list-item><p>Item
3</p></list-item>
</list>
<p>Some interesting text in the middle
of a list, but not part of a list item</p>
<list id="L2" list-type="order"
continued-from="L1">
<list-item>
<label>4</label>list-item><p>Item
4</p></list-item>…
Images courtesy CFA Institute
1. Item 1
2. Item 2
3. Item 3
Some interesting text in the middle
of a list, but not part of a list item
4. Item 4
5. Item 5
Copyright 2011 Inera Incorporated. All Rights Reserved
"See page" Problem
In print: "See page 253"
What does this mean in an eBook?
Link to
A paragraph
A section head
An arbitrary point
No good solution
Except, perhaps, author education?
Copyright 2011 Inera Incorporated. All Rights Reserved
Afterthought: Book Errata
We all make mistakes…
Provide
Errata URL in front of book (DOI is better)
Form to report errors
Update errata page as errors are found
Discussion:
http://www.linkedin.com/groupAnswers?viewQuestionAndAnswer
s=&discussionID=81521719&gid=65026&trk=eml-anet_dig-b_nd-
pst_ttle-cn&ut=0u5NFofoH3hl01
Example:
http://www.berkshirepublishing.com/brw/product.asp?projID=65
Copyright 2011 Inera Incorporated. All Rights Reserved
Conclusions
eBooks are here, now
Production more complex than journals
XML requirements
Workflow requirements
InDesign Limitations
But all can be overcome
While adding XML as a product driver
And gaining production efficiencies and cost savings
Copyright 2011 Inera Incorporated. All Rights Reserved
Questions?
Bruce Rosenblum
Inera Incorporated
+1 (617) 932 - 1932
www.inera.com