The Long Road to JATS Paul Donohoe – Senior XML Developer Jenny Sherman – XML Developer Ashwin Mistry - XML Developer Macmillan Science and Education

Embed Size (px)

Citation preview

  • Slide 1
  • The Long Road to JATS Paul Donohoe Senior XML Developer Jenny Sherman XML Developer Ashwin Mistry - XML Developer Macmillan Science and Education
  • Slide 2
  • In the beginning over half a million articles (increased to a million by the time the project started) 143 journals (increased to 180) three in-house DTDs five typesetters numerous workflows and production systems teams based across the world
  • Slide 3
  • Mapping process
  • Slide 4
  • Slide 5
  • Slide 6
  • Mapping to JATS the value of examining the content Example date element 19991125 Feasibility report mapping Content-based mapping Nothing. Used for only 10 articles in one supplement.
  • Slide 7
  • Standardization to JATS List type mapping from JATS to AJ or NPG AJJATSNPG Numbers1 2 3 Numbers with parens1) 2) 3) Lettersa b c Letters with parensa) b) c) Uppercase lettersA B C Roman numeralsi ii iii Roman numerals with parensi) ii) iii) Uppercase Roman numeralsI II III Bullets
  • Slide 8
  • Program kickoff
  • Slide 9
  • External customers
  • Slide 10
  • Internal customers
  • Slide 11
  • Existing workflow StagingLive
  • Slide 12
  • Existing to final desired workflow StagingLive Content Hub
  • Slide 13
  • Suggested interim workflow StagingLive Staging
  • Slide 14
  • Current hybrid workflow Staging Live
  • Slide 15
  • Opportunities
  • Slide 16
  • Slide 17
  • Scrum Development Process
  • Slide 18
  • (Re)Write a test Does the test fail? (Re)Write production code Run all tests Clean up code Test Succeeds Test Fails Tests Fail All Tests Succeed Repeat Test Driven Development
  • Slide 19
  • Unit Testing
  • Slide 20
  • XMLUnit
  • Slide 21
  • Slide 22
  • Slide 23
  • Slide 24
  • MarkLogic Content Ingestion Service Content Hub API Asset Service Triple- store Content Gateway MongoDB Transformation Service Validation Service FTP Hot Folder New Publishing Platform incl Article Rendering Search Product Set Up tool MySQL Content Work Flow Tool File System Ontologies Content Hub Architecture
  • Slide 25
  • Core Ontology
  • Slide 26
  • The Big Picture
  • Slide 27
  • New Publishing Platform
  • Slide 28
  • New Search old searchnew search
  • Slide 29
  • Other benefits Marketing site launch reduced from two months to two weeks Full journal launch reduced from eight months to four months
  • Slide 30
  • Slide 31
  • Continuous Modelling
  • Slide 32
  • Four DTDs
  • Slide 33
  • Next Steps Single environment publishing Data sends for third parties Archive conversion
  • Slide 34
  • Lessons Learned Dont start from here! Do not rely on DTD to DTD mappings Make use of an XML database to establish actual content markup Agile development methodology for tool development and do it properly
  • Slide 35
  • Lessons Learned (2) JATS is the best journal article DTD we know!
  • Slide 36
  • Thank you! [email protected] [email protected] [email protected] Digital Systems, Science and Scholarly Macmillan Science and Education Transforming Learning and Discovery The Macmillan Campus, 4 Crinan St, London, N1 9XW