35
The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September 28, 2010

The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Embed Size (px)

Citation preview

Page 1: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

The Future of Informatics in Digital Literature – or Literature

and it’s (Digital) Future

Donat Agosti and Terrance CatapanoPlazi

TDWG, Woods Hole, September 28, 2010

Page 2: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Literature, the tool to formalize our knowledge, and make it

part of the global knowledgebase.

Page 3: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

< 15% of taxonomists opt for Open Access

Source: Zootaxa, publisher of ca 15% of all new taxonomic names

Page 4: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

“The current scholarly communication system is

nothing but a scanned copy of the paper based system.”

Van de Sompel & Lagoze, 2009, The Forth Paradigm.

Page 5: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

E.g. BHL‘s emphasis on scanning and images of text…

Page 6: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

E.g. BHL‘s emphasis on scanning and images of text…

… and little efforts (by third parties) to provide better

access

Page 7: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

„An articulated semantic structure facilitates simpler algorithms acting on World Wide Web text and data

and is more feasible in the near term than building a layer of complex

artificial intelligence to interpret free-form human ideas using some

probabilistic approach.“

Ginsparg, 2009, The Forth Paradigm.

Page 8: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Quantity vs precission

Page 9: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Howard Ratner, Nature: Nature on Mobile: http://river-valley.tv/conferences/stm-innovations-seminar-2009

Page 10: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

A semantically enhanced, linked XML document based on

clean OCR

Page 11: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

TaxonX

Page 12: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Text XML document

<tax:treatment> <tax:nomenclature> <tax:name> <tax:xid source="HNS" identifier="193329"/> <tax:xmldata> <dc:Genus>Mystrium</dc:Genus> <dc:Species>leonie</dc:Species> </tax:xmldata> Mystrium leonie </tax:name> <tax:status>n. sp.</tax:status> Fig 1 D - F </tax:nomenclature> <tax:div type="description"> <tax:p>HOLOTYPE WORKER: TL 3.95, HL 1.02, HW 0.95, CI 93, SL 1.30, SI 137, PW 0.73, ML 0.38. Mandible outer margin strongly curving to a sharp apical tooth, the apex parallel to the anterior clypeal margin. (Holotype with material in mandibles, so mandibles and anterior clypeus $ described below from paratypes.) Median clypeus....</treatment>

Page 13: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Treatment

Page 14: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Treatment≠©

Page 15: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

- Get LSID from Hymenoptera Name Server for names; ZooBank?-Add new names

- Get bibliographic Metadata from HNS (MODS)

- Get bibliographic Guids from bioguid (or EDIT?)

- Get geographic long/lat from geonames.org

Plazi workflow: GoldenGate mark up as an example

-Get Guids for - CBOL- NCBI- specimen- images- .....

Page 16: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Plazi Search and Retrieval Server: Access to data

TAPIR, SPM

You

You

You

human

machine

Page 17: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Materials examined from literature in GBIF

Page 18: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Facebook tool to mark

up legacy publications

Page 19: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Mark-up comes at an (exorbitant) cost…

Page 20: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Mark-up comes at an (exorbitant) cost, if done at the

wrong time

Page 21: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Shift from legacy to prospective publishing

Page 22: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Taxpub NLM DTD

Page 23: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Taxpub NLM DTD:a collaboration between

National Library of MedicineZookeys

Plazi

Page 24: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Taxpub NLM DTD: taxonomic domain specific

extension of the NLM Publishing and Archiving DTD

NLM DTD

Taxpub DTD

Page 25: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Taxpub/NLM DTD+ production worklow

Zookeys

Page 26: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Taxpub/NLM DTD+ production worklow

Zookeys

XMLPrint PDF HTML Other Sites

External resources

Page 27: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Treatment + external links

Page 28: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Treatment + external links:GUID / LSID

Page 29: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Now that we will have LSIDs in your content in PMC, I was looking for an LSID resolver so that we can build links to all of

this content.

But, the only place that I was able to resolve your LSIDs was on your zoobank.org/?lsid= service. I could not resolve them on

lsid.tdwg.org or bioguid.info/lsid.php. Perhaps I don’t understand how LSIDs are supposed to work, but I thought that

any LSID resolver should be able to resolve them. If only your local resolver resolves them, then are they really LSIDs or are

they just zoobank IDs dressed up like LSIDs?

Email from Jeff Beck, NCBI

Page 30: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Why do we do all that?

Page 31: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Technically, we are far beyond the doable

Page 32: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Technically, we are far beyond the doable, we need your

input:Why do you want to have a

(taxonomic) publication?

Page 33: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Why do you want to have a (taxonomic) publication?

External links?Materials Citations?

Descriptions?Credit?

Page 34: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

Where is your data so that it can be linked? How will it be

be standardized?

Page 35: The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September

http://plazi.org

Thank you very much!

Donat Agosti and Terrance Catapano

[email protected]