2007.Sci Foo

Preview:

DESCRIPTION

V. S. Smith. Science publishing for the MySpace generation: MySpecies and the Encyclopedia of Life

Citation preview

Science Publishing forthe MySpace Generation

Vincent S. Smith

MySpecies & the Encyclopedia of Life

Biodiversity ScienceThe foundation for all biological disciplines

Mission…• Inventory the Earth’s species• Understand their relationships• Create predictive information systems from these data

Data set…• 1.8M described species (10M names)

• 300M pages (over last 250 years)

• 1.5-3B specimens

Staff…• 4-6,000 scientists• 30-40,000 amateurs• Many more citizen scientists?

Biodiversity ScienceThe foundation for all biological disciplines

250 yr progress report…• Up to 87% of life on Earth is still undescribed

• 6% of biodiversity scientists cover 80% of the worlds biodiversity

• At present rates most species will be extinct long before we describe them

Biodiversity ScienceThe foundation for all biological disciplines

250 yr progress report…• Up to 87% of life on Earth is still undescribed

• 6% of biodiversity scientists cover 80% of the worlds biodiversity

• At present rates most species will be extinct long before we describe them

Problems…• Communities working on biodiversity are highly distributed & fragmented

• So are the data they publish

• The “publication” process for biodiversity data is broken

Most biodiversity (data) is hidden

“Paper Minds”The bottleneck of traditional publication

1,000’s of journals addressinga common set of questions

What is a species? How many species are there? Where are species distributed? How have species distributions changed? How are species related? How have species characters changed? To what extent is are species relationships predictive?

DATA

“Paper Minds”The bottleneck of traditional publication

1,000’s of journals addressinga common set of questions

Mol. Phyl. Evol.21,964 pp. since 2000

Menopon gallinaeNumidicola antennatusAmyrsidea ventralisSomaphantus lusiusMenacanthus stramineusColimenopon urocoliusTrinoton anserinumMeromenopon meropisGruimenopon longumHoazineus armiferusCopocephalum zebraComatomenopon elbeli/elongatumPsittacomenopon poicephalusOdoriphila clayae/phoeniculiArdeiphilus trochioxusCuculiphilus fasciatusCiconiphilus quadripustulatusEomenopon denticulatumPiagetiella bursaepelecaniOsborniella crotophagaeHohorstiella lataNeomenopon pteroclurusMachaerilaemus laticorpus/latifronsAustromenopon crocatumEidmanniella pellucidaHolomenopon brevithoracicumDennyus hirundinisMyrsidea victrixAncistrona vagelliPseudomenopon pilosumBonomiella columbaeChapinia robustaPlegadiphilus threskiornisActornithophilus uniseriatusMEGAMENOPONRediella mirabilisLatumcephalum lesouefi/macropusParaboopia flavaParaheterodoxus insignisBoopia tarsataTherodoxus oweniLaemobothrion maximumRicinus fringillaeTrochiliphagus abdominalisTrochiloecetes rupununiLiposcelis bostrychophilus

What is a species? How many species are there? Where are species distributed? How have species distributions changed? How are species related? How have species characters changed? To what extent is are species relationships predictive?

“Paper Minds”The bottleneck of traditional publication

1,000’s of journals addressinga common set of questions

What is a species? How many species are there? Where are species distributed? How have species distributions changed? How are species related? How have species characters changed? To what extent is are species relationships predictive?

“Species Name”The universal linker

RAW DATA > Logically interconnectedbut presently fragmented by thepublication process

Other problems…• Time & money• Audience mismatch• Findability & reusability

Encyclopedia of Life (EOL)“The ultimate life list” - Mitch Leslie, Science

Nothing couldpossibly go wrong!

http://www.eol.org/

• A web page for every species

• Vision of EO Wilson

• $50m funding (5 years)- MacArthur and Sloan Foundations

• Megascience mashup- First draft 2008, complete 2018!

• Mass collaboration- Science & outreach

EOL Deja Vu

http://ecoport.org/http://www.all-species.org/

http://www.ispecies.org/ http://species.wikimedia.org/

A web page for every species

Vision of EO Wilson

Lots of money

Megascience mashup

Mass collaboration

EOL Content

http://www.biodiversitylibrary.org/

Biodiversity Heritage Library (BHL)

Content managed by

Since May 07: - 323 titles - 3,316 volumes - 1,302,530 pages

“The Internet Archive”

Digitizing the 10 largestNatural History libraries

Since 1469: - 5.4M books - 800,000 monographs - 40,000 journal titles

EOL Content

http://www.biodiversitylibrary.org/

Biodiversity Heritage Library (BHL)

Content managed by

Since May 07: - 323 titles - 3,316 volumes - 1,302,530 pages

“The Internet Archive”

Digitizing the 10 largestNatural History libraries

Since 1469: - 5.4M books - 800,000 monographs - 40,000 journal titles

Are we digitizingthe right stuff?

EOL ContentMickey mouse copyright laws

C 1923

EOL ContentMost published content cannot be legally digitized

1923

In Copyright

DOI

Publications onants

EOL ContentMost published content cannot be legally digitized

DOI

Publications onants

1890

In Copyright(Europe)

Can EOL succeed - define success?

RSScommunity integrative

intuitive

The potential of EOL can only be realized if we rethink “publication”

licensable

MySpeciesA prototype self publication tool for EOL?

A community publication tool to intuitively create, manageand share biodiversity data on the web

http://myspecies.info/

MySpecies

Multi-site CMSconfiguration

A prototype self publication tool for EOL?

Added tools & services

A prototype self publication tool for EOL?MySpecies

MySpeciesA prototype self publication tool for EOL?

Automated site creation

CC LicensingNo content control *

No brandingCitableHelp

MySpecies

… & more

• Birds• Bees• Cockroaches• Corals

• Dung beetles• Fungus gnats• Lice• Milichiid flies

• Mosquitoes• Nanofossils• Polychaetes• Solanaceae

Supporting 22 communities of biologists & counting

http://myspecies.info/SitesList

MySpecies & its successorsA new publishing model for biodiversity data?

Traditional(filter > publish)

• Fractionally “published”• Story telling• Fragmented• Low findability & reusability• Branded• Expensive

Web(publish > filter)

• 100% “published”• Smaller units of information• Findable & reusable• Meaningfully citable (data)• Unbranded• Cheap

MySpecies & its successorsA new publishing model for biodiversity data?

Traditional(filter > publish)

• Fractionally “published”• Story telling• Fragmented• Low findability & reusability• Branded• Expensive

Web(publish > filter)

• 100% “published”• Smaller units of information• Findable & reusable• Meaningfully citable (data)• Unbranded• Cheap

But,What aboutpeer review!

MySpecies & Peer ReviewHow can we provide quality assurance?

Web(publish > filter)

Data algorithmically checked

Peer used orignored

Traditional(filter > publish)

Data ignored / stories checked

MySpecies & Peer ReviewHow can we provide quality assurance?

Web(publish > filter)

Data algorithmically checked

Peer used orignored

Traditional(filter > publish)

Data ignored / stories checked

Questions?

Recommended