57
Wikis at work A short introduction to 'wikis' and wikis for biology Dan Bolser and Paolo Romano URL for slides: http://

Wikis at work

Embed Size (px)

Citation preview

Page 1: Wikis at work

Wikis at workA short introduction to 'wikis' and

wikis for biology

Dan Bolser and Paolo Romano

URL for slides: http://

Page 2: Wikis at work

Overview of today's course

Session 1 (35 min): What is a wiki? – A gentle introduction to the wiki

concept, and a look at Wikipedia.

Session 2 (35 min)

Biological wikis! – A review of some of the most important wikis for biology (BioWikis).

BREAK (15 min)

Page 3: Wikis at work

Overview of today's course

Session 3 (35 min)

Semantic MediaWiki – Software for 'data wikis'. Cell Lines Wiki – A pilot scientific wiki.

Practical session (60 min)

Editing Wikipedia (for the first time?) Working with Cell lines Wiki – A gentle introduction

to some of the key features.

Page 4: Wikis at work
Page 5: Wikis at work

Session 1

Page 6: Wikis at work

What is a wiki?

Page 7: Wikis at work

Wiki means QUICK!

Page 8: Wikis at work

Old way:

Wiki is a quicker way to let peopleput content on the web

Page 9: Wikis at work

Wiki is a quicker way to let peopleput content on the web

Old way:

Page 10: Wikis at work

Wiki is a quicker way to let peopleput content on the web

Old way:

Page 11: Wikis at work

Wiki is a quicker way to let peopleput content on the web

Old way:

Host

HTML editor

FTP software

Domain name

Page 12: Wikis at work

Wiki is a quicker way to let peopleput content on the web

Old way:

Page 13: Wikis at work

Wiki is a quicker way to let peopleput content on the web

Old way:

Wiki way:

Page 14: Wikis at work

Wiki means QUICK!

Page 15: Wikis at work

There is no longer one single 'point of control' for managing web content.

Content is managed by a decentralized community of participating users.

Wiki is radically different!

Page 16: Wikis at work

Is this good or bad?

Page 17: Wikis at work

Other advantages of wiki

There are many other advantages over 'traditional' web publishing...

Notification of changes History of changes Discussion of changes

Page 18: Wikis at work

The rise of the wiki

Page 19: Wikis at work

Condensed history

1994: Cunningham coined the term 'wiki'. A site (for software developers) with pages that can

be edited via the browser, each with a page history.

Over the next five years it spawned alternative wiki applications and websites (wiki culture).

By 2000, it had developed lots of of spin-off content, most notably MeatballWiki (for general discussion).

2001, Wikipedia launched.

2007, Wikipedia in the worlds top 10 web sites.

Page 20: Wikis at work
Page 21: Wikis at work

Wikipedia

Size Stats.

Growth Research.

Rules e.g. Deleting content.

Culture e.g. Anatomy of a talk page.

Almost all of its articles can be edited by anyone with access to the site, and it has about 90,000 regularly active contributors.

Page 22: Wikis at work

Wikipedia

Size 19.7 million articles

3.7 million in English 847,069 in Italian (4th out of 282)

2.7 billion monthly hits from the US alone.

7th most popular site in the world. The largest and most popular

general reference work.

Page 23: Wikis at work

Wikipedia

Growth (Kittur) Wikipedia has been

growing exponentially since 2002 (Voss)

“wisdom of crowds” or “elite users”

a large number of people with a small number of edits, or

a core group who do most of the work?

Page 24: Wikis at work

Wikipedia

Rules Their are no rules! The 'five pillars' of

Wikipedia' Standard deletion

procedure...

Page 25: Wikis at work

Wikipedia

Culture Their are no rules! The 'five pillars' of

Wikipedia' Discussion!

(Not bureaucracy)

Page 26: Wikis at work

Moving on...

Is WP, or something like it, the future for science?

Lets find out in Session 2!

Page 27: Wikis at work

Session 1, References

http://wikipedia.org/wiki/History_of_wikis

http://wikipedia.org/wiki/Wikipedia

1) Voss, J. Measuring Wikipedia. In Proceedings of the ISSI 2005 (Stockholm, Sweden, July 24-28, 2005).

2) Kittur A, Chi EH, Pendleton BA, Suh B, Mytkowicz T. Power of the few vs. wisdom of the crowd: Wikipedia and the rise of the bourgeoisie. CHI 2007.

http://wikipedia.org/wiki/Wikipedia:Five_pillars

http://meta.wikimedia.org/wiki/ Research:Newsletter/2011-07-25

Page 28: Wikis at work
Page 29: Wikis at work

Session 2

Page 30: Wikis at work

Scientific wikis

Page 31: Wikis at work

31

Community annotation

Page 32: Wikis at work

32

Annotation

Page 33: Wikis at work

33

Annotation

Page 34: Wikis at work

34

Community annotation

Driven by two key factors:

The vast increase in biological data

The clear success of Wikipedia

Page 35: Wikis at work

35

BioMoore's Law

Over time: Cost per unit of information can be decreased by

orders of magnitude. Throughput is increased by orders of magnitude.

Fan et al. 2006. Nat Rev Genet.

Comprehensive disease studies that might require ~1bn genotypes would now cost only a few million dollars. Revolution in human genetics.

Page 36: Wikis at work

36

BioMoore's Law

Over time: Cost per unit of information can be decreased by

orders of magnitude. Throughput is increased by orders of magnitude.

Fan et al. 2006. Nat Rev Genet.

Comprehensive disease studies that might require ~1bn genotypes would now cost only a few million dollars. Revolution in human genetics.

Page 37: Wikis at work

37

Community annotationis essential

Centralised databases can't cope with annotating the influx of data.

Less investment in more specialised data. Fewer people with a stake. Specialists more disparate.

Communities are smaller and more focused.

Do wikis hold the answer? Wikipedia as a model…

Page 38: Wikis at work

38

But Wikipedia isn’t always the answer ...

• Wikipedia is an educational resource.

– All articles are encyclopaedic in style.

– Explicitly forbids data from ‘original research’:

• http://wikipedia.org/wiki/Wikipedia:No_original_research

– “Wikipedia does not publish original research”.

– No tools for the specific analysis, presentation, or collection of ‘biological’ data.

• BioWikis!

Page 39: Wikis at work

39

BioWikis

Wikis with a biological subject matter, customized for analysis, presentation and collection of specific biological data and biological data types:

Page 40: Wikis at work

Wikis for biology

Page 41: Wikis at work

Proteopedia

http://proteopedia.org/

Aim – To make knowledge about proteins accessible.

Features: Interactive 3D viz. Contributions linked to

publications.

Problems Doesn't work on all

browsers. Can be slow.

Reference Hodis, E. et al., 2008. Proteopedia

- a scientific “wiki” bridging the rift between three-dimensional structure and function of biomacromolecules. Genome biology, 9(8), p.R121.

Page 42: Wikis at work

WikiPathways

http://wikipathways.org

Aim – The curation of biological pathways.

Features: Interactive pathway

editing. Integrated to many

biological databases.

Problems Doesn't work on all

browsers. Can be slow.

Reference Pico, A.R. et al., 2008.

WikiPathways: pathway editing for the people. PLoS biology, 6(7), p.e184.

Page 43: Wikis at work

EcoliWiki

http://ecoliwiki.net/

Aim – Share info related to non-pathogenic E. coli.

Features: Very extensive and

well structured domain information.

Referencing is good.

Problems Big resource Specific focus, could it

be applied to others?

Reference EcoliWiki, 2007. EcoliHub’s

subsystem for community annotation. http://ecoliwiki.net.

Page 44: Wikis at work

CHDWiki

http://goo.gl/info/Zxg8H

Aim(?) – Geneticists, clinicians, and mol. biologists working on Congenital Heart Defects.

Features: Curated gene lists. PPI browser.

Problems Very old MediaWiki

fork Used?

Reference Barriot, R. et al., 2010.

Collaboratively charting the gene-to-phenotype network of human congenital heart defects. Genome medicine, 2(3), p.16.

Page 45: Wikis at work

SEQwiki

http://seqwiki.org

Aim – a catalogue of analysis tools and, technologies for NGS.

Features Structured data. Linked to an

established forum.

Problems Data can become

error prone. Data is difficult to

centrally manage.

Reference Li J.W., et. al. 2012. The

SEQanswers wiki: A wiki database of tools for high throughput sequencing analysis. Nucleic Acids Research 2012 Database special issue.

Page 46: Wikis at work

Listing of biowikis...

http://bioinformatics.org/wiki/BioWiki

Page 47: Wikis at work

BioWikis

There is a variety of different systems.

All seek to 'recognize' contributors (biologists) in a way more familiar to scientists than Wikipedia.

Most have features not found in Wikipedia. Some projects use the base of Wikipedia to

successfully build integrated 'sub-projects'...

Page 48: Wikis at work

The MCB Portal in Wikipedia

http://en.wikipedia.org/wiki/Wikipedia:MCB

Aim – Better organize information related to molecular and cell biology on Wikipedia.

Features Integrated with WP

Problems The project can be

confusing to those unfamiliar with Wikipedia.

Reference ...

Page 49: Wikis at work

Gene Wiki

http://en.wikipedia.org/wiki/Portal:Gene_Wiki

Aim – applying community intelligence to gene annotation.

Special features: Semi-automatic gene

curation system.

Problems The project can be

confusing to those unfamiliar with Wikipedia.

Reference Huss, J.W. et al., 2008. A gene

wiki for community annotation of gene function. PLoS biology, 6(7), p.e175.

Page 50: Wikis at work

Wikis for Cancer?

Page 51: Wikis at work

Some aditional references

1. Fan JB, Chee MS, Gunderson KL. Highly parallel genomic assays. Nature reviews. Genetics 7, 632-44 (2006).

2. The Molecular modelling blog post http://rosettadesigngroup.com/blog/373/scientific-wikis-part-i/

1. http://bifx.org/wiki/BioWiki2. Daub, J. et al., 2008. The RNA WikiProject: community annotation of RNA families. RNA

(New York, N.Y.), 14(12), pp.2462-4. 3. Stehr, H. et al., 2010. PDBWiki: added value through community annotation of the Protein

Data Bank. Database, p.baq009-baq009.4. Flórez, L.A. et al., 2009. A community-curated consensual annotation that is continuously

updated: the Bacillus subtilis centred wiki SubtiWiki. Database : the journal of biological databases and curation, p.bap012

Page 52: Wikis at work
Page 53: Wikis at work

Session 3

Page 54: Wikis at work

Semantic MediaWiki

Page 55: Wikis at work

Semantic MediaWiki(a data wiki engine)

What is SMW?

Motivation

Frontend What you see as a

user of SMW.

Backend What you do as a

SMW site developer...

Data

Properties and types

Classes

Templates

Forms

Queries

Page 56: Wikis at work

References

SMW Homepage: http://semantic-mediawiki.org

MW Templates: http://www.mediawiki.org/wiki/Help:Templates

SF Homepage: http://www.mediawiki.org/wiki/ Extension:Semantic_Forms

SMW #ask: http://semantic-mediawiki.org/wiki/ Help:Inline_queries

SMW demo sites:

http://pgscdemo.referata.com

http://discoursedb.org

http://sandbox.referata.com

Page 57: Wikis at work

Cell Lines Wiki