Vincent Smith & Simon Rycroft Taxonomy & Scratchpads

Vincent Smith & Simon Rycroft Taxonomy & Scratchpads

Embed Size (px)

Citation preview

Page 1: Vincent Smith & Simon Rycroft Taxonomy & Scratchpads

Vincent Smith & Simon Rycroft

Taxonomy & Scratchpads

Page 2: Vincent Smith & Simon Rycroft Taxonomy & Scratchpads

Biological taxonomyFindability, relationships (ontology) & tagging

• Scale• Metadata

Page 3: Vincent Smith & Simon Rycroft Taxonomy & Scratchpads


• Multi-host Drupal (5) site• Drupal customized for taxonomists• Communities apply for a site • 65+ sites, 750+ users, 130k nodes• Taxonomy central to many features

Page 4: Vincent Smith & Simon Rycroft Taxonomy & Scratchpads

Import / ExportExcel Template (CSV file)& uBio (XML feed)

QuickTime™ and a decompressor

are needed to see this picture.

QuickTime™ and a decompressor

are needed to see this picture.


QuickTime™ and a decompressor

are needed to see this picture.

QuickTime™ and a decompressor

are needed to see this picture.

(Our) Taxonomy Importhttp://svn.scratchpads.eu/viewvc/trunk/sites/all/modules/taxonomy_import/

Page 5: Vincent Smith & Simon Rycroft Taxonomy & Scratchpads

ManagementTaxonomy Manager & Taxonomy Core


• Principle good, but no one uses it• Confusing and slow (HCI issues)• Major cross browser issues (Firefox)• Requires a number of “fixes”…

• Flexible metadata on terms (core)• Treat synonyms as full terms (core)• Link nodes as term attributes (e.g. biblio)• Improve manager HCI (drag-and-drop)

Page 6: Vincent Smith & Simon Rycroft Taxonomy & Scratchpads

Search & BrowseNavigation for finding tagged content


QuickTime™ and a decompressor

are needed to see this picture.

Automatically creates a mini-menu (block) of a vocabulary that is configurable for default term

• Intuitive• Small footprint

• Integrates with a term’s page



Augments default search with a tab for terms searches (includes term autocomplete)


Page 7: Vincent Smith & Simon Rycroft Taxonomy & Scratchpads

• Quick & intuitive• Two step submission• Fast (but could be quicker) • Encourages tagging

AutotaggingAutomated tagging of content

QuickTime™ and a decompressor

are needed to see this picture.

QuickTime™ and a decompressor

are needed to see this picture.

Untagged node

Use or ignore discovered tags (drag & drop or add)


Page 8: Vincent Smith & Simon Rycroft Taxonomy & Scratchpads

Mega-VocabulariesSites with a million plus terms

Current Taxonomy Problems

e.g. http://catlife.myspecies.info(2 million+ terms)

• Taxonomy LeftandRight module• Implements nested sets• Over rides 3 taxonomy core functions

- taxonomy_get_tree- taxonomy_overview_terms- taxonomy_select_nodes

• PHP requires too much memory for large hierarchies• Very slow, especially above 50k terms• Some sites with 300k terms (unusable)• 1.8 million known species (6-80M est.)

QuickTime™ and a decompressor

are needed to see this picture.

• Very fast (in use with 2 million terms)• Solves insertion problem with decimals

Possible Solution


Page 9: Vincent Smith & Simon Rycroft Taxonomy & Scratchpads

Sprint ExpectationsWhat we are looking to achieve

• Import and export of terms (TCS-XML?) from a repository

• Improved & flexible term metadata

• Handle synonyms as full terms

• Link nodes as attributes of terms

• Term and metadata management

• Permissions on terms (low priority?)

Page 10: Vincent Smith & Simon Rycroft Taxonomy & Scratchpads


Page 11: Vincent Smith & Simon Rycroft Taxonomy & Scratchpads
Page 12: Vincent Smith & Simon Rycroft Taxonomy & Scratchpads

Search & Browse 2Split Layout TreeMaps

e.g. http://scratchpads.eu/progress

• Intuitive• Small footprint

• Integrates with a term’s page• Potentially integrates multi-site content