Upload
amanda-buckler
View
215
Download
0
Tags:
Embed Size (px)
Citation preview
Strategies LLCTaxonomy
February 21, 2007 Copyright 2007 Taxonomy Strategies LLC. All rights reserved.
Tagging:
It’s the Interface Stupid!
2Taxonomy Strategies LLC The business of organized information
Who I am
Over 25 years in the business of organized information Founder & Principal, Taxonomy Strategies Director, Solutions Architecture, Interwoven VP, Infoware, Metacode Technologies Program Manager, Getty Foundation Manager, Pricewaterhouse Assistant Director for Technical Services, Hampshire College Chief, Technical Services, Paul Weiss Rifkind Wharton & Garrison
Metadata & taxonomies community leadership. President, American Society for Information Science & Technology Trustee, Dublin Core Metadata Initiative Co-Founder, Networked Knowledge Organization Systems/Services Adviser, National Research Council Computer Science and
Telecommunications Board Reviewer, National Science Foundation Division of Information and
Intelligent Systems
3Taxonomy Strategies LLC The business of organized information
Recent & current projects
Government Chelan County Public Utilities District Commodity Futures Trading Commission Federal Aviation Administration Federal Reserve Bank of Atlanta Head Start Infocomm Development Authority of Singapore
(http://mysearch.internet.gov.sg/) NASA (nasataxonomy.jpl.nasa.gov) U.S. Defense Intelligence Agency U.S.D.A. Economic Research Service U.S.D.A. e-Government Program (
www.usda.gov) U.S. Dept of Education ERIC U.S. D.H.S. Citizenship and Immigration
Services U.S. Environmental Protection Agency U.S. Forest Service U.S. GSA Office of Citizen Services (
www.usa.gov) U.S. Small Business Administration U.S. Social Security Administration
Commercial Agency.com Amway Albertsons Allstate Insurance Baker Hughes BHP Billiton Blue Shield of California Campbell Soup Company Capital One Debevoise & Plimpton Dell Halliburton Hewlett Packard Microsoft Motorola Oracle PeopleSoft Pricewaterhouse Coopers Siderean Software Sprint Time Inc.
NGO’s Dewy Decimal Classification European Committee for Standardization IDEAlliance International Monetary Fund National Association of Realtors OCLC
5Taxonomy Strategies LLC The business of organized information
For us, taxonomy work includes:
Metadata Scheme. Data fields for describing content so that it can be found and used.
Vocabularies. Collections of terms that are used to specify some of the metadata properties.
Relationships between content, fields or terms (hierarchical, equivalence, & associative)
Some vocabularies are big & hierarchical, some are small and flat.
Application Profile. Formal representation of metadata & vocabularies.
6Taxonomy Strategies LLC The business of organized information
Agenda
Content Tagging Tagging Interface
7Taxonomy Strategies LLC The business of organized information
Tagging Overview
Tagging is better than the words that happen to occur in a piece of content.
All tagging is useful End user tagging Tagging by librarians Automated tagging by OS and algorithms
Content should be tagged throughout its lifecycle, each time the content is handled and used so that it accrues value or its significance is diminished.
8Taxonomy Strategies LLC The business of organized information
MS Office: File Properties
How many people fill this in?
9Taxonomy Strategies LLC The business of organized information
Organize
How many people click on this?
10Taxonomy Strategies LLC The business of organized information
What is social tagging?
End user tagging Easy, intuitive tagging interfaces Almost instantaneous feedback
Enables people to tag & re-tag content … in response to seeing their tags in context with other tags.
Emergent categories Resembles open card sort process in which patterns emerge … rather than validating categories using closed card sorts.
11Taxonomy Strategies LLC The business of organized information
Social tagging innovators
flickr founders Caterina Fake Stewart Butterfield
del.icio.us founder Joshua Schachter
del.icio.us & flickr are now both part of Yahoo! As of April 2006 flickr had 130 million photos posted by 3
million registered users.
12Taxonomy Strategies LLC The business of organized information
Four tagging rules for end users
Rule Description
Use specific terms
Apply the most specific terms when tagging content. Specific terms can always be generalized, but generic terms cannot be specialized.
Use multiple terms
Use as many terms as necessary to describe What the content is about & Why it is important.
Use appropriate terms
Only fill-in the facets & values that make sense. Not all facets apply to all content.
Consider how content will be used
Anticipate how the content will be searched for in the future, & how to make it easy to find it. Remember that search engines can only operate on explicit information.
13Taxonomy Strategies LLC The business of organized information
Agenda
Content Tagging Tagging Interface
14Taxonomy Strategies LLC The business of organized information
Requirements for a tagging interface
Automated form fill-in (automatically fills in known data) Tagging precedents (see tags already assigned by
others) Controlled vocabularies, e.g., with pull-down list Multi-valued tags Geo-tagging Group tagging Clean-up tag tools, e.g., alpha list Batch editing Share/Don’t share (Public/Private) Identified owner (who can be emailed) Almost immediate feedback, e.g., tag cloud
15Taxonomy Strategies LLC The business of organized information
Form fill-in: Automatically filled-in known data
16Taxonomy Strategies LLC The business of organized information
Form fill-in: Automatically filled-in known data
Manual form fill-in w/ check boxes, pull-down lists, etc.
Auto keyword & summarization
17Taxonomy Strategies LLC The business of organized information
Form fill-in: Automatically filled-in known data
Auto-categorization
Parse & lookup (recognize names)
Rules & pattern matching
18Taxonomy Strategies LLC The business of organized information
Tagging precedents: See tags assigned by others
25Taxonomy Strategies LLC The business of organized information
Bulk tagging
ID collection of related content items by pattern or context Then, apply same attributes to all content items
26Taxonomy Strategies LLC The business of organized information
Tag a folder
Drag & drop content items into folder Then, content items inherit properties of folder
27Taxonomy Strategies LLC The business of organized information
Workflow
Approve & improve mindset
Review & Improve
Review & Improve
Add Metadata
Create Content Publish
28Taxonomy Strategies LLC The business of organized information
Interactive rewards
Almost instantaneous exposure of tags in simple user interfaces on the web provides positive reinforcement for user tagging that simply did not exist before.
For example, Most popular Tag clouds Alerts
29Taxonomy Strategies LLC The business of organized information
Most popular
Another example is most emailed from, e.g., the NY Times.
31Taxonomy Strategies LLC The business of organized information
Alerts
New (content selected by date) Subscriptions (content selected by tags) Interest (content selected by other people) Individual (content selected for you by other people)
Strategies LLCTaxonomy
February 21, 2007 Copyright 2007 Taxonomy Strategies LLC. All rights reserved.
Is faceted indexing the future of social tagging?
33Taxonomy Strategies LLC The business of organized information
Summary
There are lessons to be learned from web tagging about how to get good metadata in document and content management applications.
Document and content management system tagging must be simple, and it must be almost instantaneously easier to find relevant work products.
Strategies LLCTaxonomy
February 21, 2007 Copyright 2007 Taxonomy Strategies LLC. All rights reserved.
Questions?
Joseph A. Busch
415-377-7912
35Taxonomy Strategies LLC The business of organized information
Tagging Overview
Tagging, any kind of tagging is better than the words that happen to occur in a piece of content. End user tagging is useful, so is tagging by librarians, as are tags automatically assigned by operating systems and language processing algorithms. Content should be tagged throughout its lifecycle, each time the content is handled and used so that it accrues value or its significance is diminished.
Almost instantaneous exposure of tags in simple user interfaces on the web provides positive reinforcement for user tagging that simply did not exist before. It should not be surprising that a good user interface improves usability.
As content users flock to websites that help to organize the content on the web, advertisements and value added content services follow. The bottleneck in the semantic web has been not enough tagged content. The end user tagging revolution may begin to address this shortcoming.
There are lessons to be learned from web tagging about how to get good metadata in document and content management applications. Document and content management system tagging must be simple, and it must be almost instantaneously easier to find relevant work products.