Upload
avice-hines
View
218
Download
0
Embed Size (px)
Citation preview
Strategies LLCTaxonomy
28 August 2007 Copyright 2007 Taxonomy Strategies LLC. All rights reserved.
Metadata and Controlled Vocabularies
Global Corporate Circle Working Session
Joseph Busch
2Taxonomy Strategies LLC The business of organized information
Focus of this session
Best practices for specifying and using controlled vocabularies in DC-compliant information management applications.
Tradeoffs and best practices around organization-dependent vs. sharable common controlled vocabularies.
Tagging content for internal vs. external audiences using the same metadata and controlled vocabularies.
When and how to map different taxonomies to each other.
3Taxonomy Strategies LLC The business of organized information
For us, taxonomy work includes:
Metadata specification defines the properties needed to describe content so that it can be found & used.
Vocabularies are collections of terms that are used to specify some of the metadata properties.
Some vocabularies are big and hierarchical, some are small and flat.
An application profile specifies what metadata & vocabularies are required, and then represents them formally.
4Taxonomy Strategies LLC The business of organized information
Best practices (1)
Intranet and public taxonomies should be based on a common metadata specification and shared value vocabularies.
Some metadata attributes are directly mapable to DC, some will be local (locally declared).
Use qualified Dublin Core attributes. Some vocabularies are sharable industry standards, while
others will be organization-dependent. Some value vocabularies will be particularly relevant to
intranet content.
5Taxonomy Strategies LLC The business of organized information
ElementData Type Length
Req. /Repeat Source Purpose
Asset Metadata …
Title String Variable 1 User supplied Text search & results display.
Content Type String Variable 1 Local Value VocGroup & filter search results.
Center String Variable 1 Local Value Voc
Date Date Fixed 1 System suppliedPublish, feature, review
content.
Subject Metadata …
Activity String Variable * Local Value Voc
Search for, browse, group & filter search results.
Law String Variable * Standard Value Voc
Product String Variable * Standard Value Voc
Brand String Variable * Standard Value Voc
Company String Variable * Standard Value Voc
Condition String Variable * Standard Value Voc
Topic String Variable * Local Value Voc
Link Metadata …
Relation String Variable * Validate by lookup Reference related resources.
Use Metadata …
Audience String Variable * Local Value VocTarget, personalize content.
Geography String variable * Standard Value VocLegend: ? – 1 or more * - 0 or more
FDA Metadata specification (excerpt)
6Taxonomy Strategies LLC The business of organized information
ElementData Type Length
Req. /Repeat Source Purpose
Asset Metadata …
Title String Variable 1 User supplied Text search & results display.
Content Type String Variable 1 Local Value VocGroup & filter search results.
Center String Variable 1 Local Value Voc
Date Date Fixed 1 System suppliedPublish, feature, review
content.
Subject Metadata …
Activity String Variable * Local Value Voc
Search for, browse, group & filter search results.
Law String Variable * Standard Value Voc
Product String Variable * Standard Value Voc
Brand String Variable * Standard Value Voc
Company String Variable * Standard Value Voc
Condition String Variable * Standard Value Voc
Topic String Variable * Local Value Voc
Link Metadata …
Relation String Variable * Validate by lookup Reference related resources.
Use Metadata …
Audience String Variable * Local Value VocTarget, personalize content.
Geography String variable * Standard Value VocLegend: ? – 1 or more * - 0 or more
FDA Metadata specification (excerpt)
DC.Title
DC.Type
DC.Publisher
DC.Date
Local
Local
Local
Local
Local
Local
DC.Subject
DC.Relation
DCterms.Audience
DC.Coverage
DC.Format=“text/html”, DC.Language=“en”
Blue Book
Orange Book
Orange Book
Yellow Book
ICD9
USGS
7Taxonomy Strategies LLC The business of organized information
Audience TypeGeographyCenter Subject
Activity
Product
Condition
Law
Brand
Company
Topic
FDA* Taxonomy
* U.S. Food and Drug Administration
All facets and sub-facets
8Taxonomy Strategies LLC The business of organized information
Audience TypeGeographyCenter Subject
Activity
Product
Condition
Law
Brand
Company
Topic
FDA Taxonomy*
Consumers
Employees
Healthcare
Industry
Administration
Application & Approval
Grant-Making & Sponsorship
Investigation & Enforcement
Public Awareness
Research
Rule-Making
Training & Education
* U.S. Food and Drug Administration
Directories
Dockets
Forms
Instructions & How-To
Job Information
News
Policies & Procedures
Product Alerts
Product Information
Product Lists
Publications
Recalls
Subject Indexes
Tools & Databases
Transcripts & Statements
Warning Letters
Intranet facets– a taxonomy subset
9Taxonomy Strategies LLC The business of organized information
FDA.gov tagging example: Information about what to do about bad spinach.
Taxonomy Facet Tag ValuesDC.Type Recalls
DC.Publisher CFSAN
DC.Subject.Activity Public Awareness
DC.Subject.Law n/a
DC.Subject.Product Food: Produce
DC.Subject.Brand n/a
DC.Subject.Company n/a
DC.Subject.Condition Gastroenteritis
DC.Subject.Topic Food Safety
DCterms.Audience Consumers
DC.Coverage n/a
10Taxonomy Strategies LLC The business of organized information
FDA.gov tagging example: Information on “Accutane” for patients.
Taxonomy Facet Tax ValuesDC.Type Product Information
DC.Publisher CDER
DC.Subject.Activity Public Awareness
DC.Subject.Law n/a
DC.Subject.Product Drugs: Prescription Drugs
DC.Subject.Brand Accutane; isotretinoin
DC.Subject.Company n/a
DC.Subject.Condition Disease: Acne
DC.Subject.Topic Drug Information; Consumer Education
DCterms.Audience Healthcare; Consumers
DC.Coverage n/a
11Taxonomy Strategies LLC The business of organized information
Inside.FDA tagging example: Instructions on how to replace a security badge.
Taxonomy Facet Tag ValuesDC.Type Forms; Instructions & How-To
DC.Publisher [applicable organizational dept]
DC.Subject.Activity Administration
DC.Subject.Law n/a
DC.Subject.Product n/a
DC.Subject.Brand n/a
DC.Subject.Company n/a
DC.Subject.Condition n/a
DC.Subject.Topic n/a
DCterms.Audience Employees
DC.Coverage n/a
12Taxonomy Strategies LLC The business of organized information
Conf.
Best practices (2)
Intranet and internet content should share a common repository, but not replicate the same content in two places.
Tag content for appropriate audiences. E.g., Public, Internal, Confidential
Intranet Internet
Internal
Public PublicInternal
Intranet InternetIntranet Internet
ContentInternal
Public
13Taxonomy Strategies LLC The business of organized information
Mapping taxonomies
More complicated approach than multiple attributes with multiple value vocabularies.
Cases: One-to-one. One-to many. Parallel, independent hierarchies.
If mapping is done, then business rules can be used to Automatically add attribute values. Improve search. Create multiple views into the same content.
An ontology specifies typed associative relationships Typically “Is a” relationships.
14Taxonomy Strategies LLC The business of organized information
Taxonomy mapping
Case Level Benefit Example
One-to-one Easy Automatic switching Ivory Coast = Côte d’Ivoire
One-to-many Medium Automatic hedging (broadening/ narrowing)
Czechoslovakia = Czech Republic; Slovakia
Parallel, Independent Hierarchies
Hard Multiple views of same information space
Geographic vs. Political
15Taxonomy Strategies LLC The business of organized information
Audience TypeLocationOrganization Products
Product Line
Application
Technology
Industry
Taxonomy
Person
“Is a” Groups of Products
Advanced relations
16Taxonomy Strategies LLC The business of organized information
Product relationships provide tagging rules for product groupings
Product Product Line
Technology Application Industry
Oracle Business Activity Monitoring
Oracle Fusion Middleware
Application Server; Middleware; SOA
PeopleSoft Collaborative Supply Management
PeopleSoft Enterprise
Supplier Relationship Management
Siebel Clinical Siebel Clinical Life Sciences & Pharma
Product names are consistent labels
Generic labels
17Taxonomy Strategies LLC The business of organized information
press room application http://pressroom.oracle.com/prNavigator.jsp
“Is a” Groups of Products
18Taxonomy Strategies LLC The business of organized information
events application http://events.oracle.com/
“Is a” Groups of Product
“Is located” powers Google Maps mash-up
Strategies LLCTaxonomy
28 August 2007 Copyright 2007 Taxonomy Strategies LLC. All rights reserved.
Questions
+1-415-377-7912
20Taxonomy Strategies LLC The business of organized information
GCC (Global Corporate Circle) Topics
Change focus to large organizations including governments & government agencies.
Enterprise-Wide Metadata Applications Community (EnMAC) Is this agreeable?
2007-2008 activities. Best practices case studies. Identify and describe projects that are using DC.
– What is the best way to do this? Other activities?