“INDEXING” NON-TEXT ASSETS Download resource handout for seminar at: David Riecks, Project...

Preview:

Citation preview

“INDEXING” NON-TEXT ASSETS

Download resource handout for seminar at:http://www.controlledvocabulary.com/sla/sla-chi.pdf

David Riecks, Project Leader, http://photometadata.org; Chief Technical Advisor, PLUS (http://www.useplus.org);IPTC Photo Metadata Working Group Member; and Founder of http://ControlledVocabulary.com/Twitter: @davidriecks

www.controlledvocabulary.comtwitter: @davidriecks

THIS IS NOT A WORD

©D

avid

R

ieck

s

www.controlledvocabulary.comtwitter: @davidriecks

THIS IS NOT A WORD

This is not a Word!

©D

avid

R

ieck

s

www.controlledvocabulary.comtwitter: @davidriecks

WHAT IS A DIGITAL IMAGE?

©D

avid

R

ieck

s

www.controlledvocabulary.comtwitter: @davidriecks

WHAT IS A DIGITAL IMAGE?

©D

avid

R

ieck

s

www.controlledvocabulary.comtwitter: @davidriecks

WHAT IS A DIGITAL IMAGE?

©D

avid

R

ieck

s

www.controlledvocabulary.comtwitter: @davidriecks

WHAT IS A DIGITAL IMAGE?

©D

avid

R

ieck

s

www.controlledvocabulary.comtwitter: @davidriecks

WHAT IS METADATA?

• Standard definition: • “Metadata is data about data”

www.controlledvocabulary.comtwitter: @davidriecks

WHAT IS METADATA?

• Standard definition: • “Metadata is data about data”

• Better definition:

• Metadata is information about a thing, apart from the thing itself

• Metadata surrounds us…

www.controlledvocabulary.comtwitter: @davidriecks

REAL-LIFE EMBEDDED METADATA

www.controlledvocabulary.comtwitter: @davidriecks

REAL-LIFE EMBEDDED METADATA

www.controlledvocabulary.comtwitter: @davidriecks

MAKING “SMART” ASSETS

• What makes an asset smart?

©D

avid

R

ieck

s

www.controlledvocabulary.comtwitter: @davidriecks

MAKING “SMART” ASSETS

• What makes an asset smart?• A description that tells you about the asset.• “Controlled” Keywords are used to “tag” its “aboutness”• You can easily find the creator, and copyright holder• You know how to credit the asset if published• You know what rights you have licensed (if not your own)

www.controlledvocabulary.comtwitter: @davidriecks

DOCUMENTING YOUR WORKFLOW

• Use the IPTC / CEPIC package

www.controlledvocabulary.comtwitter: @davidriecks

MAKING “SMART” ASSETS

• How do we add this “smartness”?• By using “Standard” Metadata Schemas

• Exif• IPTC-IIM• IPTC Core• IPTC Extension• Dublin Core• PLUS• PMI (PRISM Metadata for Images)

For details visit: http://www.photometadata.org/META-101-metadata-types

www.controlledvocabulary.comtwitter: @davidriecks

MAKING “SMART” ASSETS

©D

avid

R

ieck

s

www.controlledvocabulary.comtwitter: @davidriecks

MAKING “SMART” ASSETS

www.controlledvocabulary.comtwitter: @davidriecks

MAKING “SMART” ASSETS

• How do we add this “smartness”?• By using “Standard” Metadata Schemas• By embedding the metadata values with software like:

• Adobe Bridge• Adobe Creative Suite Apps (Photoshop/Illustrator/InDesign)• Photo Mechanic (Camerabits.com)• Media Pro (PhaseOne.com)• Apple Aperture• Or Other DAM Software

www.controlledvocabulary.comtwitter: @davidriecks

MAKING “SMART” ASSETS

• “Smart” Assets are ideal for distribution:• Self-describing• Recipient can see all (or nearly all) the data you can.• All “derivative” files inherit this “smartness” as well

Read about a new initiative…..

www.controlledvocabulary.comtwitter: @davidriecks

EMBEDDED METADATA MANIFESTO

• The Five Guiding Principles:• 1. Metadata is essential to describe, identify and track digital

media and should be applied to all media items which are exchanged as files or by other means such as data streams.

View more at: http://www.embeddedmetadata.org/

www.controlledvocabulary.comtwitter: @davidriecks

EMBEDDED METADATA MANIFESTO

• The Five Guiding Principles:• 1. Metadata is essential to describe, identify and track digital

media and should be applied to all media items which are exchanged as files or by other means such as data streams.

• 2. Media file formats should provide the means to embed metadata in ways that can be read and handled by different software systems.

www.controlledvocabulary.comtwitter: @davidriecks

EMBEDDED METADATA MANIFESTO

• The Five Guiding Principles:• 1. Metadata is essential to describe, identify and track digital

media and should be applied to all media items which are exchanged as files or by other means such as data streams.

• 2. Media file formats should provide the means to embed metadata in ways that can be read and handled by different software systems.

• 3. Metadata fields, their semantics (including labels on the user interface) and values, should not be changed across metadata formats.

www.controlledvocabulary.comtwitter: @davidriecks

EMBEDDED METADATA MANIFESTO

• The Five Guiding Principles:• 4. Copyright management information metadata must never

be removed from the files.

www.controlledvocabulary.comtwitter: @davidriecks

EMBEDDED METADATA MANIFESTO

• The Five Guiding Principles:• 4. Copyright management information metadata must never

be removed from the files.• 5. Other metadata should only be removed from files by

agreement with their copyright holders.

View more at: http://www.embeddedmetadata.org/

www.controlledvocabulary.comtwitter: @davidriecks

TESTING ASSETS FOR METADATA PRESERVATION

• Test by Reading One or All After Processing:• Use off the shelf DAM “Cataloging” software

• Media Pro• Idimager• Extensis Portfolio• Canto Cumulus

• Use Originating software (Photoshop / Bridge)• Use Command Line tools (ExifTool)• Use free reader tools

• Mac: Apple Preview, Spotlight, Search function in Finder• Windows: IrfanView, Exifer, etc. • AIR: EMET (Embedded Metadata Extraction Tool)

www.controlledvocabulary.comtwitter: @davidriecks

TESTING ASSETS FOR METADATA PRESERVATION

• ALWAYS! Test After Moving or Posting Online

Jeffrey’s Online Metadata Viewer

http://regex.info/exif.cgi

Sample image available at:

http://www.controlledvocabulary.com/socialmedia/cv-testbed_social-media.jpg

The base URL http://www.controlledvocabulary.com/socialmedia/ for info on how the various social media and Photo Sharing sites fare.

www.controlledvocabulary.comtwitter: @davidriecks

TESTING ASSETS FOR METADATA PRESERVATION

www.controlledvocabulary.comtwitter: @davidriecks

WHAT KIND OF ASSETS CAN WE MAKE “SMART”

• Image Files• JPEG• TIFF• PSD• EPS• DNG

©D

avid

R

ieck

s

www.controlledvocabulary.comtwitter: @davidriecks

WHAT KIND OF ASSETS CAN WE MAKE “SMART”

• Document / Illustration Files• PDF• Adobe Illustrator (AI)• Adobe InDesign (INDD)• EPS

www.controlledvocabulary.comtwitter: @davidriecks

WHAT KIND OF ASSETS CAN WE MAKE “SMART”

• Document / Illustration Files• PDF• Adobe Illustrator (AI)• Adobe InDesign (INDD)• EPS

www.controlledvocabulary.comtwitter: @davidriecks

WHAT KIND OF ASSETS CAN WE MAKE “SMART”

• Audio Files*• XMP or ID3 metadata tags?• mp3, aiff,aif, wav, m4p, m4a, snd

• Video Files*• XMP or QuickTime wrapper?• mov, mpg, mp4, divx, qtz, avi, wmv, dv

*These file formats have decreased interoperability

www.controlledvocabulary.comtwitter: @davidriecks

WHAT DO YOU ADD TO MAKE AN ASSET “SMART”?

Use the “Guide To Photo Metadata Fields”

http://photometadata.org/META-Resources-Field-Guide-to-Metadata

www.controlledvocabulary.comtwitter: @davidriecks

WHAT DO YOU ADD TO MAKE AN ASSET “SMART”?

Use the “IPTC / CEPIC Image Metadata Handbook

http://www.iptc.org/goto?imagemetadatahandbook

www.controlledvocabulary.comtwitter: @davidriecks

QUICK REVIEW: UPSIDES

• Advantages to Embedded Metadata?• Metadata makes it easy to find/locate assets in collections• Find the File >> Find the Info• Metadata travels with any “derivative” files you distribute.• Can be “leveraged” in downstream uses.

www.controlledvocabulary.comtwitter: @davidriecks

CONNECTING THE DOTS: REVIEW

• Advantages to Embedded Metadata?• Metadata makes it easy to find/locate assets in collections• Find the File >> Find the Info• Metadata travels with any “derivative” files you distribute.• Can be “leveraged” in downstream uses.

www.controlledvocabulary.comtwitter: @davidriecks

QUICK REVIEW: DOWNSIDES

• What are the Downsides to embedding metadata?• Not all applications “know” to use this info• Some applications may inadvertently remove some/all info• Increased time to update info in large files (photos or videos)

www.controlledvocabulary.comtwitter: @davidriecks

HOW DO I FIND OR CREATE METADATA / TAGS?

• Tips to create “Smart” Collections• Add metadata before placing assets into database• Make adding metadata part of a documented workflow• Require users to annotate and tag assets they contribute• “Batch” the addition of metadata whenever possible• Use a “controlled vocabulary” when adding keywords

www.controlledvocabulary.comtwitter: @davidriecks

HOW DO I FIND OR CREATE METADATA / TAGS?

• How do you want to “Tag” your Assets?• Manually: In-house• Manually:

• Request to Suppliers• Out-source to Third Party

• Auto-magically

www.controlledvocabulary.comtwitter: @davidriecks

MAKING “SMART” ASSETS WITH CONTROLLED VOCAB

www.controlledvocabulary.comtwitter: @davidriecks

MAKING “SMART” ASSETS WITH CONTROLLED VOCAB

Start with:

feet

shoes

girl

youth

grass

humor

outside

©D

avid

R

ieck

s

www.controlledvocabulary.comtwitter: @davidriecks

MAKING “SMART” ASSETS WITH CONTROLLED VOCAB

w/ Controlled Vocabulary:

child, juvenile, 4-12 years old, people, human beings, humans, person, body, foot, feet, fashion, clothing, apparel, clothes, womens clothing, womens apparel, women’s clothing, attire, shoes, shoe, female, girl, lass, plants, grass, humor, outside, outdoors©

Dav

id

Rie

cks

www.controlledvocabulary.comtwitter: @davidriecks

DOCUMENTING YOUR WORKFLOW

• Use the IPTC / CEPIC package• Documents give sample workflow overviews• Special PDF allows you to make and print your own

http://www.iptc.org/goto?imagemetadatahandbook

www.controlledvocabulary.comtwitter: @davidriecks

DOCUMENTING YOUR WORKFLOW

• Use the IPTC / CEPIC package

www.controlledvocabulary.comtwitter: @davidriecks

DOCUMENTING YOUR WORKFLOW

• Use the IPTC / CEPIC package

www.controlledvocabulary.comtwitter: @davidriecks

DOCUMENTING YOUR WORKFLOW

• Use the IPTC / CEPIC package

www.controlledvocabulary.comtwitter: @davidriecks

DOCUMENTING YOUR WORKFLOW

• Use the IPTC / CEPIC package

www.controlledvocabulary.comtwitter: @davidriecks

DOCUMENTING YOUR WORKFLOW

• Use the IPTC / CEPIC package

www.controlledvocabulary.comtwitter: @davidriecks

DOCUMENTING YOUR WORKFLOW

• Use the IPTC / CEPIC package

www.controlledvocabulary.comtwitter: @davidriecks

DATA EXPORT & IMPORT

www.controlledvocabulary.comtwitter: @davidriecks

DATA EXPORT & IMPORT

• Code Replacement& Variables in PhotoMechanic used to Import Keywords

• See handout for URL

www.controlledvocabulary.comtwitter: @davidriecks

DOCUMENTING YOUR WORKFLOW

• Use the IPTC / CEPIC package

Download resource handout for seminar at:http://www.controlledvocabulary.com/sla/sla-chi.pdf

David Riecks, Project Leader, http://photometadata.org; Chief Technical Advisor, PLUS (http://www.useplus.org);IPTC Photo Metadata Working Group Member; and Founder of http://ControlledVocabulary.com/Twitter: @davidriecks

QUESTIONS?

“INDEXING” NON-TEXT ASSETS

Recommended