Exposing Hidden Relationships: Practical Work in Linked Data using Digital Collections

  • Published on
    17-Jul-2015

  • View
    177

  • Download
    8

Embed Size (px)

Transcript

<p>Draft Presentation</p> <div><p>Exposing Hidden Relationships: Practical Work in Linked Data using Digital Collections</p><p>Cory Lampert and Silvia Southwick</p><p>UNLV University Libraries Digital Collections</p><p>April 23, 2015</p><p>Linked Data &amp; RDF: New Frontiers in Metadata and Access Conference</p></div> <div><p>Overview </p><p>Video Demo</p><p>UNLV Linked Data Project </p><p>Digital Collections Metadata: Source of Rich (But Hidden) Relationships </p><p>Video Demo</p><p>Next Steps, Future</p><p>Questions</p></div> <div><p>Video demo</p><p>This short video (no sound, just image) will give a preview of what linked data may look like to users.</p><p>It shows the Virtuoso Pivot Viewer software acting upon UNLVs Linked Open Data triplestore. </p><p>Think about how this is similar/different to how users currently view data in library systems.</p><p>[PLAY PIVOTVIEWER.mp4]</p></div> <div><p>Exploring LOD: Taking theory to practice</p><p>How we started</p><p>Goals set</p><p>What we accomplished</p></div> <div><p>How we began</p><p>Conferences and buzz</p><p>Curiousity and professional development</p><p>Exploration and pilot project</p><p>Compelling results; sharing impact of what weve learned</p><p>Assessment </p><p>Much more to do... A sense of humor is helpful!</p><p>Photo: Five men with burros, circa 1900, Tonopah/Goldfield Collection</p></div> <div><p>Motivation</p><p>Information encapsulated in records</p><p>Records contained in collections</p><p>Very few links are created within and/or across collections</p><p>Links have to be manually created</p><p>Existing links do not specify the nature of the relationships among records</p><p>This structure hides potential context (links) within and across collections</p><p>Free metadata from silos</p><p>Expose rich relationships</p><p>Leverage powerful, seamless, interlinking of data from multiple sources</p><p>Discover and query data in new ways</p><p>More precise searching</p><p>More opportunities to repurpose data</p><p>Current Practice</p><p>LOD Potential</p></div> <div><p>Poll</p><p>Please use the agree/disagree button, available from the pull down menu at the top of the screen to respond to the statement below:</p><p>Statement: There is interest in doing practical work with linked open data at my institution.</p></div> <div><p>Foundation of pilot</p><p>Our digital collections consist of unique materials documenting the history of Southern Nevada stored in CONTENTdm; project focused on LOD for visual material collections</p><p>Definition of LOD we are using: Linked Data refers to a set of best practices for publishing and interlinking data on the Web.</p><p>A good way to better understand this is the 5-Star Data diagram: http://5stardata.info/ </p></div> <div><p>Preparing for departure</p><p>Before we launch into a discussion of how we created our linked data, lets take a short trip. </p><p>We will start in our current data: digital collections metadata records, and end in the new world of linked open data.</p><p>Photo: Photograph of Howard Hughes in cockpit of the second XF-11, April 4, 1947, Howard Hughes Collection</p></div> <div><p>Graphical Representation: Part of a Record</p></div> <div><p>10</p></div> <div><p>Examples of records</p><p>Showgirls</p><p>Menus</p><p>Dreaming the</p><p>Skyline</p></div> <div><p>11</p></div> <div><p>December 12, 1915 </p><p>Exposing Hidden Links</p></div> <div><p>12</p></div> <div><p>Poll</p><p>Please use the agree/disagree button, available from the pull down menu at the top of the screen to respond to the statement below:</p><p>Statement: The diagrams helped me to see how linked data helps to reveal hidden relationships in existing metadata.</p></div> <div><p>UNLV Linked Open Data Project Goals</p><p>Study the feasibility of developing a common process that would allow the conversion of our collection records into linked data preserving their original expressivity and richness</p><p>Publish data from our collections in the Linked Open Data Cloud to improve discoverability and connections across our collections and with data from other related data sets on the Web</p></div> <div><p>ActionsTechnologies</p><p>Clean data</p><p>Export data</p><p>CONTENTdm</p><p>Open Refine</p><p>Import data</p><p>Prepare data</p><p>Reconcile</p><p>Generate triples</p><p>Export RDF</p><p>Import data</p><p>Publish</p><p>Mulgara /</p><p>Virtuoso</p><p>Phase 1</p><p>Phase 2</p><p>Phase 3</p></div> <div><p>15</p></div> <div><p>WHAT WE LEARNED</p><p>With interest and motivation, Linked Open Data is a feasible goal</p><p>Visualization tools help convey the benefits of LOD work </p><p>A pilot quickly turned into a project and then into production</p><p>Moving into the next phase required careful examination of current practice focusing on expressing links (relationships)</p><p>Photo: Film transparency of a chimpanzee with slot machines at the Sands Hotel, Las Vegas, circa late 1950s, Sands Collection</p></div> <div><p>LOD Approach after the pilot</p><p>After learning the concepts, applying a model, and testing technologies, the LOD transformation process becomes repeatable</p><p>Sustainability of process depends upon data quality</p><p>Data begins with existing metadata in current collections; there are many lessons from the pilot that should inform revisions to current practice (even if LOD is more in future than present)</p></div> <div><p>Mining the metadata</p><p>Application profile</p><p>Shared Vocabularies</p><p>Managing Controlled Vocabularies</p><p>Managing Linked Data</p></div> <div><p>When should we start preparing metadata for Linked Data?</p><p>Evolution of metadata </p></div> <div><p>Our focus is on metadata</p><p>Why?</p><p>Metadata is essential for establishing relationships</p><p>Any metadata?</p><p>Ability of discovering relationships is directly affected by metadata quality</p><p>It is critical to:</p><p>Use well-established Controlled Vocabularies (particularly if they are linked data ready)</p><p>Rigorously control local terms</p><p>Re-use URIs</p><p>Assign URIs for local terms</p></div> <div><p>Metadata creation common Approaches</p><p>Focus is on the collection being created</p><p>Usually metadata consistency is managed within collections</p><p>Not much rigor is used to enter controlled vocabulary terms</p><p>Exs.: Misspellings, use of terms that do not match the preferred terms, etc.</p><p>Limited control of local terms</p><p>Implications:</p><p>Ability to identify relationships within and across collections is decreased</p></div> <div><p>When should we start preparing metadata for Linked Data?</p><p>what can we do to create sapient metadata?</p><p>Application</p><p>Profile</p><p>Re-design strategies to</p><p>manage and use CVs</p></div> <div><p>What do I do with my legacy metadata?</p><p>Adjust metadata according</p><p>to the</p><p>Application</p><p>Profile</p><p>Apply strategies to</p><p>manage and use CVs effectively</p></div> <div><p>Metadata Milestones AT UNLV Libraries</p><p>Adopted an approach that considers each individual digital collection as part of an integrated digital library. </p></div> <div><p>The UNLV Application Profile</p><p>Specifies:</p><p>which metadata terms UNLV Libraries uses for its digital collections</p><p>the source of metadata terms</p><p>how metadata should be expressed</p><p>labels to be used for each metadata field</p><p>Benefits:</p><p>Increases consistency of content across digital collections</p><p>Improves user interactions with digital collections</p><p>Indexing guidelines are easy to generate</p><p>Facilitates transformation to Linked Data</p><p>Increases compliance with regional and national aggregators</p></div> <div><p>Outcomes</p><p>Well-established CVs allow re-use of URIs</p><p>Rigorous rules of data entry facilitate reconciliation</p><p>Local Controlled vocabularies allow interlinking among local terms / names within collections</p><p>Shared vocabularies allow interlinkage among local terms / names across collections</p></div> <div><p>All these actions: </p><p>allow creation of a single process to transform digital collections into linked data</p><p>Video: [PLAY SUPER-SKELETON-WHH.mp4]</p></div> <div><p>Moving From experimentation to Implementation</p><p>Cleaning and sharing controlled vocabularies from legacy collections (time consuming)</p><p>Re-training metadata creators</p><p>Re-designing workflows</p><p>Delegating additional data management responsibilities</p></div> <div><p>Data Management</p><p>Maintenance of local URIs</p><p>Terms</p><p>Authoritative Names</p><p>Design and implementation of new processes to maintain synchronization between digital library and linked data set</p><p>Design processes to enrich relationships with external data sets</p></div> <div><p>Next Steps</p><p>Future Activities</p><p>Resources</p><p>Video Demo</p></div> <div><p>Future Activities</p><p>Publish data</p><p>Interlinking with other data sets</p><p>Documentation</p><p>Collaborative activities (regional controlled vocabularies)</p><p>Training and staff skill development</p><p>Interface design and development</p><p>Work with hierarchical data</p></div> <div><p>Video demo</p><p>This short video (no sound, just image) will give a preview of what linked data may look like to users.</p><p>It shows the Relfinder software acting upon UNLVs Linked Open Data triplestore. </p><p>Think about how this is similar/different to how users currently view data in library systems.</p><p>[PLAY SHOWING RELATIONSHIPS.mp4]</p></div> <div><p>The Linked Data Cloud</p></div> <div><p>resources</p><p>Leading to Linking: Introducing Linked Data to Academic Library Digital Collections: http://www.tandfonline.com/doi/pdf/10.1080/19386389.2013.826095 </p><p>A Guide for Transforming Digital Collections Metadata into Linked Data Using Open Source Technologies: </p><p>http://www.tandfonline.com/doi/pdf/10.1080/19386389.2015.1007009 </p><p>UNLV Linked Data Blog (videos posted here): https://www.library.unlv.edu/linked-data </p><p>Contact us!</p></div> <div><p>Thank you!</p><p>Contact Us:</p><p>Cory Lampert </p><p>cory.lampert@unlv.edu</p><p>Silvia Southwick</p><p>silvia.southwick@unlv.edu </p><p>UNLV Digital Collections</p><p>www.d.library.unlv.edu </p><p>Questions?</p><p>Photo: Photograph of Bluebells posing outside of Pan Am jet, 1958, Donn Arden Collection</p></div> <div><p>Questions?</p><p>Contact:</p><p>Cory Lampert </p><p>cory.lampert@unlv.edu</p><p>Silvia Southwick</p><p>silvia.southwick@unlv.edu </p><p>UNLV Digital Collections</p><p>www.d.library.unlv.edu </p></div>

Recommended

View more >