Upload
paul-albert
View
268
Download
1
Tags:
Embed Size (px)
Citation preview
Leveraging VIVO DataVisualizations, queries and reports
Paul [email protected] Cornell Medical College
Step #1Get good data.
Step #2Create something useful.
exceptional
What can we really learn from bad data?
Administrators: no fansof the error bar are they
Native Visualizations in VIVO
Co-author visualization
Publication sparklines
Co-investigator visualization
Temporalgraphs
Map of Science
DemoTime
Caveats for publication visualizations
• Is the data complete?
• Map of Science looks at the topic of the journal, not necessarily the article
• Agnostic about:- type of publication (e.g., editorial,
academic article)- significance of finding- author contribution
Generating Reports
Westheimer's Law
A few months in the laboratory can save a few hours in the library.
Corollary of Westheimer's Law
A few hours of SPARQL query construction can save you a few seconds of searching.
This does not work.
This does.
Object property should take user to all faculty members
who attended Tufts
Strongly consider linking out your data properties as well.
The tool of choice for fancy questions
SPARQL Query Builder
Most needs for data are not articulated
Articulated question #1Show me a current list of
publications by the following people.
Articulated question #2Which researchers have published the most research articles within a given set of basic science journals
within the last five years?
Articulated question #3How many journal articles have researchers published between October 2011 and March 2012?
Articulated question #4What is my H-index?
Identifying unarticulated research questions
extrapolateinvestigate
hypothesizeinfer
interviewcollaborate
I have not failed. I've just found 10,000 ways that won't work.
Thomas A. Edison Source: Yahoo Answers
We have proposed several research questions that elicited enthusiasm.
Dean’s Representative
Proposed question #1How has the number of publications co-authored with other institutions
changed year to year?
Proposed question #2Publications appearing in journals of
a given impact factor
Proposed question #3In any given year, which paper has
the most incoming citations?
Proposed question #4Which papers that have received
federal funding are not deposited in PubMed Central?
Proposed question #5Who are our institutions’ open
access key opinion leaders?
Proposed question #6Which PIs have the most
collaborations based on grant support?
Proposed question #7Which faculty have dual
appointments?
Modes of Delivery• Emailed attachments
- Spreadsheet - PDF- Word file- Bibliographic tool (e.g., .ris file)
• Link to VIVO web interface
• Third-party “data dashboard”
Goals for Reporting
1. Define the key questions. Goal: a set of fundamental questions
we ask about our researchers.
2. Allow for more key questions to be addressed
via browsing
Ingest more data in key areas and...
In areas where VIVO hasn’t quite figured out how to do something important...
3. Create third-party framework that will ingest
VIVO data and readily answer these questions.
Framework will be open source and could be incorporated into VIVO in future
releases.
Data Dashboard Project for Our Summer CS Students
• build on Miles Worthington’s Drupal/VIVO work
• tackle some of the proposed questions that have not made it into core
• can be customized for di!erent viewers
• collaboration welcome
library.weill.cornell.edu
Is VIVO Facebook for Researchers?
• No Mendeley is… (Ha)
Questions?