29
……………………………………………………………………………………………………………………………….…………………………….. ………………………………………………………………………………………………………………………………………….……………………..… UK DATA ARCHIVE MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….…………………………….. ……………………………………………………………….…... LOUISE CORTI UK DATA SERVICE UNIVERSITY OF ESSEX ……………………………………………….……………..……. University of Sussex, Why share your research data 14 February 2013

MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

Embed Size (px)

Citation preview

Page 1: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

MANAGING & SHARING RESEARCH DATA……………………………………………………………………………………………………………………………….……………………………..

……………………………………………………………….…...

LOUISE CORTIUK DATA SERVICEUNIVERSITY OF ESSEX……………………………………………….……………..…….

University of Sussex, Why share your research data14 February 2013

Page 2: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

OVERVIEW FOR TODAY

• Why share data• Data management planning• Looking after your data • Formatting and organising your data• Storage, security, transfer and file sharing • Copyright of data

Page 3: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

UK DATA SERVICE

• UK Data Archive has over forty years experience in selecting, ingesting, curating and providing access to social science data

• huge experience of supporting researchers and data creators of social science data and related disciplines

• Been managing data sharing for the ESRC Data Policy since 1995, including the Rural Economy and Land Use Programme (2004-2012)

• our best practice approaches to making data shareable are based on:• challenges faced by researchers to share data • handling research data – quantitative and qualitative

• highly skilled staff comprising researchers, technical and information specialists

www.data-archive.ac.uk/create-manage

Page 4: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

OUR MANAGING AND SHARING DATA RESOURCES

Managing and sharing guidance• sections• references• training programme

www.data-archive.ac.uk/create-managewww.data-archive.ac.uk/media/2894/managingsharing.pdf

Training resources: • presentations• exercises and discussions / answers

www.data-archive.ac.uk/create-manage/training-resources

Page 5: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

• Data created from research are valuable resources that can be used and re-used for future scientific and educational purposes.

• Sharing data:

• facilitates new scientific inquiry

• avoids duplicate data collection

• provides rich real-life resources for education and training

• credits the data creator with intellectual effort

• can help to support evidence of impact

• is something to be proud of!

BENEFITS OF MANAGING AND SHARING YOUR DATA

Page 6: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

A data management and sharing plan helps researchers consider when research is being designed and planned, how data will be managed during the research process and shared afterwards with the wider research community

Areas of coverage:

• Data management planning why & how and the research lifecycle

• Data management checklist• Roles and responsibilities• Costing data management

DATA LIFECYCLE & DATA MANAGEMENT PLANNING

Page 7: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

WHY DATA MANAGEMENT PLANNING?

• Research funders require planning for data management and data sharing, e.g. UK Research Councils• which data• how manage• how share, preserve, curate• rights to access, use,….• roles & responsibilities

• Research benefits• think what to do with research data, how collect, how look after • keep track of research data (e.g. staff leaving)• identify support, resources, services needed• plan storage, short & long-term• plan security, ethical aspects• be prepared for data requests (FoI, funder)

Page 8: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

COMPLETING A DMP

• Funder template for DMPs

• ESRC DMP requirements in data policy and DMP guidance• MRC DMP guidance and template• AHRC technical plan requirements

• DCC’s DMPonline tool

• Good to share best practice in your own departments e.g

what does an ideal DMP looks like?

UK Data Archive data management checklist

Page 9: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

KEY DATA MANAGEMENT INTERVENTION POINTS

Green and Gutmann, 2007

Sign off consent form

Agree data & metadata templates

Shared data sharing protocols

Licensing, terms and conditions for sharing, formal documentation

Data formats, data migration

Page 10: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

ROLES & RESPONSIBILITIES

Assign, not presume roles or responsibilities for data management

Who?

• PI• Research staff / students - collecting, creating, processing,

analysing data • External contractors - data collection, collation, processing; e.g.

transcribers • Support staff - managing, administering research • Local Information Services - data storage, security, back-up

services • External/institutional data centres / archives - data sharing

Page 11: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

COSTING

• Cost data management and sharing into research

• Identify resources needed to make research data shareable beyond primary research team - above planned standard research procedures and practices

• Resources = people, equipment, infrastructure, tools to manage, document, organise, store and provide access to data

• Early planning can reduce costs

• See our data management costing tool

Page 12: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

FORMATTING YOUR DATA

• Using standard and interchangeable or open lossless data formats ensures long-term usability of data

• High quality data are well organised, structured, named and versioned

• Authenticity of master files must be identified

• File formats and conversions• Organising files and folders• File naming• Version control and authenticity• Transcription

Page 13: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

CAN YOU UNDERSTAND/USE THESE DATA?

SrvMthdDraft.doc

SrvMthdFinal.doc

SrvMthdLastOne.doc

SrvMthdRealVersion.doc

Page 14: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

FILE FORMATS

• Choice of software format for digital data:• planned data analyses• software availability/cost• hardware used e.g. audio capture• discipline-specific standards and customs

• Digital data endangered by obsolescence of software/ hardware

• Best formats for long-term preservation - standard formats, interchangeable formats, open formats

e.g. tab-delimited, comma-delimited (CSV), ASCII, RTF, PDF/A, OpenDocument format, SPSS portable, XML

see UK Data Archive optimal file formats for various data types

• Beware of errors or losses of data when converting formats!

Page 15: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

EXAMPLE: FORMAT CONVERSION

MS Excel (.XLSX) format using colour highlighting for annotation

Tab-delimited text format, and loss of colour annotation

Loss of annotation

Format change

Page 16: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

ORGANISING DATA

• Plan in advance how best to organise data• Use a logical structure and ensure collaborators understand

Examples• hierarchical structure of files, grouped in folders, e.g. audio,

transcripts and annotated transcripts• survey data – spreadsheet, SPSS,

relational database• interview transcripts - individual

well-named files

Page 17: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

FILE NAMING

• file name - principal identifier of file• use logical naming i.e. easy to identify, locate, retrieve, access• naming provides organisation, context & consistency• name elements: version nr, date, content description, creator

name

Best practice• name independent of location• brief & relevant• no special characters, dots or spaces• for separation use underscores _  • versioning via filename: ordinal and decimal version numbers• use names to classify broad types of files• avoid very long file names

Page 18: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

VERSION CONTROL

Keep track of different copies or versions of data files

Which method:• single site vs. across locations• different versions to be stored vs. files to be synchronised

Best practice:• unique identifiers for files (naming convention keeping track)• record file status/versions• record relationships between files

e.g. data file and documentation; similar data files• keep track of file locations

e.g. laptop vs. PC

Various tools available for versioning and syncronising files

Page 19: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

EXAMPLE : VERSION CONTROL TABLE

Page 20: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

AUDIO TRANSCRIPTION

• adopt a uniform layout throughout the research project

• compatibility with import features of Computer Assisted Qualitative Data Analysis Software (CAQDAS)

• role of transcription varies by discipline. What to transcribe?• verbal and non-verbal?• turn-taking?• ‘interruptions’

• who does it – researcher, service? Need rules

• implications of technologies – video, multiple camera, screen capture, webcams

Page 21: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

STORING YOUR DATA

• Looking after research data for the longer-term and protecting them from unwanted loss requires having good strategies in place for:

• backing-up• transmission• secure storage• disposal

Page 22: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

BACKING-UP DATA

• Why do back-ups? Risk of loss and change - would your data survive a disaster?

• Protect against: software failure, hardware failure, malicious attack, natural disasters

• Back-ups are additional copies that can be used to restore originals

• It’s not backed-up unless backed-up with a strategy

Page 23: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

DIGITAL BACK-UP STRATEGY

Consider• what’s backed-up? - all, some, just the bits you change?• where? - original copy, external local and remote copies• what media? - CD, DVD, external hard drive, tape, etc.• how often? – assess frequency and automate the process• for how long is it kept? Data retention policies that might apply?

know your personal/institutional back-up strategy• verify and recover - never assume, regularly test a restore

Backing-up need not be expensive• 1Tb external drives are around

£50, with back-up software

Consider non-digital storage too!

Page 24: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE24

ENCRYPTION and SECURITY

Always encrypt personal or sensitive data• when moving data files e.g. interview transcripts• when or storing files e.g. shared areas, mobile devices

Free softwares that are easy to use• encrypt hard drives, partitions, files and folders • encrypt portable storage devices such as USB flash drives• Safehouse, Truecrypt, Axcrypt

Protect data from unauthorised access, use, change, disclosure and destruction

• control access to all computers devices• control physical access to buildings, rooms, cabinets• restrict access to sensitive materials e.g. consent forms

Proper disposal of equipment and media• even reformatting the hard drive is not sufficient

Page 25: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE25

FILE SHARING & COLLABORATIVE ENVIRONMENTS

Sharing data between researchers and teams

• too often email attachments

• Yousendit, Dropbox – consider if appropriate as services can be hosted outside the EU (DPA for personal data), e.g. encrypt

• Virtual Research Environments• MS SharePoint• Sakai

• file transfer protocol (ftp)

• physical media

• Essex ZendTo

Page 26: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

OPTIONS FOR SHARING CONFIDENTIAL DATA

Researchers to consider

• obtaining informed consent, also for data sharing and preservation / curation

• protecting identities e.g. anonymisation, not collecting personal data

• restricting / regulating access where needed (all or part of data) e.g. by group, use, time period.

UK Data Archive has gradation of access controls

• securely storing personal or sensitive data

Consider jointly and in dialogue with participants

Plan early in research

Page 27: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

COPYRIGHT AND DATA SHARING

• Copyright permissions sought and granted prior to data sharing / archiving

• Clearing copyright – reach agreement with copyright holder

• Repositories only publish data – they hold no copyright

• Copyright holders give permission to data archives to preserve data and make them accessible to users

• For secondary use, copyright clearance before data can be reproduced

• Exception - fair dealing - for non-commercial research, private study, teaching, quotations, criticism or review; then author and source must be cited

Page 28: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

AN ARCHIVED RESEARCH COLLECTION

• Foot and Mouth in Cumbria dataset – qualitative study

• http://beta-discovery.data-service.ac.uk/Home/DataCatalogueTab/5407

• High quality hard to replicate/collect data

• Added value from reports, methods and links to publications

Page 29: MANAGING & SHARING RESEARCH DATA ……………………………………………………………………………………………………………………………….……………………………

……………………………………………………………………………………………………………………………….……………………………..

………………………………………………………………………………………………………………………………………….……………………..…

UK DATA ARCHIVE

CONTACT

UK DATA ARCHIVEUNIVERSITY OF ESSEXWIVENHOE PARKCOLCHESTERESSEX CO4 3SQ

……………………….…………………….….T: +44 (0)1206 872001E: [email protected]: www.data-archive.ac.uk……………………………….………………..