Upload
innovationoecd
View
93
Download
0
Embed Size (px)
Citation preview
Looking forward? What data infrastructures and
partnerships
Andrea BonaccorsiUniversity of Pisa and RISE
In collaboration with Cinzia Daraio, University of Roma La Sapienza
OECD Blue Sky ConferenceGhent, 19-21 September 2016
When Luc Soete meets Scott Stern
I created a dataset from Business registrations and patents that can be used to examine the entrepreneurial ecosystem
Wonderful. They are trusted data. Why not merge these data with other trusted data, such as scientific production data?
Why is data integration important in policy making?
• Impact analysis (e.g. impact of scientific production on innovation)
• Granularity of data• Geographic• Sectoral• Institutional
• Cross-referencing (e.g. integration of data from the scientific system with data on innovation and/or industrial activities)
Authors’ name and affiliation
Inventor name and address Assignee name
Inconsistent spelling of address and mismatch of authors, inventors, and assignees
....More than 10,000 PROs in Europe, appr.1/3 funding
No geo-referentiation available
Traditional approach to data integration
Creation of a new database for each of the policy requirements
- Ad hoc- Not scalable- Not interoperable- Limited time
validityx
Big Data….. Bigger problems
Data integration problems for Research and Innovation:
• Heterogeneity of data sources• Lack of interoperability• Disambiguation problems• Different classifications schemes• Development of concordance table at the desired level
of granularity
Steps
1.Authority files2.Correspondence tables3.(Semi) automatic disambiguation
Authority files
An official and validated list of entiities
- Higher education institutions (HEIs) - Public Research Organisations (PROs)- R&D spenfing firms - Inventors
Key features:
• Censuses• Validated by official authorities (NSA,. Eurostat, OECD…)• Publicly available• Maintained and updated regularly
Scopus subjects areas/ NACE Codes (3 digits) 010020030050060070080100110120130140150160170180190200210220230240250261262263264265266267268270280290300310350360370380390410420430450460470490500510520530550560580610620630640650660680690700710720730740770790ACOUSTICS AND ULTRASONICS x x x x x x x x X X X X x X x x x x x X x xADVANCED AND SPECIALISED NURSING X X X X X xAEROSPACE ENGINEERING x x x x x x x x x X X x X x x x x x x x x x xAGEING X x x x x xAGRICOLTURE AND BIOLOGICAL SCIENCES MISCELLANEOUS X x x x x x x x X X x x x x x x x x x x x x xAGRONOMY AND CROP SCIENCE X x x x X XALGEBRA AND NUMBER THEORY x x x x x x x x x X X X X x x x x x x x xANALYSIS x x x x x x x x x X X X X x x x x x x x xANALYTICAL CHEMISTRY x x x x X X x x x x x x x x xANATOMYANESTHESIOLOGY AND PAIN MEDICINE x X X X X xANIMAL SCIENCE AND ZOOLOGY X x x X X X x x x x x x x x x x x x x x x x x xAPPLIED MATHEMATICS x x x x x x x x x X X X X x x x x x x x xAPPLIED MICROBIOLOGY AND BIOTECHNOLOGY x x x xAQUATIC SCIENCE X x x X X X x x x x x X x x X X X x x x x X X X x X XARCHITECTURE x x x x x x x x x xARTIFICIAL INTELLIGENCE x x x x x x x x x X X X X x x x x x x x xASTRONOMY AND ASTROPHYSICS x x x x x x x x X X X X x X x x x x x X x xATMOSPHERIC SCIENCE x x X X x x SATOMIC AND MOLECULAR PHYSICS, AND OPTICS x x x x x x x x X X X X x X x x x x x X x xAUTOMOTIVE ENGINEERING x x x x x x x x x X X x X x x x x x x x x x xBEHAVIORAL NEUROSCIENCEBIOCHEMISTRY X x x x x xBIOCHEMISTRY MEDICAL X x x x x xBIOENGINEERING x x x x x x x x x X X x X x x x x x x x x x xBIOLOGICAL PSYCHIATRYBIOMASS AND BIOFUEL X x x x x x x X X x X X x X X X x X X x X X X x X X xBIOMATERIALS x X x x x x x x X X x x x x x x x X X X X X X X x X X X x x X X x X X X X X X xBIOMEDICAL ENGINEERING x x x x X X X x x x x x X X X X X X X x X x x x x X X x x x X x xBIOPHYSICS X x x x x xBIOTECHNOLOGY X x x x x xBUILDING AND CONSTRUCTION x x x x X x x X X X X X X X X X X XCANCER RESEARCH X x x x x xCARDIOLOGY AND CARDIOVASCULAR MEDICINE X X X X X xCATALYSIS X xCELL BIOLOGY X x x x x x x x x x x x x x x x x x x x x x xCELLULAR AND MOLECULAR NEUROSCIENCECERAMICS AND COMPOSITES x X x x x x x x X X x x x x x x x X X X X X X X x X X X x x X X x X X X X X X xCIVIL AND STRUCTURAL ENGINEERING x x x x X x x X X X X X X X x x X X X X X X SCLINICAL BIOCHEMISTRY X X X X X xCLINICAL NEUROLOGY X X X X X xCOGNITIVE NEUROSCIENCECOLLOID AND SURFACE CHEMISTRY X xCOMMUNITY AND HOME CARE X X X X X xCOMPLEMENTARY AND ALTERNATIVE MEDICINE X X X X X xCOMPUTATIONAL MATHEMATICS x x x x x x x x x X X X X x x x x x x x xCOMPUTATIONAL MECHANICS x x x x x x x x x X X x X x x x x x x x x x xCOMPUTATIONAL THEORY AND MATHEMATICS x x x x x x x x x X X X X x x x x x x x xCOMPUTER GRAPHICS AND COMPUTER-AIDED DESIGN x x x x x x x x x X X X X x x x x x x x xCOMPUTER NETWORKS AND COMMUNICATIONS x x x x x x x x x X X X X x x x x x x x xCOMPUTER SCIENCE APPLICATIONS x x x x x x x x x X X X X x x x x x x x xCOMPUTER SCIENCE MISCELLANEOUS x x x x x x x x x X X X X x x x x x x x xCOMPUTER VISION AND PATTERN RECOGNITION x x x x x x x x x X X X X x x x x x x x xCOMPUTERS IN EARTH SCIENCES x x x x X x x x x X X X X x X XCONDENSED MATTER PHYSICS x x x x x x x x X X X X x X x x x x x X x xCONTROL AND OPTIMIZATION x x x x x x x x x X X X X x x x x x x x xCONTROL AND SYSTEMS ENGINEERING x x x x X X X x x x x x X X X X X X X x X x x x x X X x x x X x xCRTICAL CARE AND INTENSIVE CARE MEDICINE X X X X X xDERMATOLOGY X X X X X xDEVELOPMENTAL BIOLOGY X x x x x x x x x x x x x x x x x x x x x x xDEVELOPMENTAL NEUROSCIENCEDISCRETE MATHEMATICS AND COMBINATORICS x x x x x x x x x X X X X x x x x x x x xDRUG DISCOVERY X xEARTH AND PLANETARY SCIENCES MISCELLANEOUS X X x X x x x x X X X X X X X X X x X X X X X x X XEARTH-SURFACE PROCESSES X X x X X x X X X X X X X X X X X x X XECOLOGICAL MODELING X x x x x x x X X x X X x X X X x X X x X X X x X X xECOLOGY X x x x x x x X X x X X x X X X x X X x X X X x X X xECOLOGY, EVOLUTION, BEHAVIOR AND SYSTEMATICS X x x x x x x x x x x x x x x x x x x x x x xECONOMIC GEOLOGY X X x X x X X XECONOMICS AND ECONOMETRICS x x x x x x x x x x x x x x x X x x x x x X X x x x X X x x x X X X X X X X X WELECTRICAL AND ELECTRONIC ENGINEERING x x x x X X X x x x x x X X X X X X X x X x x x x X X x x x X x xELECTROCHEMISTRY x x x x X X x x x x x x x x xELECTRONIC, OPTICAL AND MAGNETIC MATERIALS x X x x x x x x X X x x x x x x x X X X X X X X x X X X x x X X x X X X X X X xEMBRYOLOGY X X X X X xEMERGENCY MEDICINE X X X X X xENDOCRINE AND AUTONOMIC SYSTEMSENDOCRINOLOGY X x x x x xENDOCRINOLOGY, DIABETES AND METABOLISM X X X X X xENERGY ENGINEERING AND POWER TECHNOLOGY x x x x x x x x x X X x X x x x x x x x x x xENGINEERING MISCELLANEOUS x x x x x x x x x X X x X x x x x x x x x x xENVIRONMENTAL CHEMISTRY X x x x x x x X X x X X x X X X x X X x X X X x X X xENVIRONMENTAL ENGINEERING X x x x x x x X X x X X x X X X x X X x X X X x X X xENVIRONMENTAL SCIENCE MISCELLANEOUS X x x x x x x X X x X X x X X X x X X x X X X x X X xEPIDEMIOLOGY X X X X X x x x x xEQUINE x x x X X X x x x x x x x x xFAMILY PRACTICE X X X X X x x x x xFILTRATION AND SEPARATION X xFINANCE x x x x x x x x x x x x x x x X x x x x X X x x x X X x x x X X X X X X X X WFLUID FLOW AND TRANSFER PROCESSES X xFOOD ANIMALS x x x X X X x x x x x x x x xFOOD SCIENCE X x x x X XFORESTRY x X X x x xFUEL TECHNOLOGY x x x x x x x x x X X x X x x x x x x x x x xGASTROENTEROLOGY X X X X X xGENETICS X x x x x x x x x x x x x x x x x x x x x x xGENETICS CLINICAL X X X X X xGEOCHEMISTRY AND PETROLOGY X X x X x X X XGEOLOGY X X x X x X X XGEOMETRY AND TOPOLOGY x x x x x x x x x X X X X x x x x x x x xGEOPHYSICS X X x X x X X XGEOTECHNICAL ENGINEERING AND ENGINEERING GEOLOGY x x X x x x X X X x x x x x x x x x x X x xGERIATRICS AND GERONTOLOGY X X X X X xGERONTOLOGY X X X X X xGLOBAL AND PLANETARY CHANGE X x x x x x x X X x x X X x X X X x x X X x x X X x X X X x X X x SHARDWARE AND ARCHITECTURE x x x x x x x x x X X X X x x x x x x x xHEALTH INFORMATICS X X X X X xHEALTH POLICY X X X X X x x x x x
Concordance table (Subject categories vs NACE-CLIO industrial sectors)
Take away
Big Data is not the solution per se
We need to manage the transition to the block-chain model of disintermediation
Indicators are in need of public trust in their validity, reliability and replicability
We need a relatively stable layer- i.e. institutions (stable, well known, trusted).
Vision 2026
• Reputation-based indicators• Funding agencies• Individual researchers’ repositories• Community-based co-creation
Individual-based repositories and reputation-based systems will not gain general acceptance unless they demonstrate their validity with respect to the universe. After that critical point (tipping point) they will grow in a self-sustainable way.
Census-based authority files will offer the benchmark against which these systems will demonstrate their coverage with respect to the universe of entities (i.e. researchers, institutions).
We have to prepare the day where full disintermediation will be in place.