12
The Emergence of Innovation Complexity at Different Geographical and Technological Scales Emanuele Pugliese 1 , Lorenzo Napolitano 1,2 , Matteo Chinazzi 3 , Guido L. Chiarotti 1,4 1 Institute for Complex Systems, Consiglio Nazionale delle Ricerche - Rome, Italy. 2 Institute of Economics, Scuola Superiore Sant’Anna - Pisa, Italy. 3 MOBS Lab, Network Science Institute, Northeastern University - Boston, MA, USA. 4 Department of Environmental, Land and Infrastructure Engineering, Politecnico di Torino - Torino, Italy. Abstract We define a novel quantitative strategy inspired by the ecological notion of nestedness to single out the scale at which innovation complexity emerges from the aggregation of specialized building blocks. Our analysis not only suggests that the innovation space can be interpreted as a natural system in which advantageous capabilities are selected by evo- lutionary pressure, but also that the emerging structure of capabilities is not independent of the scale of observation at which they are observed. Expanding on this insight allows us to understand whether the capabilities characterizing the innovation space at a given scale are compatible with a complex evolutionary dynamics or, rather, a set of essentially indepen- dent activities allowing to reduce the system at that scale to a set of disjoint non interacting sub-systems. This yields a measure of the innovation complexity of the system, i.e. of the degree of interdependence between the sets of capabilities underlying the system’s building blocks. 1 arXiv:1909.05604v1 [econ.GN] 12 Sep 2019

The Emergence of Innovation Complexity at Different

  • Upload
    others

  • View
    5

  • Download
    0

Embed Size (px)

Citation preview

Page 1: The Emergence of Innovation Complexity at Different

The Emergence of Innovation Complexity at DifferentGeographical and Technological Scales

Emanuele Pugliese1, Lorenzo Napolitano1,2, Matteo Chinazzi3,Guido L. Chiarotti1,4

1Institute for Complex Systems, Consiglio Nazionale delle Ricerche - Rome, Italy.2Institute of Economics, Scuola Superiore Sant’Anna - Pisa, Italy.

3MOBS Lab, Network Science Institute, Northeastern University - Boston, MA, USA.4Department of Environmental, Land and Infrastructure Engineering,

Politecnico di Torino - Torino, Italy.

Abstract

We define a novel quantitative strategy inspired by the ecological notion of nestednessto single out the scale at which innovation complexity emerges from the aggregation ofspecialized building blocks. Our analysis not only suggests that the innovation space canbe interpreted as a natural system in which advantageous capabilities are selected by evo-lutionary pressure, but also that the emerging structure of capabilities is not independent ofthe scale of observation at which they are observed. Expanding on this insight allows us tounderstand whether the capabilities characterizing the innovation space at a given scale arecompatible with a complex evolutionary dynamics or, rather, a set of essentially indepen-dent activities allowing to reduce the system at that scale to a set of disjoint non interactingsub-systems. This yields a measure of the innovation complexity of the system, i.e. of thedegree of interdependence between the sets of capabilities underlying the system’s buildingblocks.

1

arX

iv:1

909.

0560

4v1

[ec

on.G

N]

12

Sep

2019

Page 2: The Emergence of Innovation Complexity at Different

1 IntroductionWe define a novel quantitative strategy to single out the scale at which irreducible diversifica-tion – i.e. complexity – emerges from the aggregation of specialized building blocks. We thenapply this newly defined methodology to study the innovation space as described by patentedinventions and show the existence of a non-trivial interaction between the geographical andtechnological scales at which innovation complexity emerges. To capture the emergence of in-novation complexity, we leverage on a property shared by many ecological and socio-economicsystems: nestedness.

In many ecosystems, populations of pollinator insects and plants coexist, some of whichare specialized while others are more generalist. A commonly observed characteristic of theseecosystems is that specialist insects, which pollinate few species of plants, tend to prefer mostlygeneralist plants, which in turn are targeted by many different pollinator species, both specialistand not [1]. The resulting general lack of specialist-specialist interactions is called nestednessand it is addressed by several ecological studies about the distribution of communities of speciesin geographically accessible sites [2, 3, 4, 5, 6] and the structural organization of species inter-action networks [1, 7, 8, 9, 10, 11, 12, 13].

Nestedness can be rephrased in terms of the way in which traits are selected by evolutionarypressure. For instance, if we observed specialist insects interacting mostly with specialist plants,we could infer that characteristics (i.e. capabilities) which benefit foraging from – or beingpollinated by – some species tend to hinder interaction with others; this would translate toa plant-pollinator system consisting of non-interacting sub-systems. Nestedness implies thatcapabilities are not mutually exclusive, but rather that they imply each other through cumulativecausation. Consequently, a nested plant-pollinator system will consist of non-separable sub-systems characterized by non-independent capability sets and emerging complexity.

Interestingly, the complexity in the structure of capabilities that emerges from the observa-tion of an ecological niche at a scale of observation might not emerge at different scales. Forexample, zooming into the social structure of honeybee communities highlights a strong divi-sion of labor involving specialization and non-overlapping capabilities between the fertile butdependent queen, the female workers, and the stingless male drones. Indeed, the nested capa-bilities making honeybees generalist pollinators emerge only via aggregation from the compart-mentalized capabilities behind their social cast structure and in the framework of the interactionbetween their social structures and the rest of the ecosystem.

Nestedness has been shown to characterize also social systems [14, 15], allowing to ranktheir agents on the basis of their fitness [16]. Similarly, in the innovation space the presenceof nestedness in the network of relations linking geographical areas hosting inventors to thetechnologies embedded in their patented inventions is a sign that more complex activities implythe capability to successfully perform simpler ones. This implies the presence of a complexand irreducible system of interactions, in contrast with the intuition of mainstream models ofeconomic specialization. In a system characterized by clustered dependencies between tech-nologies (i.e. low nestedness) innovators could afford to specialize in a small set of arbitrary

2

Page 3: The Emergence of Innovation Complexity at Different

patents. Conversely, in a system governed by complex interactions between aggregates, thenested nature of the set of capabilities would force the fittest agents to treat simpler activities asstepping stones to master more complex innovations. In this paper we show that this is preciselythe case when the system is observed at the appropriate scale. This way, we aim to inform therich debate economic literature concerning the role of knowledge spillovers in fostering inno-vation [17, 18, 19, 20] by pointing out the role that the geographical scale of the analysis playsin driving empirical results and eventually contribute to defining a solid empirical basis [21] forfuture theoretical as well as policy-making efforts.

There is a rich theoretical literature describing the innovation space as a complex system inwhich the intricate interconnectedness between fields of knowledge rises both directly – fromthe recombination of concepts from different fields and the influence that innovations have oncollective behavior and thinking patterns [22, 23] – and indirectly – through the multiplicativeeffect that some innovations (e.g. in computing, instrument building, information communica-tion technology) have on the potential to innovate in unrelated fields [24, 25]. Our methodologyadds to this insight by measuring the nestedness of the innovation system at multiple resolutionsand showing that the scale of observation not only matters, but that there is a non-trivial interac-tion between geographical and technological scale. Such interaction determines the emergenceof a well-defined frontier separating the scale of observation characterized by diversification ofthe fittest (high nestedness) from the one displaying evolutionary pressure towards specializa-tion (low nestedness). To our best knowledge, this is the first time that the geographical scale atwhich complexity emerges is identified in a quantitative manner.

2 Representing the Innovation SpacePatent applications, being one of the main sources of codified information concerning inventions[26], are an ideal source of data for our analysis because the associated metadata provides uswith a standardized classification system – the International Patent Classification (IPC) scheme1

– that allows to associate the claims of innovativeness of each invention to one or more techno-logical fields. Furthermore, we are able to link patent inventors or assignees to their physicalresidence (or seat) at the time the patent application was filed and therefore to assign codes tospecific geographical regions. This allows us to represent the innovation space as a bipartitegraph connecting two types of nodes: geographical areas and technological fields .

The geographical and technological dimensions thus constructed are inherently hierarchicaland, hence, suitable for the exploration of the data at the variety of scale combinations requiredfor our investigation. For all available scale combinations, we use IPC codes to decomposepatents into sets of technological fields and assign patentees to their geographical regions. Anequal share of each invention is attributed to all the involved location-technology pairs. Eachpair maps to a cell of the so-called Innovation Map (IM ) which represents the innovation spaceat a given scale combination the same way in which geographic maps represent the Earth surface

1http://www.wipo.int/classifications/ipc/en/

3

Page 4: The Emergence of Innovation Complexity at Different

at different levels of resolution. Innovation maps can also be interpreted as incidence matricesof a bipartite network in which rows correspond to geograhical locations, columns correspondto technology fields, and entries are nonzero whenever the corresponding location-technologycombination displays a revealed competitive advantage 2 and is thus well-equipped to innovatein a specific technological field. Figure 1 depicts some examples of IMs computed at specificgeographical and technological resolutions. Given these representations of the innovation space, we measure their nestedness [27] to uncover the footprint of innovation complexity.

3 Shuffling Capabilities: a Null ModelIn order to test the statistical significance of the characteristic nestedness of a specific scaleof observation, we need a suitable null hypothesis accounting for the fact that some nested-ness carries over to any random rearrangement of the data contained in the IMs from lowerresolutions. As a result, the excess nestedness we measure at any scale is a result of enhanc-ing the resolution of data representation. To this aim, we contrast the nestedness of a highresolution empirical innovation map, IMhighRes, with the expected nestedness given the dis-tribution of technological capabilities observed in its lower resolution counterpart IM lowRes.In particular, we generate a null distribution of nestedness values from a set of synthetic IMsof the same shape as IMhighRes which preserve the distribution of technological competenceswithin IM lowRes. If the nestedness of IMhighRes is not significantly different to that implied bythe null distribution, we deduce that zooming-in brings out no further nestedness with respectto IM lowRes and hence highlights no further structural property of the system. On the otherhand, finding a significantly higher or lower nestedness than the null distribution suggests thatIMhighRes is either significantly more or significantly less nested than implied by IM lowRes andhence that changing scale yields information about the system.

Put into the context of the innovation space, our null hypothesis implies that innovativecapabilities relative to a specific technological field that are observed in a given geographi-cal area (e.g. a nation) are randomly shuffled between the child sub-regions of said area (e.gregions or states). The resulting null model – which we call Reshuffled Capabilities model(ReCap) – allows to define a distribution of nestedness values that are compatible with thenestedness measured at a coarser geograhical resolution. Assessing the significance of the nest-edness of IMhighRes with respect to the null distribution amounts to washing out the nestednessof IM lowRes from IMhighRes thus allowing to test the alternative hypothesis that observingthe system at a higher resolution uncovers unexpectedly high (or low) nestedness in the geo-graphical structure of technological capabilities (which we interpret as a signal of irreduciblediversification) against the null hypothesis that the nestedness observed in IMhighRes is nothingmore than the residual of that already measured in the more aggregated matrix.

The ReCap fixes the ubiquities of the technological codes (i.e. the column sums) withinIMhighRes while letting the locations (i.e the row sums) vary freely within the blocks of rows

2The results are robust to different binarization strategies.

4

Page 5: The Emergence of Innovation Complexity at Different

corresponding to the rows of IM lowRes. In other words, building a null matrix from IMhighRes

amounts to splitting it into its constituent sub-matrices, shuffling each one according to thenull model, and then recomposing the mosaic. The nestedness temperature [28, 27] of the nullmatrices is measured and used to construct the null distribution, against which the nestednesstemperature of IMhighRes is compared to assess significance.

Figure 2 contains a schematic representation of the overall procedure starting from collect-ing the geographical and technological information from patent applications to applying ReCapto measure the significance of observed nestedness.

4 ResultsOur analysis shows that, for a given technological disaggregation, zooming to a finer geograph-ical scale produces a structure that is less and less significantly nested with respect to a randomreallocation of technological competitiveness among the geographical subunits of the systemrepresented at the coarser scale. Eventually, zooming-in along the geograhical dimension of thesystem yields a matrix that is consistent with a process fostering specialization over diversifica-tion. This implies that the less common and more complex technology codes are predominantlytargeted by firms localized in specialized areas when such areas are sufficiently narrow. Forexample, when we consider technologies at the IPC sub-class scale, we observe two regimes.When the geographical scale is set to identify nations as units of analysis, the individual geo-graphical areas display a more-than-random level of diversification (i.e. the innovation spaceis nested). Instead, when we zoom-in at the regional level of disaggregation, specializationemerges and we find that rare technologies are pursued in specialized industrial districts (i.e.the innovation space is anti-nested).

If we perform the same exercise as above but let the technological scale vary while keep-ing the geographical dimension fixed, we observe a similar yet mirrored situation, in whichzooming-out produces a structure that is less and less significantly nested and more specializedthan what is predicted by the null model. For example, at the regional level of geographicalaggregation, we observe that nestedness and hence diversification are higher at a finer techno-logical scale, while the system is significantly less nested and more specialized at the broadestlevel of technological aggregation. The overall picture that we obtain is that diversification pre-vails in the upper-left corner of the matrix, while specialization dominates in the lower-rightcorner. In other words, the scales at which the phenomenon is observed define a frontier sepa-rating the scale combinations that are significantly more nested than would be expected giventhe nestedness of a more coarse-grained representation of the same system from the scale com-binations that are significantly less nested. Figure 3 summarizes our findings.

5

Page 6: The Emergence of Innovation Complexity at Different

5 DiscussionIt is well known in economics that diversification is typically observed in macroeconomic ag-gregates, while specialization characterizes the behavior of microeconomic agents. However,while intuition suggests that countries are highly diversified entities and that the firms consti-tuting their productive tissue are bound to a higher degree of specialization due to capacityconstraints, it is not a trivial task to locate the boundary separating diversified scales character-ized by complex interactions between capabilities from specialized scales at which capabilitiesare clustered in relatively independent sets. Our work adds insight in this direction by show-ing that there are significant interactions between geographical and technological scales and wedemonstrate that the prevalence of specialization vis a vis diversification in the fittest locationsdepends on the scale of observation.

In particular, we can measure whether the aggregated capabilities linking locations to tech-nologies at a given scale are nested – suggesting a complex (evolutionary) behavior of thesystem – or rather the result of essentially independent activities allowing to reduce the systemat that scale to a set of disjoint non interacting sub-systems. For example, we find that at theregional level of aggregation, locations are specialists if we observe the system at a coarsertechnological scale (e.g. IPC Section) but they become generalists if we observe the system ata finer technological scale (e.g. IPC groups). Therefore, diversification and specialization arenot elements of a dichotomy, but rather the extremes of a continuum that is uncovered when thegeographical and technological granularity of the innovation system are defined appropriately.

Our study has direct implications for the literature on National and Regional Systems ofInnovation [29, 30] and can be used to inform policy makers about the dependence of the tech-nological scope to be pursued depending on the geographical scale at which they operate. Morein general, although the present analysis focuses on technological innovation, our methodologycan be generalized to a broader class of social and natural systems characterized by emergentinteractions between aggregates at different scales (e.g. geographical, technological, temporal,etc..) to uncover the seeds of irreducible complexity.

6

Page 7: The Emergence of Innovation Complexity at Different

References[1] Jordi Bascompte, Pedro Jordano, Carlos J Melian, and Jens M Olesen. The nested as-

sembly of plant–animal mutualistic networks. Proceedings of the National Academy ofSciences, 100(16):9383–9387, 2003.

[2] Bruce D Patterson and Wirt Atmar. Nested subsets and the structure of insular mammalianfaunas and archipelagos. Biological Journal of the Linnean Society, 28(1-2):65–82, 1986.

[3] David H Wright and Jaxk H Reeves. On the meaning and measurement of nestedness ofspecies assemblages. Oecologia, 92(3):416–428, 1992.

[4] Alan Cutler. Nested faunas and extinction in fragmented habitats. Conservation Biology,5(4):496–504, 1991.

[5] Alan H Cutler. Nested biotas and biological conservation: metrics, mechanisms, andmeaning of nestedness. Landscape and Urban Planning, 28(1):73–82, 1994.

[6] David H Wright, Bruce D Patterson, Greg M Mikkelson, Alan Cutler, and Wirt Atmar.A comparative analysis of nested subset patterns of species composition. Oecologia,113(1):1–20, 1997.

[7] Yoko L Dupont, Dennis M Hansen, and Jens M Olesen. Structure of a plant–flower-visitornetwork in the high-altitude sub-alpine desert of tenerife, canary islands. Ecography,26(3):301–310, 2003.

[8] Jeff Ollerton, Duncan McCollin, Daphne G Fautin, and Gerald R Allen. Finding nemo:nestedness engendered by mutualistic organization in anemonefish and their hosts. Pro-ceedings of the Royal Society of London B: Biological Sciences, 274(1609):591–598,2007.

[9] JEFF Ollerton, Steven D Johnson, Louise Cranmer, and SAM Kellie. The pollination ecol-ogy of an assemblage of grassland asclepiads in south africa. Annals of Botany, 92(6):807–834, 2003.

[10] Paulo R Guimaraes, Victor Rico-Gray, Sergio Furtado Dos Reis, and John N Thompson.Asymmetries in specialization in ant–plant mutualistic networks. Proceedings of the RoyalSociety of London B: Biological Sciences, 273(1597):2041–2047, 2006.

[11] Paulo R Guimaraes, Cristina Sazima, Sergio Furtado Dos Reis, and Ivan Sazima. Thenested structure of marine cleaning symbiosis: is it like flowers and bees? Biology Letters,3(1):51–54, 2007.

[12] Thomas M Lewinsohn, Paulo Inacio Prado, Pedro Jordano, Jordi Bascompte, and JensM Olesen. Structure in plant–animal interaction assemblages. Oikos, 113(1):174–184,2006.

7

Page 8: The Emergence of Innovation Complexity at Different

[13] KC Burns. Network properties of an epiphyte metacommunity. Journal of Ecology,95(5):1142–1151, 2007.

[14] Cesar A Hidalgo, Bailey Klinger, A-L Barabasi, and Ricardo Hausmann. The productspace conditions the development of nations. Science, 317(5837):482–487, 2007.

[15] Ricardo Hausmann and Cesar A Hidalgo. The network structure of economic output.Journal of Economic Growth, 16(4):309–342, 2011.

[16] Andrea Tacchella, Matthieu Cristelli, Guido Caldarelli, Andrea Gabrielli, and LucianoPietronero. A new metrics for countries’ fitness and products’ complexity. Scientificreports, 2:723, 2012.

[17] Ron Boschma, Pierre-Alexandre Balland, and Dieter Franz Kogler. Relatedness and tech-nological change in cities: the rise and fall of technological knowledge in us metropolitanareas from 1981 to 2010. Industrial and corporate change, 24(1):223–250, 2014.

[18] Ron Boschma, Gaston Heimeriks, and Pierre-Alexandre Balland. Scientific knowledgedynamics and relatedness in biotech cities. Research Policy, 43(1):107–114, 2014.

[19] Pierre-Alexandre Balland, Ron Boschma, and Koen Frenken. Proximity and innovation:From statics to dynamics. Regional Studies, 49(6):907–920, 2015.

[20] Pierre-Alexandre Balland and David Rigby. The geography of complex knowledge. Eco-nomic Geography, 93(1):1–23, 2017.

[21] Koen Frenken and Ron A Boschma. Why is economic geography not an evolutionaryscience? towards an evolutionary economic geography. In Economy, pages 127–156.Routledge, 2017.

[22] Deborah Strumsky, Jose Lobo, and Sander Van der Leeuw. Using patent technology codesto study technological change. Economics of Innovation and New technology, 21(3):267–286, 2012.

[23] Francesca Tria, Vittorio Loreto, Vito Domenico Pietro Servedio, and Steven H Strogatz.The dynamics of correlated novelties. Scientific reports, 4, 2014.

[24] Timothy F Bresnahan and Manuel Trajtenberg. General purpose technologies engines ofgrowth? Journal of econometrics, 65(1):83–108, 1995.

[25] Lorenzo Napolitano, Evangelos Evangelou, Emanuele Pugliese, Paolo Zeppini, and Gra-ham Room. Technology networks: the autocatalytic origins of innovation. Royal Societyopen science, 5(6):172445, 2018.

[26] Zvi Griliches. Patent statistics as economic indicators: a survey. Technical report, NationalBureau of Economic Research, 1990.

8

Page 9: The Emergence of Innovation Complexity at Different

[27] Miguel A Rodrıguez-Girones and Luis Santamarıa. A new algorithm to calculate the nest-edness temperature of presence–absence matrices. Journal of Biogeography, 33(5):924–935, 2006.

[28] Wirt Atmar and Bruce D Patterson. The measure of order and disorder in the distributionof species in fragmented habitat. Oecologia, 96(3):373–382, 1993.

[29] Christopher Freeman. Technology policy and economic performance. Pinter PublishersGreat Britain, 1989.

[30] Philip Cooke. Regional innovation systems, clusters, and the knowledge economy. Indus-trial and corporate change, 10(4):945–974, 2001.

9

Page 10: The Emergence of Innovation Complexity at Different

Figure 1: Innovation Maps at different scale combinations. This figure, reports some Inno-vation Maps (IMs) at different levels of resolution (zooms), both along the geographical andtechnological dimensions. The relation matrices are built by joining information about the tech-nological content of patents (IPC Codes) and the seat (or residence) of patentees . The coarsestIM reported in this figure (top-right) consists of 74 rows and 129 columns, while the highestresolution IM (bottom-left) represents 230 regions and 643 technological classes. The matri-ces have been reordered to emphasize nestedness by means of the algorithm proposed by [16].Note that the nonzero entries (colored dots) are concentrated in the top-left corner at all scales,though to different extents, suggesting the presence of widespread nestedness throughout thesystem.

10

Page 11: The Emergence of Innovation Complexity at Different

Figure 2: From Patents to Nestedness Temperature This is a schematic representation ofthe overall procedure starting from collecting the geographical and technological informationfrom patent applications to applying the Reshuffled Capabilities (ReCap) null model to measurethe significance of observed nestedness. In this example, the empirical matrix is blue becauseits measured nestedness temperature is significantly lower (around 3σ) than the average nullmatrix, implying that it is significantly more nested.

11

Page 12: The Emergence of Innovation Complexity at Different

Figure 3: Statistical significance of measured Innovation Map temperatures at all scalecombinations. Each row reflects the analysis conducted at a specific geographical aggregation,while each column reflects the analysis conducted at a specific technological aggregation. Thecolor of the graph spans from blue to red for empirical matrices that show higher (blue) or lower(red) degree of nestedness than expected under the ReCap null model. The darker the color, thehigher the statistical significance with respect to the null model. For example, the upper left bluecell in the array indicates that the system is significantly more nested (i.e. more diversified) thana random case in which the null matrix has as many rows as the number of nations included inthe data and as many columns as the number of technology codes defined according to the finestclassification. The above results are robust across time and also hold under different definitionsof the patent sample and different binarization strategies for the IMs.

12