4
LETTERS Population context determines cell-to-cell variability in endocytosis and virus infection Berend Snijder 1,2 , Raphael Sacher 1,2 , Pauli Ra ¨mo ¨ 1 , Eva-Maria Damm 1 , Prisca Liberali 1 & Lucas Pelkmans 1 Single-cell heterogeneity in cell populations arises from a combina- tion of intrinsic and extrinsic factors 1–3 . This heterogeneity has been measured for gene transcription, phosphorylation, cell mor- phology and drug perturbations, and used to explain various aspects of cellular physiology 4–6 . In all cases, however, the causes of heterogeneity were not studied. Here we analyse, for the first time, the heterogeneous patterns of related cellular activities, namely virus infection, endocytosis and membrane lipid composi- tion in adherent human cells. We reveal correlations with specific cellular states that are defined by the population context of a cell, and we derive probabilistic models that can explain and predict most cellular heterogeneity of these activities, solely on the basis of each cell’s population context. We find that accounting for population-determined heterogeneity is essential for interpret- ing differences between the activity levels of cell populations. Finally, we reveal that synergy between two molecular components, focal adhesion kinase and the sphingolipid GM1, enhances the population-determined pattern of simian virus 40 (SV40) infection. Our findings provide an explanation for the origin of heterogeneity patterns of cellular activities in adherent cell populations. Virus infection can be considered the end result of the concerted action of many cellular processes, including endocytosis and the establishment of membrane lipid composition 7,8 . It is well accepted that these activities display heterogeneity within a population of cells, but the underlying causes of this heterogeneity are not known. We can therefore not predict how such activities behave in a population of cells 9 . To study the heterogeneity of these activities, we examined large populations of single monoclonal adherent human cells after three days of growth. These were infected by rotavirus, dengue virus, mouse hepatitis virus (MHV) or SV40. MHV uses clathrin-mediated endo- cytosis (CME) for infectious entry into the cell 10 , and SV40 binds to the sphingolipid GM1 for host cell attachment and entry 11,12 . To ana- lyse CME, we allowed cells to internalize fluorescent transferrin 13 . To determine the amount of GM1, which is enriched in lipid rafts on the cell surface 14 , we exposed cells to fluorescent cholera toxin B 15 . We observed that the cell-to-cell variation or heterogeneity of these activities displays specific patterns in both cancer- (HeLa and A431) (Fig. 1A) and non-cancer-derived (MCF10A) cells (see below). The patterns could not have been the result of physical constraints on the accessibility of virus particles or fluorescent probes to particular cells, because different types of pattern were observed. A potential source of single-cell heterogeneity is the deterministic interplay between the phenotypic state of a cell and its activities 2,16 , mediated by sensors and signal transduction networks. Growing iso- genic adherent mammalian cells create cell islets and regions that are sparsely or densely populated, to which cells adapt their size, shape and rate of proliferation 17,18 (Supplementary Movie 1), and display several discrete subpopulations 19 . Together, this implies that patterns of varying cellular states can arise from the adaptation of individual human cells to the particular ‘population context’ in which they reside. The coupling between cell size and timing of meiosis in yeast 20 , between cell size and the determination of phage infection outcome in Escherichia coli 21 , and the self-assembly of prokaryotic cells into complex colony patterns 22 , indicate that such mechanisms also operate in unicellular organisms. We therefore hypothesized that the heterogeneity patterns that we observed are caused by regulatory mechanisms that couple the phenotypic state of a cell to intracellular processes on which virus infection depends, such as endocytosis and the lipid composition of the cell surface. To reveal the presence of such mechanisms, we used three data- driven modelling approaches on several single-cell measurements obtained by computerized image analysis and supervised machine learning from multiple large populations of cells. (See Fig. 1B for an overview of the methods used, and Supplementary Methods for detailed information.) We first measured for each cell the size of the population to which it belonged, its local cell density, its position on a cell islet edge, its cell size, its mitotic state and its apoptotic state (microenvironment and cell state). By delineating how these para- meters interact with each other, we quantified how the characteristics of a cell population are determined. This showed that each para- meter, in particular population size, local cell density, position on a cell islet edge, and cell size, represents a population-determined property of an individual cell. The same interactions were observed in cancer cells (Supplementary Fig. 1), normal diploid cells (see below) and primary cells (not shown), despite their different morphologies and growth rates. For each cell, we next determined whether it was infected, or we quantified its CME activity or cell-surface level of GM1 (cellular activities). We then used the population-determined properties as predictors to model the probability of infection, the activity of CME and the amount of GM1 on the cell surface. In each instance, models could be derived that had good fits (Fig. 1C and Supplementary Fig. 2). The model parameters demonstrated the existence of exten- sive and specific regulatory mechanisms between population context and virus infection, endocytosis and membrane lipid composition (Fig. 1D). Importantly, the models were able to predict accurately the heterogeneity patterns of cellular activities, solely based on a quanti- tative assessment of the population context and the state of indi- vidual cells (Fig. 1E). An interesting question that arises is to what extent these population- determined effects contribute to the total variation observed in a population. Strikingly, variance analysis (Supplementary Methods) demonstrated that variation determined by population context is a major component of the total variation observed. For rotavirus, SV40 and dengue virus infection, our measures of the local environment 1 Institute of Molecular Systems Biology, ETH Zurich (Swiss Federal Institute of Technology), Wolfgang Pauli-Strasse 16, CH-8093 Zurich, Switzerland. 2 Zurich PhD Program in Molecular Life Sciences, Zurich, Switzerland. Vol 461 | 24 September 2009 | doi:10.1038/nature08282 520 Macmillan Publishers Limited. All rights reserved ©2009

Vol 461 24 September 2009 LETTERS - Pelkmanslab · LETTERS Population context determines cell-to-cell variability in endocytosis and virus infection Berend Snijder1,2, Raphael Sacher1,2,

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Vol 461 24 September 2009 LETTERS - Pelkmanslab · LETTERS Population context determines cell-to-cell variability in endocytosis and virus infection Berend Snijder1,2, Raphael Sacher1,2,

LETTERS

Population context determines cell-to-cell variabilityin endocytosis and virus infectionBerend Snijder1,2, Raphael Sacher1,2, Pauli Ramo1, Eva-Maria Damm1, Prisca Liberali1 & Lucas Pelkmans1

Single-cell heterogeneity in cell populations arises from a combina-tion of intrinsic and extrinsic factors1–3. This heterogeneity hasbeen measured for gene transcription, phosphorylation, cell mor-phology and drug perturbations, and used to explain variousaspects of cellular physiology4–6. In all cases, however, the causesof heterogeneity were not studied. Here we analyse, for the firsttime, the heterogeneous patterns of related cellular activities,namely virus infection, endocytosis and membrane lipid composi-tion in adherent human cells. We reveal correlations with specificcellular states that are defined by the population context of a cell,and we derive probabilistic models that can explain and predictmost cellular heterogeneity of these activities, solely on the basisof each cell’s population context. We find that accounting forpopulation-determined heterogeneity is essential for interpret-ing differences between the activity levels of cell populations.Finally, we reveal that synergy between two molecular components,focal adhesion kinase and the sphingolipid GM1, enhances thepopulation-determined pattern of simian virus 40 (SV40) infection.Our findings provide an explanation for the origin of heterogeneitypatterns of cellular activities in adherent cell populations.

Virus infection can be considered the end result of the concertedaction of many cellular processes, including endocytosis and theestablishment of membrane lipid composition7,8. It is well acceptedthat these activities display heterogeneity within a population of cells,but the underlying causes of this heterogeneity are not known. Wecan therefore not predict how such activities behave in a populationof cells9.

To study the heterogeneity of these activities, we examined largepopulations of single monoclonal adherent human cells after threedays of growth. These were infected by rotavirus, dengue virus, mousehepatitis virus (MHV) or SV40. MHV uses clathrin-mediated endo-cytosis (CME) for infectious entry into the cell10, and SV40 binds tothe sphingolipid GM1 for host cell attachment and entry11,12. To ana-lyse CME, we allowed cells to internalize fluorescent transferrin13. Todetermine the amount of GM1, which is enriched in lipid rafts on thecell surface14, we exposed cells to fluorescent cholera toxin B15.

We observed that the cell-to-cell variation or heterogeneity ofthese activities displays specific patterns in both cancer- (HeLa andA431) (Fig. 1A) and non-cancer-derived (MCF10A) cells (see below).The patterns could not have been the result of physical constraints onthe accessibility of virus particles or fluorescent probes to particularcells, because different types of pattern were observed.

A potential source of single-cell heterogeneity is the deterministicinterplay between the phenotypic state of a cell and its activities2,16,mediated by sensors and signal transduction networks. Growing iso-genic adherent mammalian cells create cell islets and regions that aresparsely or densely populated, to which cells adapt their size, shapeand rate of proliferation17,18 (Supplementary Movie 1), and display

several discrete subpopulations19. Together, this implies that patternsof varying cellular states can arise from the adaptation of individualhuman cells to the particular ‘population context’ in which theyreside. The coupling between cell size and timing of meiosis in yeast20,between cell size and the determination of phage infection outcomein Escherichia coli21, and the self-assembly of prokaryotic cells intocomplex colony patterns22, indicate that such mechanisms alsooperate in unicellular organisms. We therefore hypothesized thatthe heterogeneity patterns that we observed are caused by regulatorymechanisms that couple the phenotypic state of a cell to intracellularprocesses on which virus infection depends, such as endocytosis andthe lipid composition of the cell surface.

To reveal the presence of such mechanisms, we used three data-driven modelling approaches on several single-cell measurementsobtained by computerized image analysis and supervised machinelearning from multiple large populations of cells. (See Fig. 1B for anoverview of the methods used, and Supplementary Methods fordetailed information.) We first measured for each cell the size ofthe population to which it belonged, its local cell density, its positionon a cell islet edge, its cell size, its mitotic state and its apoptotic state(microenvironment and cell state). By delineating how these para-meters interact with each other, we quantified how the characteristicsof a cell population are determined. This showed that each para-meter, in particular population size, local cell density, position ona cell islet edge, and cell size, represents a population-determinedproperty of an individual cell. The same interactions were observedin cancer cells (Supplementary Fig. 1), normal diploid cells (seebelow) and primary cells (not shown), despite their differentmorphologies and growth rates.

For each cell, we next determined whether it was infected, or wequantified its CME activity or cell-surface level of GM1 (cellularactivities). We then used the population-determined properties aspredictors to model the probability of infection, the activity of CMEand the amount of GM1 on the cell surface. In each instance, modelscould be derived that had good fits (Fig. 1C and SupplementaryFig. 2). The model parameters demonstrated the existence of exten-sive and specific regulatory mechanisms between population contextand virus infection, endocytosis and membrane lipid composition(Fig. 1D). Importantly, the models were able to predict accurately theheterogeneity patterns of cellular activities, solely based on a quanti-tative assessment of the population context and the state of indi-vidual cells (Fig. 1E).

An interesting question that arises is to what extent these population-determined effects contribute to the total variation observed in apopulation. Strikingly, variance analysis (Supplementary Methods)demonstrated that variation determined by population context is amajor component of the total variation observed. For rotavirus, SV40and dengue virus infection, our measures of the local environment

1Institute of Molecular Systems Biology, ETH Zurich (Swiss Federal Institute of Technology), Wolfgang Pauli-Strasse 16, CH-8093 Zurich, Switzerland. 2Zurich PhD Program inMolecular Life Sciences, Zurich, Switzerland.

Vol 461 | 24 September 2009 | doi:10.1038/nature08282

520 Macmillan Publishers Limited. All rights reserved©2009

Page 2: Vol 461 24 September 2009 LETTERS - Pelkmanslab · LETTERS Population context determines cell-to-cell variability in endocytosis and virus infection Berend Snijder1,2, Raphael Sacher1,2,

and cellular state account for at least 60%, 74% and 82% of therespective total variation for each activity (Fig. 2a). For CME andGM1 content on the cell surface, variation determined by populationcontext appeared to constitute at least 30% and 21% of the respectivetotal variation (Fig. 2a).

To understand better the variation unexplained by populationcontext, we analysed the variance in CME activity in groups of cellswith similar population-determined phenotypic properties. Thisrevealed that cells in certain population contexts displayed as muchas a 20-fold lower variation than randomly selected cells (inset Fig. 2a).When we analysed this variance relative to the mean CME activityin these groups of cells (coefficient of variation), we observed athreefold lower coefficient of variation in cells with a high local celldensity (Fig. 2b). This indicates that these cells have, besides a highCME activity, mechanisms in place that exert a tight control on thisactivity.

The finding that much of the variation in virus infection, endocy-tosis and lipid composition of the cell surface is deterministicallyestablished by the adaptation of cells to their population context revealsa fundamental problem in our current methods of studying differencesin these activities between cell populations. We illustrate this by com-paring the fraction of cells infected with dengue virus (Fig. 2c) and thesingle-cell distributions of CME activity (Fig. 2d) between unperturbedpopulations of different sizes. The activities display levels that are stati-stically different (respectively P 5 2.4 3 1026 and P , 10210) betweenthe different unperturbed populations. However, after normalizationof both activities with activities predicted by their respective models, nosignificant differences were found (P values of respectively 0.97 and0.75). This is the correct conclusion, as both populations were un-perturbed and only differed in their population context. A similarproblem exists with supervised and unsupervised clustering methods.Even though such methods correctly identify different phenotypic

Rot

aviru

s

0.01

0.03

0.05

0.07

0.08

0.10

MH

V

0.00

0.06

0.12

0.17

0.23

0.29

Measured infectionPredicted infection level Predicted CME

Measured CME

Predicted single-cellinfection

Den

gue

viru

s

0.06

0.07

0.09

0.11

0.13

SV

40

0.00

0.05

0.10

0.14

0.19

0.24

B

E

C

EDGE

Virus infectionEndocytosisLipid composition

Population size

Mitotic stateApoptotic state

Local cell densityCell size

Colony edge

Assays

Selected single-cell featuresMicroenvironment Cell state

Cellular activities

Regression modelling of activity

Bayesian network learning

Microenvironment

Cell state andmolecular markers

Cellular activity

Correct extrinsic heterogeneity

Xij

250 features per cell

F

E

D C A

E

GF

Predict activity

Graphical Gaussian modelling

Model micro-environment and cell state

A

B

CD

EF

G

H

1 2 3 4 5 6 7 8 9 10 11 12

B

APOP

POP.SIZE

DENSITYSIZE

MITOT

0

4

8

12

× 105 × 105 × 105 × 104× 106

–3–2–1012

Log 2 r

elat

ive

infe

ctio

nin

dex

SV40 infection

Local cell density (cells per 2 × 103 μm2)2 4 6 8 10

–1–0.5

00.5

11.5

Dengue virusinfection

Non

-ed

ge

Ed

ge

0

2

4

6

8

0

1

2

3

4

–2

0

2

4

Rotavirus infection

Population size (× 104)0.5 1.5 2 2.5

0

2

4

6

8

–2

–1

0

1

2

MHV infection

Local cell density (cells per 2 × 103 μm2)

Tota

l num

ber

of c

ells

2 4 6 8 10 00

4

8

12

0.3

0.4

0.5

0.6

0.7

CME activity

Local cell density (cells per 2 × 103 μm2)Log 10

CM

E a

ctiv

ity (a

.u.)

21 3 54 6 7

D

SV40

GM1

Denguevirus

Rotavirus

CME

MHV

Local cell density

Cell size

Cell islet edges

Population size

Mitotic state

Apoptotic state

–0.12

–0.11

–0.16

–0.08

3.10

0.05

0.12

–0.52

–0.04

0.00

–0.48

0.01

0.13

4.37

0.60

0.72

–6.65

–0.11

0.02

–0.74

–0.01

–0.03

–0.25

0.02

–0.14

ND

ND ND

ND

0.65

0.50

–0.04

0.19

0.90

0.39

–0.01

1st– +

2nd

3rd

R2 r

ank

Regression coefficients

Cell stateMicroenvironment

A

GM1

DV CME SV40

MHV

Model Measured

a

bd

e

c

Figure 1 | Modelling and predicting diverse activity patterns inheterogeneous human cell populations. A, GM1 levels on the plasmamembrane of A431 cells and dengue virus (DV) infection in HeLa cells arehigher in cells at islet edges (upper and lower left panels). MHV infection andCME (upper right and middle panels) are highest in HeLa cells in crowdedregions. SV40 (lower right panel) infects HeLa cells that are large and spreadout. Scale bars: GM1 and dengue virus, 50 mm; CME, MHV and SV40respectively 300, 250 and 150mm. B, Overview of the experimental,computational and statistical methods used in this study (see Methods andSupplementary Methods for details). C, Virus infection (n 5 9; grey

box-plots) and CME (black lines; average 6 single-cell s.d.) plotted againstpopulation-context parameters. Model fits are depicted in red. Right axesdisplay the number of cells measured (light-grey bars and lines).D, Regression coefficients between population-context parameters(columns) and cellular activities (rows), colour-coded according tocoefficient sign and top-three ranked explanatory power (see legend). ND,not determined. E, Model-predicted virus infection and CME patternsdisplayed on computer-generated reproductions of original images (leftpanels). Predicted and measured single-cell activity levels are shown (middleand right panels).

NATURE | Vol 461 | 24 September 2009 LETTERS

521 Macmillan Publishers Limited. All rights reserved©2009

Page 3: Vol 461 24 September 2009 LETTERS - Pelkmanslab · LETTERS Population context determines cell-to-cell variability in endocytosis and virus infection Berend Snijder1,2, Raphael Sacher1,2,

states (Supplementary Fig. 3), they will not reveal whether these pheno-typic states are determined by cellular adaptations to the populationcontext. Therefore, these methods will be unable to distinguishbetween a change in cellular activity that is a consequence of an alteredpopulation context and a change that is a consequence of a directperturbation.

Besides these fundamental implications, our analysis revealssuggestions for the mechanisms underlying population-determinedheterogeneity of virus infection (see ranking in Fig. 1d and Sup-plementary Fig. 4). The infection efficiency of rotavirus is stronglyincreased in cells that grow in sparsely populated areas. Indeed, rota-virus uses integrins as its receptor23, which are probably highlyexpressed on the surface of cells that grow sparsely24. MHV, on thecontrary, prefers to infect cells that grow at high local density, similarto where CME is most active. Dengue virus infection occurs almostexclusively in cells located on the edge of cell islets, indicating that itrelies on mechanisms highly active in polarized cells25.

The heterogeneity signature of SV40 infection demonstrated aparticular preference for large and spread-out cells in both cancer-and non-cancer-derived cell lines (see below). As these cells are pre-dominantly located on the edge of cell islets and in areas that aresparsely populated, they also have higher amounts of GM1 on thecell surface (Fig. 1D), the receptor for SV4011,12. In addition, wepreviously found that focal adhesion kinase (FAK or PTK2) isrequired for SV40 infection8, and regulatory links between the micro-environment of cells and FAK24, and between FAK and sphingolipidmembrane domains, have been reported26,27.

To test whether FAK and GM1 can be placed in a causal networkthat determines the heterogeneity pattern of SV40 infection, weapplied Bayesian network learning5 (Supplementary Methods) on

single-cell measurements of SV40 infection, amount of GM1 onthe cell surface, level of Y397-phosphorylated (activated) FAK andpopulation-determined phenotypic properties (Fig. 3a and Sup-plementary Fig. 5). We chose to use MCF10A cells to point out therelevance of these phenomena also for non-cancer-derived, normaldiploid cells. This identified a unique causal network that combinessingle-cell microenvironmental parameters with molecular com-ponents and virus infection (Fig. 3b). We found that cell density iscoupled to the regulation of cell size through GM1 cell surfacecontent, and FAK activation. The last two are also determinants ofSV40 infection, in addition to as yet unidentified factors that areregulated by cell density. We validated three causal interactions inthe Bayesian network. Exogenous addition of GM1 to MCF10A cellsincreased the levels of phosphorylated FAK and increased SV40 infec-tion. Additionally, knockdown of FAK by RNA interference stronglydecreased SV40 infection (Supplementary Fig. 7). Thus, by applyingquantitative data-driven modelling on a combination of micro-environmental, cellular and molecular parameters, we reveal mech-anistic insights into the establishment of cellular heterogeneity.

We next wondered why the heterogeneity of SV40 infection ismore deterministic (74% explained) than the GM1 levels and FAKactivity on which it depends (respectively 30% and 65% explained)(Fig. 3b). Interestingly, when we validated the causal interaction

a

c d

b

0

5

10

15

0

5

10

15

Variance Log10 CME (a.u.)

0

1

50 100 150 200 250

Sam

e co

ntex

tR

and

omiz

ed0

100

20

40

60

80

Dengue virus

SV40Rotavirus

CME

GM1

Var

ianc

e ex

pla

ined

by

pop

ulat

ion

cont

ext

(%)

Den

gue

viru

s-in

fect

ed c

ells

(%)

Nor

mal

ized

cou

nt

Nor

mal

ized

cou

nt

Cor

rect

ed d

engu

e vi

rus

infe

ctio

n(a

.u.) P = 0.97

P = 2.4 × 10–6

Sm

all p

opul

atio

ns(7

,295

± 1

,434

cel

ls)

Larg

e p

opul

atio

ns(1

4,91

8 ±

1,2

39 c

ells

)

After correctionBefore correction

10 30 500

1

0

1P < 10–10

Large population (14,995 cells)Small population (184 cells)

Log10 CME (a.u.)

Before correction

–20 0 20

P = 0.75

Corrected Log10 CME (a.u.)

After correction

0 6 12

0.2

0.25

0.3

0.35

0.4

Overall coefficient of variation

Local cell density (cells per 2 × 103 μm2)C

oeffi

cien

t of

var

iatio

n

20

25

30

35

40

2,600 cells

100 cells

Log10 CME (a.u.)

Figure 2 | Most variation observed in cellular activities is populationdetermined and must be accounted for. a, Population-determined variance(percentage of total) in the levels of virus infection, CME and GM1 content(Supplementary Methods). Variance of CME in groups of cells in similarpopulation contexts (insert; green) compared with randomized groups ofcells (blue). b, Coefficient of variation (s.d./mean) of CME in groups of cellswith similar population contexts plotted against local cell density(r2 5 20.57, P , 10210). Number of cells per group (dot size) and averageactivity level (colour) are indicated (see legend). c, Significantly differentdengue virus infection (P 5 2.4 3 1026) between unperturbed populationsof cells with different population properties (n 5 9). Model correctionabolishes this difference (P 5 0.97). d, Significantly different single-cell CMEactivities (P , 10210) between unperturbed populations of cells withdifferent population properties. Model correction abolishes this difference(P 5 0.75).

b c

Local celldensity

GM1

Edgecells

Population size

pFAK

SV40infection

(65% explained)

(74% explained)

(30% explained)

Cellsize

–0.21*

–0.49*

0.53*

0.87*

0.75*0.58*

0.45*

–0.30*

* Pairwise correlation coefficient

2 4 6 85

10

15

20

00

1,000

2,000

3,000

Exo

geno

us G

M1-

ind

uced

FA

K a

ctiv

atio

n (%

)

Local cell density (cells per 1.1 × 104 μm2)

Tota

l num

ber

of c

ells

aHigh GM1; SV40 infection

Low GM1; No SV40 infection

GM1GM1 SV40SV40DAPIDAPI pFAKpFAKDAPIDAPI

Figure 3 | SV40 infection and cell size are co-regulated by GM1 and activeFAK, depending on the local cell density. a, SV40 infection (red), GM1content (green) and Y397-phosphorylated FAK (enlargements; yellow) inMCF10A cells. Scale bar, 150mm. b, Bootstrapped Bayesian networklearning (Supplementary Fig. 6 and Supplementary Methods) reveals causallinks between population-context parameters, GM1 content, pFAK levelsand SV40 infection. Edges are drawn according to the sign of pairwise single-cell correlations. Variation in levels of GM1, pFAK and SV40 infectionexplained by population context are respectively 30%, 65% and 74%.c, Scatter plot of exogenous GM1-induced increase in pFAK (percentage ofnon-induced) against local cell density. Linear fit (red) and total number ofcells for each measurement (right axis, grey) are shown.

LETTERS NATURE | Vol 461 | 24 September 2009

522 Macmillan Publishers Limited. All rights reserved©2009

Page 4: Vol 461 24 September 2009 LETTERS - Pelkmanslab · LETTERS Population context determines cell-to-cell variability in endocytosis and virus infection Berend Snijder1,2, Raphael Sacher1,2,

between GM1 and FAK by exogenous addition of GM1, we observeda 2.5-fold stronger activation of FAK in sparse cells compared withdense cells (Fig. 3c). This indicates that GM1 and FAK synergize insparse cells, leading to the more deterministic downstream activity ofSV40 infection. Such synergistic mechanisms can partly cancel outthe intrinsic and uncorrelated noise of individual components,increasing the deterministic nature of complex activities at the cel-lular level2,28.

We have shown that much of the variation in virus infection,endocytosis and membrane lipid composition is determined by theadaptation of cells to their population context, and uncovered basicpredictive principles by which non-differentiated cells create com-plex patterns of activity. Similar mechanisms may determine theheterogeneity of other cellular processes. Perturbation screens, com-bined with quantitative modelling of single cells in their populationcontext, will further reveal the molecular networks that regulate het-erogeneity patterns in cell populations. The principles described heremost likely operate in all systems of collective cellular behaviour,from prokaryotic colonies to multicellular organisms.

METHODS SUMMARYAll human cell lines were maintained under standard tissue culture conditions. All

assays were performed in 96-well plates, and cells were imaged on automated

widefield cellWoRx microscopes (Applied Precision) or ImageXpress Micro

microscopes (Molecular Devices) with 310 or 320 magnification. Infection

assays were performed as described (Supplementary Information). CME was

measured using Alexa Fluor 488-conjugated transferrin, and GM1 levels were

visualized using cholera toxin B conjugated to Alexa Fluor 568 as described8.

Phosphorylated-FAK [Y397] was visualized by using standard indirect immuno-

fluorescence protocols. High-content single-cell image analysis was

performed using CellProfiler29 and additional image analysis algorithms written

specifically for this study (Supplementary Information). In short, cells were

identified by object detection on DAPI images, and these regions were typicallyexpanded to cover the cytoplasm. Shape features and features describing the

texture and intensities in all channels were extracted for these regions of interest.

Support vector machine learning was applied (P. Ramo, R. Sacher, B. Snijder,

B. Begemann and L. Pelkmans, submitted) for the classification of diverse cellular

phenotypes, including interphase, mitotic and apoptotic cellular states, as well as

infectious phenotypes and technical artefacts. Globally, these data were further

analysed using several statistical methods. First, graphical Gaussian modelling

quantified how a population shapes the property distributions of individual cells

and how individual properties influence each other, which was experimentally

validated by live cell imaging. Second, probit regression modelling revealed how

virus infection depends on population-determined properties of individual cells,

as did weighted linear regression for GM1 content and CME. Finally, bootstrapped

Bayesian network learning was applied to identify causal interactions between

properties of the microenvironment, FAK-phosphorylation, GM1 content and

SV40 infection in MCF10A cells. Selected inferred causal links were experimentally

validated by using GM1 addback and short interfering RNA (siRNA)-mediated

silencing of FAK as described8. A detailed description of the methods performed in

this study is provided in the Supplementary Information.

Received 20 March; accepted 10 July 2009.Published online 26 August 2009.

1. Elowitz, M. B., Levine, A. J., Siggia, E. D. & Swain, P. S. Stochastic gene expressionin a single cell. Science 297, 1183–1186 (2002).

2. Maheshri, N. & O’Shea, E. K. Living with noisy genes: how cells function reliablywith inherent variability in gene expression. Annu. Rev. Biophys. Biomol. Struct. 36,413–434 (2007).

3. Raj, A. & van Oudenaarden, A. Nature, nurture, or chance: stochastic geneexpression and its consequences. Cell 135, 216–226 (2008).

4. Sigal, A. et al. Variability and memory of protein levels in human cells. Nature 444,643–646 (2006).

5. Sachs, K., Perez, O., Pe’er, D., Lauffenburger, D. A. & Nolan, G. P. Causal protein-signaling networks derived from multiparameter single-cell data. Science 308,523–529 (2005).

6. Keren, K. et al. Mechanism of shape determination in motile cells. Nature 453,475–480 (2008).

7. Liberali, P., Ramo, P. & Pelkmans, L. Protein kinases: starting a molecular systemsview of endocytosis. Annu. Rev. Cell Dev. Biol. 24, 501–523 (2008).

8. Pelkmans, L. et al. Genome-wide analysis of human kinases in clathrin- andcaveolae/raft-mediated endocytosis. Nature 436, 78–86 (2005).

9. Wilkinson, D. J. Stochastic modelling for quantitative description ofheterogeneous biological systems. Nature Rev. Genet. 10, 122–133 (2009).

10. Eifart, P. et al. Role of endocytosis and low pH in murine hepatitis virus strain A59cell entry. J. Virol. 81, 10758–10768 (2007).

11. Neu, U., Woellner, K., Gauglitz, G. & Stehle, T. Structural basis of GM1 gangliosiderecognition by simian virus 40. Proc. Natl Acad. Sci. USA 105, 5219–5224 (2008).

12. Pelkmans, L. Secrets of caveolae- and lipid raft-mediated endocytosis revealed bymammalian viruses. Biochim. Biophys. Acta 1746, 295–304 (2005).

13. Conner, S. D. & Schmid, S. L. Regulated portals of entry into the cell. Nature 422,37–44 (2003).

14. Mayor, S. & Pagano, R. E. Pathways of clathrin-independent endocytosis. NatureRev. Mol. Cell Biol. 8, 603–612 (2007).

15. Holmgren, J., Lonnroth, I. & Svennerholm, L. Tissue receptor for cholera exotoxin:postulated structure from studies with GM1 ganglioside and related glycolipids.Infect. Immun. 8, 208–214 (1973).

16. Sacher, R., Stergiou, L. & Pelkmans, L. Lessons from genetics: interpreting complexphenotypes in RNAi screens. Curr. Opin. Cell Biol. 20, 483–489 (2008).

17. Eagle, H. & Levine, E. M. Growth regulatory effects of cellular interaction. Nature213, 1102–1106 (1967).

18. Castor, L. N. Flattening, movement and control of division of epithelial-like cells.J. Cell. Physiol. 75, 57–64 (1970).

19. Slack, M. D., Martinez, E. D., Wu, L. F. & Altschuler, S. J. Characterizingheterogeneous cellular responses to perturbations. Proc. Natl Acad. Sci. USA 105,19306–19311 (2008).

20. Nachman, I., Regev, A. & Ramanathan, S. Dissecting timing variability in yeastmeiosis. Cell 131, 544–556 (2007).

21. St-Pierre, F. & Endy, D. Determination of cell fate selection during phage lambdainfection. Proc. Natl Acad. Sci. USA 105, 20705–20710 (2008).

22. Ben-Jacob, E., Cohen, I. & Gutnick, D. L. Cooperative organization of bacterialcolonies: from genotype to morphotype. Annu. Rev. Microbiol. 52, 779–806(1998).

23. Lopez, S. & Arias, C. F. Multistep entry of rotavirus into cells: a Versaillesquedance. Trends Microbiol. 12, 271–278 (2004).

24. Geiger, B., Spatz, J. P. & Bershadsky, A. D. Environmental sensing through focaladhesions. Nature Rev. Mol. Cell Biol. 10, 21–33 (2009).

25. Neumann, A. K., Thompson, N. L. & Jacobson, K. Distribution and lateral mobilityof DC-SIGN on immature dendritic cells–implications for pathogen uptake. J. CellSci. 121, 634–643 (2008).

26. Iwabuchi, K. et al. Reconstitution of membranes simulating ‘‘glycosignalingdomain’’ and their susceptibility to Lyso-GM3. J. Biol. Chem. 275, 15174–15181(2000).

27. Palazzo, A. F., Eng, C. H., Schlaepfer, D. D., Marcantonio, E. E. & Gundersen, G. G.Localized stabilization of microtubules by integrin- and FAK-facilitated Rhosignaling. Science 303, 836–839 (2004).

28. Newman, J. R. & Weissman, J. S. Systems biology: many things from one. Nature444, 561–562 (2006).

29. Carpenter, A. E. et al. CellProfiler: image analysis software for identifying andquantifying cell phenotypes. Genome Biol. 7, R100 (2006).

Supplementary Information is linked to the online version of the paper atwww.nature.com/nature.

Acknowledgements We acknowledge H. Verheije and L. Burleigh for providingimages of MHV and dengue virus infection, G. Jurisic for providing primary cellsand help with experiments, and all members of the laboratory for comments on themanuscript. P.R. is supported by the European Molecular Biology Organisation andthe Human Frontiers Science Program, E.-M.D. by Oncosuisse and P.L. by theFederation of European Biochemical Societies. L.P. is supported by the ETH Zurich,SystemsX.ch, the Swiss National Science Foundation and the European Union.

Author Contributions L.P. supervised and conceived the project. R.S., B.S., E.-M.D.and P.L. performed experiments, B.S. and P.R. developed computational imageanalysis methods, B.S. performed all computational image analysis, B.S. and P.R.conceived the statistical analysis methods, B.S. performed all statistical analysis,L.P. and B.S. wrote the manuscript.

Author Information Reprints and permissions information is available atwww.nature.com/reprints. Correspondence and requests for materials should beaddressed to L.P. ([email protected]).

NATURE | Vol 461 | 24 September 2009 LETTERS

523 Macmillan Publishers Limited. All rights reserved©2009