1
LETTER No evidence for genetic assortative mating beyond that due to population stratification Domingue et al. (1) use genome-wide SNPs to show in non-Hispanic US whites that spouses are genetically more similar than random pairs of individuals. We argue that, although this reported result is descriptively true, the spousal genetic similarity can be explained by assortment on shared ancestry (i.e., population stratification) and thus does not reflect genetic assortative mating as inter- preted by Dominigue et al. This greatly affects the implications of the findings for un- derstanding assortative mating in humans. Genetic population stratification is a major driver of genetic spouse similarities as a con- sequence of ethnic or social homogamy and/or geographic proximity. Social homogamy oc- curs when individuals from similar social, ethnic, or demographic backgrounds are more likely to mate. Residential proximity is a strong predictor of who marries whom (2), and European ancestry within the United States is not independent from geographic location or social group (3). People of European ancestry show systematic genetic differences depending on where in Europe their ancestors originate from, as cataloged in the 1000 Genomes and HapMap projects (www.1000genomes.org). Excluding non-Caucasian individuals is not sufficient to account for population stratifi- cation (especially when only based on self- reports). The standard method to deal with (subtle) population stratification is to con- trol for the major dimensions of genetic variation [principle components (PCs)] (4). When Domingue et al. use this established method, the slight genetic similarity between mates disappeared (their SI Text and table S1). However, this result is not reported in the main textthe authors instead reported results of three alternative controls that did not completely eliminate spousal genetic similarity. These methods are nonstandard and, we argue, inadequate. The first method restricted analyses to individuals with less variability on the genetic PCs; this method will reduce but not eliminate heterogeneity in ancestry. Even within single European populations from geographically small regions, PCs reflecting the relatively small ancestry differences show significant spouse correlations (5). The second method was to control for geographical region of birth as a proxy for ethnicity. However, the census division used does not correspond geographically with the clustering of different European or global ancestries in the United States (3). The third method was to restrict analyses to SNPs the authors claim show very little evidence of population stratification. How- ever, using PCs obtained from the European populations in the 1000 Genomes project, Fig. 1 shows that that even the most conser- vatively restricted set of SNPs captures European ancestry differences just as well as the rest of the SNPs (both PCs showed cor- relations of >0.9 between the two SNP sets). Last, Domingue et al. replicate their find- ing in a geographically/genetically homoge- neouspopulation, without accounting for population stratification. Using a large sub- set of the same sample (n = 515 spouse pairs), we found that significant spousal ge- netic similarity is eliminated after correcting for two ancestry-informative PCs projected from the HapMap-3 dataset; the empirical P value changed from 0.001 to 0.343. In conclusion, we believe that Domingue et al. did not provide sufficient evidence for spousal genetic similarity beyond that due to population stratification. Abdel Abdellaoui a,1 , Karin J. H. Verweij a,b,1 , and Brendan P. Zietsch b,c,2 a Department of Biological Psychology, Vrije Universiteit Amsterdam, 1081 BT, Amsterdam, The Netherlands; b School of Psychology, University of Queensland, St. Lucia, Brisbane, QLD 4029, Australia; and c Genetic Epidemiology Laboratory, QIMR Berghofer Medical Research Institute, Herston, Brisbane, QLD 4006, Australia 1 Domingue BW, Fletcher J, Conley D, Boardman JD (2014) Genetic and educational assortative mating among US adults. Proc Natl Acad Sci USA 111(22):79968000. 2 Bossard JH (1932) Residential propinquity as a factor in marriage selection. Am J Sociol 38(2):219224. 3 Brittingham A, De La Cruz GP (2004) Ancestry: 2000 (US Department of Commerce, Economics and Statistics Administration, US Census Bureau, Washington, DC). 4 Price AL, et al. (2006) Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet 38(8): 904909. 5 Abdellaoui A, et al. (2013) Population structure, migration, and diversifying selection in the Netherlands. Eur J Hum Genet 21(11): 12771285. Author contributions: A.A., K.J.H.V., and B.P.Z. designed research; A.A. analyzed data; and A.A., K.J.H.V., and B.P.Z. wrotethe paper. The authors declare no conflict of interest. 1 A.A. and K.J.H.V. contributed equally to this work. 2 To whom correspondence should be addressed. Email: zietsch@ psy.uq.edu.au. 0.2 0.1 0.0 0.1 0.05 0.00 0.05 0.10 PC 2 PC 1 0.10 0.05 0.00 0.05 0.05 0.00 0.05 0.10 PC 2 PC 1 Iberian populations in Spain Toscan individuals CEPH individuals British individuals from England and Scotland HapMap Finnish individuals from Finland Fig. 1. The first two ancestry-informative PCs from the PC analysis (PCA) conducted on the European populations from the 1000 Genomes project using Eigenstrat (4). (Left) PCs from a PCA on 1,185,924 of the 1,250,013 ancestry SNPs that correlated significantly (α = 0.05) with any of the first five PCs from Domingue et al. (Right) PCs from the PCA on 433,450 of 457,201 SNPs that were not individually significantly associated with any of the first five PCs, showing very little evidence of population stratification according to Domingue et al. PC1 correlates 0.97 between the two SNP sets, and PC2 correlates 0.92. www.pnas.org/cgi/doi/10.1073/pnas.1410781111 PNAS | October 7, 2014 | vol. 111 | no. 40 | E4137 LETTER

No evidence for genetic assortative mating beyond that due to population stratification

Embed Size (px)

Citation preview

LETTER

No evidence for genetic assortative matingbeyond that due to population stratificationDomingue et al. (1) use genome-wide SNPsto show in non-Hispanic US whites thatspouses are genetically more similar thanrandom pairs of individuals. We argue that,although this reported result is descriptivelytrue, the spousal genetic similarity can beexplained by assortment on shared ancestry(i.e., population stratification) and thus doesnot reflect genetic assortative mating as inter-preted by Dominigue et al. This greatlyaffects the implications of the findings for un-derstanding assortative mating in humans.Genetic population stratification is a major

driver of genetic spouse similarities as a con-sequence of ethnic or social homogamy and/orgeographic proximity. Social homogamy oc-curs when individuals from similar social,ethnic, or demographic backgrounds aremore likely to mate. Residential proximityis a strong predictor of who marries whom(2), and European ancestry within the UnitedStates is not independent from geographiclocation or social group (3). People ofEuropean ancestry show systematic geneticdifferences depending on where in Europetheir ancestors originate from, as catalogedin the 1000 Genomes and HapMap projects(www.1000genomes.org).Excluding non-Caucasian individuals is not

sufficient to account for population stratifi-cation (especially when only based on self-reports). The standard method to deal with

(subtle) population stratification is to con-trol for the major dimensions of geneticvariation [principle components (PCs)] (4).When Domingue et al. use this establishedmethod, the slight genetic similarity betweenmates disappeared (their SI Text and tableS1). However, this result is not reported inthe main text—the authors instead reportedresults of three alternative controls that didnot completely eliminate spousal geneticsimilarity. These methods are nonstandardand, we argue, inadequate.The first method restricted analyses to

individuals with less variability on the geneticPCs; this method will reduce but not eliminateheterogeneity in ancestry. Even within singleEuropean populations from geographicallysmall regions, PCs reflecting the relativelysmall ancestry differences show significantspouse correlations (5).The second method was to control for

geographical region of birth as a proxy forethnicity. However, the census division useddoes not correspond geographically with theclustering of different European or globalancestries in the United States (3).The third method was to restrict analyses

to SNPs the authors claim show very littleevidence of population stratification. How-ever, using PCs obtained from the Europeanpopulations in the 1000 Genomes project,Fig. 1 shows that that even the most conser-

vatively restricted set of SNPs capturesEuropean ancestry differences just as well asthe rest of the SNPs (both PCs showed cor-relations of >0.9 between the two SNP sets).Last, Domingue et al. replicate their find-

ing in a “geographically/genetically homoge-neous” population, without accounting forpopulation stratification. Using a large sub-set of the same sample (n = 515 spousepairs), we found that significant spousal ge-netic similarity is eliminated after correctingfor two ancestry-informative PCs projectedfrom the HapMap-3 dataset; the empiricalP value changed from 0.001 to 0.343.In conclusion, we believe that Domingue

et al. did not provide sufficient evidence forspousal genetic similarity beyond that due topopulation stratification.

Abdel Abdellaouia,1, Karin J. H. Verweija,b,1,and Brendan P. Zietschb,c,2aDepartment of Biological Psychology, VrijeUniversiteit Amsterdam, 1081 BT,Amsterdam, The Netherlands; bSchool ofPsychology, University of Queensland, St.Lucia, Brisbane, QLD 4029, Australia; andcGenetic Epidemiology Laboratory, QIMRBerghofer Medical Research Institute, Herston,Brisbane, QLD 4006, Australia

1 Domingue BW, Fletcher J, Conley D, Boardman JD (2014) Genetic

and educational assortative mating among US adults. Proc Natl Acad

Sci USA 111(22):7996–8000.2 Bossard JH (1932) Residential propinquity as a factor in marriage

selection. Am J Sociol 38(2):219–224.3 Brittingham A, De La Cruz GP (2004) Ancestry: 2000 (US

Department of Commerce, Economics and Statistics Administration,

US Census Bureau, Washington, DC).4 Price AL, et al. (2006) Principal components analysis corrects for

stratification in genome-wide association studies. Nat Genet 38(8):

904–909.5 Abdellaoui A, et al. (2013) Population structure, migration, and

diversifying selection in the Netherlands. Eur J Hum Genet 21(11):

1277–1285.

Author contributions: A.A., K.J.H.V., and B.P.Z. designed research;

A.A. analyzed data; and A.A., K.J.H.V., and B.P.Z. wrote the paper.

The authors declare no conflict of interest.

1A.A. and K.J.H.V. contributed equally to this work.

2To whom correspondence should be addressed. Email: [email protected].

−0.2 −0.1 0.0 0.1

−0.0

50.

000.

050.

10

PC 2

PC

1

−0.10 −0.05 0.00 0.05

−0.0

50.

000.

050.

10

PC 2

PC

1

Iberian populations in SpainToscan individualsCEPH individualsBritish individuals from England and ScotlandHapMap Finnish individuals from Finland

Fig. 1. The first two ancestry-informative PCs from the PC analysis (PCA) conducted on the European populationsfrom the 1000 Genomes project using Eigenstrat (4). (Left) PCs from a PCA on 1,185,924 of the 1,250,013 ancestrySNPs that correlated significantly (α = 0.05) with any of the first five PCs from Domingue et al. (Right) PCs from thePCA on 433,450 of 457,201 SNPs that were not individually significantly associated with any of the first five PCs,showing very little evidence of population stratification according to Domingue et al. PC1 correlates 0.97 between thetwo SNP sets, and PC2 correlates 0.92.

www.pnas.org/cgi/doi/10.1073/pnas.1410781111 PNAS | October 7, 2014 | vol. 111 | no. 40 | E4137

LETT

ER