Abstract
As population structure can result in spurious associations, it has constrained the use of association studies in human and plant genetics. Association mapping, however, holds great promise if true signals of functional association can be separated from the vast number of false signals generated by population structure1,2. We have developed a unified mixed-model approach to account for multiple levels of relatedness simultaneously as detected by random genetic markers. We applied this new approach to two samples: a family-based sample of 14 human families, for quantitative gene expression dissection, and a sample of 277 diverse maize inbred lines with complex familial relationships and population structure, for quantitative trait dissection. Our method demonstrates improved control of both type I and type II error rates over other methods. As this new method crosses the boundary between family-based and structured association samples, it provides a powerful complement to currently available methods for association mapping.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
$209.00 per year
only $17.42 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout




Similar content being viewed by others
References
Lander, E.S. & Schork, N.J. Genetic dissection of complex traits. Science 265, 2037–2048 (1994).
Risch, N. & Merikangas, K. The future of genetic studies of complex human diseases. Science 273, 1516–1517 (1996).
Hartl, D.L. & Clark, A.G. Principles of Population Genetics (Sinauer, Sunderland, Massachusetts, 1997).
Hey, J. & Machado, C.A. The study of structured populations-new hope for a difficult and divided science. Nat. Rev. Genet. 4, 535–543 (2003).
Allison, D.B. Transmission-disequilibrium tests for quantitative traits. Am. J. Hum. Genet. 60, 676–690 (1997).
Fulker, D.W., Cherny, S.S., Sham, P.C. & Hewitt, J.K. Combined linkage and association analysis for quantitative traits. Am. J. Hum. Genet. 64, 259–267 (1999).
Abecasis, G.R., Cardon, L.R. & Cookson, W.O.C. A general test for association for quantitative traits in nuclear families. Am. J. Hum. Genet. 66, 279–292 (2000).
Cardon, L.R. A sib-pair regression model of linkage disequilibrium for quantitative traits. Hum. Hered. 50, 350–358 (2000).
Pritchard, J.K., Stephens, M. & Donnelly, P. Inference of population structure using multilocus genotype data. Genetics 155, 945–959 (2000).
Devlin, B. & Roeder, K. Genomic control for association studies. Biometrics 55, 997–1004 (1999).
Marchini, J., Cardon, L.R., Phillips, M.S. & Donnelly, P. The effects of human population structure on large genetic association studies. Nat. Genet. 36, 512–517 (2004).
Pritchard, J.K. & Donnelly, P. Case-control studies of association in structured or admixed populations. Theor. Popul. Biol. 60, 227–237 (2001).
Thornsberry, J.M. et al. Dwarf8 polymorphisms associate with variation in flowering time. Nat. Genet. 28, 286–289 (2001).
Morley, M. et al. Genetic analysis of genome-wide variation in human gene expression. Nature 430, 743–747 (2004).
Flint-Garcia, S.A. et al. Maize association population: a high resolution platform for QTL dissection. Plant J. 44, 1054–1064 (2005).
Henderson, C.R. Application of Linear Models in Animal Breeding (Univ. of Guelph, Ontario, 1984).
Kennedy, B.W., Quinton, M. & van Arendonk, J.A.M. Estimation of effects of single genes on quantitative trait. J. Anim. Sci. 70, 2000–2012 (1992).
George, A.W., Visscher, P.M. & Haley, C.S. Mapping quantitative trait loci in complex pedigrees: a two-step variance component approach. Genetics 156, 2081–2092 (2000).
Loiselle, B.A. et al. Spatial genetic structure of a tropical understory shrub, Psychotria officinalis (Rubiaceae). Am. J. Bot. 82, 1420–1425 (1995).
Ritland, K. Estimators for pairwise relatedness and individual inbreeding coefficients. Genet. Res. 67, 175–186 (1996).
Hirschhorn, J.N. & Daly, M.J. Genome-wide association studies for common diseases and complex traits. Nat. Rev. Genet. 6, 95–108 (2005).
Almasy, L. & Blangero, J. Multipoint quantitative-trait linkage analysis in general pedigrees. Am. J. Hum. Genet. 62, 1198–1211 (1998).
Wright, S.I. et al. The effects of artificial selection on the maize genome. Science 308, 1310–1314 (2005).
Gardiner, J. et al. Anchoring 9,371 maize expressed sequence tagged unigenes to the bacterial artificial chromosome contig map by two-dimensional overgo hybridization. Plant Physiol. 134, 1317–1326 (2004).
Jurinke, C., van den Boom, D., Cantor, C.R. & Koster, H. The use of MassARRAY technology for high throughput genotyping. Adv. Biochem. Eng. Biotechnol. 77, 57–74 (2002).
Long, A.D. & Langley, C.H. The power of association studies to detect the contribution of candidate genetic loci to variation in complex traits. Genome Res. 9, 720–731 (1999).
Abecasis, G.R. et al. Merlin-rapid analysis of dense genetic maps using sparse gene flow trees. Nat. Genet. 30, 97–101 (2002).
Zhang, W. et al. Impact of population structure, effective bottleneck time, and allele frequency on linkage disequilibrium maps. Proc. Natl. Acad. Sci. USA 101, 18075–18080 (2004).
Hardy, O.J. & Vekemans, X. SPAGeDi: a versatile computer program to analyse spatial genetic structure at the individual or population levels. Mol. Ecol. Notes 2, 618–620 (2002).
SAS Institute. SAS/STAT User's Guide Version 8 (SAS Institute, Cary, North Carolina, 1999).
Acknowledgements
We thank past and present members of the Buckler lab for their assistance in phenotypic data collection. We thank P.J. Bradbury, Z. Zhang, R.L. Quass and E.J. Pollak for their insights and discussion regarding the mixed model. We thank N. Stevens for technical editing of the manuscript. This work was supported by the US National Science Foundation and the USDA-ARS. Mention of trade names or commercial products in this publication is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the US Department of Agriculture.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Supplementary information
Supplementary Fig. 1
Model comparison with three additional human gene expression phenotypes. (PDF 85 kb)
Rights and permissions
About this article
Cite this article
Yu, J., Pressoir, G., Briggs, W. et al. A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat Genet 38, 203–208 (2006). https://doi.org/10.1038/ng1702
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/ng1702
This article is cited by
-
Genome-wide association study identifies genetic variants underlying footrot in Portuguese Merino sheep
BMC Genomics (2024)
-
Superior haplotypes of key drought-responsive genes reveal opportunities for the development of climate-resilient rice varieties
Communications Biology (2024)
-
Genome-wide association study and transcriptome analysis reveal natural variation of key genes regulation flowering time in rapeseed
Molecular Breeding (2024)
-
Multi-locus genome-wide association studies reveal the dynamic genetic architecture of flowering time in chrysanthemum
Plant Cell Reports (2024)
-
Identification of superior haplotypes for flowering time in pigeonpea through candidate gene-based association study of a diverse minicore collection
Plant Cell Reports (2024)