A saturated map of common genetic variants associated with human height

23andMe Research Team; VA Million Veteran Program; DiscovEHR (DiscovEHR and MyCode Community Health Initiative); eMERGE (Electronic Medical Records and Genomics Network); LifeLines Cohort Study; The PRACTICAL Consortium; Understanding Society Scientific Group

doi:10.1038/s41586-022-05275-y

A saturated map of common genetic variants associated with human height

23andMe Research Team, VA Million Veteran Program, DiscovEHR (DiscovEHR and MyCode Community Health Initiative), eMERGE (Electronic Medical Records and Genomics Network), LifeLines Cohort Study, The PRACTICAL Consortium, Understanding Society Scientific Group

Research output: Contribution to journal › Article › peer-review

133 Scopus citations

Abstract

Common single-nucleotide polymorphisms (SNPs) are predicted to collectively explain 40–50% of phenotypic variation in human height, but identifying the specific variants and associated regions requires huge sample sizes¹. Here, using data from a genome-wide association study of 5.4 million individuals of diverse ancestries, we show that 12,111 independent SNPs that are significantly associated with height account for nearly all of the common SNP-based heritability. These SNPs are clustered within 7,209 non-overlapping genomic segments with a mean size of around 90 kb, covering about 21% of the genome. The density of independent associations varies across the genome and the regions of increased density are enriched for biologically relevant genes. In out-of-sample estimation and prediction, the 12,111 SNPs (or all SNPs in the HapMap 3 panel²) account for 40% (45%) of phenotypic variance in populations of European ancestry but only around 10–20% (14–24%) in populations of other ancestries. Effect sizes, associated regions and gene prioritization are similar across ancestries, indicating that reduced prediction accuracy is likely to be explained by linkage disequilibrium and differences in allele frequency within associated regions. Finally, we show that the relevant biological pathways are detectable with smaller sample sizes than are needed to implicate causal genes and variants. Overall, this study provides a comprehensive map of specific genomic regions that contain the vast majority of common height-associated variants. Although this map is saturated for populations of European ancestry, further research is needed to achieve equivalent saturation in other ancestries.

Original language	English (US)
Pages (from-to)	704-712
Number of pages	9
Journal	Nature
Volume	610
Issue number	7933
DOIs	https://doi.org/10.1038/s41586-022-05275-y
State	Published - Oct 27 2022
Externally published	Yes

ASJC Scopus subject areas

General

Access to Document

10.1038/s41586-022-05275-y

Cite this

23andMe Research Team, VA Million Veteran Program, DiscovEHR (DiscovEHR and MyCode Community Health Initiative), eMERGE (Electronic Medical Records and Genomics Network), LifeLines Cohort Study, The PRACTICAL Consortium, & Understanding Society Scientific Group (2022). A saturated map of common genetic variants associated with human height. Nature, 610(7933), 704-712. https://doi.org/10.1038/s41586-022-05275-y

23andMe Research Team, VA Million Veteran Program, DiscovEHR (DiscovEHR and MyCode Community Health Initiative), eMERGE (Electronic Medical Records and Genomics Network), LifeLines Cohort Study, The PRACTICAL Consortium & Understanding Society Scientific Group 2022, 'A saturated map of common genetic variants associated with human height', Nature, vol. 610, no. 7933, pp. 704-712. https://doi.org/10.1038/s41586-022-05275-y

23andMe Research Team, VA Million Veteran Program, DiscovEHR (DiscovEHR and MyCode Community Health Initiative), eMERGE (Electronic Medical Records and Genomics Network), LifeLines Cohort Study, The PRACTICAL Consortium et al. A saturated map of common genetic variants associated with human height. Nature. 2022 Oct 27;610(7933):704-712. doi: 10.1038/s41586-022-05275-y

@article{210288b6bce64feb932f7da018c8d112,

title = "A saturated map of common genetic variants associated with human height",

abstract = "Common single-nucleotide polymorphisms (SNPs) are predicted to collectively explain 40–50% of phenotypic variation in human height, but identifying the specific variants and associated regions requires huge sample sizes1. Here, using data from a genome-wide association study of 5.4 million individuals of diverse ancestries, we show that 12,111 independent SNPs that are significantly associated with height account for nearly all of the common SNP-based heritability. These SNPs are clustered within 7,209 non-overlapping genomic segments with a mean size of around 90 kb, covering about 21% of the genome. The density of independent associations varies across the genome and the regions of increased density are enriched for biologically relevant genes. In out-of-sample estimation and prediction, the 12,111 SNPs (or all SNPs in the HapMap 3 panel2) account for 40% (45%) of phenotypic variance in populations of European ancestry but only around 10–20% (14–24%) in populations of other ancestries. Effect sizes, associated regions and gene prioritization are similar across ancestries, indicating that reduced prediction accuracy is likely to be explained by linkage disequilibrium and differences in allele frequency within associated regions. Finally, we show that the relevant biological pathways are detectable with smaller sample sizes than are needed to implicate causal genes and variants. Overall, this study provides a comprehensive map of specific genomic regions that contain the vast majority of common height-associated variants. Although this map is saturated for populations of European ancestry, further research is needed to achieve equivalent saturation in other ancestries.",

author = "{23andMe Research Team} and {VA Million Veteran Program} and {DiscovEHR (DiscovEHR and MyCode Community Health Initiative)} and {eMERGE (Electronic Medical Records and Genomics Network)} and {LifeLines Cohort Study} and {The PRACTICAL Consortium} and {Understanding Society Scientific Group} and Lo{\"i}c Yengo and Sailaja Vedantam and Eirini Marouli and Julia Sidorenko and Eric Bartell and Saori Sakaue and Marielisa Graff and Eliasen, {Anders U.} and Yunxuan Jiang and Sridharan Raghavan and Jenkai Miao and Arias, {Joshua D.} and Graham, {Sarah E.} and Mukamel, {Ronen E.} and Spracklen, {Cassandra N.} and Xianyong Yin and Chen, {Shyh Huei} and Teresa Ferreira and Highland, {Heather H.} and Yingjie Ji and Tugce Karaderi and Kuang Lin and Kreete L{\"u}ll and Malden, {Deborah E.} and Carolina Medina-Gomez and Moara Machado and Amy Moore and Sina R{\"u}eger and Xueling Sim and Scott Vrieze and Ahluwalia, {Tarunveer S.} and Masato Akiyama and Allison, {Matthew A.} and Marcus Alvarez and Andersen, {Mette K.} and Alireza Ani and Vivek Appadurai and Liubov Arbeeva and Seema Bhaskar and Bielak, {Lawrence F.} and Sailalitha Bollepalli and Bonnycastle, {Lori L.} and Jette Bork-Jensen and Bradfield, {Jonathan P.} and Yuki Bradford and Braund, {Peter S.} and Brody, {Jennifer A.} and Burgdorf, {Kristoffer S.} and Isasi, {Carmen R.} and Qibin Qi",

note = "Publisher Copyright: {\textcopyright} 2022, The Author(s).",

year = "2022",

month = oct,

day = "27",

doi = "10.1038/s41586-022-05275-y",

language = "English (US)",

volume = "610",

pages = "704--712",

journal = "Nature",

issn = "0028-0836",

publisher = "Nature Publishing Group",

number = "7933",

}

TY - JOUR

T1 - A saturated map of common genetic variants associated with human height

AU - 23andMe Research Team

AU - VA Million Veteran Program

AU - DiscovEHR (DiscovEHR and MyCode Community Health Initiative)

AU - eMERGE (Electronic Medical Records and Genomics Network)

AU - LifeLines Cohort Study

AU - The PRACTICAL Consortium

AU - Understanding Society Scientific Group

AU - Yengo, Loïc

AU - Vedantam, Sailaja

AU - Marouli, Eirini

AU - Sidorenko, Julia

AU - Bartell, Eric

AU - Sakaue, Saori

AU - Graff, Marielisa

AU - Eliasen, Anders U.

AU - Jiang, Yunxuan

AU - Raghavan, Sridharan

AU - Miao, Jenkai

AU - Arias, Joshua D.

AU - Graham, Sarah E.

AU - Mukamel, Ronen E.

AU - Spracklen, Cassandra N.

AU - Yin, Xianyong

AU - Chen, Shyh Huei

AU - Ferreira, Teresa

AU - Highland, Heather H.

AU - Ji, Yingjie

AU - Karaderi, Tugce

AU - Lin, Kuang

AU - Lüll, Kreete

AU - Malden, Deborah E.

AU - Medina-Gomez, Carolina

AU - Machado, Moara

AU - Moore, Amy

AU - Rüeger, Sina

AU - Sim, Xueling

AU - Vrieze, Scott

AU - Ahluwalia, Tarunveer S.

AU - Akiyama, Masato

AU - Allison, Matthew A.

AU - Alvarez, Marcus

AU - Andersen, Mette K.

AU - Ani, Alireza

AU - Appadurai, Vivek

AU - Arbeeva, Liubov

AU - Bhaskar, Seema

AU - Bielak, Lawrence F.

AU - Bollepalli, Sailalitha

AU - Bonnycastle, Lori L.

AU - Bork-Jensen, Jette

AU - Bradfield, Jonathan P.

AU - Bradford, Yuki

AU - Braund, Peter S.

AU - Brody, Jennifer A.

AU - Burgdorf, Kristoffer S.

AU - Isasi, Carmen R.

AU - Qi, Qibin

PY - 2022/10/27

Y1 - 2022/10/27

N2 - Common single-nucleotide polymorphisms (SNPs) are predicted to collectively explain 40–50% of phenotypic variation in human height, but identifying the specific variants and associated regions requires huge sample sizes1. Here, using data from a genome-wide association study of 5.4 million individuals of diverse ancestries, we show that 12,111 independent SNPs that are significantly associated with height account for nearly all of the common SNP-based heritability. These SNPs are clustered within 7,209 non-overlapping genomic segments with a mean size of around 90 kb, covering about 21% of the genome. The density of independent associations varies across the genome and the regions of increased density are enriched for biologically relevant genes. In out-of-sample estimation and prediction, the 12,111 SNPs (or all SNPs in the HapMap 3 panel2) account for 40% (45%) of phenotypic variance in populations of European ancestry but only around 10–20% (14–24%) in populations of other ancestries. Effect sizes, associated regions and gene prioritization are similar across ancestries, indicating that reduced prediction accuracy is likely to be explained by linkage disequilibrium and differences in allele frequency within associated regions. Finally, we show that the relevant biological pathways are detectable with smaller sample sizes than are needed to implicate causal genes and variants. Overall, this study provides a comprehensive map of specific genomic regions that contain the vast majority of common height-associated variants. Although this map is saturated for populations of European ancestry, further research is needed to achieve equivalent saturation in other ancestries.

AB - Common single-nucleotide polymorphisms (SNPs) are predicted to collectively explain 40–50% of phenotypic variation in human height, but identifying the specific variants and associated regions requires huge sample sizes1. Here, using data from a genome-wide association study of 5.4 million individuals of diverse ancestries, we show that 12,111 independent SNPs that are significantly associated with height account for nearly all of the common SNP-based heritability. These SNPs are clustered within 7,209 non-overlapping genomic segments with a mean size of around 90 kb, covering about 21% of the genome. The density of independent associations varies across the genome and the regions of increased density are enriched for biologically relevant genes. In out-of-sample estimation and prediction, the 12,111 SNPs (or all SNPs in the HapMap 3 panel2) account for 40% (45%) of phenotypic variance in populations of European ancestry but only around 10–20% (14–24%) in populations of other ancestries. Effect sizes, associated regions and gene prioritization are similar across ancestries, indicating that reduced prediction accuracy is likely to be explained by linkage disequilibrium and differences in allele frequency within associated regions. Finally, we show that the relevant biological pathways are detectable with smaller sample sizes than are needed to implicate causal genes and variants. Overall, this study provides a comprehensive map of specific genomic regions that contain the vast majority of common height-associated variants. Although this map is saturated for populations of European ancestry, further research is needed to achieve equivalent saturation in other ancestries.

UR - http://www.scopus.com/inward/record.url?scp=85139748621&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85139748621&partnerID=8YFLogxK

U2 - 10.1038/s41586-022-05275-y

DO - 10.1038/s41586-022-05275-y

M3 - Article

C2 - 36224396

AN - SCOPUS:85139748621

SN - 0028-0836

VL - 610

SP - 704

EP - 712

JO - Nature

JF - Nature

IS - 7933

ER -

A saturated map of common genetic variants associated with human height

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this