Heritability, genetic variation, and the number of risk SNPs effect on deep learning and polygenic risk scores AUC

Muhammad Muneeb, Samuel Feng, Andreas Henschel

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

For genotype-phenotype classification, many methods are used, like polygenic risk scores and deep learning, each using a different computation technique. The performance of each method varies depending on the genetic variation and is measured by accuracy or area under the curve (AUC). This article investigates the relationship between deep learning classifiers and polygenic risk scores performance for genotype-phenotype classification with respect to variation in heritability, genetic variation, and the number of risk SNP (400 different datasets of 5000 people) through extensive computation. These variation helps to find an optimal classifier for a dataset with specific heritability and an expected score for a specific case/control classification. The deep learning classifier AUC decreases with an increase in heritability, whereas the polygenic risk scores AUC improves. The machine-learning algorithm has low AUC for high genetic variation, but for low genetic variation, AUC is high. PRS tools have the opposite behavior; for high genetic variation, the PRS tools have high AUC compared to low genetic variation data sets. The article gives a basic template showing deep learning or PRS tools should be used depending on the heritability and genetic variation of the dataset. All the code segments are available publically to generate datasets with different parameters and explore such patterns.

Original languageBritish English
Title of host publicationICBBT 2022 - Proceedings of 2022 14th International Conference on Bioinformatics and Biomedical Technology
Pages65-71
Number of pages7
ISBN (Electronic)9781450396387
DOIs
StatePublished - 27 May 2022
Event14th International Conference on Bioinformatics and Biomedical Technology, ICBBT 2022 - Xi'an, China
Duration: 27 May 202229 May 2022

Publication series

NameACM International Conference Proceeding Series

Conference

Conference14th International Conference on Bioinformatics and Biomedical Technology, ICBBT 2022
Country/TerritoryChina
CityXi'an
Period27/05/2229/05/22

Keywords

  • applied deep learning
  • genetic variation
  • heritability
  • polygenic risk scores
  • risk SNPs

Fingerprint

Dive into the research topics of 'Heritability, genetic variation, and the number of risk SNPs effect on deep learning and polygenic risk scores AUC'. Together they form a unique fingerprint.

Cite this