|
Modification of the Method for Calculating Polygenic Risks With Variation Graph
O. A. Kondratevaab, E. A. Karpulevitcha a Ivannikov Institute for System Programming of the RAS
b Lomonosov Moscow State University
Abstract:
Representation of the DNA sequence is possible in various ways. The variation graph is one of the most accurate methods that allows you to work with atypical areas and take into account all their diversity. Based on this data structure and the polygenic risk assessment method, a DNA interpretation system was built. As a result, a correlation coefficient was obtained between the path in the column responsible for a specific DNA sequence and the feature. We then compared it with a coefficient obtained by a similar method but using sequence representation using a reference genome. Such a comparison helped to evaluate the effectiveness of the representation in the form of a graph. After that, a modified method for calculating the polygenic score on the alignment data of the vg tool was built, which was also compared with existing methods. The modified method showed an improvement in the prediction of the trait.
Keywords:
graph, genome representation, variation graph, HISAT2, vg, minimap2, GGP, genomic graph pipeline, PRS, polygenic score, polygenic risk score
Citation:
O. A. Kondrateva, E. A. Karpulevitch, “Modification of the Method for Calculating Polygenic Risks With Variation Graph”, Proceedings of ISP RAS, 34:2 (2022), 191–200
Linking options:
https://www.mathnet.ru/eng/tisp687 https://www.mathnet.ru/eng/tisp/v34/i2/p191
|
Statistics & downloads: |
Abstract page: | 39 | Full-text PDF : | 35 |
|