|
Data mining
The method for analysis of expression data homogeneity based on the Student test
R. O. Aliev, N. M. Borisov NRC «Kurchatov Institute»
Abstract:
As early as in 2002, the need was declared for a public repository of experimental results for gene expression profiling. Since that time, several storage hubs for gene expression profiling data have been created, to enable profile analysis and comparison. This gene expression profiling may usually be performed using either mRNA microarray hybridization ornext-generation sequencing. However, all these big data may be heterogeneous, even if they were obtained for the same type of normal or pathologically altered organs and tissues, and have been investigated using the same experimental platform. In the current work, we have proposed a new method for analyzing the homogeneity of expression data based on the Student test. Using computational experiments, we have shown the advantage of our method in terms of computational speed for large datasets, and developed an approach to interpreting the results for the Student test application. Using a new method of data analysis, we have suggested a scheme for visualization of the overall picture of gene expression and comparison of expression profiles at different diseases and/or different stages of the same disease.
Key words:
gene expression, mRNA profiling, microraay hybridization, next-generation sequencing, transcriptome, big data, public data repositories, Student test, clustering.
Received 05.12.2017, Published 06.04.2018
Citation:
R. O. Aliev, N. M. Borisov, “The method for analysis of expression data homogeneity based on the Student test”, Mat. Biolog. Bioinform., 13:1 (2018), 50–67
Linking options:
https://www.mathnet.ru/eng/mbb327 https://www.mathnet.ru/eng/mbb/v13/i1/p50
|
Statistics & downloads: |
Abstract page: | 195 | Full-text PDF : | 57 | References: | 20 |
|