Abstract:
Currently various microarrays platforms containing nucleotides, proteins, peptides, glycans and other molecules are used in biomedical research. Number and density of immobilized molecules on microarrays are constantly increasing. Microarray data handling requires optimization of methods for their analysis. Peptide microarrays data analysis has certain characteristics that require non-conventional statistical methods. In this paper we present the results of antibody repertoire analysis in breast cancer patients sera utilizing microchips containing 330,000 peptides. We investigated methods for space dimension reduction such as projective methods and methods for selection of informative features. We have shown that method of projection to latent structures can detect an effective data dimension, reduce overfitting of the model and increase the quality of object recognition. Accuracy of the experimental results was assessed with the ROC-curve; the best quality was achieved with three latent structures without normalization and reduction of total numbers of peptides.
Key words:
microarrays, peptides, normalization, latent variables, clustering, ROC, method of projection to latent structures.
Citation:
D. S. Anisimov, S. V. Podlesnykh, E. A. Kolosova, D. N. Shcherbakov, V. D. Petrova, S. S. Johnston, A. F. Lazarev, N. M. Oskorbin, A. I. Shapoval, M. A. Ryazanov, “Projection to latent structures as a strategy for peptides microarray data analysis”, Mat. Biolog. Bioinform., 12:2 (2017), 435–445