Abstract:
The paper presents a new approach for phylotyping that can be potentially used for pure cultures and for mixed bacterial populations. It is based on the use of short unique nucleotide sequences (k-mers) that are present in the genomes of all strains of the same species and are absent in bacterial genomes of other taxonomic groups. We show that the number N of such sequences depends on the percentage bias towards A/T or G/C base pairs, increasing for genomes with approximately equal composition. We found that the largest contribution to the set of primarily unique sequences is given by 16–17-mers, while sigmoidal curves reflecting the dependence of N on the length of k-mers showed the maximum slope increment (ΔN/Δk) for k=17,18. Unique sequences of the length 16–18 bases can therefore be offered as potential markers. Comparing the sets of unique k-mers in the genomes of four Enterobacter strains, we estimated the level of their intraspecies stability and interspecies plasticity. As a result, we suggest discriminatory subsets as stencils for phylotyping, thereby increasing the list of genotyping markers with signatures of the new type.
The study was partially supported by the Russian Foundation for Basic Research (grants №16-04-01570 and №15-07-05783).
Received 25.11.2017, Published 19.12.2017
Document Type:
Article
UDC:
579:252
Language: English
Citation:
V. V. Panyukov, S. S. Kiselev, O. V. Alikina, N. N. Nazipova, O. N. Ozoline, “Short unique sequences in bacterial genomes as strain- and species-specific signatures”, Mat. Biolog. Bioinform., 12:2 (2017), 547–558
\Bibitem{PanKisAli17}
\by V.~V.~Panyukov, S.~S.~Kiselev, O.~V.~Alikina, N.~N.~Nazipova, O.~N.~Ozoline
\paper Short unique sequences in bacterial genomes as strain- and species-specific signatures
\jour Mat. Biolog. Bioinform.
\yr 2017
\vol 12
\issue 2
\pages 547--558
\mathnet{http://mi.mathnet.ru/mbb312}
\crossref{https://doi.org/10.17537/2017.12.547}
Linking options:
https://www.mathnet.ru/eng/mbb312
https://www.mathnet.ru/eng/mbb/v12/i2/p547
This publication is cited in the following 8 articles:
O. Y. Kiryanova, I. I. Kiryanov, B. R. Kuluev, R. R. Garafutdinov, A. V. Chemeris, I. M. Gubaydullin, “Multiplex in silico RAPD-analysis for genome barcoding”, Mat. Biolog. Bioinform., 17:2 (2022), 208–229
N.N. Nazipova, Proceedings of the International Conference “Mathematical Biology and Bioinformatics”, 9, Proceedings of the International Conference “Mathematical Biology and Bioinformatics”, 2022
M. Frolova, S. Yudin, V. Makarov, O. Glazunova, O. Alikina, N. Markelova, N. Kolzhetsov, T. Dzhelyadin, V. Shcherbakova, V. Trubitsyn, V. Panyukov, A. Zaitsev, S. Kiselev, K. Shavkunov, O. Ozoline, “Lacticaseibacillus paracasei: occurrence in the human gut microbiota and k-mer-based assessment of intraspecies diversity”, Life-Basel, 11:11 (2021), 1246
N. Markelova, O. Glazunova, O. Alikina, V. Panyukov, K. Shavkunov, O. Ozoline, “Suppression of Escherichia coli growth dynamics via RNAs secreted by competing bacteria”, Front. Mol. Biosci., 8 (2021), 609979
P. Beran, D. Stehlikova, S. P. Cohen, V. Curn, “KEC: unique sequence search by k-mer exclusion”, Bioinformatics, 37:19 (2021), 3349–3350
Panyukov V.V., Kiselev S.S., Ozoline O.N., “Unique K-Mers as Strain-Specific Barcodes For Phylogenetic Analysis and Natural Microbiome Profiling”, Int. J. Mol. Sci., 21:3 (2020), 944
O. V. Alikina, O. A. Glazunova, A. A. Bykov, S. S. Kiselev, M. N. Tutukina, K. S. Shavkunov, O. N. Ozoline, “A cohabiting bacterium alters the spectrum of short RNAs secreted by Escherichia coli”, FEMS Microbiol. Lett., 365:24 (2018), fny262
S.S. Kiselev, O.N. Ozoline, V.V. Panyukov, Proceedings of the International Conference “Mathematical Biology and Bioinformatics”, 7, Proceedings of the International Conference “Mathematical Biology and Bioinformatics”, 2018