Order and correlations in genomic DNA sequences. The spectral approach
V.V. Lobzin a
V.R. Chechetkin b
a Pushkov Institute of Terrestrial Magnetism, Ionosphere and Radio Wave Propagation, Russian Academy of Sciences, Kaluzhskoe shosse 4, Troitsk, Москва, 108840, Russian Federation
b State Scientific Center of the Russian Federation ‘Troitsk Institute for Innovation and Fusion Research’, Troitsk, Moscow Region, Russian Federation
The structural analysis of genomic DNA sequences is discussed in the framework of the spectral approach, which is sufficiently universal due to the reciprocal correspondence and mutual complementarity of Fourier transform length scales. The spectral characteristics of random sequences of the same nucleotide composition possess the property of self-averaging for relatively short sequences of length M \geqslant 100-300. Comparison with the characteristics of random sequences determines the statistical significance of the structural features observed. Apart from traditional applications to the search for hidden periodicities, spectral methods are also efficient in studying mutual correlations in DNA sequences. By combining spectra for structure factors and correlation functions, not only integral correlations can be estimated but also their origin identified. Using the structural spectral entropy approach, the regularity of a sequence can be quantitatively assessed. A brief introduction to the problem is also presented and other major methods of DNA sequence analysis described.