Enhanced graphic matrix analysis of nucleic acid and protein sequences.
- 1 December 1981
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 78 (12), 7665-7669
- https://doi.org/10.1073/pnas.78.12.7665
Abstract
The enhanced graphic matrix procedure analyzes nucleic acid and amino acid sequences for features of possible biological interest and reveals the spatial patterns of such features. When a sequence is compared to itself the technique shows regions of self-complementarity, direct repeats, and palindromic subsequences. Comparison of 2 different sequences, exemplified by Ig .kappa. L chain genes, by using colored graphic matrices showed domains of similarity, regions of divergence and features explainable by transpositions. Analysis of mouse constant domain Ig sequences revealed self-complementary regions that can be used to fold the molecule into EM a structure consistent with EM observations. Computer translation of nucleic acid sequences into all possible amino acid sequences followed by graphic matrix analysis provides a way to detect the most likely protein encoding regions and can predict the correct reading frames in sequences in which splicing patterns are not defined. Application of this technique to regions of SV-40 and polyoma virus demonstrates the frames of translation and shows the agreement of sequences determined in separate laboratories with different virus isolates. The graphic matrix technique can also be used to assemble fragmentary sequences during determination, to display local variations in base composition, to detect distant evolutionary relationships and to display intragenic variation in rates of evolution.This publication has 26 references indexed in Scilit:
- The Genome of Simian Virus 40Science, 1978
- Further procedures for sequence analysis by computerNucleic Acids Research, 1978
- DNA sequencing with chain-terminating inhibitorsProceedings of the National Academy of Sciences, 1977
- Computer analysis of nucleic acid regulatory sequences.Proceedings of the National Academy of Sciences, 1977
- A new method for sequencing DNA.Proceedings of the National Academy of Sciences, 1977
- Sequence data handling by computerNucleic Acids Research, 1977
- Tests for comparing related amino-acid sequences. Cytochrome c and cytochrome c551Journal of Molecular Biology, 1971
- Estimation of Secondary Structure in Ribonucleic AcidsNature, 1971
- The Diagram, a Method for Comparing SequencesEuropean Journal of Biochemistry, 1970
- Locating gaps in amino acid sequences to optimize the homology between two proteinsBiochemical Genetics, 1969