Protein Bioinformatics and Mixtures of Bivariate von Mises Distributions for Angular Data
- 16 November 2006
- journal article
- Published by Oxford University Press (OUP) in Biometrics
- Vol. 63 (2), 505-512
- https://doi.org/10.1111/j.1541-0420.2006.00682.x
Abstract
A fundamental problem in bioinformatics is to characterize the secondary structure of a protein, which has traditionally been carried out by examining a scatterplot (Ramachandran plot) of the conformational angles. We examine two natural bivariate von Mises distributions--referred to as Sine and Cosine models--which have five parameters and, for concentrated data, tend to a bivariate normal distribution. These are analyzed and their main properties derived. Conditions on the parameters are established which result in bimodal behavior for the joint density and the marginal distribution, and we note an interesting situation in which the joint density is bimodal but the marginal distributions are unimodal. We carry out comparisons of the two models, and it is seen that the Cosine model may be preferred. Mixture distributions of the Cosine model are fitted to two representative protein datasets using the expectation maximization algorithm, which results in an objective partition of the scatterplot into a number of components. Our results are consistent with empirical observations; new insights are discussed.This publication has 9 references indexed in Scilit:
- Bayesian Statistical Studies of the Ramachandran DistributionStatistical Applications in Genetics and Molecular Biology, 2005
- Probabilistic model for two dependent circular variablesBiometrika, 2002
- Directional StatisticsWiley Series in Probability and Statistics, 1999
- Statistical Analysis of Circular DataPublished by Cambridge University Press (CUP) ,1993
- A distribution for dependent unit vectorsCommunications in Statistics - Theory and Methods, 1988
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983
- Simple Approximations for the von Mises Concentration StatisticJournal of the Royal Statistical Society Series C: Applied Statistics, 1978
- Stereochemistry of polypeptide chain configurationsJournal of Molecular Biology, 1963
- The computation of Fourier synthesis with a digital electronic calculating machineActa Crystallographica, 1952