Phylogenetic reconstruction of vertebrate Hox cluster duplications

Abstract
In vertebrates and the cephalochordate, amphioxus, the closest vertebrate relative, Hox genes are linked in a single cluster. Accompanying the emergence of higher vertebrates, the Hox gene cluster duplicated in either a single step or multiple steps, resulting in the four-cluster state present in teleosts and tetrapods. Mammalian Hox clusters (designated A, B, C, and D) extend over 100 kb and are located on four different chromosomes. Reconstructing the history of the duplications and its relation to vertebrate evolution has been problematic due to the lack of alignable sequence information. In this study, the problem was approached by conducting a statistical analysis of sequences from the fibrillar-type collagens (I, II, III, and IV), genes closely linked to each Hox cluster which likely share the same duplication history as the Hox genes. We find statistical support for the hypothesis that the cluster duplication occurred as multiple distinct events and that the four-cluster situation arose by a three-step sequential process.