Adaptive Copy Number Evolution in Malaria Parasites

Abstract
Copy number polymorphism (CNP) is ubiquitous in eukaryotic genomes, but the degree to which this reflects the action of positive selection is poorly understood. The first gene in the Plasmodium folate biosynthesis pathway, GTP-cyclohydrolase I (gch1), shows extensive CNP. We provide compelling evidence that gch1 CNP is an adaptive consequence of selection by antifolate drugs, which target enzymes downstream in this pathway. (1) We compared gch1 CNP in parasites from Thailand (strong historical antifolate selection) with those from neighboring Laos (weak antifolate selection). Two percent of chromosomes had amplified copy number in Laos, while 72% carried multiple (2–11) copies in Thailand, and differentiation exceeded that observed at 73 synonymous SNPs. (2) We found five amplicon types containing one to greater than six genes and spanning 1 to >11 kb, consistent with parallel evolution and strong selection for this gene amplification. gch1 was the only gene occurring in all amplicons suggesting that this locus is the target of selection. (3) We observed reduced microsatellite variation and increased linkage disequilibrium (LD) in a 900-kb region flanking gch1 in parasites from Thailand, consistent with rapid recent spread of chromosomes carrying multiple copies of gch1. (4) We found that parasites bearing dhfr-164L, which causes high-level resistance to antifolate drugs, carry significantly (p = 0.00003) higher copy numbers of gch1 than parasites bearing 164I, indicating functional association between genes located on different chromosomes but linked in the same biochemical pathway. These results demonstrate that CNP at gch1 is adaptive and the associations with dhfr-164L strongly suggest a compensatory function. More generally, these data demonstrate how selection affects multiple enzymes in a single biochemical pathway, and suggest that investigation of structural variation may provide a fast-track to locating genes underlying adaptation. Recent comparative genomic hybridization studies have revealed extensive copy number variation in eukaryotic genomes. The first gene in the Plasmodium folate biosynthesis pathway, GTP-cyclohydrolase I (gch1), shows extensive copy number polymorphism (CNP). We provide compelling evidence that gch1 CNP is adaptive and most likely results from selection by antifolate drugs, which target enzymes downstream in this pathway. Gch1 CNP shows extreme geographical differentiation; hitchhiking reduces diversity and increases LD in flanking sequence, indicating recent rapid spread within Thailand, while amplicon structure reveals multiple origins and parallel evolution. Furthermore, strong association between elevated copy number and a critical mutation dhfr-I164L that underlies high-level antifolate resistance indicates functional linkage and fitness epistasis between genes on different chromosomes. These data reveal hidden complexity in the evolutionary response to antifolate treatment and demonstrate that analysis of structural variation can provide a fast-track to locating genes that underlie adaptation.