Spliced leader–based metatranscriptomic analyses lead to recognition of hidden genomic features in dinoflagellates

Abstract
Environmental transcriptomics (metatranscriptomics) for a specific lineage of eukaryotic microbes (e.g., Dinoflagellata) would be instrumental for unraveling the genetic mechanisms by which these microbes respond to the natural environment, but it has not been exploited because of technical difficulties. Using the recently discovered dinoflagellate mRNA-specific spliced leader as a selective primer, we constructed cDNA libraries (e-cDNAs) from one marine and two freshwater plankton assemblages. Small-scale sequencing of the e-cDNAs revealed functionally diverse transcriptomes proven to be of dinoflagellate origin. A set of dinoflagellate common genes and transcripts of dominant dinoflagellate species were identified. Further analyses of the dataset prompted us to delve into the existing, largely unannotated dinoflagellate EST datasets (DinoEST). Consequently, all four nucleosome core histones, two histone modification proteins, and a nucleosome assembly protein were detected, clearly indicating the presence of nucleosome-like machinery long thought not to exist in dinoflagellates. The isolation of rhodopsin from taxonomically and ecotypically diverse dinoflagellates and its structural similarity and phylogenetic affinity to xanthorhodopsin suggest a common genetic potential in dinoflagellates to use solar energy nonphotosynthetically. Furthermore, we found 55 cytoplasmic ribosomal proteins (RPs) from the e-cDNAs and 24 more from DinoEST, showing that the dinoflagellate phylum possesses all 79 eukaryotic RPs. Our results suggest that a sophisticated eukaryotic molecular machine operates in dinoflagellates that likely encodes many more unsuspected physiological capabilities and, meanwhile, demonstrate that unique spliced leaders are useful for profiling lineage-specific microbial transcriptomes in situ.