Diaspora is a multicopy envelope-less LTR retroelement that comprises at least 0.5% of the Glycine max genome (Laten 1998; Yano et al. 2005). Diaspora belongs to Athila clade, a lineage of possible retroviruses in plants (Wright and Voytas 2002), which is sibling of Tat clade, a lineage of plant LTR retrotransposons. Both lineages provide evidence of a shared ancestral lineage originally described as Class A by Wright and Voytas (1998). The genomic structure of Diaspora is not represented in current databases but a full-length consensus sequence of Diaspora was generated from short overlapping sequences (Yano et al. 2005).We have considered partially this information to classify the full-lenght Diaspora consensus, which is 11.7 Kb in size including LTRs of 2.5 Kb. The internal region of this element displays a Primer Binding Site (PBS) complementary to a tRNAAsp, a single ORF that encodes a gag-pol polyprotein, and, as other LTR retroelements, a Polypurine Tract (PPT) adjacent to the 3´LTR.
Figure not to scale. If present, long terminal repeats (LTRs) have been highlighted in blue. Amino acid motifs noted with lines indicate the conserved residues in each protein domain, abbreviations below mean:
|DU or DUT=dUTPase
|TAV or IBMP=transactivator/viroplasmin or inclusion body matrix protein
|PBS=primer binding site
|ATF=aphid transmission factor
|VAP=virion associated protein