Stephen C. Hardies, Yingping Zhao, Lanxin Zhou, Christopher P.D. Jacobs, Liping Wang, N. Carol Casavant, and Rhonda Greene-Till
Dept. of Biochemistry, Univ. of Texas Health Science Center in San Antonio, TX 78284
The phylogeny of LINE-1 families in Mus musculus domesticus and M. spretus has been extended to higher resolution. The recently reported active F-type LINE-1 from the reeler locus was found to belong to the previously known L1Md4 clade in its 3' sequence. The previously defined spretus LINE-1 families coalesce with the L1Md4/L1reeler lineage nearly half way down from its split with the active L1MdA2 subfamily. This implied that there should also be a spretus A2 clade and the L1Md4/L1reeler clade appear to have experienced the spretus/domesticus split at much different times. A new retroviral-like repetitive sequence, named MuERVC has been characterized in comparison to previously known retroviral-like families. Like LINE-1, each family appears to have been derived from a different nondefective ancestor, but unlike LINE-1 the MuERVC families appear to amplify after [should say concomitant with] loss of functional reading frames. Dispersal of MuERVC sequences appears to be more episodic than that of LINE-1 subfamilies. Examples are now known of both LINE-1 and MuERVC that recently crossed from spretus to domesticus within an introgressing haplotype. GM51847
Next to L1C105 is an endogenous retrovirus-like sequence named MuERVC. Downstream between two ancient LINE-1 inserts, we have identified a 500 bp stretch of unique sequence (the open box above). The MuERVC insert at the C105 locus predated the spretus/musculus speciation (assayed by the E PCR primer set). An identical sequence (1140 bp) of MuERVC was recovered from C57BL/6J at the C105 locus and from M. spretus (using the M PCR primer set). This mandates that the MuERVC-C105 sequence cotransferred with L1C105 within the last 50,000 yrs. [For any mouse sequence without conservative selection, two alleles will diverge at 2%/Myr since the time of the common ancestor to the alleles. This sequence is known to have no selective constraints, since the reading frame is disrupted. So identical sequence over 1140 bp in both domesticus and spretus MuERVC alleles requires an exchange in the last 50,000 yrs. irrespective of any other information.]
Alleles of unique sequence flanking MuERVC-C105 were recovered from a variety of musculus and spretus strains (using the U PCR primer set). The C57BL/6J allele is also spretus-like, although we are still looking for a perfect match.
Dendogram of unique sequence alleles at the C105 locus. Numbers of base substitutions are indicated on branches. The division at 1 Mya is the spretus/musculus speciation (the time when most spretus and musculus homologues shared their last common ancestor). Mice marked with an asterisk map to the wrong side of the division and are hence candidates for recipients of interspecific gene transfer. SMZ retains an ancestral polymorphism. Also shown are alleles detected by length polymorphism with the E primers and their transitions (in parentheses).
An independent interspecific transfer at the C105 locus of a domesticus allele into M. spretus is also detected (from C57BL/KsJ to SEG and an outbred spretus individual from Morocco) which is statistically much more clear (p =.007). By sequence, the transferred allele is of domesticus origin, so the unusual history of C57BL/KsJ is not an issue. Unlike the classic inbred strains, the two spretus recipients should be representative of the wild population. This second transfer supports the conclusion that these interspecific transfers happened in nature.
Background: Subfamilies are often reported for LINE-1, however these are often ambiguous as to whether they are different age groups on the same lineage or independent coexisting LINE-1 lineages. Coexisting lineages are useful to map because they reveal if different behaviors can simultaneously exist in the host-transposon relationship.
Legend. Each clade shown (other than those with question marks) have from 500-10,000 members. Those clades indicated with question marks are known only from 2-5 individual sequences. The number of diagnostic base substitutions assigned to each lineage is given. Triangles indicate tentative positions of peak output. The endpoints of the clades are scaled vertically based on degree of divergence among its youngest known members. The indicated speciation time is at 1 Mya.
The A2 lineage, carrying the original mouse prototypical LINE-1 named L1MdA2, is now recognized to have a sister family in spretus joining at the anticipated speciation time for musculus and spretus. L1MdA2 has a prototypical A-type promoter.
The L1MdZ lineage, previously unrecognized, similarly splits to musculus and spretus lineages at the anticipated speciation time. The promoter type in this lineage is unknown. It's unclear if the lineage presisted after an initial surge of output.
The L1reeler/L1Md4 lineage is now seen to have shared a common ancestor about 0.5 Myr after the anticipated speciation time. This is apparantly an example of gene exchange between the species that carried an active LINE-1 master and successfully populated the second species. On the spretus side, this lineage split to the previously characterized MS7024 clade and the aggressively amplifying MS475 clade. On the musculus side, this lineage became the previously known L1Md4 clade which is now known to include L1reeler. L1reeler is a spontaneous insert in recent times and carries an F-type promoter.
It will be of interest to determine when the promoter switch occurred. An attractive hypothesis is to identify promoter alterations with the generation of new lineages. This might reflect acquisition of expression in a new germline compartment as a means of escaping mechanisms restraining the older lineage.
Discussion
The existence of unrecognized additional lineages can easily cause a false indication of a surge in output. We are applying intensive sampling methods to develop a more confident description of each of these lineages.
See Casavant et al.
L1C105 is part of a spretus-derived haplotype.
LINE-1 Lineages in the Mouse.
Three different LINE-1 lineages are now characterized as they cross the musculus/spretus speciation.
Did the interspecific transfer establish a new stable lineage?
Horizontal transfer is often associated with transposons that tend to amplify impressively and then fail in the new host. LINE-1 is usually associated with stable maintenance in the host germline over at least 100 Myr time intervals. It is therefore of interest to see what happens after the unusual interspecific transfer of a LINE-1 lineage. So far it seems that the new lineage establashed and has persisted. There is some evidence of a preexisting lineage of the same type in M. domesticus which may have been displaced by the invading lineage.
When did the L1reeler lineage acquire the F-type promoter?
Are there Surges in LINE-1 Output?
L1 subfamilies are often discussed with the implication that they relate to episodes of increased output (like SINEs). The term "master" LINE-1 is often associated with LINE-1 subfamilies with the implication that a single locus underlies the elaboration of an episode of increased output. However, those fluctuations in output that have been documented last for an order of magnitude longer than the persistence of an individual active locus. A theory is developed at right to deal with surges of output as components of the host-transposon interaction that is tied to adaptations in the LINE-1 sequence as opposed to a particular locus. However, it is important to refine the observational basis of LINE-1 fluctuations in output to a finer time scale and to sort out those that are just driven by incidental changes in parameters such as mutation rate or host generation time.
![]() |
Through sampling of mouse species that split off during the amplification, it should be possible to gain a similar level of information about this amplification event as that exhibited in the LINE-1 system. Discussion
Selfish Transposons and Stability through Self-Restraint in Higher Eucaryotic Systems.