Originally dismissed as “junk” DNA, transposons—or transposable elements (TEs)—are now widely recognized within both prokaryotic and eukaryotic genomes for their ability to move or replicate. TEs are broadly categorized into two main classes based on their transposition mechanisms: RNA and DNA transposons. DNA transposons employ distinct transposition methods and can be further divided into three primary types: cut-and-paste, peel-and-paste, and self-synthesizing transposons. In contrast, RNA transposons, also known as retrotransposons, incorporate RNA intermediates during their transposition cycles. Notably, retrotransposons constitute between 30-50% of mammalian genomes and are significant genomic constituents in mammals. They are subdivided into three principal groups: long interspersed nuclear elements (LINEs), short interspersed nuclear elements (SINEs), and long terminal repeat elements (LTRs), which encompass endogenous retroviruses (ERVs).
TEs, notably DNA transposons, significantly contribute to genomic evolution and the facilitation of genetic innovation. Furthermore, the transpositional capabilities of transposons make them a potent mechanism for the movement of large DNA segments, thus establishing their importance as tools in transgenesis and human gene therapy for gene delivery.
Transposon Lab is interested in the following topics:
1. Origins of eukaryotic DNA transposons
The ability of transposons to facilitate horizontal gene transfer subverts traditional notions of the vertical inheritance of genetic information in evolutionary processes. This property allows genes to be shared across different species, kingdoms, and even domains, thereby influencing genetic diversity and evolution on a grand scale. Transposable elements, including DNA transposons, are instrumental in mediating genetic exchanges between prokaryotic and eukaryotic organisms. Despite the considerable diversity of eukaryotic DNA transposons, their precise origins are predominantly obscure. A comparative genomic analysis of DNA transposon evolution in prokaryotes and eukaryotes might uncover evidence of interspecies transposon transfer, thereby illuminating the evolutionary origins of eukaryotic transposons.
2. Target DNA transposons mining and engineering
Retroviruses and DNA transposons are frequently employed as gene therapy delivery systems. Retroviruses harness the inherent functionality of endogenous reverse transcriptase and transposition via a copy-and-paste mechanism, but this process can result in unstable retroviral insertions due to the involvement of RNA intermediates and reverse transcription. Furthermore, retroviral vectors present hurdles in manipulation and suffer from restricted payload capacity and high cost.
Conversely, DNA transposons operate through a direct cut-and-paste mechanism, facilitating a more predictable and robust genomic integration. Researchers have successfully developed potent gene delivery vectors, such as ZB, PS, Sleeping Beauty, and piggyBac, from DNA transposons for application in gene therapy, transgenesis, and mutagenesis. These vectors confer several benefits in gene therapy: low cost, substantial gene delivery capacity, stable integration, a diminished immune response, and manipulation simplicity with reduced risk of immunosuppression.
Nevertheless, gene therapy that involves DNA transposons, as well as retroviruses, carries the potential hazard of random transgene integration, heightening the risk of insertional mutagenesis. Such unspecific incorporation can inadvertently affect gene function, including the regulation of oncogenes and tumor suppressor genes, posing a profound safety challenge in gene therapy.
Therefore, advancing gene delivery systems by discovering and engineering targeting DNA transposons, with aid of artificial intelligence (AI) and other technologies, is imperative. These enhancements can mitigate the dangers of arbitrary integration, thereby substantially elevating the safety profile of DNA transposon-mediated gene therapy.
Fig.1. Structure of cut-and-paste transposons and their transposition mechanism. (A) The transposon is a mobile genetic element containing a transposase coding sequence (green box) flanked by terminal inverted repeats (TIRs; orange arrows on the left and right). (B) The transposase (green spheres) binds to its sites within the transposon TIRs (orange boxes). Excision takes place in a synaptic complex, and separates the transposon from the donor DNA (gray box). The excised element integrates into a target site in the target DNA (yellow box). This process generates target site duplications (TSDs, black boxes) which flank the newly integrated transposon. Most cut-and-paste transposons generate a transposon excision footprint in the donor DNA.
Fig.2. Experimental pipeline for the development of genetic tools based on active DNA transposons.
3. Transposon-derived miniature RNA-guided endonucleases mining and engineering
The Cas9 and Cas12a (Cpf1) nucleases, integral to the CRISPR/Cas gene editing system, possess substantial molecular weights, spanning 1200-1400 amino acids, which complicate their engineering, gene packaging, and delivery into cells. Additionally, the efficiency of gene editing is often suboptimal, and the prevalence of off-target effects considerably constrains the utility of CRISPR/Cas-based technologies in gene therapy and bio-breeding.
Conversely, compact RNA-guided nucleases offer substantial advantages, including easier synthesis, improved stability, and more efficient genetic manipulation and cellular transfection. Research and development of miniaturized RNA-guided endonucleases, particularly those originating from transposons and enhanced through artificial intelligence (AI) technologies, are crucial for increasing the safety and effectiveness of gene editing tools for human gene therapy and the development of genetically modified organisms.
TnpB is a protein associated with transposons, which are mobile genetic elements capable of moving within and between genomes. One of the key functions of TnpB is its nuclease activity, which is essential for the transposition process. Here’s a more detailed breakdown of how TnpB processes its own mRNA into guide RNA and why this is important:
- Self-Processing of mRNA: TnpB can process its own mRNA to produce guide RNAs. These guide RNAs are short RNA molecules that direct the TnpB nuclease to specific DNA sequences. The processing involves cleaving the mRNA to generate the guide RNA, which is then used to target the nuclease activity to the correct location in the genome.
- Guide RNA Function: The guide RNA forms a complex with TnpB and directs it to the specific DNA sequences that need to be cleaved. This targeting is crucial for the precise insertion or excision of transposons, ensuring that the transposition process does not cause excessive damage to the host genome.
- Nuclease Activity: Once directed by the guide RNA, TnpB cleaves the DNA at the target site. This cleavage is necessary for the transposon to be inserted into a new location or to be excised from its current location. The nuclease activity of TnpB is thus a critical component of the transposition machinery.
- Maintenance of Transposons: By processing its own mRNA into guide RNAs and utilizing its nuclease activity, TnpB ensures the efficient and accurate transposition of mobile genetic elements. This maintenance is vital for the survival and propagation of transposons within genomes, as it allows them to move and replicate without causing detrimental mutations or genomic instability.
In summary, TnpB’s ability to process its own mRNA into guide RNAs and its nuclease activity are essential for the transposition process. These functions enable TnpB to accurately target and cleave DNA, facilitating the movement and maintenance of transposons within genomes. This intricate mechanism highlights the sophisticated strategies that mobile genetic elements have evolved to ensure their persistence and propagation.
4. RIP mining and RIP chip developing in livestock
Since transposons can transfer between and within species, they can generate rich genomic structural variations like other mutation sources. Structural variations generated by transposons are also known as transposon insertion polymorphisms (TIP). Transposons are major components of the genomes of many model animals, accounting for around 55% of the zebrafish genome, 35% of the frog genome, 45% of the silkworm genome, and 40% of the pig genome. In addition, since transposon insertion fragments are usually large (generally more than 200 bp), and mostly contain functional elements (such as promoters and enhancers), TIPs generally have stronger genetic effects than SNPs. Transposon insertion can cause changes in gene function and activity by importing regulatory elements, changing gene splicing modes, or changing epigenetic regulation, thereby causing phenotypic variations. Furthermore, since TIPs are mediated by endogenous transposases, while SNPs are natural mutations, the mutation rate of TIPs (2.5×10−2) is higher than that of SNPs (1.0-1.8×10–8).
Therefore, transposons not only promote the differentiation of genomes among species, but also generate rich genetic diversity within species, playing an important role in the formation of species, varieties, subspecies, strains, and new traits. As a result, polymorphisms formed by transposon insertions are important new molecular markers, whose applications in genetic evolution research provide a new perspective for us to understand the evolution of higher organism genomes. At the same time, transposon insertion polymorphism molecular marker technology has become an important tool for research on biodiversity, genetic evolution and molecular breeding. Compared with traditional SNP markers, transposon insertion polymorphism (TIP) markers have the advantages of high mutation rate, good stability, large quantity, wide distribution in genome, high genetic effects, simple and fast detection methods, and low development costs. Therefore, the molecular marker mining based on transposon insertion polymorphism (TIP) on the genome, as well as the development of TIP chips, have higher application value in genetic breeding research such as QTL fine mapping and whole genome selection. Over 50% of the structural variations in the genomes of domestic animals, such as pigs, are generated by retrotransposon insertions mediated by retrotransposons, also known as Retrotransposon Insertion Polymorphism (RIP). Especially, the RIP markers produced by SINE retrotransposons, which are the most widespread and polymorphic transposons in livestock genomes, have the greatest potential for development.