Bioinformatics and Transposable Elements by Matej Lexa
Bioinformatics, 2020
Motivation: Transposable elements (TEs) in eukaryotes often get inserted into one another, formin... more Motivation: Transposable elements (TEs) in eukaryotes often get inserted into one another, forming sequences that become a complex mixture of full-length elements and their fragments. The reconstruction of full-length elements and the order in which they have been inserted is important for genome and transposon evolution studies. However, the accumulation of mutations and genome rearrangements over evolutionary time makes this process error-prone and decreases the efficiency of software aiming to recover all nested full-length TEs. Results: We created software that uses a greedy recursive algorithm to mine increasingly fragmented copies of full-length LTR retrotransposons in assembled genomes and other sequence data. The software called TE-greedy-nester takes into account not only sequence similarity but also the structure of elements. This new tool was tested on a set of natural and synthetic sequences and its accuracy was compared to similar software. We found TE-greedy-nester to be superior in a number of parameters, namely computation time and full-length TE recovery in highly nested regions. Availability and implementation: http://gitlab.fi.muni.cz/lexa/nested Contact:
Bookmarks Related papers MentionsView impact
Bioinformatics, 2017
Motivation: G-quadruplexes (G4s) are one of the non-B DNA structures easily observed in vitro and... more Motivation: G-quadruplexes (G4s) are one of the non-B DNA structures easily observed in vitro and assumed to form in vivo. The latest experiments with G4-specific antibodies and G4-unwinding helicase mutants confirm this conjecture. These four-stranded structures have also been shown to influence a range of molecular processes in cells. As G4s are intensively studied, it is often desirable to screen DNA sequences and pinpoint the precise locations where they might form. Results: We describe and have tested a newly-developed Bioconductor package for identifying potential quadruplex-forming sequences (PQS). The package is easy-to-use, flexible and customizable. It allows for sequence searches that accommodate possible divergences from the optimal G4 base composition. A novel aspect of our research was the creation and training (parametrization) of an advanced scoring model which resulted in increased precision compared to similar tools. We demonstrate that the algorithm behind the searches has a 96% accuracy on 392 currently known and experimentally observed G4 structures. We also carried out searches against the recent G4-seq data to verify how well we can identify the structures detected by that technology. The correlation with pqsfinder predictions was 0.622, higher than the correlation 0.491 obtained with the second best G4Hunter.
Bookmarks Related papers MentionsView impact
BMC Genomics, 2018
Background: Many studies have shown that guanine-rich DNA sequences form quadruplex structures (G... more Background: Many studies have shown that guanine-rich DNA sequences form quadruplex structures (G4) in vitro but there is scarce evidence of guanine quadruplexes in vivo. The majority of potential quadruplex-forming sequences (PQS) are located in transposable elements (TEs), especially close to promoters within long terminal repeats of plant LTR retrotransposons.
Results: In order to test the potential effect of G4s on retrotransposon expression, we cloned the long terminal repeats of selected maize LTR retrotransposons upstream of the lacZ reporter gene and measured its transcription and translation in yeast. We found that G4s had an inhibitory effect on translation in vivo since "mutants" (where guanines were replaced by adenines in PQS) showed higher expression levels than wild-types. In parallel, we confirmed by circular dichroism measurements that the selected sequences can indeed adopt G4 conformation in vitro. Analysis of RNA-Seq of polyA RNA in maize seedlings grown in the presence of a G4-stabilizing ligand (NMM) showed both inhibitory as well as stimulatory effects on the transcription of LTR retrotransposons.
Conclusions: Our results demonstrate that quadruplex DNA located within long terminal repeats of LTR retrotransposons can be formed in vivo and that it plays a regulatory role in the LTR retrotransposon life-cycle, thus also affecting genome dynamics.
Bookmarks Related papers MentionsView impact
A significant part of eukaryotic genomes is formed by transposable elements (TEs) containing not ... more A significant part of eukaryotic genomes is formed by transposable elements (TEs) containing not only genes but also regulatory sequences. Some of the regulatory sequences located within TEs can form secondary structures like hairpins or three-stranded (triplex DNA) and four-stranded (quadruplex DNA) conformations. This review focuses on recent evidence showing that G-quadruplex-forming sequences in particular are often present in specific parts of TEs in plants and humans. We discuss the potential role of these structures in the TE life cycle as well as the impact of G-quadruplexes on replication, transcription, translation, chromatin status, and recombination. The aim of this review is to emphasize that TEs may serve as vehicles for the genomic spread of G-quadruplexes. These non-canonical DNA structures and their conformational switches may constitute another regulatory system that, together with small and long non-coding RNA molecules and proteins, contribute to the complex cellular network resulting in the large diversity of eukaryotes.
http://link.springer.com/article/10.1007/s10577-015-9491-7
Bookmarks Related papers MentionsView impact
BMC Genomics, 2014
Bookmarks Related papers MentionsView impact
Bioinformatics, 2003
Bookmarks Related papers MentionsView impact
We report the latest details on improvements made or planned for the VPCR simulation software cur... more We report the latest details on improvements made or planned for the VPCR simulation software currently accessible on the Internet (installed on servers in Padova and Brno). We describe the inner workings of the dynamic amplification simulation model, concentrating mostly on the time- and sensitivity-critical step of sequence matching. Replacement of BLAST by the more sensitive and faster PRIMEX similarity search program resulted in a marked improvement of PCR product predictions. Further speed improvements were acheived using hardware acceleration of approximate sequence matching. We report our theorethical computation time estimates and results of the first tests using the Arabidopis and human genome sequences as PCR templates.
Bookmarks Related papers MentionsView impact
Bioinformatics, 2001
Bookmarks Related papers MentionsView impact
Bookmarks Related papers MentionsView impact
Bookmarks Related papers MentionsView impact
PLoS ONE, 2013
Bookmarks Related papers MentionsView impact
Genetics, 2008
Bookmarks Related papers MentionsView impact
Bioinformatics, 2013
Bookmarks Related papers MentionsView impact
Bioinformatics, 2011
Bookmarks Related papers MentionsView impact
Bookmarks Related papers MentionsView impact
Bookmarks Related papers MentionsView impact
ABSTRACT Vyd. 1. Terminologický slovník
Bookmarks Related papers MentionsView impact
Bookmarks Related papers MentionsView impact
Bookmarks Related papers MentionsView impact
Bookmarks Related papers MentionsView impact
Uploads
Bioinformatics and Transposable Elements by Matej Lexa
Results: In order to test the potential effect of G4s on retrotransposon expression, we cloned the long terminal repeats of selected maize LTR retrotransposons upstream of the lacZ reporter gene and measured its transcription and translation in yeast. We found that G4s had an inhibitory effect on translation in vivo since "mutants" (where guanines were replaced by adenines in PQS) showed higher expression levels than wild-types. In parallel, we confirmed by circular dichroism measurements that the selected sequences can indeed adopt G4 conformation in vitro. Analysis of RNA-Seq of polyA RNA in maize seedlings grown in the presence of a G4-stabilizing ligand (NMM) showed both inhibitory as well as stimulatory effects on the transcription of LTR retrotransposons.
Conclusions: Our results demonstrate that quadruplex DNA located within long terminal repeats of LTR retrotransposons can be formed in vivo and that it plays a regulatory role in the LTR retrotransposon life-cycle, thus also affecting genome dynamics.
http://link.springer.com/article/10.1007/s10577-015-9491-7
Results: In order to test the potential effect of G4s on retrotransposon expression, we cloned the long terminal repeats of selected maize LTR retrotransposons upstream of the lacZ reporter gene and measured its transcription and translation in yeast. We found that G4s had an inhibitory effect on translation in vivo since "mutants" (where guanines were replaced by adenines in PQS) showed higher expression levels than wild-types. In parallel, we confirmed by circular dichroism measurements that the selected sequences can indeed adopt G4 conformation in vitro. Analysis of RNA-Seq of polyA RNA in maize seedlings grown in the presence of a G4-stabilizing ligand (NMM) showed both inhibitory as well as stimulatory effects on the transcription of LTR retrotransposons.
Conclusions: Our results demonstrate that quadruplex DNA located within long terminal repeats of LTR retrotransposons can be formed in vivo and that it plays a regulatory role in the LTR retrotransposon life-cycle, thus also affecting genome dynamics.
http://link.springer.com/article/10.1007/s10577-015-9491-7