CNCA aligns small annotated genomes - Laboratoire de Probabilités et Modèles Aléatoires
Article Dans Une Revue BMC Bioinformatics Année : 2023

CNCA aligns small annotated genomes

Résumé

Background: To explore the evolutionary history of sequences, a sequence alignment is a first and necessary step, and its quality is crucial. In the context of the study of the proximal origins of SARS-CoV-2 coronavirus, we wanted to construct an alignment of genomes closely related to SARS-CoV-2 using both coding and non-coding sequences. To our knowledge, there is no tool that can be used to construct this type of alignment, which motivated the creation of CNCA. Results: CNCA is a web tool that aligns annotated genomes from GenBank files. It generates a nucleotide alignment that is then updated based on the protein sequence alignment. The output final nucleotide alignment matches the protein alignment and guarantees no frameshift. CNCA was designed to align closely related small genome sequences up to 50 kb (typically viruses) for which the gene order is conserved. Conclusions: CNCA constructs multiple alignments of small genomes by integrating both coding and non-coding sequences. This preserves regions traditionally ignored in conventional back-translation methods, such as non-coding regions.
Fichier principal
Vignette du fichier
s12859-024-05700-1.pdf (987.7 Ko) Télécharger le fichier
Origine Fichiers éditeurs autorisés sur une archive ouverte
licence

Dates et versions

hal-04268598 , version 1 (02-11-2023)
hal-04268598 , version 2 (10-06-2024)

Licence

Identifiants

Citer

Jean-Noël Lorenzi, François Graner, Virginie Courtier-Orgogozo, Guillaume Achaz. CNCA aligns small annotated genomes. BMC Bioinformatics, 2023, 25 (1), pp.89. ⟨10.1186/s12859-024-05700-1⟩. ⟨hal-04268598v2⟩
293 Consultations
78 Téléchargements

Altmetric

Partager

More