Unknown

Dataset Information

0

Parallelization of MAFFT for large-scale multiple sequence alignments.


ABSTRACT: Summary:We report an update for the MAFFT multiple sequence alignment program to enable parallel calculation of large numbers of sequences. The G-INS-1 option of MAFFT was recently reported to have higher accuracy than other methods for large data, but this method has been impractical for most large-scale analyses, due to the requirement of large computational resources. We introduce a scalable variant, G-large-INS-1, which has equivalent accuracy to G-INS-1 and is applicable to 50 000 or more sequences. Availability and implementation:This feature is available in MAFFT versions 7.355 or later at https://mafft.cbrc.jp/alignment/software/mpi.html. Supplementary information:Supplementary data are available at Bioinformatics online.

SUBMITTER: Nakamura T 

PROVIDER: S-EPMC6041967 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC2905546 | biostudies-literature
| S-EPMC2764451 | biostudies-literature
| S-EPMC6894943 | biostudies-literature
| S-EPMC548345 | biostudies-literature
| S-EPMC6733932 | biostudies-literature
| S-EPMC1948021 | biostudies-literature
| S-EPMC7313716 | biostudies-literature
| S-EPMC3603318 | biostudies-literature
| S-EPMC5939968 | biostudies-literature