MergeAlign is a program that constructs a consensus multiple sequence alignment from multiple independent alignments. Using dynamic programming it efficiently combines individual multiple sequence alignments to generate a consensus that is maximally representative of all constituent alignments.

Using MergeAlign to combine multiple sequence alignments generated using different matrices of amino acid substitution produces multiple sequence alignments that are more robust and more accurate than alignments generated using only a single matrix of amino acid substitution. Phylogenetic trees inferred from these MergeAlign alignments have better topological support values, are better resolved and show increased consistency.

MergeAlign generates column support scores for each column in a multiple sequence alignment. When constituent alignments are generated using different models of amino acid substitution these support scores are related to alignment precision. MergeAlign can therefore be used to select accurately aligned data for downstream phylogenetic applications.

Download a command line executable version of MergeAlign below

Download “MergeAlign” MergeAlign.tar.gz – Downloaded 504 times – 3 KB

Download the 91 MergeAlign matrices below

Download “MergeAlign Matrices” mergealign_matrices.tar.gz – Downloaded 213 times – 49 KB

Align and Combine using MergeAlign

Enter a set of unaligned protein sequences in FASTA format and we will align them using MAFFT and 91 different models of amino acid substitution, then use MergeAlign to find the optimal consensus of all these alignments.

This may take a couple of minutes depending on alignment size and server load.


If you use MergeAlign in your work please cite:

Collingridge, P. and Kelly, S. (2012). MergeAlign: improving multiple sequence alignment performance by dynamic reconstruction of consensus multiple sequence alignments. BMC Bioinformatics.