. 2005 Feb;15(2):330–340. doi: 10.1101/gr.2821705

Table 2.

Significance test for differences in BAliBASE performance

	DIALIGN	CLUSTALW	MAFFT	T-Coffee	MUSCLE	ProbCons	ProbCons-ext
Align-M	−(0.61)	−8.2 × 10⁻⁶	−<10⁻¹⁰	−<10⁻¹⁰	−<10⁻¹⁰	−<10⁻¹⁰	−<10⁻¹⁰
DIALIGN		−1.9 × 10⁻⁵	−<10⁻¹⁰	−<10⁻¹⁰	−<10⁻¹⁰	−<10⁻¹⁰	−<10⁻¹⁰
CLUSTALW	+2.4 × 10⁻³		−1.0 × 10⁻³	−3.0 × 10⁻⁵	−4.9 × 10⁻⁸	−6.1 × 10⁻¹⁰	−<10⁻¹⁰
MAFFT	+1.2 × 10⁻⁹	+1.0 × 10⁻³		−(0.65)	−1.7 × 10⁻⁵	−2.6 × 10⁻⁹	−4.9 × 10⁻⁸
T-Coffee	+<10⁻¹⁰	+8.4 × 10⁻⁶	− (0.92)		−7.0 × 10⁻³	−1.5 × 10⁻⁶	−8.4 × 10⁻⁶
MUSCLE	+<10⁻¹⁰	+1.9 × 10⁻⁸	+9.6 × 10⁻⁶	+1.7 × 10⁻³		−3.0 × 10⁻³	−6.6 × 10⁻³
ProbCons	+<10⁻¹⁰	+<10⁻¹⁰	+1.6 × 10⁻⁷	+1.9 × 10⁻⁶	+0.012		+0.043
ProbCons-ext	+<10⁻¹⁰	+<10⁻¹⁰	+8.3 × 10⁻⁶	+3.2 × 10⁻⁵	+(0.092)	−(0.088)

Entries show the p-value indicating the significance of a difference in performance between two alignment methods as measured using a Friedman rank test. Nonitalicized values above the diagonal were calculated using SP scores on all alignments, whereas italicized values were computed using CS scores. (+) Method on the left had lower average rank (better performance); (−) Method on the left had higher average rank (worse performance); parentheses denote (nonsignificant) p-values >0.05.