A question I see quite often, in scientific forums, is how to determine whether the observed performance difference of two classifiers is statistically significant. Let us work through an example, as to how we can compute this statistical significance (click on the link below):
How to Compute the Statistical Significance of two Classifiers’ Performance Difference