Statistical determination of diagnostic species for site groups of unequal size