Cluster-C, an algorithm for the large-scale clustering of protein sequences based on the extraction of maximal cliques