CD-HIT Suite: a web server for clustering and comparing biological sequences
Top Cited Papers
Open Access
- 6 January 2010
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 26 (5) , 680-682
- https://doi.org/10.1093/bioinformatics/btq003
Abstract
Summary: CD-HIT is a widely used program for clustering and comparing large biological sequence datasets. In order to further assist the CD-HIT users, we significantly improved this program with more functions and better accuracy, scalability and flexibility. Most importantly, we developed a new web server, CD-HIT Suite, for clustering a user-uploaded sequence dataset or comparing it to another dataset at different identity levels. Users can now interactively explore the clusters within web browsers. We also provide downloadable clusters for several public databases (NCBI NR, Swissprot and PDB) at different identity levels. Availability: Free access at http://cd-hit.org Contact:liwz@sdsc.edu Supplementary information:Supplementary data are available at Bioinformatics online.Keywords
This publication has 10 references indexed in Scilit:
- A core gut microbiome in obese and lean twinsNature, 2008
- SMART 6: recent updates and new developmentsNucleic Acids Research, 2008
- Probing Metagenomics by Rapid Cluster Analysis of Very Large DatasetsPLOS ONE, 2008
- Gene identification and protein classification in microbial metagenomic sequence data via incremental clusteringBMC Bioinformatics, 2008
- UniRef: comprehensive and non-redundant UniProt reference clustersBioinformatics, 2007
- The Sorcerer II Global Ocean Sampling Expedition: Expanding the Universe of Protein FamiliesPLoS Biology, 2007
- Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequencesBioinformatics, 2006
- Tolerating some redundancy significantly speeds up clustering of large protein databasesBioinformatics, 2002
- Clustering of highly homologous sequences to reduce the size of large protein databasesBioinformatics, 2001