Removing redundancy in SWISS-PROT and TrEMBL.

Abstract
SUMMARY: One of the distinguishing criteria of the SWISS-PROT protein sequence data bank is minimal redundancy. The introduction of TrEMBL as a supplementary database ensured the comprehensiveness of SWISS-PROT and TrEMBL but introduced some degree of redundancy. We developed a strategy to identify the redundancy present within and between SWISS-PROT and TrEMBL and its subsequent removal. AVAILABILITY: The tools mentioned in this paper are available on request.

This publication has 0 references indexed in Scilit: