SPI: a tool for incorporating gene expression data into a four-dimensional database of Caenorhabditis elegans embryogenesis

Abstract
Motivation: A comprehensive gene expression database is essential for computer modeling and simulation of biological phenomena, including development. Development is a four-dimensional (4D; 3D structure and time course) phenomenon. We are constructing a 4D database of gene expression for the early embryogenesis of the nematode Caenorhabditis elegans. As a framework of the 4D database, we have constructed computer graphics (CG), into which we will incorporate the expression data of a number of genes at the subcellular level. However, the assignment of 3D distribution of gene products (protein, mRNA), of embryos at various developmental stages, is both difficult and tedious. We need to automate this process. For this purpose, we developed a new system, named SPI after superimposing fluorescent confocal microscopic data onto a CG framework. Results: The scheme of this system comprises the following: (1) acquirement of serial sections (40 slices) of fluorescent confocal images of three colors (4′,6′-diamino-2-phenylindole (DAPI) for nuclei, indodicarbocyanine (Cy-3) for the internal marker, which is a germline-specific protein POS-1 and indocarbocyanine (Cy-5) for the gene product to be examined); (2) identification of several features of the stained embryos, such as contour, developmental stage and position of the internal marker; (3) selection of CG images of the corresponding stage for template matching; (4) superimposition of serial sections onto the CG; (5) assignment of the position of superimposed gene products. The Snakes algorithm identified the embryo contour. The detection accuracy of embryo contours was 92.1% when applied to 2- to 28-cell-stage embryos. The accuracy of the developmental stage prediction method was 81.2% for 2- to 8-cell-stage embryos. We manually judged only the later stage embryos because the accuracy for embryos at the later stages was unsatisfactory due to experimental noise effects. Finally, our system chose the optimal CG and performed the superposition and assignment of gene product distribution. We established an initial 4D gene expression database with 56 maternal gene products. Availability: This system is available at http://anti.lab.nig.ac.jp/spi/ and http://anti.lab.nig.ac.jp/4ddb/

This publication has 0 references indexed in Scilit: