Indiana University Bloomington

School of Informatics and Computing

Technical Report TR607:
A Data Management Architecture for Computational Biology

Yu Ma, Randall Bramley, Sun Kim
(Jan 2005), 7
Abstract: With increasing availability of high-performance computer clusters, a key bottleneck in computational biology research is the handling of metadata: information about computational jobs, locations of files, staging input and output data where it is needed, and application-specific data allowing searching and querying the results of computations. By examining the needs of PLATCOM, a gene similarity platform, general requirements are derived for a data management architecture, from which particular data management systems can be quickly composed. The prototype architecture is described and its application to genome comparative analysis in PLATCOM is shown in detail.

