HOME RESEARCH PUBLICATIONS PEOPLE


Current Projects

In-Cloud metadata for eScience data (XMC Cat)

XMC Cat is a toolkit for capturing and storing metadata gathered during the scientific workflow execution. Its advantages include support for automatic creation metadata through curation plugins needed for subsequent discovery and use purposes. It provides search and browse capability through a client side toolkit and GUIs. XMCcat can be adapted to new scientific domains through configuration changes. That is, new code need not be written. It is currently in use in the LEAD Science Gateway.

Data Catalog

Data catalog is a toolkit used by the LEAD project to support the registration of data collections and to provide LEAD Metadata documents during scientific workflow execution. It crawls metadata from data repositories and provides a web service API as well as a command line interface for searching these metadata. It supports automatic conversion from THREDDS catalog to LEAD Metadata schema(LMS), and to Lucene index format.

LEAD

Linked Environments for Atmospheric Discovery (LEAD) is a collaboration between meteorologists, computer scientists, and educators to build a cyberinfrastructure that enables adaptive weather forecasting, forecasting that is responsive to and can focus on emerging local severe weather conditions. LEAD is funded through NSF.

Provenance Collection and Representation (Karma)

Karma is a domain independent tool for capturing and using provenance data gathered on scientific digital data generated by data-driven discovery, analysis and modeling. The project is currently funded by NSF SDCI and through funding by Lilly.

Calder Complex Events Processing

Calder researches complex events processing in data driven scientific computing. Current work investigates complex scientific use cases for events processing and programming models that integrate complex events processing and service oriented workflow systems. Earlier work investigated provenance for data streams. The work is funded through a DOE CAREER grant.


Archived Projects

dQUOB: dynamic Query Objects

dQUOB is a middleware system providing continuous evaluation of queries over time sequenced data through an SQL-like language interface. The dQUOB system has been applied to such diverse applications as a safety critical autonomous robotics simulation, and scientific software visualization for global atmospheric transport modeling.

Relational Grid Resources (RGR) Project

The Relational Grid Resource project explored the representation of grid resource data. It advanced the thinking about resource information (e.g., CPU speeds, amount of memory, I/O bandwidth, storage resources) by showing that existing solutions using LDAP were inadequate for storing resource information because resource information underwent rapid change. This work was funded through an NSF ITR grant.

Doppler Source

The Doppler Source project processed streaming data from the NEXRAD radar network comprising 120 operational WSR-88D Doppler radars in the continental United States. It automatically generated metadata on the fly and stored the data to a SQLserver database. The files themselves were stored to a tape archive, then retrieved using attributes gathered during metadata generation. This work forms the basis for some of the ideas in the XMCcat project. This work was funded by Microsoft.

Copyright 2008 Indiana University