Abstract
![]() Integrating Spatial and Tabular Extract Transform and Load (ETL) Processes Track: Health and Human Services Author(s): Terri Cohen, Julie Baitty The Health Resources and Services Administration (HRSA) Geospatial Data Warehouse is designed to support the HRSA mission: improving the availability of and access to quality health care for all. The HRSA Geospatial Data Warehouse provides both tabular and spatial information about HRSA programs and related health resources demographic data. This information is refreshed on a scheduled basis using Extract Transform and Load (ETL) processes developed using Informatica. We have extended the traditional data warehouse ETL to include address correction (via Trillium Software) and integrated Esri's ArcGIS geoprocessing model technology to geocode and load the data into the HRSA Geospatial Data Warehouse. The entire spatial and tabular refresh process for each HRSA data layer is driven through a single Informatica ETL mapping. This paper will detail this process and describe how the integration of these technologies has improved the data warehouse data refresh cycle from weeks to just a few days. Terri Cohen HHS/HRSA OIT 5600 Fishers Lane Room 10-30A Rockville , MD 20857 US Phone: 301-443-3144 Fax: 301-443-4414 E-mail: tcohen@hrsa.gov Julie Baitty HHS/HRSA OIT 5600 Fishers Lane Room 10-30A Rockville , MD 20857 US Phone: 301-443-6514 E-mail: jbaitty@hrsa.gov |