Abstract

back
   Back


Paper
Integrating Spatial and Tabular Extract Transform and Load (ETL) Processes
Track: Health and Human Services
Author(s): Terri Cohen, Julie Baitty

The Health Resources and Services Administration (HRSA) Geospatial Data Warehouse is designed to support the HRSA mission: improving the availability of and access to quality health care for all. The HRSA Geospatial Data Warehouse provides both tabular and spatial information about HRSA programs and related health resources demographic data. This information is refreshed on a scheduled basis using Extract Transform and Load (ETL) processes developed using Informatica. We have extended the traditional data warehouse ETL to include address correction (via Trillium Software) and integrated Esri's ArcGIS geoprocessing model technology to geocode and load the data into the HRSA Geospatial Data Warehouse. The entire spatial and tabular refresh process for each HRSA data layer is driven through a single Informatica ETL mapping. This paper will detail this process and describe how the integration of these technologies has improved the data warehouse data refresh cycle from weeks to just a few days.

Terri Cohen
HHS/HRSA
OIT
5600 Fishers Lane
Room 10-30A
Rockville , MD 20857
US
Phone: 301-443-3144
Fax: 301-443-4414
E-mail: tcohen@hrsa.gov

Julie Baitty
HHS/HRSA
OIT
5600 Fishers Lane
Room 10-30A
Rockville , MD 20857
US
Phone: 301-443-6514
E-mail: jbaitty@hrsa.gov

Contact Us | Privacy | Legal | Careers