Government organizations are experiencing more and more pressure to keep detailed and up-to-date planimetric and topographic basemaps, in order to support their functions effectively and efficiently under their tight budget constraints. To date, basemaps are updated yearly (at most), using new orthophotography and/or maintaining their parcel fabric. Many times this means starting from scratch over and over again in order to obtain a reasonable horizontal and vertical integration of the geographic features among layers. As accuracy needs increases, the volume of information is substantially higher, and so are the corresponding costs and delivery times.
Exploiting the power of the new ArcGIS architecture, this paper proposes a new concept, the “intelligent basemap”, where the quality and the integrity of the geometric and attribute base are assured. It also promotes easier and faster editing and updating, and extremely efficient support of derived geometry and metadata, and therefore, overcoming many of the challenges described above.
Because of the increasing acceptance of GIS technology within the IT industry, the changes in GIS base technology itself, the availability (or potential liability) of GIS mapping services through the Internet to massive audiences, changes in regulations, tight budgets and increasing demands from management to reduce operational costs, results in the need of having more detailed and updated maps than ever before. Such demand generates the need to “re-invent” the area of basemapping.
Every time the scale is increased, the amount of details grows significantly, meaning more resources and more time to keep them updated, with the correct attributes, the right topology, vertically and horizontally integrated with the other layers. Until now, the most common way to address these issues is to generate basemaps every few years, sharing the datasets, and spending a tremendous amount of time and manual labor.
Accelerating the pace of updated versions and augmenting the level of detail will require a new paradigm. Budgets and deadlines do not seem to escalate with those needs; hence having more of the same is not an optimal long-term solution. This paper offers a new approach based on Esri ArcObjects’ framework, coupled with new data acquisition methods and technologies, to address these challenges.
Nowadays most geographic data and metadata are considered to be static. Each layer is “freezed” in time with every new version of the dataset. The idea is to turn both data and metadata into dynamic and living entities, which become aware of their origin and that then can automatically adapt to changes.
This paper summarizes the concepts and technologies behind this transformation from snapshots to living, “intelligent” geographic data.
A basemap is a layer or set of layers that provide geometric support to other set of geographic features or layers, in 2D (planimetric) or 3D (topographic). For that reason it is desirable that basemaps possess the following two properties:
The most common layers being used as basemaps are ortho-corrected raster datasets and/or digitized mylar maps. Most of the other layers needed by an organization will be supported by and derived from basemaps; two examples could be administrative boundaries and landuse.
There is an increasing awareness about the importance of measuring the events related to human activities or natural phenomena in space and time, monitoring their changes and enforcing regulations. Consequently, data needs to be captured on time, more accurate and with greater detail. In order to satisfy those needs, basemaps themselves need to be up-to-date.
The challenges faced by the basemapping community to comply with these issues are diverse. To name a few:
Figure 1 shows a simple example. Based on aerial photography and ground control points (GCP), an organization plans to extract landuse, compile basic line features for reference, and geometrically adjust an existing demographics layer to perform a determined spatial analysis (growth capacity). Due to budget and time constraints, the aerial triangulation is computed and the USGS DEM is used for the final geometric correction. The transformed photographs become the basemap, the linear features are extracted accordingly, the landuse layer is created and after rubber-sheeting, demographics fits the basemap geometry.
Figure 1: This graph represents the workflow that computes a growth capacity map. Each box corresponds to the process that results in the dataset named in its label. The arrows indicate the dependency among datasets, reflecting their lineage. The evolution of that dependency is color-coded, from left to right. Two processes with the same color can be executed in parallel.
As the project is successful they want to repeat the experience the next year, wanting also to understand the evolution between the two years. New photography is contracted, and this time there is money to compute the orthos from the stereo pairs. What happens: the new landuse follows the new geometry, not the old; but the demographics layer remains with the old definition, but needs to be corrected to the new geometry in order to be able to perform the new analysis. The overlap required for the multitemporal study needs to match both years’ results. Typically all this work will be manual and costly.
In the next section, the same example is reviewed using a different approach, by having a new model in which derived layers can automatically follow the geometric changes undergone by its “dynamically linked” sources.
It is worthwhile to note that routes and regions introduced in former ArcInfo releases can be considered as early references of the concept presented here, strongly limited by the underlying technology available at the time. Having census tracts defined as regions over a network of centerlines, any modifications to the centerline geometry would result in the automatic alteration of the census tracts.
The metadata synchronizer present in ArcCatalog constitutes an important reference to our work. The synchronizer completes as much of the metadata as possible automatically, assuring that metadata is kept up-to-date with changes to the data set. We will take this concept further by providing a two-way synchronization between data and metadata.
From neural networks architectures [6] and learning rules, several ideas have been drawn about how knowledge is propagated among the different elements of an artificial neural net (ANN). Even though we are not using the ANN technology per se, many useful analogies have been exploited.
The quality of the attribute information is as crucial as the geometry, and it can be maintained using Esri geodatabase techniques described elsewhere [2], using domains, rules and so on. The main focus of this paper will be in geometric features, where one of the most critical challenges resides.
“Intelligent basemap” is herein defined as a geodatabase that incorporates behavior into their constituent layers such that:
A metaphor for the intelligent basemap is the conventional spreadsheet, like MS-Excel. Instead of storing fixed cell values and every time there is a change, get a calculator, make all the calculations externally, and manually edit the results, the formula of how to compute them automatically are kept, such that changes are taken care of on they own, leaving the spreadsheet in a consistent state at all times.
Although the idea is simple and straightforward, applying the same concept to geometry is far more complicated. However, object-oriented technology [1,4] and the ArcObjects framework [5] in particular make this possible.
The following benefits can be highlighted:
Equivalent ideas have been successfully applied in the field of Computer Aided Design/Computer Aided Manufacturing (CAD/CAM), where tens of thousands of 3D parts are designed and assembled parametrically, in such a way that if any of them is altered, the impact is reflected immediately in the whole model, providing instant feedback to the engineer designing a particular piece. Although the context is different and that technology cannot be utilized for mapping purposes, the principles and the benefits are comparable.
Having this new technology in place, how does it make the previous example different?
Figure 2. The two workflows represent the same work done with a difference of one year in time and with different accuracies. By computing a geometric transformation model only between the old to the new basemaps, vertical and multitemporal automatic integration of all other layers is guaranteed. Datasets marked with a asterisk (*) are the new data, while those unmarked represent the old data in the appropriate geometry, being its original or transformed.
The upper workflow in Figure 2 produces the same result as the one depicted in Figure 1, but it has been redesigned, such that the corrected demographics layer this time is computed from the linear features derived in turn from the orthos. Even though it will be more work the first time, if any of the sources changes, the demographics will be adjusted accordingly. For instance, if the USGS DEM with 30 meters postings is replaced by a more accurate DEM with 30 cm contours, the orthos can be recreated and a “geometric” forward propagation occur.
The lower workflow in Figure 2 represents the new job, where the new data is marked with an asterisk (*). Establishing an automatic correspondence between the new orthos and the old ones, it allows to transform every other pixel or vertex of every other layer. In the example, the linear features and the demographics layers have been transformed through this vector field to match the new geometry, instead of being rebuilt. Undoubtedly the most interesting thing is what happens if the linear features are captured again with higher accuracy. Then, the demographics layer will inherit the new accurate geometry automatically, and so will its descendants.
The next section, describes the technology that makes this possible.
Briefly some of the technologies that could make this vision possible are mentioned.
Even though this may not seem as strictly related to the concept of “intelligent basemaps”, it is in fact its most important component for two reasons. First, if the data is not captured and produced timely and with the required level of detail, nothing can be done later to fix it. Second, the data captured, remotely or in the field, are always the first links of the lineage chains (or graphs) for any other existing dataset in the geodatabase.
In the raster domain, Earthdata has a broad set of sensors, covering wide ranges in speed of acquisition and/or processing, in spectrum, in levels of detail and swath coverage.
On the vector domain, the ability to collect real-time GPS (or differential GPS) measurements in the field, connected to a PDA, and linked to the Internet via wireless protocols to an ArcIMS server, offers new opportunities for on-time, accurate basemap updating.
This concept requires to store for each object not only its latest computed coordinates, but also information about the formula or process that produced those coordinates.
In the case of raster layers, the rectification or the ortho-correction process that produces the geometric transformation map should be stored (i.e. a vector field associating each pixel of the original with the new position in the corrected image). In the case of a rectification, the transformation map is a polynomial with few coefficients, dependent on the ground control. Ortho-correction processes produced a more complex transformation map.
The simplest example is an image rectified through an affine transformation. If the GCP are measured more accurately, the coefficients change, and the image geometry can be updated accordingly. The new polynomial can now also be applied to all the derived layers, both raster and vector.
If the data is collected in vector form, it is more complicated, because it is necessary to keep track of all the sketch construction commands [3], parameters and constraints entered during an editing session. For instance, if a point coordinates are computed as the intersection of two circles, that information with the circle centers and radius are stored as part of the object. If one of those centers moves, the other point should follow.
Note that the lineage information until now was only kept in the metadata base as a textual description of the origin of the data, for visualization purposes only. Such unstructured information is not enough to automate conversion processes to this new technology, so it has to be done manually once. From that moment on, data and metadata become fully synchronized, and changes in one are reflected in the other immediately.
Whenever there is an increment in the level of geometric detail, both raster and vector, we need to be dealing with multiresolution algorithms to establish the correspondence, automatically or semi-automatically, between the same objects at the two scales.
In the example of Figure 2, in order to compute the geometric transformation model between the two sets of orthophotos, given that they have different scales, it is necessary to use appropriate algorithms that can perform automatic matching at different scales. Similarly, for vector datasets, new conflation tools are being developed following similar principles.
A new concept has been presented in the area of basemapping that can substantially reduce the efforts of maintaining and updating existing geographic databases. The integration of technologies, from new sensors for more effective and efficient data capture, to the object oriented paradigm and Esri ArcObjects, could produce a dramatic impact in the future of basemapping.
[1] Bahar, M., Object Technology Made Simple, Simple Software Publishing, 1996.
[2] McDonald, A., Building a Geodatabase, Esri Press, 2000.
[3] Shaner, J., Wrightsell, J., Editing in ArcMap, Esri Press, 2000.
[4] Taylor, D., Object Technology: A Manager’s Guide, Addison-Wesley, 1997.
[5] Zeiler, M.., Exploring ArcObjects, Esri Press, 2000.
[6] Hagan, M., Demuth, H., Bealle M., Demuth, B., Neural Network Design, Pws Pub Co, 1995.