A Call for an Open Spatial Data Infrastructure

In full disclosure, this is a soap-box issue of mine. I’ve long been a vocal advocate of open public data in the geospatial arena.

The “open” provides us all the opportunity to build common spatial data infrastructures so critical to addressing public, private, and broader societal needs. Here I express concern that even with the most open of data, we may yet be compounding vital problems regarding a critical goal of spatial data infrastructures: authoritative and consistent data. Consistency is key, in my humble opinion.

One needs only to look so far as pleas such as in Jonathan Feldman’s recent article “How To Fix The GIS Data Mess” to see how consistent data shared among all potential users is much needed and desired. In my own experience, beyond accuracy and unfettered access to geospatial data, consistency of those data among users is critical. When agencies and organizations rely on geospatial data for critical decision making and those data differ, the decisions based on those data will necessarily differ, notwithstanding the best intentions.

Is it emergency responders and non-profit agencies looking at different authoritative data sources to deploy rescue efforts to your pets and family members? Is it the construction crew, development company, city, and recreational group looking to difference data sources when trails are cleared for that latest building project? Data consistency is vital – for public safety and for the public interest. Consistency (and with it I’m implying shared maintenance) is key to helping control costs.

I am a big fan of efforts such as Open Street Map (OSM) in democratizing geospatial data. This is an effort to be applauded. Clearly the sweeping early success of such effort, particularly in those areas of the world where geospatial data are less public than the US, demonstrates that people are ready and eager to create and support open data sources; I am myself. But I lend a word of caution as well… What do we do when other authoritative data that are open already exist? How do we determine authoritative? How do we share maintenance? These questions remain largely unanswered.

Members of the National States’ Geographic Information Council (NSGIC) are working with public and private organizations at all levels to address these very questions.

In Indiana for example, the community is working together to overcome institutional obstacles and build a statewide spatial data infrastructure that is open and consistent (see the Indiana Geographic Information Council). Local agencies are providing data publicly, such as street centerlines and parcel boundaries, and the state is integrating and publishing rather than duplicating those efforts. The state is contributing as well, not only through coordination and infrastructure, but also with statewide data sets such as aerial photography that make sense to maintain at a broader scale. And the effort doesn’t stop there. With university participation, those data are made public (view and download) through the IndianaMap. They are provided to federal agencies, such as U.S. Census for map modernization. In recognition that not everyone comes to government sources for their decision-making, statewide aerial photography (2005) was shipped to Google and Microsoft to integrate into their map services.

Such a model holds out a glimmer of hope that statewide, national, and international spatial data infrastructures are not only possible, but also within reach. However, even with such open data, when the process is ill-defined and under-funded we may miss the target. How, for instance, will the IndianaMap data be incorporated into other open source efforts the likes of OSM? With a desire by all parties, how might maintenance be addressed? These questions remain unanswered.

We must continue to strive for solutions which focus on process. Consistent data are key in the potential for geospatial data to solve problems at the most local to the most global of scales. While I agree any data may be viewed as better than no data at all, a preponderance of inconsistent data may prove no better with regard to vital issues. There are inherent problems when local data (cities and counties) differ from state data, differ from federal, private, non-profit, and open data. This is where a National Spatial Data Infrastructure is necessary.