Geographic Information Systems (GIS) are now commonplace within various commercial, governmental and academic settings. From their inception in the late 1960’s and early 1970’s, such systems have developed continually; migrating from the mainframe to the desktop and, more recently, to the Internet and the mobile device. The advent of component based GIS toolkits has also supported the broadening of the GIS audience and the increase in its uptake.
Simon Doyle and Martin Daly / Cadcorp
Within this process, users have traditionally adopted a proprietary or homogeneous approach, i.e. using a single GIS product and using that system’s proprietary file format as a de facto standard. Many factors have driven this approach, such as implementation, maintenance management and communication issues. Many of which are further exasperated through the potentially complex or expensive adoption of multiple systems and data formats. These factors have all previously restricted GIS interoperability i.e. the ability for the systems to communicate effectively - regardless of vendor, data format or operating system. The benefits of overcoming these limiting factors are many, not least that of distributing data and systems in an ‘open’ manner. This increased state of interoperability is being promoted through OpenGIS Consortium (OGC) initiatives and standards. This paper discusses the trend towards interoperability and uses examples of OGC software implementations carried out by Cadcorp Ltd, a UK GIS software house and OGC Technical Member.
IntroductionThe evolution of Geographic Information Systems is well documented (Dangermond 1991, Longley et al 2000). They are now commonplace within many sectors but are still not considered to be ‘mainstream’ in the same way as other software applications e.g. spreadsheets or word processing tools. It cannot be denied however, that GIS is heading towards this end, if the history of GIS is any indicator e.g. the migration from the mainframe to the desktop personal computer (PC) to the browser and the portable device. This growth is represented fiscally too, with GIS revenues accounting for circa US$1,047 million in the year 2000 (Daratech 2000,p3). There are now hundreds of software houses developing GIS applications based on their own or third party technologies. The advent of GIS toolkits or component based developer applications has, no doubt, helped fuel this trend. End users are also developing their own applications too, via macro or scripting languages and through the customisation of existing GIS. The upshot is that there are a large (and growing) number of systems and file formats in circulation.
There are, however, increased requirements for such systems to communicate with each other – both at system and data levels. In the United Kingdom, Central Government has established several ‘e-Government’ initiatives, such as the e-GIF (Government Interoperability Framework), in which Public Services are expected to interoperate and make better use of Information Technology (I.T.) within, across and beyond constituent organisations (Stationery Office 1999). This trend is not just restricted to the UK, it is evident in other economies too. It may also be seen in the private sector, where the recognised value of spatial information has increased through many channels. The rise in popularity and performance of the Internet Browser and associated services has also contributed to this growth. Such initiatives and demands are placing new pressures on existing GIS infrastructure. As such the importance of distributed or heterogeneous GIS will grow. The following paragraphs highlight the limitations of current practices and the potential of broader based GIS.
Proprietary GIS (Homogeneous model)The majority of GIS in operation today follow an homogeneous or proprietary approach. In the simplest form the software would be sourced from a single vendor and perhaps use a single operating system. Within this model, an organisation would also, typically, adopt an associated vendor file format as an internal standard – perhaps through necessity. Whilst there is nothing ‘wrong’ with adopting this approach, several limitations occur. Functionality is restricted to that of the incumbent system, data standards can be ruled by de facto standards which are difficult to share with other non-aligned software and dependency is placed upon specific service and software service providers. Many ‘common’ GIS have their own associated, proprietary formats which have served their parent systems for two decades or more e.g. SHP, MIF, DXF, all be it in a myopic way. These formats tend to require ancillary software and human processing so that data can be shared across systems. This is costly and time consuming as well as being open to error.
Whilst many datasets are available in proprietary formats, such formats are not ratified by any central or organising body. The reality of translating, importing or exporting such data between different systems is more often than not a harsh one. To accompany institutional limitations, there are also technical limitations attached to de facto data standards. For instance, some may have to store geometries, such as points, lines and polygons separately. Some may not be capable of storing or manipulating topology. Some may only be capable of storing two-dimensional data, and so on. Users are also dependent upon the authoring vendor to not alter (or drop) the format specification. There are advantages to this model with regard to dealing with a single supplier and system management and maintenance. Users need only learn about one product (range) and data can be moved between related systems without much effort. However, once an organisation is required to interact with another and between non-aligned systems the rigidity of this approach is exposed as a weakness.
Distributed GIS (Heterogeneous model)In an attempt to alleviate the restrictions imposed in the proprietary model (above), tools and technologies are being developed to encourage and enable distributed or heterogeneous GIS. The term heterogeneous refers to a computing environment in which a variety of software and hardware co-exist and interact. In this environment, users are not restricted to specific vendor systems or formats, nor are they necessarily aware of the diversity of the system from an end-use perspective. This, is interoperable GIS.
The Internet (World wide Web) is often cited as an heterogeneous environment due to the diverse range of hardware, software and data which construct its form. Within such an environment there are common metrics and protocols which bind these disparate entities together. In the case of the Internet, HTML (Hyper-text Mark-up Language) and TCP (Transmission Control Protocol) are the common metrics and are borne through Internet Browser applications. Within the GIS community the benefits of interoperability are growing in acceptance. The major benefit being the ability to distribute and then combine disparate data to produce new combinatorial datasets, regardless of the original format. Therefore promoting information exchange across organisations. An example of such a model is presented.
ExampleA data supplier hosts a variety of datasets across numerous servers. These databases are accessed directly by client applications across a network or through Internet Browsers. The clients communicate with the data servers through a common, de jure metric which allows a uniform supply of data to each client – regardless of their origin but dependent on the implementation of the same metric.
For this to work in practice, the emphasis is on the interface between systems. Since 1999 the OpenGIS Consortium (OGC) has been researching the feasibility of such heterogeneous systems. The aim of the OGC Web Mapping Testbed Phase 1 (WMT1) was to “increase users’ ability to access and overlay maps and earth images available form other vendor’s Web map servers. ” (www.opengis.org) . Within this and the subsequent OGC Testbeds, many data providers and curators have successfully combined disparate data and systems via the Internet.
Figure 1a. Cadcorp SIS – Spatial Information System user interface to connect to an OpenGIS Web Map Server as a client.
Figure 1b. Cadcorp SIS – Spatial Information System displaying an ECW file from a remote OpenGIS Web Map Server, as selected in Figure 1a.
To enable the practices described above, several standards have been designed and implemented. The OGC have several ratified interfaces, which 230+ organisations have been party to. These bodies include large GIS vendors and users who recognise the importance of interoperability. These interfaces standardise the semantics and the functionality delivered by any implementing software. Furthermore, software can be submitted for conformance testing by OGC. They also provide an open and stable framework for future application and service delivery. An important branch of this standardisation process has been the distribution of vector (and raster) data over the Internet. This has produced the Geography Mark-up Language (GML) which is used to encode real world features in a modified version of the W3C eXtensible Mark-up Language (XML). GML originated from OGC’s WMT1 project. Systems interpret this neutral information through OGC specified interfaces, ensuring standard delivery between systems and organisations. Like HTML and XML before it, GML will no doubt be used for many, as yet, unforeseen purposes.