Beruflich Dokumente
Kultur Dokumente
IN A THREE TEIR GOVERNMENT NAME NWANKWO JOEL C MATRIC NO 980405049 COURSE SVY 822 LECTURE DR OLUSINA
Contents
CHAPTER 1 .................................................................................................................................................... 3 INTRODUCTION Spatial Data Clearinghouse................................................................................................. 3 What is a Clearinghouse?.......................................................................................................................... 3 Common Goals for a Clearinghouse ...................................................................................................... 3 Accessing Data in a Clearinghouse Sample configuration ...................................................................... 3 What Defines a Site /Node? ...................................................................................................................... 4 Logical Grouping: .................................................................................................................................. 4 Spatial Data Cleaning and Validation .................................................................................................... 4 Preparing Related Tabular Data ................................................................................................................ 5 Language ............................................................................................................................................... 5 Geo-processing ..................................................................................................................................... 5 Spatial Extent ........................................................................................................................................ 5 Field Names........................................................................................................................................... 6 Study Objectives ....................................................................................................................................... 6 Initiatives at the Federal Level .................................................................................................................. 7 Chapter 2: The Prototype Spatial Data Clearinghouse ................................................................................. 8 State Controlled Geospatial Data Clearinghouse...................................................................................... 8 Content ................................................................................................................................................. 8 Technology ................................................................................................................................................ 8 Management............................................................................................................................................. 9 Policy ......................................................................................................................................................... 9 Describing Data Sets with Standard Metadata ........................................................................................... 10 Search and Retrieval ................................................................................................................................... 11 The Search Form ..................................................................................................................................... 11 Clearinghouse Implementation & Reaction ............................................................................................ 12 How the Clearinghouse can be constructed ........................................................................................... 13 The Common Gateway Interface ............................................................................................................ 13 Other Options for Constructing the Clearinghouse ................................................................................ 13 Advantages offered by free WAIS-sf: ...................................................................................................... 13
Advantages offered by a Relational Database ........................................................................................ 14 Isite ...................................................................................................................................................... 14 Other Uses of the Clearinghouse ............................................................................................................ 15 CHAPTER 3 .................................................................................................................................................. 17 KEY FACTORS IN MANAGEMENT OF A CLEARING HOUSE .......................................................................... 17 The Number of Data Suppliers ................................................................................................................ 17 The Type of Data Accessibility ................................................................................................................ 17 The Metadata-Standard Used ................................................................................................................. 17 The Number of Spatial Datasets ............................................................................................................. 17 The Most Recently Produced Dataset .................................................................................................... 18 The Number of Web References............................................................................................................. 18 The Number of Monthly Visitors ............................................................................................................ 18 The Frequency of Web Updates ............................................................................................................. 18 The Languages Used ............................................................................................................................... 18 The Use of Maps for Searching ............................................................................................................... 19 Registration-Only Access......................................................................................................................... 19 Summary ..................................................................................................................................................... 19 Responding to data requests from Clearinghouse users. ....................................................................... 23 CONCLUSIONS ............................................................................................................................................. 24
VI.
VII.
VIII.
IX.
X.
Each business feature ID value should be unique across the map layer. Often the business feature ID will be determined by the business program/area, e.g., each mine has a unique identifier. If so, this value should be used as the feature business identifier. In all cases the business feature identifier should be the ID field, unless it is not unique. In this case define a unique ID field and use the business identifier as the Name field or as an additional attribute in the dbf. Each map layer should have a spatial index created on the feature type field, and for selectable map layers on the business feature ID field, and the business feature Name field.
Language Consider whether or not you will need to translate the data and field names into another language to support clients. If so, you may also want to create a separate warehouse based on language. Geo-processing Perform any geo-processing on the map layer that might aid in its rendering such as merging or dissolving on features. For example, if you want to conduct value rendering on a map layer of, say, elevation, you may first want to dissolve all the features so that each elevation has a unique record rather than multiple records. Spatial Extent Consider the spatial extent of the data. For example, if the shape file extent extends beyond the boundaries of the warehouse, you should clip the shapes to fit.
Field Names As noted above, field names in the .dbf and any related tabular databases should be carefully considered for clarity. For numeric fields, ensure the unit of measure is given.
Study Objectives
This report covers a study on how a state office can serve as a focal point and clearing house for spatial data to both government agencies and other states. The lab report is centered on the idea of a data cooperative where many tiers of government starting from the state would work together to share data and expertise for individual and mutual needs. The development of the Clearinghouse network has been motivated by a desire to minimize the duplication of effort in the collection of expensive digital spatial data and to foster cooperative digital data collection activities. The Clearinghouse and associated metadata documentation of geographic holdings, promotes the availability, quality, and requirements for digital data through a searchable on-line system and provides a primary data dissemination mechanism to traditional and non-traditional spatial data users. Possible participants can come from all levels of government, the private sector, the academic community, utilities, and the not-for-profit sector. The project research activities can involve participants from across the state and many additional participants from the national GIS community. Presentations and articles in regional user group newsletters can be used to inform the GIS community about the project activities and to invite participation. Work can be done with the relevant state and national agencies /department to improve the ability of any individual or organization to identify the existence of relevant data sets, to enhance understanding of the value of geographic information systems, and to explore the need for coordination of efforts to use GIS and spatial data in New York State. The project team identified a set of deliverables that reflected the proposal submitted by DEC, and which, to the extent possible, also addressed some of the charges to the NYS Temporary GIS Council which was getting underway at the same time. The study is focused on two areas. The first was the development of a prototype designed to demonstrate the efficacy of an on-line clearinghouse of metadata and spatial data sets. The Clearinghouse would be available to public, private, academic, and non-profit users as a mechanism to share data. The federal metadata standard was adopted for use in the prototype Clearinghouse. In a parallel project activity, the federal standard for metadata was further analyzed for its usability to support data sharing. The second focus was to review the literature and work with the GIS community to gather data on the value of GIS as a decision making tool, to identify effective approaches to assessing costs and benefits, to identify barriers to sharing and coordination of GIS activities, and to gather information and recommendations from the community regarding the future coordination of GIS in New York State. Within this larger framework, the project team pursued three specific objectives:
1. Demonstrate the value of GIS by examining exemplary applications and existing evaluation approaches. 2. Identify barriers to sharing spatial data and explore potential solutions for overcoming those barriers. 3. Investigate some practical tools to support GIS coordination in New York State. This paper reports on the project efforts associated with the development of a web based prototype data repository as a mechanism to support GIS coordination in New York State. An overview of the larger environment within which the NYS Clearinghouse exists, the functionality of the Clearinghouse and the underlying technical structure are presented. Implementation alternatives and recommendations for implementation are also presented. (Please see Appendix A for a related products list.)
Content
The content of the state controlled Geospatial Data Clearinghouse is information about all sets of geospatial data. Information about data is known as metadata. Because this data deals with geospatial data then this metadata is called as geospatial metadata. Metadata required by the Geospatial Data Clearinghouse consists of organization metadata, collection metadata and inventory metadata. Organization metadata is metadata that consists of information about the data producer organization. Collection metadata is metadata that contains information about a collection of data. And finally the inventory metadata is metadata that refers to information about all collection data in detail. To develop metadata for the National Geospatial Data Clearinghouse, a content standard of metadata is necessary. The study model, in this case, has decided to adopt the Federal Geographic Data Committee metadata standard called 'Content Standard for Digital Geospatial Metadata', as the content standard of metadata. This adoption means that all data producers have to use this standard in developing their geospatial metadata. To implement this standard an application has to be developed to generate the geospatial metadata information and present it on the web.
Technology
The National Geospatial Data Clearinghouse consists of a number of metadata servers that are interconnected forming a network. These metadata servers belong to data producer organizations and are called network nodes. Within this network, there is an additional server that is operated as a network gateway that allows connections to other networks. This server is called metadata gateway server. This network is developed on the TCP/IP protocol and is connected to the internet. The connection to the internet is considered so that all metadata servers that are located in all parts of Indonesia can be connected into the network and all metadata users can access the network from internet. Metadata server consists of metadata server applications that are designed to generate metadata database and transmits the server to the internet. This metadata database is generated using Geo Profile as a reference of technical specification of metadata standard with Z39.50 protocol. The type of metadata information that was stored on the server is a collection metadata and or an inventory metadata. This metadata server should be developed and maintained by any institution that produces the geospatial data.
The metadata gateway server contains the registry of all node servers belong to this clearinghouse network system. In addition, this server also has the information of institutions metadata servers who register their servers. Besides, this gateway server consists of internet based applications that are designed to search and retrieve metadata information that was stored in metadata database on the node servers. This Search Engine is developed using Z39.50 protocol. Users can access metadata information on the node server through metadata gateway server on the internet. In relation to this concept, Indonesia has been developing an application to implement this metadata gateway which is called the DDSN. The database system adopted by the National Geospatial Data Clearinghouse is a distributed database system. All data producer should develop geospatial metadata databases under their responsibility. Each metadata database will be collected, stored and maintained in each data producer. The development of the geospatial metadata in each data producer should ensure that this comply with the content metadata standard adopted by the National Geospatial Data Clearinghouse. The use of protocol Z39.50 allows the National Geospatial Data Clearinghouse to be promoted into the world metadata community networks by making links to the regional and global clearinghouse. In addition, this protocol allows the implementation of the distributed database system. In this case the Geospatial Data Clearinghouse will be linked to the regional network called the Regional Spatial Data Infrastructure (RSDI), and to the global network called the National Spatial Data Infrastructure (NSDI) in which all other states are members. The Geospatial Data Clearinghouse then becomes an integrated national node to APSDI and GSDI clearinghouses.
Management
In principle, the Geospatial Data Clearinghouse is managed by the stakeholders consisting of all geospatial data producers and users in Indonesia. The management of the Geospatial Data Clearinghouse is implemented through the three organizations established for this purpose. These organizations are established by the stakeholders. These organizations consist of the Permanent Committee, Metadata Gateway and the Geospatial Metadata Centers. The Permanent Committee deals with activities in directing, administering controlling, and monitoring the existence of the Geospatial Data Clearinghouse. The National Metadata Gateway is an inter-institution body that is established to develop, maintain and operate the l Metadata. The Geospatial Metadata Centers are a unit in data producer institutions that is established to develop, maintain and operate a metadata server within each institution.
Policy
The Geospatial Data Clearinghouse will be served as a metadata gateway for all geospatial metadata produced in the country. This means that all metadata users can only access national geospatial metadata through this focal Geospatial Data Clearinghouse. To improve service for the users, the National Geospatial Data Clearinghouse will be linked to the RSDI and NSDI clearinghouses and become an integrated node of these clearinghouses.
The Spatial Data Clearinghouse will be developed in concert with the Federal initiatives to facilitate the exchange of spatial data among members of the national GIS community. The Spatial Data Clearinghouse will be available on the Internet and is designed to increase the value of spatial data through sharing. As part of this national effort, the Clearinghouse provides a mechanism for potential users of spatial data to determine whether data sets they need are already available or under development. This means of improving access to and sharing of spatial data has the potential to lower the cost and greatly increase the use of these data through out the nation. The Clearinghouse is unique among state Clearinghouses for two primary reasons. First, it is a statewide resource available to support the sharing of spatial data for all sectors. State agencies, local governments, non-profits, academia, utilities, and the private sector may all use the Clearinghouse as a mechanism for sharing their spatial data as well as for identifying useful spatial data. Second, the Clearinghouse is the only state clearinghouses which offer spatially searchable standard metadata for the three tiers of government. The primary purpose of the Clearinghouse is to allow producers of geographic data to describe what data sets they have available and to allow users of Geographic Information Systems to find the data sets they need. Once the appropriate data set is located, the system provides information on how to obtain the data files, including an option for immediate online transfer of the files using ftp. Users who access the Clearinghouse may search for available data and review detailed descriptions of the data. Once a data set of interest is identified, information is provided on how to obtain the data files. For some data sets, an option for immediate on-line transfer of the files is available using standard file transfer protocol (ftp). The URL for the prototype Spatial Data Clearinghouse is: http://www.xxxxx.html. Figure 1 shows the World Wide Web home page for the Clearinghouse.
I.
II.
III.
Standard metadata provide a common language for GIS users to describe data sets. Since one goal is to facilitate the exchange of data sets among a large number of independent organizations, a common language for describing the data sets is needed. Standard metadata help ensure that the data sets will be described thoroughly. The metadata template identifies mandatory and suggested fields; consequently, it serves as a guideline to those describing the data sets. The metadata can serve as an important tool for an organizations internal documentation as well as for data set exchange. Standard metadata offer the opportunity for automation. Once the metadata are standardized, software can be developed for creating, collecting, and searching the metadata on the Internet.
II.
III.
General Query (optional): Words listed in this input box are matched against the entire metadata document. Desired words can simply be listed in the input box, separated by spaces, or more complex Boolean searches can be constructed using AND, OR, and parentheses. Righthand truncation is also permissible with the use of an asterisk, so that sch* will find all documents which contain words beginning with "sch."
Is freely available; Is easy to learn and implement; It can search records in many different formats, such as text, HTML, gif, etc.
Isite
The Clearinghouse for Network Information Discovery and Retrieval (CNIDR) has developed software called Isite (http://vinca.cnidr.org/software/Isite/Isite.html ), designed to incorporate some of the benefits offered by both free WAIS-sf and relational databases. Isite uses the Z39.50 protocol for communications between the client and server. On the server side, it includes its own text indexer and search engine. Alternatively, it includes an application program interface (API) which can be used to perform commercial database searches. Spatial Data Clearinghouse Technical Information Environment The Spatial Data Clearinghouse exists as part of a larger spatial data clearinghouse environment. The National Spatial Data Clearinghouse (NSDC), an initiative of the Federal Geographic Data Committee, is a network of virtual and physical repositories of spatial data available over the Internet. Hardware The Clearinghouse can be implemented within CTG on a DECstation 5000 running Ultrix 4.2, with 10 gigabyte of storage and 8 gigabytes of memory. Contents Forty-nine spatial data sets are described in the Clearinghouse with metadata which conform to the Federal Standard for Digital Geospatial Metadata. The Federal Metadata Standard was adopted as the standard for the Clearinghouse. Organizations interested in submitting data were provided with a packet containing information about the standard and a template of the standard on disk with embedded html code. The template was provided to support the collection of the metadata. In addition to providing
metadata some users provided both the data set and a image of the area represented by the data set. Structure The figure on the following page represents the structure of the prototype Clearinghouse
Recommendations for Implementation The prototype Clearinghouse is a central site for storing metadata and some spatial data sets. A centrally managed site can offer savings, but we also believe that data management must remain in the hands of the data owners. Although it may be appropriate for a central site to contain metadata, the data sets themselves do not need to be stored at the same site. The size and quantity of the data sets may preclude central site storage. Information on existing Spatial Data Clearinghouse yields the following recommendations.
Fully automate as many aspects of the process as practical including metadata creation, editing, and collection; updating of all Clearinghouse web pages as metadata is added or changed; and data set transfer through the use of ftp and online order forms. Centralize the home page and technical support for the Clearinghouse. The Clearinghouse application and its technical administration can be managed most efficiently from a single site, avoiding the need for duplicative development and maintenance by multiple organizations. A central site also ensures a uniform presentation of the metadata and provides a principal forum for information exchange. Data set owners, who know the data best, should create and manage their own metadata. Metadata creation, update, and deletion needs to be the responsibility of each independent organization. The development of automated tools will make the independent management of metadata possible.
Organizations should house their own data sets and create their own ftp sites. The metadata can contain a link to the corresponding ftp site so that a simple click of the transfer option initiates the downloading of the file. Because the data location makes no difference to the user, several organizations can work together to form a cooperative ftp site. Clearinghouse efforts should be leveraged by incorporating additional information and functionality for the GIS community. Making the site a one-stop shopping center for GISrelated activities in a nation will improve upon the utility of the Clearinghouse, increase visits to the site, and ensure more widespread participation in its ongoing development.
spatial datasets together. 10 clearinghouses have more than 1000 datasets described (Australia, Austria, Canada, the Czech Republic, Japan, Mexico, South Africa, Switzerland, Uruguay, and the U.S.).
written in English (in addition, five of these are written in Arabic, Chinese, Greek, Japanese and Korean script). 29 clearinghouses use only their home language. These language problems reduce the accessibility to data (for English-speaking people).
Registration-Only Access
This characteristic describes the management and possible limitations of use. Before accessing the data, users must register themselves by entering personal details. This characteristic could have a negative impact on accessibility. For eight national clearinghouses, the user is required to register to access metadata or data (Canada, El Salvador, Finland, Hungary, Malaysia, Singapore, Spain, Canada, and Uruguay).
Summary
Information technologists can apply the environmentalists concept of reduce, reuse, and recycle to improve on the management of data. Members of this project sought to maximize the benefits of developing spatial data by reducing redundant data creation efforts, re-using existing expensive data sets, and recycling spatial data information. The recognition that organizational units can benefit from cooperative data development and exchange is part of an ongoing trend. It began when functional departments within organizations began to see themselves, not as separate entities with independent informational needs, but rather as part of an integrated whole which should work together to create an integrated information base. This trend progressed further as organizations realized the benefits of reaching out beyond their organizational walls, and began exchanging information with their economic partners, such as suppliers and customers. The participants in this project have extended the concept of information sharing one step further and have demonstrated the economic benefits of cooperatively developing and exchanging data among organizations which are essentially independent, but which have overlapping informational needs. The Internet-based Clearinghouse has proven to be a practical tool for fostering the desired cooperation and achieving the desired benefits. It is being used today with the tools currently available and will most certainly evolve into a more powerful instrument. Discussions are underway to transfer the Clearinghouse to a state agency, such as the State Library, for permanent operations. The focal Spatial Data Clearinghouse exists as part of a larger spatial data clearinghouse environment. The National Spatial Data Clearinghouse (NSDC), an initiative of the Federal Geographic Data Committee, is a network of virtual and physical repositories of spatial data available over the Internet. Hardware
The Clearinghouse can be implemented within CTG on a DECstation 5000 running Ultrix 4.2, with 10 gigabyte of storage and 8 gigabytes of memory. Contents Forty-nine spatial data sets are described in the Clearinghouse with metadata which conform to the Federal Standard for Digital Geospatial Metadata. The Federal Metadata Standard was adopted as the standard for the Clearinghouse. Organizations interested in submitting data were provided with a packet containing information about the standard and a template of the standard on disk with embedded html code. The template was provided to support the collection of the metadata. In addition to providing metadata some users provided both the data set and an image of the area represented by the data set. Structure The figure on the following page represents the structure of the prototype Clearinghouse.
1. Every geospatial data producer shall provide metadata for each of its data holdings. 2. Government, through the lead agency and in consultation with the NGDI Committee, shall establish electronic geospatial metadata catalogue and Clearinghouses in NGDI node agencies in partnership with those agencies
3. A custodian of a fundamental dataset must, not later than 30 days after updating, furnish all updates of the base dataset to the clearinghouse; the clearinghouse shall in turn inform the custodian(s) of the derivative dataset(s) within 7 days, in order to ensure synchronous maintenance of the fundamental and derivative datasets. There are also unique costs brought about by the requirement for a National Geospatial Data Clearinghouse. These Clearinghouse costs include: Internet connectivity and system administration. Server hardware purchase and maintenance. Metadata creation for new and existing data sets. Maintenance of inactive data sets. Metadata distribution.
CONCLUSIONS
Since 1994, the number of national clearinghouses has steadily increased to a total of 59. Looking at the trend of implementation, countries can expect to see additional national clearinghouses established. In fact, building clearinghouses is a global activity (with the exception of Africa and the Middle East (as well as Australasia and Oceania)). Most existing clearinghouses are established in Europe, Southeast Asia, and North and South America. The main initiatives for establishment come from Anglo-Saxon countries, such as the U.S., South Africa, and Australia. The U.S., in particular, which is supported by the FGDC, has stimulated many (American continent) countries to build a clearinghouse. However, 124 countries have still not shown any initiative to build one. There are several reasons for this. For example, a country may not have appropriate network architecture or there may be institutional bottlenecks for implementation. The differences in content, use, and management between the clearinghouses are broad. An example of such broad difference in content is the total number of accessible datasets described in a clearinghouse. In the U.S. clearinghouse, this number is 10 times as high as the total number of all 24 European clearinghouses. The reason for such difference is due to each countrys unique historical, institutional, economic, legal, technical, and cultural setting. Especially in Europe, there are great contrasts in the number of datasets, suppliers, visitors, Web references, and frequency of Web updates, probably as a result of the high institutional, economic, legal, technological, and cultural diversity within this region. However, similarities between clearinghouses do exist (for example, the type of data accessibility and the metadata-standard used). The most applied metadata-standard is the FGDC. However, looking to the numerous projects to apply the ISO standard, it is likely that ISO19115 will be the most applied standard in the future. This international consensus standard reflects FGDC, CEN, and other inputs. It provides detail that goes beyond FGDC and CEN metadata, including special coverage of raster and imagery information. Currently, there are several initiatives to create implementable subsets and extensions of ISO19115 so that conversion of FGDC-support tools and implementations to meet ISO conformance requirements are facilitated (Federal Geographic Data Committee Metadata Staff Coordinator 2001). Looking to the average number of data suppliers, Web references, and visitors, we can conclude that national clearinghouses are a popular facility to distribute and access spatial data. Finally, in the future, it is highly probable that many national clearinghouses will give access to spatial data itself and provide complementary services such as online mapping. However, a concern could be the low frequency of Web updates of several clearinghouses due to poor management. Therefore, special attention has to be given to keep clearinghouse managers motivated for having a well-managed clearinghouse. Based on the 12 characteristics used, we can conclude that Australia, Canada, Portugal, and the U.S. have the best existing national clearinghouses. Additionally, this Web survey shows that not
only the richest countries have good clearinghouses. Examples of relatively poorer countries with suitable national clearinghouse are El Salvador, Nicaragua, and Uruguay. Based on the above research, for all countries, it seems that one of the keys for successful clearinghouse implementation is high political support and interest by means of funding and long-term strategy.