Beruflich Dokumente
Kultur Dokumente
Helen Freimark
(Specials)
Andreas Wehrle
(Content)
Content
1. Geodata Structures and Data Models ...................................................................................................... 2 1.1. Geodata Structures and Data Models ............................................................................................... 3 1.1.1. Definition of Geodata Structures and Data Models .................................................................. 3 1.2. Rasterdata Structures ........................................................................................................................ 6 1.2.1. Rasterdata Structures ................................................................................................................. 6 1.3. Vectordata Structures ...................................................................................................................... 10 1.3.1. Structures ................................................................................................................................. 10 1.3.2. Special types ............................................................................................................................ 13 1.4. Vector Data vs. Raster Data ........................................................................................................... 16 1.4.1. Differences ............................................................................................................................... 16 1.4.2. Different use of the data in GIS .............................................................................................. 16 1.5. Data attributes ................................................................................................................................. 18 1.5.1. Data attributes .......................................................................................................................... 18 1.6. Data symbology .............................................................................................................................. 20 1.6.1. Use of symbology .................................................................................................................... 20 1.6.2. Get existing symbology ........................................................................................................... 21 1.7. Metadata .......................................................................................................................................... 22 1.7.1. About Metadata ....................................................................................................................... 22 1.7.2. Metadata Standards .................................................................................................................. 23 1.7.3. Metadata in GeoVITe .............................................................................................................. 25 1.8. Summary ......................................................................................................................................... 27 1.9. Recommended Reading .................................................................................................................. 28 1.10. Glossary ........................................................................................................................................ 29 1.11. Bibliography ................................................................................................................................. 31
This lesson provides an introduction to the various geodata structures and related data models. After providing definitions for data structures and data models, it lists the most common geodata structures, their properties, examples, use cases and some products that rely on a certain data structure. Furthermore we introduce the meaning and use case of metadata and what metadata we deliver with the geodata we provide. Learn about Geodata Structures and Geodata Models Learn about Rasterdata Structures Learn about Vectordata Structures Learn about Differences and Use Case of Raster- and Vectordata Learn about Topology Data Structures Learn about Non Graphical Attributes (Thematic Data) Learn about Symbolization Learn about Metadata
A geodata model is an abstract, artificially created mapping of a part of the real world relevant to a geoinformatics project. The goal of
geodata modeling is to map the relevant conditions and processes in the real world to geodata structures. A data model not only describes the content, properties and data structures, but also rules and relations between the entities of a data model.
2
As geodata structure we can define the logical, internal data organization of our geographic information, the means of representing a real-
life entity inside a geodata model. Data structures should enable data storage and data management, as well as quick retrieval of the data. Unique identifier, links, relationships and dependencies help to build consistent and normalized data structures and enable links within the dataset or to external data sources.
As geodata structures we can define the logical, internal data organization of our geographic information, the means of representing a real-life entity inside a geodata model. Data structures should enable data storage and data management, as well as quick retrieval of the data. Unique identifier, links, relationships and dependencies help to build consistent and normalized 5 data structures and enable links within the dataset or to external data sources. The efficiency of a GISystem directly relates to the usage of efficient data structures. It usually comes down to a trade-off between optimized data storage (small file sizes) and quick data access. Completely normalized and compressed data structures are not always the most efficient ones.
Data Normalization is the process of removing redundancy in data sets through dividing the data sets in to relations, linked through
identifiers. The result of a normalization process not only leads to more efficient data storage (smaller files), but also facilitates geodata updating. One distinguishes five different normal forms (NF1 to NF5) with various levels of redundancy removal.
As Geodata Update we can define the process of data appending or the replacement of existing data to reflect changes in the world or the
model the data is derived from. Special data models must be taken into account for temporal GIS functions. Unfortunately these temporal GIS functions are still experimental and not yet part of commercial GIS systems.
Following is a list of attributes that are relevant when dealing with raster data
Disadvantages of Raster Data Structures Non-adaptive data structure tends to generate huge files, depending on resolution cell arrangement is usually random and does not respect natural borders limited interactivity and more primitive analysis algorithms
Raster data structures are commonly used in the following scenarios: Usage Scenarios of Raster Data Structures Photos Photogrammetry and Remote Sensing Scanned Images of Maps Terrain Modeling Landcover Analysis Hydrologic Modeling and Analysis General GIS surface Modeling and Analysis for continuous surfaces
1.3.1. Structures
Vectordata represents geographic objects with the basic elements points, lines and areas, also called polygons. Every element in a vector modell is described mathematically and bases on points that are defined by cartesian coordinates. The ordered accumulation of points results in an object that may be a point, a line or a polygon. A line is based on two points; a start and an end point. According to this, a polygon is a closed accumulation of line elements. Complex elements, such as curves, splines or circles are definded exactly by means of these base elements.
Vectordata not only contains the geometry 10 of a point; depending on the model, it can also include topology
11
or neighborhood relations, e.g. areas next to a line or start and end point of a line. Different type of models allow the storage of vector data: Spaghetti model The spaghetti model is a simple model that stores the data in an unstructured way. This model just stores the name of every object, followed by the coordinates the object is composed of. Since the objects are not related to each other, no topological information is included and the consistency can not be verified. The spaghetti data model has a simple structure, every object is described independently of the others. The same coordinates may appear several times, therefore it needs large amounts of storage space. On the other hand, an advantage of this data structure is the possibility to modify every object without affecting the others.
10 11
In GIS, the geometry describes the form and situation of an object, but not the relation to other objects, as it does the . The topology defines the situtation and arrangement of geometrical objects. The metrical relations are irrelevant, just the relation between
the objects is important. A topologic map shows only the logical connections of objects and not the exact situation or dimension. For example, a bus plan ist a typical topologic map; it shows the connections between various points (bus-stops), but not the exact position.
10
Topological model Another model is the topological model. As new elements, this model introduces nodes and edges. A node is a distinguished point that connects one or several arcs. An edge is a line composed by a start and an end node. Every object is composed by a less complex object. For example, a polygon is the composition of several arcs which are defined earlier. The advantage of this model is the topological information it is containing: every object includes information about the elements it is related to. Since every geometrical is only stored once, there is no redundancy.
Simple Features The Open Geospatial Consortium 12 (OGC) defiines a standard that specifies the digital storage of geographical data (point, line, etc.) with spatial attributes as well as non-spatial attributes. This specification, called Simple Features also includes spatial operators that can be used to generate new geometries based on existing geometries.
12
The Open Geospatial Consortium (OGC) is an international organisation with more than 300 governmental, non-profit, research and
commercial member organisations. Its goal is the development and implementation of standards for geospatial contents and services.
11
12
13
Networks Another strength of vectordata is the ability of defining logical networks. Lines can be directed and for this reason, flows of any type may be calculated. For example, it is possible to calcualte water currents with by defining the direction of the flow, capacities, taps, etc. Using the proper tools and a well-defined subelements, it is also able evaluate errors in the network construction. Another application areas are supply management, pipeline network, route calculations, etc.
14
Network Model
Advantages of Vector Data Structures Small amount of data Easy to update Logical data structure Attributes are combined with objects Preserves quality after interactivity (e.g. scaling) More sophisticated in spatial analysis
Disadvantages of Vector Data Structures Continous data is not represented effectively Spatial analysis and filtering within polygons is impossible Needs a lot of manual editing to get good quality It always introduces hard boundaries Unable to model uncertainty or
Usage Scenarios of Vector Data Structures CAD, technical drawings Street or river networks, cadastral maps Network analysis Cartography
15
1.4.1. Differences
Vector data sets and raster data sets are both important in a GIS. Every one has its strenths, therefore, it's counterproductive to use just one form of these datasets. An important difference between these two data sets is noticeable in the visualisation of the data set. The graphical representation of raster or vector data results in raster graphics 14 or vector graphics 15. In a raster data set, the attributes are associated with a raster cell, therefore, the appearance is only changeable by modificating the color value of the raster cell. In comparison, vector data stores the information associated with the object, the appearance may be modified easily by adding graphical components to every object. In a GIS the difference between a raster or a vector data set could look like this:
Visualised as rasterdata
Visualised as vectordata
14
Raster graphics is the combination of raster data with graphical attributes. It is only variable in the color of the raster cell. Usually, it Vector graphics is the combination of vector data with graphical attributes. Various attributes can be modified, a polygon may vary in
its outline color and thickness, hatching, etc. Common formats are .svg, .dxf, .shp, .pdf.
16
Orthophoto
Relief
Crop Evapotranspiration
Nautical Chart
(Massachusetts GIS)
(Massachusetts GIS)
(Massachusetts GIS)
(Massachusetts GIS)
Highway
Housedistricts
Schools
Latitud/Longitud Grid
(Massachusetts GIS)
(Massachusetts GIS)
(Massachusetts GIS)
(Massachusetts GIS)
Vector data may be converted easily into raster data; common software normally integrate this feature. In contrast, the conversion from raster to vector data is still solved unsatisfactorily; manual editing is mostly necessary.
17
By the means of these attributes, queries, analyses and the visualisation of spatial data is possible. On a road network, a concrete task could be the evaluation of every highway that was built before the year 1980. Datasets are usually separated by thematic content. As spatial data normally isn't free of charge, it is essential to get just the needed data and therefore, the data has to be separated thematically. Swisstopo 16, what disposes of lots of thematic data, offers the datasets separated by thematic levels. On the scale of the 1:25'000, Swisstopo offers the data by the following themes: Road network Railway network Other traffic Hydrological network Primary surfaces Buildings Hedges and trees Facilities Single objects
Every theme contains important information about various objects. E.g. the theme buildings contains the following information about every building that it is containing: ObjectId: Unique key to identify every object ObjectOrigin: Origin of the data YearOfChange: Year of the most recent amendment ObjectVal: Type of building, e.g. church, station, castle, glasshouse, etc.
16
Swisstopo is the competence centre of the Swiss Confederation responsible for geographical reference data and all products derived
from them. If offers a variety of spatial data, in raster form as well as vector form.
18
19
20
18
A Web Map Service (WMS) produces maps of geospatial information dynamically from geographic information. It is defined by the
OGC.
21
1.7. Metadata
This unit explains the term metadata, and shows the use of metadata and possibilities to standardise it.
Metadata exist also about geodata. The more complete metadata exist about a geodata, the easier is it to find a concrete dataset. The following attributes are typical metadata about geodata Year of data acquisition Datasource
19
Metadata describes other data ("data about data") by defining attributes such as year of creation, author, included area, origin, etc. It
22
23
24
25
26
1.8. Summary
In this lesson we learned a few things about data structures and data models of a GIS. We know the differences between raster and vector data and in which situation which datatype is appropriate. We also know, that thematic data is essential in a GIS and that the data attributes may vary depending on the theme. We are also aware of the importance of metadata in order to catalogue and find the datasets.
27
BILL, R.., 1999. Grundlagen der Geo-Informationssysteme 1. 4. Heidelberg: Herbert Wichmann Verlag.
Especially chapters 1 "Einfhrung in GIS" and 4 "Erfassung raumbezogener Daten" (german)
28
1.10. Glossary
data aquisition: As geodata aquisition we define the collection and recording of geodata for further processing. The process of data aquisition includes the recording of geometry (spatial information), date and time (temporal information) and any non-graphical related attributes (thematic information). data normalization: Data Normalization is the process of removing redundancy in data sets through dividing the data sets in to relations, linked through identifiers. The result of a normalization process not only leads to more efficient data storage (smaller files), but also facilitates geodata updating. One distinguishes five different normal forms (NF1 to NF5) with various levels of redundancy removal. geodata: As Geodata we can define every dataset that has a spatial aspect or component. Synonyms are "spatial data", "geographic data", "geographic data sets" or "GIS data". The syllable "Geo" implies that the dataset has a spatial component that allows to georeference the described phenomena to a location or region on the earth. geodata model: A geodata model is an abstract, artificially created mapping of a part of the real world relevant to a geoinformatics project. The goal of geodata modeling is to map the relevant conditions and processes in the real world to geodata structures. A data model not only describes the content, properties and data structures, but also rules and relations between the entities of a data model. geodata structure: As geodata structure we can define the logical, internal data organization of our geographic information, the means of representing a real-life entity inside a geodata model. Data structures should enable data storage and data management, as well as quick retrieval of the data. Unique identifier, links, relationships and dependencies help to build consistent and normalized data structures and enable links within the dataset or to external data sources. geodata update: As Geodata Update we can define the process of data appending or the replacement of existing data to reflect changes in the world or the model the data is derived from. Special data models must be taken into account for temporal GIS functions. Unfortunately these temporal GIS functions are still experimental and not yet part of commercial GIS systems. geometry: In GIS, the geometry describes the form and situation of an object, but not the relation to other objects, as it does the topology. GIS: As a Geo Information System (GIS) we can define a computer-aided system for geographic data management, modeling, analysis, simulation and presentation. A GIS is an organized collection of computer hardware, software, geodata and skilled operators. More powerful GISoftware usually utilizes modern database technology or builds on spatial databases. metadata: Metadata describes other data ("data about data") by defining attributes such as year of creation, author, included area, origin, etc. It helps to identify and select the proper product. Open Geospatial Consortium:
29
30
1.11. Bibliography
BILL, R., ZEHNER, M. L., 2001. Lexikon der Geoinformatik. Heidelberg: Herbert Wichmann Verlag. BILL, R.., 1999. Grundlagen der Geo-Informationssysteme 1. 4. Heidelberg: Herbert Wichmann Verlag. BILL, R.., 1999. Grundlagen der Geo-Informationssysteme 2. 2. Heidelberg: Herbert Wichmann Verlag. Fiona Ellis. Introduction to GIS - vector based GIS [online]. Available from: http:// www.sli.unimelb.edu.au/gisweb/GISModule/GIST_Vector.htm [Accessed 2006-06-27]. GDF Hannover. An introduction to the practical use of the Free Geographical Information System GRASS 6.0 [online]. Available from: http://www.gdf-hannover.de/lit_html/grass60_v1.2_en/node8.html [Accessed 2006-06-27]. Massachusetts GIS. Download free data [online]. Available from: http://www.mass.gov/mgis/dwnimgs.htm [Accessed 2006-08-08]. Michael Gertz. Spatial Databases [online]. Available from: http://dbis.ucdavis.edu/twiki/pub/ SpatialDB/SpatialDBHandouts/02-concepts2.pdf [Accessed 2006-08-07]. Orion S. Lawlor. Supporting Dynamic Parallel Object Arrays [online]. Available from: http:// charm.cs.uiuc.edu/papers/ArrayMigISCOPE01.www/ [Accessed 2006-06-13]. Rick Sethi. Conceptual Data Model, Example for an ER diagram [online]. Available from: http:// www.sethi.org/data-pirates/report/flowcharts/cdm.gif [Accessed 2006-01-05]. Swisstopo. Sample Data Vector 25 Landscape Model [online]. Available from: http:// www.swisstopo.ch/en/download/testdata/landscape/vec25 [Accessed 2006-08-09].
31