Why the measurement scale and level of generalization is critical to consider when combining datasets?

Zanele Lisa Hi

I propose that many errors may arise due to the incompatibility of different datasets. What we can do in this case? I suggest two approaches to the problem solving (you can find these articles in my ResearchGate site):

Eppelbaum, L., Eppelbaum, V. and Ben-Avraham, Z., 2003. Formalization and estimation of integrated geological investigations: Informational Approach. Geoinformatics, 14, No.3, 233-240.

Eppelbaum, L.V., 2014. Geophysical observations at archaeological sites: Estimating informational content. Archaeological Prospection, 21, No. 2, 25-38.

Best regards

Lev

Alfredo Ramon Morte

Technological resources get increasing amounts of geographic information and geometric precision in data collection. However, its use may be implying inadequate scales and forms of representation.

To resolve this problem in "GEO - bigdata" it's necessary to find ways to improve the management of metadata (open data), use of standard data formats (OGC) and even make processes of generalization with open libraries (open source) or algorithms able to generalize and adapt different data sets for the uses proposed by each user and each place. (task very complex...)

As an example propose a contribution to the study of the precise measurement of the changes in the area of occupation of endemic and rare vegetal species with methods of spatial sampling, scale changes and cartographic generalisation to support sustainable urban planning:

Zaragozi, B. Gimenez, P., Navarro, J.T., Dongb, P. and Ramón, A.:2012. Development of free and opensource GIS software for cartographic generalisation and occupancy area calculations. Ecological Informatics Volume 8, Pages 48–54

Regards

Alf

http://www.sciencedirect.com/science/article/pii/S1574954112000040

Melih Basaraner

Spatial data sets are produced at multiple scales for satisfying different requirements. We normally expect higher accuracy and higher level of detail (LoD) from larger scale (or higher resolution) data sets. If the heterogenous data sets are combined in this respect, it will yield uncertain results, for example, in spatial analysis from geometric and semantic aspects. For further information you may refer to the relevant books that can be found on the following links.

http://www.springer.com/us/book/9783319002026

http://www.amazon.com/Scale-Spatial-Information-Analysis-Jingxiong/dp/1439829373/ref=sr_1_4?s=books&ie=UTF8&qid=1432044068&sr=1-4&keywords=generalization+scale

http://www.amazon.com/Scale-Geographic-Inquiry-Nature-Society/dp/063123070X/ref=sr_1_7?s=books&ie=UTF8&qid=1432044068&sr=1-7&keywords=generalization+scale

http://www.amazon.com/Scale-Issues-Remote-Sensing-Qihao/dp/1118305043/ref=sr_1_8?s=books&ie=UTF8&qid=1432044068&sr=1-8&keywords=generalization+scale

http://www.amazon.com/Causes-Consequences-Generalization-Research-Monographs/dp/0748407766/ref=sr_1_15?s=books&ie=UTF8&qid=1432044448&sr=1-15&keywords=cartographic+generalization

http://www.sciencedirect.com/science/book/9780080453743

Paulo Raposo

Hi Zanele,

Data sets compiled at different spatial resolutions will have different levels of spatial precision. If you compare, say, two rasters with different resolutions, you can't have any certainty that what the finer raster describes in a given pixel actually relates to what the coarser raster describes in the same spot, because what the coarser raster describes comes from a different support (i.e., the particular size and orientation of the sampling strategy). The coarser pixel might have a value that is heavily influenced by a statistical outlier that actually happens elsewhere in the pixel rather than the area where it overlaps with the finer pixel.

Similar issues exist with vector data. They are compiled to different levels of spatial detail, which still translates to a spatial resolution (e.g., how close together on average are the vertices in the data). A given river line will be a complex line if modeled at high resolution, and considerably simpler and straighter at low resolution. Both of these model the same real-world river, but the lines have two very different lengths, sinuosity, etc. They have different levels of uncertainty/error. Thus, if you measure something related to river length (e.g., water discharge rate) from the high resolution line and relate that to something from the low resolution line (e.g., a historical water discharge rate measured older, coarser data), you will be comparing numbers whose error brackets are possibly very widely different. The margin of error of one, for example, might be very much bigger than the resolution, let alone margin of error, of the other!

In general, the appropriate procedure when you must use data across different levels of resolution is to generalize the higher resolution data to the level of detail of the coarsest dataset you must use, even though how exactly to do that is not always straight-forward.

What are the effects of grid resolution on data accuracy?

In raster GIS?

Do you think Serotonin is Primarily A NEUROTRANSMITTER? Does anyone else like me think this Beautiful Biomarker was repurposed as a Neurotransmitter?

Have you seen our opening for a Senior Spatial Data scientist for CLIMATE ACTION?

Why the distance from the road is considered as an important input feature for machine learning earth science models?

What is the best intrapertonneal injection protocol for LPS (Sprague dawelly rats) that will be enough for induction of cytokines elevationc in GI?

How to measure the GI value in beer?

How can I analyse solar panel glare effects using GIS?

Does anyone know a good scale for Privacy/Confidentiality Concerns in the context of Psychotherapy?

To what extend fish scales interaction with internal body media of the fish?

How can I determine genetic interaction based on Arabidopsis root growth rate measurements?

What are the inhibitory and excitatory neurotransmitters released by motor neurons that control GI motility?