-->

Cluster and Outlier Analysis (Anselin Local Morans I) Arcgis

Cluster and Outlier Analysis (Anselin Local Morans I) arcgis


You will first examine the data generally to determine if median household income is clustered or dispersed in Middlesex County, then you will run local statistical tests to see where clusters exist.

1. Open ArcMap and a blank map.
2. Add the data layer, “medianhousing” to the map from your folder on the desktop.

In the following exercises you will use the default setting for spatial weights in each tool, however you also have the option of creating a spatial weights matrix using this tool: Spatial Statistics > Modeling Spatial Relationships > Generate Spatial Weights Matrix. If you are familiar with network analysis you can construct a spatial weights matrix based on your road network using the Generate Network Spatial Weights tool.

Calculating General Clustering

You will first run the Getis-Ord General G statistic to see if areas with a high or low median income are clustered.

1. Open the toolbox if is it not already open by clicking Geoprocessing > ArcToolbox.
2. Expand Spatial Statistics Tools.
3. From the Analyzing Patterns tools, select High/Low Clustering (Getis Ord General G).
4. For the input feature class, enter “medianhousing”. For the input field, select “Median_val”.
5. Check the box next to Generate Report.
6. Keep the Conceptualization and Distance methods as the default. Feel free to experiment with other conceptualizations after you run this analysis.
7. Click Ok.


ArcMap warns you that it used the distance of 12597.9 meters for the distance of the search threshold. By default, ArcMap will use the minimum distance to insure all features have at least one neighbor. You can change this distance to make the analysis more appropriate for your project. The larger the value, the more neighbors that will be considered during the calculation.

8. Once the test has finished running, click Geoprocessing > Results (in the top menu bar)
9. Expand Current Session and double click Report File: GeneralG_Results.html.


Q: What is the G score? What is the Z score? What does this tell you?
A: Because the Z score is highly significant, the median household income in each tract is significantly more clustered than a random pattern.

Next you will run a spatial autocorrelation test to see if the general pattern of features is clustered or dispersed (as opposed to clustering specifically of high or low values).

10. Select Spatial Autocorrelation from the Analyzing Patterns menu and input the same information as you did for the General G test.
11. Check the box for “generate report”
12. Once the test has finished running, expand the Spatial Autocorrelation category under Current Session in the results window and double click on the report, MoransI_Results2.html.
13. Double click on the HTML report.



Q: What are the results? What does this tell you?
A: The median housing values are clustered at a significance level of p<.001. This means that there is clustering of similar values, but the test does not tell us which values are clustered (positive or negative).

Calculating Local Statistics

While the General G and Spatial Autocorrelation tests told you something about the overall pattern of median housing values in Middlesex County, you will now use local statistical tests to learn more about where clusters are located.

Local Moran’s I


1. Expand the Mapping Clusters tool set, choose Cluster and Outlier Analysis (Anselin Local Morans I).
2. Use the same Input Feature Class and Input Field as you did for the two previous tests.
3. For the Output Feature Class, click on the folder icon and save the file to your desktop folder, using a name that makes sense to you.
4. Click Ok.


A map showing the types of significant relationships is added to your map.


Q: Are there any clusters or outliers present?
A: You will notice a large cluster of low-low values, a large cluster of high-high values, and a polygon with a high-low relationship.

HH indicates that the relationship is High-High. These tracts are surrounded by tracts with high median housing values and their values are significantly higher than their neighbors. This indicates that although there are high median housing values in this area in general, there are significantly higher values in these census tracts. The inverse is true for the Low-Low relationships. For the tract that has a significant High-Low relationship with its neighbors, median housing values are high in this tract and it is surrounded by tracts with significantly lower housing values.

5. Right click on this layer name in the table of contents and select Open Attribute Table. Scroll through the table and notice the fields for Z-score and P-value. If you were interested, you could map these values as well.


TAG

عن الكاتب :

ليست هناك تعليقات

إرسال تعليق

الاسم

بريد إلكتروني *

رسالة *