I'll start off by saying I am not a mathematician, nor have I studied maths for many years, so please keep the answers simple :)
I am working with data that has a geographical context. So assume I have data about an area 10km by 10km. My data is split into 1km by 1mk tiles, so I have 100 rows of data. 1 row for each km square in my area.
Values in my data include the following, to name a few:
Number of buildings
Number of addresses
Number of postcodes
% ground area covered by buildings in the km square
I am creating some cloropleth maps to give an overview of the whole area. I am symbolising the data using standard deviation so I can show which km squares are above and below the mean.
This works well for things such as building counts, so I know if a single km square is above or below the mean for the whole 100km area.
My question is, am I able to calculate the standard deviation for my % values? i.e. % ground area covered by buildings, % ground area covered by roads etc... Will this give me a sensible result?
Or would it be best to calculate the standard deviation on the actual area covered by these features in each km tile?