Archaeological sites in Teesdale

The last set of exercises again involve County Durham, and will use data in two text files, tees_coords.txt and tees_details.txt, which constitute a collection of archaeological sites from the Teesdale District of County Durham. These will have to be turned into a shapefile using the method we used to make a shapefile from the locations of stone circles in Cumbria.

These can be displayed along with the map of the county districts distpoly.shp, and the following shapefiles will add other useful information:

Rivers.shp contains rivers of various sizes. It can be displayed satisfactorily in two ways, either using a graduated symbol in the legend, with Fc as the classification field (use increasingly thick lines for codes 6223, 6224, 6225, then a thin line for 6255, which is the code for lakes), or using a graduated colour, again with Fc as the classification field, and the blue colour ramp. Roads.shp needs a set of unique values, again based on the Fc field, which will distinguish between A, B, and minor roads (codes 3001, 3002, 3004).

Looking at the sites on their own, it's immediately clear that their distribution isn't smooth across the district. There's significant grouping and a strong tendency towards linearity in some places. When the rivers and roads are displayed as well, it's again clear that many of the sites are placed close to features of one or another of these themes, and this seems logical; both rivers and roads tend to follow river valleys, which also have attractions for settlement of various kinds.

Displaying the archaeological sites on top of the elevation data (which can usefully be shown as graduated colours, with min elevation as the classification field, and one of the terrain elevation colour ramps), there is an even more striking relationship between the locations of sites and the nature of the terrain. The highest areas have few or no sites, and there's an apparent division between prehistoric sites occurring on higher ground and medieval ones at lower elevations.

These immediate impressions may be a valuable stimulus to the formation of hypotheses, but one of the most useful functions of GIS software is that it gives us the ability to quantify the relationships which we think we see. Ultimately our hypotheses should be subjected to statistical testing, a subject outside the scope of this document. Here we will concentrate on investigating the basic spatial relationships between features of different kinds in the landscape, and it must be borne in mind that it is the spatial relationships which we are looking at. To take as an example the possibility that there is a difference in distribution of prehistoric and medieval sites with respect to elevation, if we decide in the end that there is a real relationship between elevation and the age of a site, it could be caused by a number of factors - perhaps there was a preference in the earlier periods for locating activities at a higher level, or perhaps later settlement was more dense at a lower level, or agricultural exploitation more intensive, so that prehistoric sites which did exist have been destroyed or concealed by later activity. GIS analysis may reveal spatial relationships, but it can't explain them.

We have already looked at simple analysis of spatial relationships, using the Select by Theme function, but we will now investigate another important GIS facility, the creation of buffers.

Buffers are polygons created around the selected features of a Theme, which can be used to select features of other Themes which fall within them. In this way they provide a facility similar to the Are Within Distance Of option used with Theme - Select by Theme, but they can be more complex than this. Buffers can be created as:

Buffering is an operation which can place considerable demand on your computer system. For larger datasets, as the operation starts you will be given information about how many features and vertexes are being buffered, along with a warning 'May take a long time'. Perhaps no two users of a GIS will agree about the definition of 'a long time', but for complex buffering operations you should certainly be prepared to wait for 10-15 minutes, and perhaps much longer, depending on the capacity of your system. As the operation proceeds a report will appear at the bottom right of the ArcView window, telling you how much (in percentage terms) has been completed, along with a blue bar which gradually lengthens, and there is a STOP button which allows you to abort if necessary.

Before you can create buffers, the map and distance units (both metres) must be set using View - Properties. If you don't do this the buffer function will be unobtainable.

For the first exercise we'll buffer the river shapefile, so add this to a View if you haven't already, then

At the end you will have in the View a new and complex set of polygons which outline those areas which are within 500m of any river in the area. If you display the archaeological sites over this you will get some impression of how may fall within the zone, but it's obviously much more useful to use another spatial analysis tool to extract the sites so that they can be examined more easily.

Activate the theme containing the archaeological information, choose Theme - select by Theme, and opt to Select features of active themes that Intersect the selected features of [name of your buffer theme]. The points representing sites within the buffer zone will turn yellow, and if you open the theme's table the corresponding records will be highlighted in yellow. Click on the name of the Period field, then use Field - sort Ascending, which will sort all the records into the order of their period, with the selected records still mixed up with the others. Now use Table - Promote, and the selected records will move to the top, but still sorted by period. This is one simple way to obtain a quantification of sites which are within 500m of a river, and if you just count the number of sites of each period, you should find that there are:

This operation could be repeated after sorting on the basis of Type, and promoting the selected records again, which would tell you, amongst other things, that in the buffer zone there are 8 chapels, 7 churches, 20 enclosures, 16 settlements, 6 shielings, and so on. It is, however, a rather laborious process, and there is a better way to do it.

First of all, convert the selected features of the theme containing the archaeological data to a new shapefile, then open its Attribute Table. With the Table active, use File - Export, which allows the data in the table to be saved in another format. In the export dialogue box choose the delimited text option, which will create a file of plain text with commas as the field delimiters. This file can then be read by almost any other software package, and if you open it in Excel (or another spreadsheet) you can use the Pivot Table facility to automatically generate a table showing the counts of period or type without the time-consuming necessity of manual counting.

Here are figures for the numbers of sites of different periods which are within 500m of rivers, roads, and modern settlements, but converted to percentages of the total number of sites of each period; there are very few sites of the early medieval period, and only one modern one, so as ever caution is required when looking at the figures.

  1p 2r 3em 4m 5pm 6mo 7u
<500m from settlement 22 30 36 53 47 0 27
<500m from river 56 78 45 58 76 0 69
<500m from a road 53 81 55 84 76 100 73

The following charts show the same information in graphical form.

 

Graph showing the percentage of the total number of sites that are situated within 500 m of a river at different periods

Graph showing the percentage of the total number of sites that are situatedwithin 500 m of a road at different periods

Some things stand out immediately, such as the relatively small percentage of prehistoric and Roman sites which are close to a modern settlement, or the relatively large percentage of Roman and medieval sites which are close to a road. This figure of 500m is arbitrary, though, and can only be a starting point to further investigations. What would happen if the buffer zone were reduced to 100m, for example? Which site types are closest to each of these landscape features?

The terrain model data can also be used to analyse the distribution of the archaeological sites. Without any analytical work at all, it looks as if there is some bias in distribution of site periods with respect to elevation, and again quantifying the relationship is fairly easy:

The results should be:

  1p 2r 3em 4m 5pm 6mo 7u
Under 300m 40 17 4 86 16 0 15
Over 300m 58 10 7 7 1 1 11

 

which confirms the instant impression that medieval sites are predominantly confined to the lower elevations. Why this should be is a question which requires the application of archaeological thought rather than GIS analysis.