Cite this as: Hall, K. and Loy, M. 2025 Collecting Data in the Conservation Lab for Network Visualisation, Internet Archaeology 69. https://doi.org/10.11141/ia.69.8
In 2023 and 2024, conservators at the East Crete Study Center of the Institute for Aegean Prehistory (INSTAP-SCEC) worked to prepare material from Minoan multi-chambered communal tombs for publication. The material came from a series of tombs excavated by Nota Dimopoulou between 1999 and 2002 on the hill of Trypiti at Poros, which is the Neopalatial cemetery of the harbour town of the palace of Knossos. Deposits in the tombs represented multiple episodes of mortuary activity. In addition, all the tombs had been visited and partially looted in later periods, with only a small number of primary burials and the large pit deposits found undisturbed.
Some of the important small finds and complete vessels (Dimopoulou-Rethemiotaki 2004) had been studied; however the large amount of broken pottery had not, and there was much information still to be collected. During conservation of the ceramics, stratigraphic data relating to the use of the tombs was recorded.
This data was investigated by the authors using software for network visualisation and analysis. The method, using data from the Laonikou tomb excavated in 2002 (Dimopoulou and Krzyszkowska 2019, 47-48), is presented here.
The use of network analytical techniques is now quite commonplace in archaeology, with a particular proliferation in the number of studies that have been published in recent years (Brughmans 2012; Brughmans and Peeples 2023, especially for the below sub-division of network methods with further references). Network methods handle, process, and visualise relationalities between different entities (i.e. a multivariate analysis), where the entities considered might variously be one or a combination of objects, people, places, styles, production techniques, ideas or other such affordances. Networks have been used successfully to track optimal paths for crossing a landscape, to generate patterns of intervisibility, or to interrogate abstract socio-cultural phenomena; less commonly, material-culture based networks have been used, where ties are visualised either between contexts or places on the basis of similar assemblages (Bernardini 2007; Mills et al. 2013; Iacono 2016), or between objects themselves with respect to shared styles, shapes, functional groups or otherwise (Loy 2024). What such studies generally have in common is that they operate at an inter-site or regional level, mainly working on the basis of synthesising huge datasets with an interest to making further substantive conclusions about a site following the close of a project (Peeples and Haas 2013; Blake 2014). What we illustrate in this paper is a network analysis that is used as an interpretive tool in an ongoing excavation, visualising data at the site level, to interpret specific archaeological rather than broader historical questions.
The authors chose to use network analysis for this particular case study because of the method's flexibility. The problem, as outlined below, demanded a form of multivariate calculation, and network analysis offered the benefit of organising multivariate data gathered in the lab into a spatial format. This permitted us to visualise fragmentation patterns and the dispersal of individual pottery vessels across the investigated tomb. In other words, network methods enabled the authors to analyse both object biographies and the temporality of the site within a single analytical framework: the fragments discussed below were considered both in their abstract context relational to one another and also in their primary field context.
For the tomb discussed, well preserved vessels were catalogued and mapped during excavation. In addition, more than a hundred bags of sherds were collected, each of these containing a pottery 'lot' pertaining to a particular zone in the tomb. At INSTAP-SCEC, working in the large outdoor space, the conservators organised the pottery sherds and began carefully sorting through them, gradually piecing together vessels from the mass of fragments (Figure 1).
Eventually, more than a hundred vessels were created, forming a large assembly of jugs and cups, i.e. vessels for pouring and drinking. Many of these vessels were completed from fragments scattered through different 'lots' and even through different chambers of the tomb. The conservators kept careful track of this by labelling all sherds removed from pottery groups with tiny strips of blue painters' tape labelled with fine permanent marker (Figure 2). When the new vessels were catalogued, these details were transcribed into the database for ceramic finds. Vessels were reconstructed using an adhesive of Paraloid B72 in acetone for the cups, and 3:1 Paraloid B48N:Paraloid B72 in acetone for the jugs.
The question was then; how to use this data? Clearly, it is a window on human activities in the tomb. But how were we to make sense of the columns of data representing many-to-many relationships between the sherds?
The information collected during conservation was stored in an Excel spreadsheet [xlsx]. The first column showed the ID number of the pottery vessel: 72 pottery vessels were catalogued with non-sequential IDs, the lowest being 30 and the highest being 376. (Vessels from a single location were not included in the dataset). The second column designated the 'lot' (or excavation unit) where a particular vessel was first identified (Figure 3). In the following 16 columns, further lots were listed where further sherds were found and added to the given vessel. Some pottery vessels (16) had sherds originating in only one other lot additional to the initial discovery lot, and only 20 had vessels with sherds spread across more than five lots.
The dataset also included (see second tab of the spreadsheet), a list of all 93 excavation lots that had yielded pottery (Figure 4). The general location of these lots was described according to five categories (translated from Greek): entrance fill, antechamber, north chamber, central chamber, south chamber.
The task set by the conservator was to visualise all the information concerning the pottery and lot joins in a graphical format. The pots and lots should be shown together on the same diagram, but easily distinguishable.
The neatest solution proposed was to use a bimodal network visualisation, as a diagram of this type allows that:
The following procedure was undertaken to transform the dataset supplied into a format readable in the graphs and networks visualisation platform Cytoscape. All steps can be performed manually in Excel by practitioners with no particular training in the Digital Humanities, but R code is also supplied here to enable exact reproducibility of all steps described.
Cytoscape is a popular open-source software for network analysis for beginners, and several online tutorials are available. For basic analysis, two spreadsheets are needed. The first is a Node Table which lists node attributes. In our example, these attributes are the pot catalogue number and the names of the pottery lots, and their location in the tomb. The second spreadsheet required is an Edges Table, in our example a simple list in two columns of every link between a pottery vessel and an excavation lot.
The columns of the core Excel dataset were renamed to reduce confusions in processing the dataset e.g. 'ID' was re-categorised as 'Pot_ID'. It was decided that it was not important in the present visualisation to indicate the original lot from which a vessel's sherds were first located, so this column ('Context') and all subsequent columns listing lot names ('Join 1', 'Join 2' etc.) were simply renamed together in sequence 'Lot_1', 'Lot_2', 'Lot_3', etc. (Figure 5).
new_column_names <- c("Pot_ID", paste0("Lot_", 1:17))
colnames(data) <-new_column_names
The second tab of the Excel spreadsheet, the list of all the excavation lots, was used to create the Node Table. Cytoscape requires that nodes are imported from spreadsheets with sequential integer key ID numbers. A column was therefore created that combined 'Pot_ID' and 'Lot_ID' as 'Node_ID'. In order that each entity had a unique number, 'Lot_IDs' had to be renumbered starting from 74 (i.e. the next number available after the 73 pottery vessels) (Figure 6).
This code uses the package 'readxl'
joins_data <- read_excel(file_path, sheet = "all joins")
pottery_lots_data <- read_excel(file_path, sheet = "pottery lots")
nodes_data <- add_row(
nodes_data,
ID = 0,
Pot = 0,
Lot = 0,
Label = "Null",
NodeID = 0,
Location = "0"
)
joins_labels <- paste("Pot", joins_data[[1]], sep = " ")
nodes_data <- rbind(nodes_data, data.frame(
ID = NA,
Pot = 1,
Lot = 0,
Label = joins_labels,
NodeID = NA,
Location = 0,
stringsAsFactors = FALSE
))
lot_labels <- pottery_lots_data[[2]]
lot_locations <- pottery_lots_data[[3]]
nodes_data <- rbind(nodes_data, data.frame(
ID = NA,
Pot = 0,
Lot = 1,
Label = lot_labels,
NodeID = NA,
Location = lot_locations,
stringsAsFactors = FALSE
))
nodes_data$ID <- c(0, seq(1, nrow(nodes_data) - 1))
nodes_data$NodeID <- nodes_data$ID
It was also required for visualisation in Cytoscape to express the joins between vessels and lots as a list of paired nodes, where nodes were expressed by their 'Node_ID'. To prepare this information, within the core Excel dataset the names of lots needed to be transformed to the relevant 'Node_ID(= 'Lot_ID'). Using the All Lots/Node Table tab as a reference point, this re-mapping can be performed either manually, or using a VLOOKUP function within Excel (Figure 7).
=VLOOKUP($'CoreDataset'.B2,$'AllLots'.B2:C96,2,0)
Finally, the matrix of numbered joins is required as a simple list of paired nodes, i.e. the Edges table. While the neatest solution is in R, it is also possible to create this list with an IF function. For all 73 rows of data, an IF function should be run to search for a non-null value in the corresponding column (Figure 8), and to paste in the value into a new sheet which will become the Edges Table:
[for the source node] =IF($'DatasetRemapped'.B2="",,$'DatasetRemapped'.A2)
[for the target node] =IF($'DatasetRemapped'.B2="",,$'DatasetRemapped'.B2)
from row 2 all the way down to row 74.
This search needed to be repeated 17 times for all 73 pottery vessels, to catch all possible joins in the field (i.e. for all 73 rows in column B, C, D, all the way through to column R). Where the IF formula did not find a match (i.e. where there was no join between a vessel and a lot), a value of '0' was pasted for both the source and target columns. These 'double 0' returns can then be stripped out of the dataset to leave a clean Edges Table, in this case 297 joins.
edge_creation_data <- joins_data
edge_creation_data[[1]] <- ifelse(!is.na(edge_creation_data[[1]]),
paste("Pot", edge_creation_data[[1]]),
edge_creation_data[[1]])
block <- edge_creation_data[1:73, 1:18]
lookup <- setNames(nodes_data$NodeID, as.character(nodes_data$Label))
block_id <- apply(block, 2, function(col) unname(lookup[as.character(col)]))
# block_id is a matrix with NodeIDs where labels matched, NA otherwise
edges <- do.call(rbind, lapply(seq_len(nrow(block_id)), function(i) {
ids <- unique(na.omit(as.integer(block_id[i, ])))
if (length(ids) < 2) return(NULL)
prs <- t(combn(ids, 2))
# normalise so Source < Target to de-duplicate undirected edges
cbind(pmin(prs[,1], prs[,2]), pmax(prs[,1], prs[,2]))
}))
edges_data <- as.data.frame(unique(edges))
names(edges_data) <- c("Source", "Target")
row.names(edges_data) <- NULL
Both the Node and Edges Table are importable as CSVs into Cytoscape. In our visualisation, using 'Fill Colour' in the Style menu defined the colour of the nodes (lots were coloured grey, and pots, yellow). In the same menu, using 'Label' displayed the name of a node drawn directly from the attribute list (Figure 9). An image annotation was then added to the Cytoscape session (using the tomb plan drawn by Pepi Stefanaki) and the nodes were then arranged manually in their correct positions. The image was then exported as a JPEG. A small key was added using Photoshop (Figure 10).
The visualisation shows that most vessels were found in the north chamber of the tomb. More than 50% of these vessels were comprised of sherds only from the north chamber; perhaps these vessels are in their original place of deposition. Post-depositional disturbances in the tomb are also mapped, with sherds from separate chambers of the tomb joining together. For the Laonikou tomb, the network diagram prompted further interrogation of the data and the excavation notebooks.
Obviously, the diagram presented in Figure 10 is only one way of imaging this dataset. Once cataloguing is complete and each vessel has been dated and classified, other network diagrams can be created and analysed, or other forms of visualisation used to answer different questions. In this way, we hope to contribute to the understanding of the formation processes of the deposits. We also present this as an example of the process of investigative conservation, in which the collaboration of conservators and archaeologists can provide new insights into material processes.
We have since used this methodology in the ongoing study of two other chamber tombs from the series. Each had a pit into the floor of a chamber, filled with broken pottery and other small finds. We used vertical sections of the stratigraphy in the pits together with network visualisation to illustrate depositional events in these pits.
The innovation in this study has been to combine field and lab data for enriching an understanding of the archaeological context. In addition, by using a reproducible framework, it has been possible to quickly enhance the field analysis whenever new information became available from the conservation lab.
At a practical level, where the spatial visualisation indicates a high concentration of pottery fragments for which there are many joins between pieces, a high area of disturbance with much fragmentation can be interpreted. In the case of the Laonikou tomb, such an area is to be found in the north-east sector of the tomb, in excavation plots Θ8, Θ9, Ι8, Ι9, Κ8 and Κ9. This area functioned as a niche (Dimopoulou and Krzyszkowska 2019, 78) where piles of bones and small finds were concentrated. Furthermore, something can be said of the vessels that are widely distributed over multiple excavation units, such as finely decorated (and precisely dateable) cups 72, 95, 96, 209 and 214; the visualisation perhaps allows us to begin to reconstruct primary placement of key artifacts despite the damage done by later looting.
By contrast, at a conceptual level, network analysis offers a new dimension on investigating the object biography. While the location of a pottery fragment within this tomb might simply show its final position, looking at the relationships between a sherd and its neighbouring pieces can give some indication of where it came from and how it was deposited. Considering how the pieces fragmented offers deeper insight into the tomb's use-history.
Methodologically, this paper has presented a use of network analysis that adds to the growing arsenal of material-culture based approaches. Our motivation for using multivariate methods was to enhance our understanding of the excavation process, and our analysis was focused between only a few contexts within a single site. We present our use-case as a very practical example of problem-solving, demonstrating the flexibility of multi-scalar network methods, more usually applied to broader interpretive and substantive historical problems.
The authors would like to thank Dr. Nota Dimopoulou for permission to experiment with the dataset.
No conflict of interest was reported by the authors
Internet Archaeology is an open access journal based in the Department of Archaeology, University of York. Except where otherwise noted, content from this work may be used under the terms of the Creative Commons Attribution 3.0 (CC BY) Unported licence, which permits unrestricted use, distribution, and reproduction in any medium, provided that attribution to the author(s), the title of the work, the Internet Archaeology journal and the relevant URL/DOI are given.
Terms and Conditions | Legal Statements | Privacy Policy | Cookies Policy | Citing Internet Archaeology
Internet Archaeology content is preserved for the long term with the Archaeology Data Service. Help sustain and support open access publication by donating to our Open Access Archaeology Fund.