Mini journal logo  Home Issue Contents All Issues

Open Archaeology, Open Source? Collaborative practices in an emerging community of archaeological software engineersOpen DataOpen Materials

Zachary Batist and Joe Roe

Cite this as: Batist, Z. and Roe, J. 2024 Open Archaeology, Open Source? Collaborative practices in an emerging community of archaeological software engineers, Internet Archaeology 67. https://doi.org/10.11141/ia.67.13

Summary

Surveying the first quarter-century of computer applications in archaeology, Scollar (1999) lamented that the field relied almost exclusively on 'hand-me-down' tools repurposed from other disciplines. Twenty-five years later, this is no longer the case: computational archaeologists often find themselves practising the dual roles of data analyst and research software engineer (Baxter et al. 2012; Schmidt and Marwick 2020), developing and applying new tools that are tailored specifically to archaeological problems and archaeological methods. Though this trend can be traced to the very earliest days of the field (Cowgill 1967), its most recent manifestation is distinguished by its apparent embrace of practices from free and open-source software. Most prominently, since around 2015, there has been a rapid uptake of workflow tools designed for open-source development communities, such as the version control system git and associated online source code management platforms (e.g. GitHub, GitLab). These tools facilitate collaboration among developers and users of open source software using patterns that can diverge quite radically from conventional scholarly norms (Tennant et al. 2020).

In this article, we investigate modes of collaboration in this emerging community of practice using 'open-archaeo', a curated list of archaeological software, and data on the activity of associated GitHub repositories and users. We conduct an exploratory quantitative analysis to characterise the nature and intensity of these collaborations and map the collaborative networks that emerge from them. We document uneven adoption of open source collaborative practices beyond the basic use of git as a version control system and GitHub to host source code. Most projects do make use of collaborative features and, through shared contributions, we can trace a collaborative network that includes the majority of archaeologists active on GitHub. However, a majority of repositories have 1-3 contributors, with only a few projects distinguished by an active and diverse developer base. Direct collaboration on code or other repository content - as opposed to the more passive, social media-style interaction that GitHub supports – remains very limited. In other words, there is little evidence that archaeologists ' adoption of open-source tools (git and GitHub) has been accompanied by the decentralised, participatory forms of collaboration that characterise other open-source communities. On the contrary, our results indicate that research software engineering in archaeology remains largely embedded in conventional professional norms and organisational structures of academia.

Zachary BatistORCID logo
https://zackbatist.info

Corresponding author: Joe RoeORCID logo
Institute of Archaeological Sciences, University of Bern

Full text

Figure 1: Growth of open archaeology projects on GitHub.

Figure 2: Lifespan of open archaeology repositories.

Figure 3: Lifespan and commit rate of open archaeology repositories.

Figure 4: Box plot showing use of GitHub collaboration features in open archaeology repositories.

Figure 5: Distribution of contributions in multi-contributor open archaeology repositories.

Figure 6: Graph of open archaeology repositories and users connected by contributions.

Figure 7: Graph of open archaeology repositories connected by common contributors.

Figure 8: Graph of open archaeology users connected by contributions to common repositories.

Figure 9: Repository centrality by age and length.

Figure 10: Mean repository centrality by category and platform.

Figure 11: User centrality by total number of contributions.

Adema, J. and Moore, S. 2021 'Scaling small; or how to envision new relationalities for knowledge production',Westminster Papers in Communication and Culture 16(1). https://doi.org/10.16997/wpcc.918

Atici, L., Kansa, S.W., Lev-Tov, J. and Kansa, E.C. 2013 'Other people's data: a demonstration of the imperative of publishing primary data, Journal of Archaeological Method and Theory 20(4), 663. https://doi.org/10.1007/s10816-012-9132-9

Balter, B. 2015 'Open Source License Usage on GitHub.com', The GitHub Blog, March 10, 2015. https://github.blog/2015-03-09-open-source-license-usage-on-github-com/

Batist, Z. 2023 Archaeological Data Work as Continuous and Collaborative Practice, PhD thesis, University of Toronto. https://hdl.handle.net/1807/130306

Batist, Z. and Roe, J. 2023 'Open-Archaeo: a resource for documenting archaeological software development practices', Journal of Open Archaeology Data 11. https://doi.org/10.5334/joad.111

Baxter, R., Chue Hong, N., Gorissen, D., Hetherington, J. and Todorov I. 2012 'The research software engineer' in Digital Research Conference, Oxford 2012, Oxford. 1-3. https://www.research.ed.ac.uk/en/publications/the-research-software-engineer

Beck, A. and Neylon, C. 2012 'A vision for open archaeology', World Archaeology 44(4), 479-97. https://doi.org/10.1080/00438243.2012.737581

Bennett, E.A. 2021 ;Open Science From a Qualitative, Feminist Perspective: Epistemological Dogmas and a Call for Critical Examination;, Psychology of Women Quarterly 45(4), 448-456. https://doi.org/10.1177/03616843211036460

Brembs, B., Lenardic, A., Murray-Rust, P., Chan, L. and Irawan, D.E. 2023 'Mastodon over Mammon: towards publicly owned scholarly knowledge', Royal Society Open Science 10(7), 230207. https://doi.org/10.1098/rsos.230207

Carver, J.C., Weber, N., Ram, K., Gesing, S. and Katz, D.S. 2022 'A survey of the state of the practice for research software in the United States', PeerJ Computer Science, 8:e963. https://doi.org/10.7717/peerj-cs.963

Coleman, E.G. 2012 Coding Freedom: The Ethics and Aesthetics of Hacking, Princeton University Press. https://doi.org/10.1515/9781400845293

Cornelius-Bell, A. and Bell, P. 2021 'The academic precariat post-COVID-19', Fast Capitalism 18(1). https://doi.org/10.32855/fcapital.202101.001

Cowgill, G.L. 1967 'Computer applications in archaeology' in AFIPS '67 (Fall): Proceedings of the November 14-16, 1967, fall joint computer conference, New York: Association for Computing Machinery. 331-37. https://doi.org/10.1145/1465611.1465654

Dorta-González, P., González-Betancor, S.M. and Dorta-González, M.I. 2021 'To what extent is researchers' data-sharing motivated by formal mechanisms of recognition and Credit?', Scientometrics 126(3), 2209-25. https://doi.org/10.1007/s11192-021-03869-3

Ducke, B. 2012 'Natives of a connected world: free and open source software in archaeology', World Archaeology 44(4), 571-79. https://doi.org/10.1080/00438243.2012.743259

Ducke, B. 2013 'Reproducible data analysis and the open source paradigm in archaeology' in A. Bevan and M. Lake (eds) Computational Approaches to Archaeological Spaces, Walnut Creek, CA: Left Coast Press. 315-26.

Ducke, B. 2015 'Free and open source software in commercial and academic archaeology' in A.T. Wilson and B. Edwards (eds) Open Source Archaeology: Ethics and Practice, Warsaw, Poland: De Gruyter Open. https://doi.org/10.1515/9783110440171-008

Dusollier, S. 2007 'Open source and copyleft: authorship reconsidered?' in W.T. Gallagher (ed) Intellectual Property, London, UK: Routledge, 563-78. https://www.taylorfrancis.com/chapters/edit/10.4324/9781315252148-24/open-source-copyleft-authorship-reconsidered-severine-dusollier

Escamilla, E., Klein, M., Cooper, T., Rampin, V., Weigle, M.C. and Nelson. M.L. 2022 'The rise of GitHub in scholarly publications' in G. Silvello, O. Corcho, P. Manghi, G. Maria Di Nunzio, K. Golub, N. Ferro and A. Poggi (eds) Linking Theory and Practice of Digital Libraries, Lecture Notes in Computer Science, Cham: Springer International Publishing. 187-200. https://doi.org/10.1007/978-3-031-16802-4_15

Escamilla, E., Salsabil, L., Klein, M., Wu, J., Weigle, M.C. and Nelson, M.L. 2023 'It's not just GitHub: identifying data and software sources included in publications' in O. Alonso, H. Cousijn, G. Silvello, M. Marrero, C. Teixeira Lopes and S. Marchesin (eds) Linking Theory and Practice of Digital Libraries, Lecture Notes in Computer Science, Cham: Springer Nature Switzerland. 195-206. https://doi.org/10.1007/978-3-031-43849-3_17

Faniel, I., Kansa, E.C., Whitcher Kansa, S., Barrera-Gomez, J. and Yakel, E. 2013 'The challenges of digging data: a study of context in archaeological data reuse' in Proceedings of the 13th ACM/IEEE-CS Joint Conference on Digital Libraries, New York: ACM. 295-304. https://doi.org/10.1145/2467696.2467712

Girvan, M. and Newman, M.E.J. 2002 'Community structure in social and biological networks', Proceedings of the National Academy of Sciences 99(12), 7821-26. https://doi.org/10.1073/pnas.122653799

Hacıgüzeller, P., Taylor, J.S. and Perry, S. 2021 'On the emerging supremacy of structured digital data in archaeology: a preliminary assessment of information, knowledge and wisdom left behind', Open Archaeology 7(1), 1709-30. https://doi.org/10.1515/opar-2020-0220

Hippel, E. von and Krogh, G. von 2003 'Open source software and the "private-collective" innovation model: issues for organization science', Organization Science 14(2), 209-23. https://doi.org/10.1287/orsc.14.2.209.14992

Howison, J. and Herbsleb, J.D. 2013 'Incentives and integration in scientific software production' in Proceedings of the 2013 Conference on Computer Supported Cooperative Work. CSCW 13, New York, NY, USA: Association for Computing Machinery, 459-70. https://doi.org/10.1145/2441776.2441828

Huggett, J. 2012 'Lost in information? Ways of knowing and modes of representation in e-archaeology', World Archaeology 44(4), 538-52. https://doi.org/10.1080/00438243.2012.736274

Huggett, J. 2018 'Reuse remix recycle: repurposing archaeological digital data', Advances in Archaeological Practice 6(2), 93-104. https://doi.org/10.1017/aap.2018.1

Huggett, J. 2022 'Data legacies, epistemic anxieties, and digital imaginaries in archaeology', Digital 2(2), 267-95. https://doi.org/10.3390/digital2020016

Kansa, E.C. 2012 'Openness and archaeology's information ecosystem', World Archaeology 44(4). https://doi.org/10.1080/00438243.2012.737575

Kansa, E.C. 2022 'On infrastructure, accountability, and governance in digital archaeology' in K. Garstki (ed) Critical Archaeology in the Digital Age: Proceedings of the 12th IEMA Visiting Scholar's Conference, Los Angeles: Cotsen Institute of Archaeology Press. 141-52. https://escholarship.org/uc/item/0vh9t9jq#page=156

Kansa, E.C., Whitcher Kansa, S. and Arbuckle, B. 2014 'Publishing and pushing: mixing models for communicating research data in archaeology', International Journal of Digital Curation 9(1), 57-70. https://doi.org/10.2218/ijdc.v9i1.301

Kelty, C.M. 2008 Two Bits: The Cultural Significance of Free Software, Duke University Press.

Khan, S., Hirsch, J.S. and Zeltzer-Zubida, O. 2024 'A dataset without a code book: ethnography and open science', Frontiers in Sociology 9, 1308029. https://doi.org/10.3389/fsoc.2024.1308029

Kim, M. 2007 'The Creative Commons and copyright protection in the digital era: uses of Creative Commons Licenses', Journal of Computer-Mediated Communication 13(1), 187-209. https://doi.org/10.1111/j.1083-6101.2007.00392.x

Kintigh, K.W., Altschul, J.H., Kinzig, A.P., Limp, W.F., Michener, W.K., Sabloff, J.A., Hackett, E.J., Kohler, T.A., Ludäscher, B. and Lynch, C.A. 2015 'Cultural dynamics, deep time, and data: planning cyberinfrastructure investments for archaeology', Advances in Archaeological Practice 3(1), 1-15. https://doi.org/10.7183/2326-3768.3.1.1

Kling, R., McKim, G. and King, A. 2003 'A bit more to it: scholarly communication forums as socio-technical interaction network', Journal of the American Society for Information Science and Technology 54(1), 47-67. https://doi.org/10.1002/asi.10154

Lai, J., Lortie, C.J., Muenchen, R.A., Yang, J. and Ma, K. 2019 'Evaluating the popularity of R in ecology', Ecosphere 10(1), e02567. https://doi.org/10.1002/ecs2.2567

Lake, M. 2012 'Open archaeology', World Archaeology 44(4), 471-78. https://doi.org/10.1080/00438243.2012.748521

Leonelli, S. 2023 Philosophy of Open Science, 1st edition, Elements in the Philosophy of Science, Cambridge University Press. https://doi.org/10.1017/9781009416368

Limp, W., Kansa, F.E. and Kansa, S. 2011 'Web 2.0 and beyond, or on the Web nobody knows you're an archaeologist', Archaeology 2, 265-80. https://escholarship.org/uc/item/1r6137tb#page=281

Marwick, B., d Alpoim Guedes, J., Barton, C.M., Bates, L.A., Baxter, M., Bevan, A., Bollwerk, E.A., Bocinsky, R.K., Brughmans, T., Carter, A.K. et al. 2017 'Open science in archaeology', SAA Archaeological Record 17(4), 8-14. https://eprints.gla.ac.uk/148887/

Milliken, G., Nguyễn, S. and Steeves, V. 2021 'A behavioral approach to understanding the git experience' in Proceedings of the 54th Hawaii International Conference on System Sciences, Kauai, HI. 7239-7248. https://hdl.handle.net/10125/71493

Mirowski, P. 2018 'The future(s) of open science', Social Studies of Science 48(2), 171-203. https://doi.org/10.1177/0306312718772086

Morgan, C. 2015 'Punk, DIY, and anarchy in archaeological thought and practice', AP: Online Journal in Public Archaeology 5, 123-46. https://doi.org/10.23914/ap.v5i0.67

Morgan, C. and Eve, S. 2012 'DIY and digital archaeology: what are you doing to participate?', World Archaeology 44(4), 521-37. https://doi.org/10.1080/00438243.2012.741810

Nguyễn, S. and Rampin, V. 2022 'Who writes scholarly code?', International Journal of Digital Curation 17(1). https://doi.org/10.2218/ijdc.v17i1.839

O'Neil, M. 2009 Cyberchiefs: Autonomy and Authority in Online Tribes, London, UK: Pluto Press.

Open Knowledge Foundation 2015 Open Definition 2.1. https://opendefinition.org/od/2.1/en/

Open Source Initiative 2007 The Open Source definition. https://opensource.org/osd/

Opitz, R., Strawhacker, C., Buckland, P., Cothren, J., Dawson, T., Dugmore, A., Hambrecht, G. et al. 2021 'A lockpick's guide to dataARC: designing infrastructures and building communities to enable transdisciplinary research', Internet Archaeology 56. https://doi.org/10.11141/ia.56.15

Pownall, M., Azevedo, F., König, L.M., Slack, H.R., Evans, T.R., Flack, Z., Grinschgl, S. et al. 2023 'Teaching open and reproducible scholarship: a critical review of the evidence base for current pedagogical methods and their outcomes', Royal Society Open Science 10(5), 221255. https://doi.org/10.1098/rsos.221255

R Core Team 2023 R: A language and environment for statistical computing, Vienna, Austria: R Foundation for Statistical Computing. https://www.R-project.org/

Ratto, M. 2003 'Re-working by the Linux Kernel developers', FLOSShub, Department of Communication, University of California, San Diego. https://flosshub.org/sites/flosshub.org/files/ratto.pdf

Ratto, M. 2007 'A practice-based model of access for science. Linux Kernel development and shared digital resources', Science & Technology Studies 20(1), 73-105. https://doi.org/10.23987/sts.55220

Raymond, E. 1999 'The cathedral and the bazaar', Knowledge, Technology & Policy 12(3), 23-49. https://doi.org/10.1007/s12130-999-1026-0

Roe, J. and Batist, Z. 2024 'Zackbatist/Openarchaeo-collaboration: v1.0', Zenodo. https://doi.org/10.5281/zenodo.8393043

Roosevelt, C.H., Cobb, P., Moss, E., Olson, B.R. and Ünlüsoy, S. 2015 'Excavation is destruction digitization: advances in archaeological practice', Journal of Field Archaeology 40(3), 325-46. https://doi.org/10.1179/2042458215Y.0000000004

Saunders, J.L. 2022 'Decentralized infrastructure for (neuro)science' arXiv, 2209.07493 (cs) https://doi.org/10.48550/ARXIV.2209.07493

Schmidt, S.C. and Marwick, B. 2020 'Tool-driven revolutions in archaeological science', Journal of Computer Applications in Archaeology 3(1), 18-32. https://doi.org/10.5334/jcaa.29

Scollar, I. 1999 '25 Years of computer applications in archaeology' in L. Dingwall, S. Exon, V. Gaffney, S. Laflin and M. van Leusen (eds) Archaeology in the Age of the Internet, Oxford: Archaeopress. 5-10. https://proceedings.caaconference.org/paper/02_scollar_caa_1997/

Sobotkova, A. 2018 'Sociotechnical obstacles to archaeological data reuse', Advances in Archaeological Practice 6(2), 117-24. https://doi.org/10.1017/aap.2017.37

Tennant, J., Agarwal, R., Baždarić, K., Brassard, D., Crick, T., Dunleavy, D.J., Evans, T.R. et al. 2020 'A tale of two Opens: intersections between free and open source software and open scholarship', SocArXiv, 6 March 2020. https://doi.org/10.31235/osf.io/2kxq8

Tukey, J.W. 1977 Exploratory Data Analysis, Reading, MA: Addison-Wesley Publishing Company. http://theta.edu.pl/wp-content/uploads/2012/10/exploratorydataanalysis_tukey.pdf

Whallon, R. 1972 'The computer in archaeology: a critical survey', Computers and the Humanities 7(1), 29-45. https://doi.org/10.1007/BF02403759

Internet Archaeology is an open access journal based in the Department of Archaeology, University of York. Except where otherwise noted, content from this work may be used under the terms of the Creative Commons Attribution 3.0 (CC BY) Unported licence, which permits unrestricted use, distribution, and reproduction in any medium, provided that attribution to the author(s), the title of the work, the Internet Archaeology journal and the relevant URL/DOI are given.

Terms and Conditions | Legal Statements | Privacy Policy | Cookies Policy | Citing Internet Archaeology

Internet Archaeology content is preserved for the long term with the Archaeology Data Service. Help sustain and support open access publication by donating to our Open Access Archaeology Fund.