Figures
Abstract
Organic residues are a rich source of biomolecular information on ancient diets. In particular, foodcrusts, charred residues on ceramics, are commonly analysed for their lipid content and to a lesser extent protein in order to identify foods, culinary practices and material culture use in past populations. However, the composition of foodcrusts and the factors behind their formation are not well understood. Here we analyse proteomic data (available via ProteomeXchange with identifier PXD059930) from foodcrusts made using a series of mixtures of protein- (salmon flesh), lipid- (beef fat) and carbohydrate-rich (beetroot) foods to investigate the relationship between the biomolecular composition of the input and the recovered signal using conventional methods applied to archaeological material. Additionally, using 3D modelling we quantify the volume of foodcrust generated by different ingredient combinations The results highlight biases in the data obtained in the analyses of organic residues both in terms of identified resources reflecting the cooked foodstuffs, e.g., an overrepresentation of fish proteins, as well as with regards to the abundance of foodcrust, for example mixtures of only salmon and beef fat resulted in relatively small amounts of foodcrust, and suggest caution in interpreting the composition of residues formed from complex mixtures of foodstuffs.
Citation: Dekker JAA, Hagan R, Collins M, Hendy J (2025) An unclean slate, discrepancies between food input and recovered protein signal from experimental foodcrusts. PLoS One 20(8): e0330195. https://doi.org/10.1371/journal.pone.0330195
Editor: Raven Garvey, University of Michigan, UNITED STATES OF AMERICA
Received: March 3, 2025; Accepted: July 28, 2025; Published: August 22, 2025
Copyright: © 2025 Dekker et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: A full submission of the acquired mass spectrometry data has been made to the ProteomeXchange Consortium via the PRIDE 64 partner repository with the dataset identifier PXD059930 and https://doi.org/10.6019/PXD059930. Additional metadata of the MetaMorpheus database search and the 3D model object files of the experimental ceramic vessels used in the cooking experiment can be found on Zenodo under the following DOI: https://doi.org/10.5281/zenodo.15874520.
Funding: This project was made possible thanks to funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No 956351. The funders had no role in study design, data collection, analysis, decision to publish or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Food residues on or in ceramic vessels are a popular target for studies on ancient diets and foodways. Their potential for revealing components of the diets of past people is demonstrated by studies on the adoption of pottery by hunter-gatherers [1] and continuity in diet under political regime change [2]. Organic residues on archaeological vessels can manifest in different ways, including calcified deposits [3], charred foodcrusts (hereafter foodcrusts) [4] and organic residues entrapped within the ceramic matrix [5], and more rarely, organic remains preserved on the surface of vessels through arid, frozen or waterlogged conditions [6–8]. These three different types of residues are formed by different processes and are affected differently by diagenesis. Of the three types of residue, we focus on foodcrusts in this study. Foodcrusts have so far been mostly analysed for their lipid content (including fats, oils and waxes). Lipid analysis is robust and has been successfully applied to materials from a wide range of preservation conditions spanning the globe and stretching back as far as at least 12 000 years ago [2,9–11]. For example, the study of lipids has been successful in studying the emergence and spread of the use of dairy products [12–14] as well as the continuity in the widespread use of marine resources through the Mesolithic into the Neolithic [15–18]. However, there are still areas for improvement. The results of compound-specific isotope analysis can be confounded by the mixture of multiple resources and the results are likely biased towards lipid-rich foodstuffs compared to lipid-poor foods, such as most plants [19].
The presence of plants in foodcrusts is best investigated using SEM (scanning electron microscopy) [20,21], revealing, for example, the presence of acorns in ceramic vessels in the Neolithic Netherlands [22]. Animal remains, although difficult to taxonomically resolve, can also be observed using SEM [20].
In theory, the disadvantages of both lipid analysis and SEM could be circumvented by protein analysis. As proteins are encoded by DNA, they inherently carry phylogenetic information, although the taxonomic specificity of a protein varies greatly between proteins. Beyond their use for taxonomic identification, some proteins can be used to identify the tissues that were cooked, as some proteins are abundant in or exclusive to specific tissues. For example, the frequently observed protein beta-lactoglobulin (BLG) is specific to dairy [23–26], and is not synthesised by humans at all. Additionally, as proteomic taxonomic identification is based on the presence of diagnostic peptide sequences, protein analysis should be able to distinguish between the different components of a food mixture. However, the application of protein analysis to foodcrusts is in its infancy [4,7,27,28] While some success has been observed [4,29], biases and challenges in this method have been noted [12]. For example, the cooking and burning process of foodcrust formation likely impacts the numbers of food proteins, which may or may not be taxonomically diagnostic. Furthermore, in the case of analysing mixtures of ingredients, it is unknown whether the foodcrust preserves an accurate reflection of the input food proteins. Protein analysis will of course favour protein-rich foods, but other foodstuffs less enriched in protein, such as plants, still contain characteristic proteins and have been identified in organic residues [3]. Whether there are additional factors beyond the initial protein content determining the likelihood of a particular protein’s survival is unclear. A recent study on protein incorporation and survival in the ceramic matrix and foodcrust tested the influence of several physico-chemical properties, but beyond a slight favour towards hydrophobic peptides, there were no clear universal trends in protein survival [30]. Beyond these general limitations, not much is known about the challenges of applying protein analysis to organic residues nor about how the theoretical advantages translate into reality.
This study will focus on foodcrusts and aims to clarify the strengths and limitations of protein analysis using experimental foodcrusts made of a mixture of protein-, lipid- and carbohydrate-rich foods. Specifically, we will investigate how mixtures of different foodstuffs might influence the volume of foodcrust formed and the recoverable proteome. In particular, we will examine the degree to which the ‘extractome’, the suite of recovered proteins [31], reflects the original input food mixture, in order to highlight the potential biases involved in the identification of foodstuffs in archaeological material. Additionally, this study aims to map how repeated cooking events and burial influence the extractome’s composition and the volume of generated foodcrust, adding to our understanding of the diagenesis of proteins in charred organic residues.
In order to maximise the relevance of the experiment’s results for archaeological samples the choice of foodstuffs is vital. As this experiment is part of a larger project focused on Mesolithic foodways in Denmark, we selected foodstuffs that would be similar to those utilised by late Danish hunter-gatherers. Secondly, as the three commonly used methods, i.e., lipid analysis, proteomics and SEM, to study organic residues all favour a different class of biomolecule we decided to select one lipid-rich, one protein-rich and one carbohydrate-rich foodstuff to mix. Using foodstuffs with vastly different nutritional compositions facilitates visualising the biases towards either carbohydrates, proteins or lipids. Therefore, care was taken to ensure that the relative proportions of the wet weights of the carbohydrate-rich, protein-rich and lipid-rich foodstuffs matched as closely as possible to the ratio of carbohydrates:proteins:lipids in the total mixture (e.g., a mixture consisting of 20% of the protein-rich food would also contain 20% proteins). Perfect symmetry between the ratios of wet weights and the molecular categories was not possible, especially not when limited to foodstuffs likely available in Mesolithic Denmark, but the closest match was obtained using salmon, beef fat and beetroot (SI 1). However, we acknowledge that this combination of foodstuffs underrepresents the carbohydrate component in particular.
Beetroot was chosen as the carbohydrate-rich food due to the prevalence of its wild ancestor Beta vulgaris maritima at Mesolithic sites such as Tybrind Vig and Møllegabet II [32,33], as well as the fact that it has a relatively well-annotated reference proteome. Protein identification is heavily dependent on the quality and coverage of reference proteomes. Low-quality reference proteomes only contain a fraction of the total proteome of a species and any proteins not included cannot be identified. To highlight the magnitude of this issue, the median number of proteins listed on UniProt [34] for 29 plant species found at Mesolithic northern European sites [32,35–38] is 120 proteins (SI 2), in contrast to the 7679 available for Beta vulgaris or the 130 673 for Triticum aestivum (wheat), a high-quality reference proteome.
We selected beef fat and salmon because the exploitation of aurochs (Bos primigenius) and salmon (Salmo salar) is well attested at a number of Danish late Mesolithic sites [36,39–42]. Additionally, salmon has a well-annotated reference proteome covering 90% of its total proteome. Available reference data for aurochs is much more limited, but due to the short evolutionary distance between them, domestic cattle (Bos taurus), which has a 97% complete reference proteome, was used.
By varying these three ingredients in regular increments we are able to establish the presence of any biases favouring or disadvantaging the recovery of proteins from any of the foods. In turn, this new insight on the presence of biases will facilitate the interpretation of proteins identified in archaeological foodcrusts and thus our understanding of past foodways. The analyses presented here demonstrate that, as expected, there is a bias towards the recovery of proteins from protein-rich foods and that it is likely that not all ingredients can be identified from a food mixture.
Materials
The ceramic vessels used in the cooking experiments were purchased from Potted History, a company specialising in historical ceramic replicas. The specifications of the ceramics are in line with the pottery used in the experiments by Bondetti et al. [43]: they were hand thrown with the ‘standard red’ clay (Al2O3, 22.78; Fe2O3, 7.37; CaO, 0.57; MgO, 0.86; K2O, 1.6; and Na2O, 0.1), had a diameter of ~10 cm, a wall thickness of ~0.5 cm and were fired at 700 °C. The foodstuffs used in the experiments were all obtained from local commercial vendors; the beetroot was purchased from Morrisons, the beef fat from M&K Quality Butchers, York, and the salmon was purchased online from LondonGrocery.net (https://londongrocery.net/). As what is often sold as ‘salmon’ can actually belong to a variety of species and even genera, the purchased specimens were examined for morphological characteristics [44] to ensure that they were in fact Salmo salar.
Methods
Cooking experimental protocol
In order to test how different combinations of carbohydrate-, protein- and lipid-rich foodstuffs influence foodcrust formation it was decided to mix the three selected foodstuffs, beetroot, salmon and beef fat, in all possible combinations using 20% increments, which results in 21 sample conditions (Fig 1). To each of these unique combinations one ceramic vessel was assigned. Additionally, we included one ceramic vessel as a negative control, which would only be filled with water during cooking. No samples were taken of the negative control for proteomic analysis, as indeed no foodcrust formed on the water filled vessel. Instead, an extraction blank was included in the protein extraction process to serve as negative control.
Each red node indicates an experimental vessel.
Before the first cooking event, all pots were soaked in tap water at room temperature for 15 minutes in order to prevent cracking when heated. The vessels were then filled with the ingredients according to their experimental conditions to a total mass of 100 g. Additionally, 100 mL of distilled water was added to each vessel. The filled vessels were wrapped in aluminium foil, leaving the top open, and placed for an hour in an oven (Binder) preheated to 270°C. The aluminium foil was intended to hold the pots and their contents in place in the event of the ceramic fracturing, as pilot tests had shown this to be a possibility. The cooking temperature of 270°C was selected based on previous research that found that APAAs (ω-(O-alkylphenyl) alkanoic acids) often found in archaeological foodcrusts, formed after heating for one hour at 270°C [43]. Indirect heating using a convection oven was chosen in order to ensure a stable and even temperature for all vessels and throughout the cooking event. Direct heating is likely more representative of cooking during Mesolithic times, but introduces substantial variability in the precise temperatures the food is exposed to both during a cooking event as well as between different cooking events and between different vessels within the same cooking event. As we expected temperature to play an important role in the formation of foodcrust, it was preferred to keep it as stable as possible.
After cooking for one hour, the vessels were removed from the oven and allowed to cool down at room temperature. The cooled foodstuffs were then removed from the vessels as we were interested in multiple cooking events. Care was taken to only remove the foodstuffs and to leave the foodcrusts intact. Charred remains that adhered to the ceramic wall, but were macroscopically still recognisable as food, were removed as well. Fig 2 provides some examples of foodcrusts that were formed after the final cooking event.
The number denotes the experimental of each vessel and corresponds to Fig 1.
After removing the food, the vessels were upturned for roughly an hour to allow any liquid to drip out. Any fat that consolidated on a vessel’s rim during this period was not considered foodcrust and removed. For subsequent cooking episodes, the vessels were no longer soaked in water. Instead, the vessels were immediately placed in the oven after filling them with their ingredients and 100 mL water. In total nine cooking events were performed for each pot and foodcrust samples were taken after the first, fifth and ninth cooking episodes. Foodcrust could form on the rim, body and bottom of the vessel depending on the food input, but the rim of the vessel was the only location that always contained sufficient foodcrust for sampling, and hence foodcrust was consistently sampled from the rim. Where possible samples were still taken from the bottom and body of the vessel.
After the ninth cooking episode, the vessels were buried with their foodcrusts in the York Experimental Archaeological Research (YEAR) Centre for 52 days at a depth of 50 cm. The sediment the vessels were buried in was sandy and the soil pH of the same area was previously measured to be 7.16 [43]. For the months of August and September, during which the burial experiment was carried out, the Met Office National Climate Information Centre reported a mean monthly air temperature of 16.9 °C and 13.5 °C for North England respectively [45]. The vessels were wrapped loosely in Galvanised wire netting (13 mm mesh size) to prevent bioturbation from any larger creatures. After the burial period had passed, the vessels were excavated and a foodcrust sample was taken from each vessel. All samples were stored at −20°C until protein extraction.
Protein extraction
Foodcrust protein analysis was performed according to a variant of the SP3 protocol [46] developed by Dr. Virginia Harvey and Dr. Jessica Hendy [47]. In short, 150 μL 6 M guanidine hydrochloride (GuHCl) was added to 20 mg foodcrust, vortexed, centrifuged and incubated at 65°C for one hour. Subsequently, the samples were centrifuged and the foodcrust pellets were discarded. 15 μL of a 100 mM TCEP (Tris(2-carboxyethyl)phosphine) and CAA (2-Chloroacetamide) solution was added to the supernatant, homogenised, centrifuged and incubated at 99°C for 10 minutes. After the incubation, the samples were allowed to cool in room temperature conditions, after which 500 μg of magnetic hydrophilic and hydrophobic beads (Cytiva) and 175 μL of 100% (v/v) ethanol were added to each sample. The samples were then incubated again at 24°C, 1000 RPM for 5 minutes. Protein filtration using the beads was performed by placing the samples on a magnetic rack and allowing the beads to migrate for 2 minutes. The liquid in the samples was then removed until only the beads remained. Samples were taken off the rack and were rinsed with 80% ethanol. This process was repeated three times, the first time 500 μL ethanol was used for the rinsing, the second time 300 μL and the third rinse was done with 200 μL. After the final rinse, the beads were resuspended in 100 μL 50 mM ammonium bicarbonate (ABC). This solution was incubated at 37°C with 750 RPM for 3 minutes, after which the samples were placed back on the magnetic rack. In-solution digestion was performed by adding 1 μL of a 0.2 μg/μL trypsin solution to the samples and incubating them at 37°C, 750 RPM for 18 hours.
After digestion, the samples were centrifuged and placed back on the magnetic rack. Beads and suspended proteins were separated by transferring the solution to another set of Eppendorf tubes. 10 μL of 5% (v/v) trifluoroacetic acid (TFA) was added to the samples to halt digestion. As a last step, the peptides were desalted with C18 ZipTips. The ZipTips were primed twice with a 100 μL 50% acetonitrile (ACN) and 0.1% TFA solution and washed twice with 0.1% TFA. Then the peptides were aspirated from the sample and the ZipTips were washed again twice with 0.1% TFA before eluting the peptides in 50% ACN and 0.1% TFA.
Protein mass spectrometry analysis
The eluted peptides were analysed using an mClass nanoflow UPLC (Ultra Performance Liquid Chromatography) system (Waters) coupled to an Orbitrap Fusion Tribrid mass spectrometer (Thermo Scientific) following the same protocol as published elsewhere [48]. In short, the UPLC was equipped with a nanoEaze M/Z Symmetry 100 Å C18, 5 μm trap column (180 μm x 20 mm, Waters) and a PepMap, 2 μm, 100 Å, C18 EasyNano nanocapillary column (75 μm x 500 mm, Thermo Scientific). Chromatographic separation of the peptides was performed by first washing the peptides from the trap column onto the capillary column with 0.05% trifluoroacetic acid aqueous solvent at a flow rate of 15 μL/min. After five minutes the flow was switched to the capillary column, which was first washed for seven minutes with solvent A (0.1% FA (formic acid)) and 3–10% solvent B (0.1% FA in acetonitrile). The concentration of solution B in the mixture was steadily increased over time. After these initial seven minutes, the concentration of solvent B was increased to 35% over 30 minutes and lastly to 99% over five minutes. Finally, the column was washed with 99% solvent B for four minutes.
As the peptides were eluted off the column they were introduced into the mass spectrometer, which was equipped with an EasyNano ionisation source (Thermo Scientific). Xcalibur (version 4.0, Thermo Scientific) was used to acquire ESI-MS and MS2 spectra both in positive mode. The following instrument source settings were used: ion spray voltage, 1,900 V; sweep gas, 0 Arb; ion transfer tube temperature; 275°C. MS1 spectra were acquired in the Orbitrap with: 120,000 resolution, scan range: m/z 375−1,500; AGC target, 4e5; max fill time, 100 ms. Acquisition was data dependent (DDA) and performed in topN mode selecting the 12 most intense precursors with charge states >1. Easy-IC was used for internal calibration. Dynamic exclusion was performed for 50 s post precursor selection and a minimum threshold for fragmentation was set at 5e3. MS2 spectra were acquired in the Orbitrap with: 30,000 resolution, max fill time, 100 ms., HCD; activation energy: 32 NCE. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE 64 partner repository with the dataset identifier PXD059930 and 10.6019/PXD059930.
Protein data analysis
The resulting raw files were analysed with MetaMorpheus [49] using the following settings: files were calibrated with a product mass tolerance of 25 ppm and a precursor tolerance of 15 ppm. A G-PTM-D (Global-Post Translational Modification Discovery) task was performed in order to identify the most common post-translational modifications in the samples. The following PTM groups were included in the G-PTM-D task: common biological (34 PTMs), common artefact (10 PTMs), metal (16 PTMs) and trypsin digested (2 PTMs). The precise PTMs that were included can be found in SI 3. To quantify PTM presence occupancy, that is the fraction of a protein that displays a given PTM at a specific position in the protein sequence [50], was calculated automatically at protein level by MetaMorpheus. For the search task a product mass tolerance of 0.5 Da and precursor mass tolerance of 3 ppm was maintained. The maximum number of missed cleavages was 2 and the minimum peptide length was set to 7. Only fully tryptic peptides were accepted.
Using these settings the samples were searched against a custom database consisting of reference proteomes of Beta vulgaris, Chenopodium quinoa, Spinacia oleracea, Salmo salar and Bos taurus, which were downloaded from the Uniprot protein database [34]. In addition to the proteomes of the species used in the cooking experiments, it was also decided to include quinoa (Chenopodium quinoa) and spinach (Spinacia oleracea), because of the limited coverage of the Beta vulgaris reference proteome. We feared that our analysis would bias against Beta vulgaris and therefore decided to include the two species with the largest reference proteomes from the Amaranthaceae family, which includes Beta vulgaris. All five proteomes mentioned above were downloaded from UniProt on 03-03-2023 [34]. Lastly, we included MetaMorpheus’ inbuilt database of contaminants, cRAP [51], in our search. The exact task files and protein database used in this analysis are included in SI 3.
The MetaMorpheus protein identifications were then filtered following the recommendations for palaeoproteomic identifications [52], rejecting all proteins which were present in the blanks or which were represented by less than two unique peptides.
Statistical analysis was performed in R v4.1.1 [53] using the Rstudio environment v2024.4.0.735 [54].
Foodcrust quantification
In order to investigate the influence of ingredient composition on the amount of foodcrust generated after cooking we quantified the volume of foodcrust using 3D scanning. A 3D scan of each ceramic vessel was acquired before the first cooking event and after the ninth using an Artec Space Spider (Artec 3D). 3D model construction and further analysis of the 3D models were all performed in Artec Studio 16 Professional v16.0.8.2. Volume measurements were obtained by combining the scans for each vessel and time point using the ‘Sharp fusion’ function. Then the volume of the entire 3D model was calculated using the in-built volume calculation function. The foodcrust volume was calculated by subtracting the before-cooking 3D model volume from the after-cooking 3D model volume.
Vessels 1, 7, 11 and 21 were damaged during the cooking process and some fragments had detached from the ceramic wall. To prevent the loss of ceramic fragments from influencing the foodcrust quantification the ceramic wall from which the fragment had detached were deleted from both the before and after model. The resulting hole in the 3D models was closed using the ‘Fix holes’ function. Irregularities in the 3D model resulting from this intervention were resolved using the ‘Small-object filter’ and ‘Hole filling’ ‘Postprocessing’ functions. Foodcrust volume calculations could then be obtained in the same way as for the undamaged experiments.
Results
A total list of all proteins recovered from the samples can be found in SI 4. There appeared to be substantial differences in the number of proteins that could be identified between the different experimental conditions and the number of cooking events for a particular ingredient combination (Fig 3). Additionally, most samples show large differences in the contribution of the different ingredients to the total amount of recovered proteins. Some samples, 6.5, 7.1, 12.1, 12.9, 15.1 and 21.1, also yielded proteins from a species not included in their mixture (Table 1).
PE indicates the post-excavation sample. Beta vulgaris, Chenopodium quinoa and Spinacia oleracea are grouped together as ‘plant proteins’. Each square represents an identified protein.
Fig 3 also shows that for most experimental conditions the number of protein identifications decreases as the number of cooking events increases, which fits with the expected increase in protein damage due to extended heating. To further explore this pattern we plotted the distribution of protein identifications per sample point (Fig 4). For each sample we also calculated the protein input in grams based on the ingredients’ protein content [55]. As expected the number of proteins identified decreases after the first sampling point and is lowest after the burial, but some of the samples obtained after the ninth cooking event contain more proteins than any sample taken after the fifth event. To test if these differences are statistically significant, a Kruskal Wallis test [56] was performed (p = 4.183e-8), which indeed suggests that there are significant differences between the number of proteins recovered after the different cooking events. Post hoc analysis using Dunn’s test [57,58] shows that there are significant differences between cooking events 1 and 5 (p = 5.46*10−5), 1 and 9 (p = 1.08*10−3), 1 and PE (5.78*10−7) and lastly between events 9 and PE (p = 0.0497). The seeming difference in protein recovery between events 5 and 9 is therefore not statistically significant.
In general, the number of identified proteins decreases with increasing cooking events, with two vessels acting as outliers.
To further investigate the relationship between the observed proteome abundance and the cooked ingredients we processed the identified peptides in the following way: first, we filtered the identified peptides to only include those that were observed in the single-species samples (i.e., experiment numbers 1, 6 and 21). As the number of identified peptides, let alone shared-peptides, decreases rapidly with increased cooking, we only included samples from the first cooking event. For each protein in each sample, we then selected the three peptides with the highest intensities (Fig 5) and summed their intensities per sample, giving a summed intensity as a proxy of their protein abundance. Using these summed intensities as the response variable we created a linear regression model with the nutritional composition as predictors. In the first model we split the fat content of each sample by saturated, mono-unsaturated and poly-unsaturated fatty acids, while in the second model, we grouped these three fatty acid subcategories together. Both models showed a significant relationship (R2 = 0.7765, F(2,18) = 35.75), the total protein (p = 1.16e-6) and carbohydrate (p = 4.84e-6) content, in particular, showed a significant positive correlation with the observed sum of intensities.
The y-axis shows the log 10 of intensity. For the ingredient combination of each experiment number see Fig 1.
The same approach also allowed us to identify the proteins that were most abundant across our samples after the first cooking event (Table 2). Of the ten most intense proteins, eight belong to Salmo salar and the remaining two are from Bos taurus. No plant proteins feature among the most intense proteins. Interestingly, L-lactate dehydrogenase, number nine in the top ten, was only identified in a single sample (vessel 6), which was filled with 100% salmon. Additionally, Salmo salar collagen alpha-1(I) is listed twice in the top ten with two different accession numbers. An alignment of the complete sequences of these two proteins via Clustal [59] shows a 93.68% similarity in sequence between the two, indicating that there are a number of differences in the sequences. The UniProt entries of the two proteins indicate that they are both automatically predicted from genomic data, albeit from different gene identifiers [34](accessed on 08-07-2025).
The quantification strategy described above is limited by the severe cooking- and burial-induced degradation of the proteins. The number of proteins that survive all cooking events and the burial is limited across all conditions of mixed foods (Fig 3). Out of the total 623 proteins that were identified across all the cooking experiments, 13 are present in samples from both the first, fifth, ninth cooking event and post-burial (Table 3). Remarkably, of these 13 persistent proteins, three belong to plant species, while none of the B. taurus proteins were found in samples from each cooking event. An overview of all identified proteins and the number of samples they were recovered in per cooking event (SI 5) shows that there are three additional salmon proteins that were recovered from the post-excavation samples but were not observed in samples from earlier time points. Two, isocitrate dehydrogenase (B5DGS2) and aspartate aminotransferase (B5X142), were not observed in the sample taken after the fifth cooking event, while myosin-7B (A0A1S3QIK3) was not detected after the ninth cooking event. Additionally, some proteins appear to occur consistently and frequently throughout the experiment until the burial, after which they disappear. A notable example is two collagen alpha-1(I)-like entries, A0A1S3Q7E3 and A0A1S3SM10, which are found in 9–11 samples after the first, fifth and ninth cooking event, but in none after burial.
The above results highlight the challenges in linking the abundance of the identified proteins with the species used in the cooking, which was one of the prime aims of this study. Especially, as of the most intense proteins after the first cooking event (Table 2), only alpha-1,4 glucan phosphorylase (B5DG55) and myosin regulatory light chain 2 (Q7ZZN0) could be recovered in samples from all cooking events, highlighting that there are substantial differences in the impact of degradation on particular proteins. Due to the differences in chemical properties between proteins, such variation is not wholly unexpected. Recently, hydrophobicity (refs) has been gaining attention as a potentially important factor in protein survival. Therefore, we calculated the hydrophobicity of the observed fraction of our identified proteins (Fig 6) using the whole residue values as determined by Wimley and White [60].
Each dot represents a single protein and is coloured by its taxonomic origin. Hydrophobicity was calculated using amino acid hydrophobicity values following Wimley and White [60].
It is clear that the overwhelming majority of identified proteins are hydrophilic, or at least the part of the protein sequence that we identify is hydrophilic. This applies to all samples and across all cooking events, even though recent studies suggest that hydrophobicity may aid protein survival [30, 61]. Consequently, the strong bias towards hydrophobic parts of the protein sequences may be more indicative of biases in the protein extraction process rather than resistance against degradation during periods of heating and burial.
Apart from degrading proteins to the point they can no longer be detected, it is well known that processes such as heating can lead to PTMs forming, although the precise effects are not well understood [62, 63]. Our G-PTM-D search identified 43 different PTMs (SI 5), some of which were near ubiquitous, while others were extremely rare. The PTM profile of the identified PTMs on the most intense proteins from the first cooking event seems fairly consistent across all cooking events (Fig 7). Some changes in the average occupancy of PTMs can be observed, such as the increase in formylation on K and deamidation in hemoglobin subunit beta
Only PTMs that had a minimum average occupancy of 10% were included.
(P02070) and beta-enolase (A0A1S2X522), yet simultaneously N deamidation seems to decrease in L-lactate dehydrogenase (A0A1S3MJX7) and one of the salmon collagen alpha-1(I) chain-like proteins (A0A1S3SM10). Similarly, phosphorylation on S and T seems to increase in bovine collagen alpha-1(I) chain P02453, but decreases in the other salmon collagen alpha-1(I) chain-like protein (A0A1S3Q7E3).
To test if there was a clearer pattern in PTM occupancy in the larger dataset, we tested for differences between the cooking events and the occupancy using the Kruskal Wallis test for each of the 43 identified PTMs. In case the Kruskal Wallis test indicated a significant difference (p < 0.05), a post-hoc Dunn’s test was performed to identify between what cooking events the occupancy differed. Statistically significant differences were found between:
- Hydroxylation on P (ꭓ2 = 23.72, p = 2.858e-5), significant differences (p < 0.05) between all events, except between cooking event 5 and 9.
- Deamidation on N (ꭓ2 = 28.582, p = 2.742e-6), significant differences between cooking events 1 and 5 and 1 and 9.
- Sodium on E (ꭓ2 = 10.612, p = 0.01402), only differs significantly between cooking events 1 and 5.
- Potassium on E (ꭓ2 = 6.4568, p = 0.03962), significantly decreases between cooking events 1 and 9.
- Acetylation on K (ꭓ2 = 10.939, p = 0.004214), significant difference between cooking events 5 and 9.
Although these five PTMs are statistically significantly different between cooking events, the lack of data for some cooking events for in particular acetylation on K, potassium on E and to a lesser extent sodium on E raise some questions regarding the robustness of these findings (Fig 8). Similarly, hydroxylation on P seems to consistently display a high occupancy, but was hardly detected in the samples from after the burial.
Boxplots show the second and third quartile, as well as the median of the distribution.
Of secondary importance was to identify the tissue types based on the identified proteins. One theoretical advantage of proteins over other biomolecules is that they may reflect particular tissue-types. In order to achieve this we used STRING (version 12.0) [64,65] to create protein networks for each of the three species and downloaded the tissue expression analysis, which compares the genes associated with the identified proteins to a database (TISSUE) of gene expression profiles per tissue type [66]. The output of this analysis is a list of tissue types, gene names and various metrics regarding the robustness of the tissue association. We filtered this list of tissue associations to only include tissues with a false discovery rate below 0.01 and a minimum number of 35 supporting proteins. The remaining tissue-gene associations were linked back to the list of proteins identified per sample (Fig 9). Unfortunately, the TISSUE database did not include any data for the observed plant proteins, so it was not possible to associate any particular tissues with the plant proteins.
The tissue associations from detected salmon and beef proteins are mostly unspecific, e.g., “animal” or “whole body”. Several associations with heart-related tissues, such as the heart itself, but also the right atrium and ventricles, were observed. One could argue to group these together as one large cluster associated with the cardiovascular system. We chose to keep them separate, as it highlights a disadvantage of this kind of tissue annotation, namely the unspecificity of the observed proteins. The fact that the average number rounded down of associated tissues per gene in our dataset is 5 ± 2 illustrates well how generic most of these genes are. This also explains why tissues such as the liver and various glands are listed in Fig 6 as well. Despite the lack of a clear muscle-associated proteome, Fig 9 highlights the difficulty in distinguishing between raw fats and muscle based on gene expression annotations, as well as showcases the breadth of tissue associations that can be expected from a foodcrust derived from these tissues.
Foodcrust accumulation showed sizeable variation between samples (Fig 10). A linear regression model of the percentage of the ingredients and the foodcrust volume yielded no statistically significant relationships and had an adjusted R2 value of 0.16, showing that the relative abundance of the three ingredients could only explain a small portion of the foodcrust volume variability. A linear regression model of the nutritional content showed a similar pattern. The overall model had a R2 value of 0.16 and neither the absolute input of proteins, carbohydrates or fats yielded a significant relationship.
The input ingredient composition of each of the experiments is shown by the pie chart on top of the bars. Increasing the proportion of one ingredient over another does not seem to have influenced foodcrust formation.
Discussion
Protein recovery
Our results show large variations in both the number of recovered proteins as well as the relative taxonomic composition of the recovered proteome between the different samples. However, the only significant relationship that we could observe was a decrease in the number of proteins after more cooking events. Such a decrease in protein recovery is not unexpected as it is well known that proteins are vulnerable to high temperatures [67,68] and a decrease in protein recovery from fresh foodstuffs to foodcrust has been observed in previous studies [30]. In fact, one of the aims of cooking is to make food more easily digestible by opening up closed protein structures [63, 69]. Unfortunately, denatured proteins are more vulnerable to degradation [70,71], which bodes ill for protein extraction from archaeological vessels. Although the decrease in the number of recovered proteins with increased cooking events demonstrates the negative effects of prolonged heating, the generally low number of proteins found after the first cooking event indicates that even limited exposure to heat can already induce severe protein degradation.
Heating is also known to induce additional PTMs, yet this pattern was not clearly replicated in our data. Five PTMs showed statistically significant differences between the cooking events, but the limited number of observations for sodium on E, potassium on E and acetylation on K call into question the meaningfulness of these statistical relationships. In contrast, the increasing trend in occupancy of N deamidation with more advanced cooking events is supported by a fairly sizeable set of observations. Deamidation has been linked to protein degradation in archaeological samples [72–74] and an increase in deamidation after increasing cooking duration and burial is in line with our expectations. Although we could observe no statistically significant change in the Q deamidation.
However, as this study does not include unheated samples, it may be that the main effect of heating on PTMs occurs during initial heating rather than prolonged heating after partial charring has already taken place. Instead, the key message of Fig 6 is rather the diversity and high occupancy of many of the PTMs. Open searches, like a G-PTM-D, are not standard practice in palaeoproteomics, and in most studies a select number of PTMs, such as deamidation (NQ) and oxidation (M) are included. Our results show that foodcrusts contain many more PTMs and by calculating the occupancy rather than the percentage of modified residues we can show that the occurrence of PTMs at particular places in the protein sequence is consistent. Although our results do not allow us to conclude that these PTMs are the result of heating, protein identification from foodcrusts may be more limited if a large number of PTMs are not included.
Apart from the heating-induced damage, biases in the incorporation of proteins in the foodcrusts and extraction efficiency may have contributed to the limited size of the extracted proteome. For example, Fig 6 shows that in the case of nearly all recovered proteins, the part of the sequence that we are able to recover is hydrophobic. Additionally, the fact that the standard deviation is roughly equal to the mean number of detected proteins for cooking events 5, 9 and post-excavation, plus the large variation in how different experimental conditions respond to increased cooking in our opinion suggests that there might be considerable variation in protein preservation across the foodcrust in one pot. The degree of spatial variation in foodcrust protein content might best be tested with a spectroscopy-based method, such as NIR [75]. A better understanding of the amount of spatial variation as well as the scale at which it occurs could be of great benefit in maximising the proteomic data obtained from these degraded samples.
Furthermore, recent studies highlight that the current state of proteomic bioinformatic analysis only manages to assign a protein identification to a fraction of the extracted compounds [76,77]. It may be that future proteomic bioinformatic developments will significantly improve our ability to identify proteins in foodcrust samples, yet for now, we were only able to identify a small set of proteins, which for some samples do not represent all the taxa used in the cooking mixture.
All these issues may have impacted our ability to extract and identify proteins from our experimental samples, but ancient foodcrusts may have been exposed to higher temperatures, for longer periods of time and more often. Nevertheless, previous studies have shown that it is possible to extract proteins from archaeological foodcrusts, but the number of recovered proteins is often in the single digits [4,29]. Considering the limitations of recovering proteins from foodcrusts, it is important to evaluate how well we can succeed in using the limited number of found proteins to identify the taxa and tissues used during the cooking process. Especially in comparison to lipid analysis, which is the most common method for analysing pottery use [78].
Disentangling mixtures & taxonomic representativeness
One of the main theoretical advantages of proteomics over lipid analysis is the ability to distinguish the different resources in a mixture. As interpretations of resource use in lipid analysis are often based on compound-specific isotope ratios, samples with values that do not fall in the established range of the references are more challenging to interpret. Lipid biomarkers could help with this problem, but they are relatively limited, both in terms of the number of taxa with established biomarkers as well as the taxonomic specificity of the biomarkers [19,43,79,80]. Alternatively, one can apply a mixing model to calculate what combination of foodstuffs could explain the observed values. Mixing models are known to work well in estimating the proportion of a limited number of resources with distinct isotope ratios but struggle more with more complex mixtures of a larger number of foodstuffs. Additionally, species with very similar ecological niches will be nearly impossible to distinguish isotopically [81].
In theory, proteomic analysis does not suffer from these limitations. Mixing of multiple foodstuffs should not impact our ability to identify them and proteins can allow for up to species-level identifications [82], although this is dependent on the type of recovered proteins and their coverage. Although these theoretical advantages of proteomics might lead to high expectations regarding the performance of proteomic analysis of foodcrust, the aforementioned issues in recovering proteins should temper them. To illustrate this point, 25 out of the 84 samples analysed in this study yielded no proteins at all. 32 samples yielded proteins from all species that were cooked in them. In 18 samples no proteins of one of the cooked species could be found and in 4 samples two species were missing. Lastly, in five samples proteins were found that belonged to a species that was not part of the mixture cooked in that pot (Table 1). The peptides that were uniquely associated with these proteins were run through BLAST [83] to verify their taxonomic identification. The most specific taxon represented by at least two peptides was accepted as the new taxonomic identification (Table 4). This authentication step indicated that the plant proteins in samples 15.1 and 21.1 were shared up to the level of Eukaryota and could thus well derive from the ingredients cooked in the vessels. Although the BLAST taxonomic identifications are more specific for some of the remaining proteins than the others, it does appear that in samples 6.5, 7.1, 12.1 and 12.9 there are proteins present from a species that was not used in the cooking.
There are several explanations for these ‘lost’ proteins, which occur in 5 out of 84 samples. It could be that the contamination occurred during the cooking experiment or during sampling, although sampling equipment was cleaned with ethanol after each sample. Contamination could have occurred during protein extraction, but all proteins present in the blank were not included in further analysis. Another potential explanation is machine carry over. The fish-specific peptides detected in samples 6.1, 12.1, 12.9 were all also detected in the samples run before them. It may have happened that proteins from a previous sample remained stuck on the column and were only eluted with the next sample. Carry over seems a less likely explanation for the B. taurus protein in sample 6.5, as it was the first sample in its batch. Instead, it may be that this protein, A0A3Q1LL35, is background contamination. It is listed as an uncharacterised protein and the taxon assigned by BLAST, Boreoeutheria, encompasses most of the placental mammals. Consequently, this protein could have derived from any number of common contaminants that could have been introduced at any stage of the experiment.
As with the overall number of proteins, the representativeness of the extracted proteome decreases substantially with additional cooking episodes. After the first cooking event 5 of the 21 samples were missing one of the input foods, but after burying the samples 6 out of 7 samples were missing proteins from one or two of the cooked foods. Interestingly, despite 3 of 7 samples yielding protein after burial containing beef fat, none yielded Bos taurus proteins. To check if there was a systematic bias in recovery against any particular species we calculated the recovery rate of each species by dividing the number of samples they were found in by the number of samples that the species was cooked in (Table 5). The aforementioned 6 samples with proteins from species that were not part of the mixture were excluded from the recovery rate calculation for that particular species. The protein abundance of each species was not taken into account in calculating the recovery rate due to the previously discussed challenges in quantifying protein abundance for degraded samples.
Table 5 shows clear differences in recovery rate between the foodstuffs. In general, beetroot was the most challenging to recover, which is in line with expectations considering the low protein content of beetroot. Vice versa, the fact that salmon has the highest recovery rate is in line with it having the highest protein content. Additionally, although in general there is a pattern of lower numbers of proteins and decreased recovery rate with an increased number of cooking events, the recovery rate for beetroot and beef is higher for cooking event 9 than 5. However, the Kruskal Wallis and Dunn’s test showed that cooking events 5 and 9 were the only two groups without a statistically significant difference in protein numbers. The lack of a statistically significant difference may be explained by the addition of fresh food at the start of each cooking event. It may be that it is this ‘freshest’ food incorporated into the foodcrust that is yielding us proteins.
Secondly, Table 5 highlights the sudden disappearance of Bos taurus proteins after burial. It is unclear what causes the lack of Bos taurus protein preservation. It could be that the proteins in beef fat (which includes vimentin, creatine kinase B-type and heat shock protein beta-1) are simply more vulnerable to degradation due to their structure. Although Fig 5 shows that the Bos taurus proteins identified are associated with largely the same tissues as the Salmo salar proteins. Alternatively, the vulnerability to degradation may be due to the structure of the foodcrust and how fat-rich foods are incorporated in them rather than the structure of the proteins. There is some reason to suspect other factors might also be at play. A recent study performed a similar experiment, generating and burying foodcrusts made using red deer (Cervus elaphus), salmon (Salmo salar) and sweet chestnut (Castanea sativa) [30]. Although our experiment did not use Cervus elaphus, it may be compared to our vessels with Bos taurus. The two species are closely enough related that most of their proteins should be similar, if not their sequences. Of their top 5 most abundant deer proteins in the foodcrust after cooking by peptide count, only fructose-bisphosphate aldolase (A0A3S5ZPB0) was detected in our samples and only 2 samples from the first cooking event. Two of the other common deer proteins, troponin T3 and myosin-1 were not directly detected in our samples, but myosin light chain 1/3 (A0JNJ5) and troponin I2 (F6QIC1) were detected, although again in a limited number of samples. As for the most abundant proteins after cooking, only myoglobin was detected in our samples, but bovine myoglobin is the 7th most frequently recovered bovine protein in our samples. However, it was not found in any samples after burial.
There is more overlap in the Salmo salar proteins between our study and Evans et al. (2024). All five most abundant Salmo salar proteins from their after-cooking samples were observed in our samples, although they do not occur in the top 10 of most frequently recovered Salmo salar proteins in our samples. As for the most abundant proteins after burial, four out of five were detected in our samples, but only one, myosin heavy chain, fast skeletal muscle-like (A0A1S3QIK3), in one of our after-burial samples.
The recovery of sweet chestnut and beetroot cannot be compared as easily as Cervus elaphus and Bos taurus, but it is noteworthy that they recover chestnut proteins less frequently than Salmo salar or Cervus elaphus proteins in their buried samples, whereas we do find plant proteins, but no Bos taurus proteins in our buried samples. Of course, their experiment used deer meat, while we used beef fat and it may be that this changes the preservation potential of the proteins contained within the tissue. Regardless of what prevented us from recovering Bos taurus proteins, it is clear that, especially for buried samples, the recovered proteome might not reflect all the foodstuffs cooked in the vessel and that there is substantial variability in our ability to obtain proteins from buried samples.
Compared to other proteomic analyses of organic residues, the results presented here differ somewhat in the diversity of proteins identified. Previous cooking experiments (without burial) on protein extraction from ceramics found mostly myosin, collagen and haemoglobin type proteins [84, 85]. Myosins and collagens have also been found in the limited number of studies on archaeological foodcrust [29], as well as parvalbumin and vitellogenin in one specific study [4]. Calcified residues have also yielded myosin and haemoglobin proteins [86], but in general this type of residue seems to more frequently yield dairy proteins [26, 28]. Collagen, myosin and haemoglobin also feature among the most intense proteins we observed after the first cooking event (Table 2), but only myosin persisted through all cooking events and burial (Table 3). Instead, most of the other proteins listed in Table 3 are not commonly reported from organic residues. Data from other types of organic residues, such as from the ceramic matrix or calcified residues are not necessarily directly comparable to foodcrusts. Although these materials also contain dietary proteins, each of them has their own unique properties and processes of protein incorporation. Nonetheless, based on previous studies it seems that for meat-based foodstuffs proteins such as collagens, myosin and haemoglobins form a ‘core’ proteome that can be relatively frequently recovered.
Foodcrust abundance
Our and previous results [30] show that the protein contents of foodcrusts do not ‘fairly’ represent all the foodstuffs prepared in the vessel. Biases in protein recovery for particular foodstuffs were expected to a certain extent, but Fig 10 suggests that there are also differences in the quantity of foodcrust that particular foodstuffs generate. Fig 10 shows that there is not one particular food that is driving the formation of more foodcrust. Overall, the beetroot-salmon mixtures produce the most foodcrust, followed by beetroot-salmon-beef fat mixtures. Beetroot-beef fat or salmon-beef fat seems to produce the least foodcrust, suggesting that the presence of such a fat-rich food is not necessarily a driver of foodcrust formation. Indeed, nowadays in contact frying it is common to add a lipid-rich substance, such as oil or butter, to prevent food from sticking to cookware. This common practice is supported by studies showing that increasing amounts of oil reduces the adhesion of food to containers [87, 88]. Lipid-based coatings have even been proposed for pharmaceutical containers to prevent nonspecific binding of the active components of drugs to container walls [89]. Beyond food content, experimental studies on organic residue formation, or ‘fouling’ [90], show that the material characteristics (e.g., surface roughness) have an substantial effect on the non-stick properties of cookware, as well as the ease of cleaning [88, 91].
The key finding of Fig 10 is perhaps to highlight the importance of carbohydrate and protein-rich foodstuffs in generating foodcrust in the first place, while fat-rich foods seem to play a lesser role. Yet the most common method of analysing these organic residues has so far been lipid analysis, which may bias our interpretation of the processed resources towards fat-rich foodstuffs. Such a variable pattern in foodcrust formation in our dataset emphasises the value of replicates, which we, unfortunately, are not able to provide. Nonetheless, we argue that these results are a valuable addition to our current understanding of the formation of organic residues, especially because as far as we are aware there has been no absolute quantification of the abundance of foodcrust formed up to this date.
Additionally, although we could not obtain quantitative measurements of the volume of foodcrust forming on different parts of the pot (e.g., rim, body or bottom), it is clear from Fig 2 and the 3D scans that foodcrusts predominantly, and in the case of some vessels only, at the rim of the vessel. In particular, foodcrusts seem to form mostly at the interface between the water level and the ceramic, which seems to be in line with observations on the formation of ‘fouling’ in industrial dairy processing contexts [92]. A potential explanation for the proclivity of foodcrust formation at this position might be the steep gradient in temperature. Most of the food during the cooking events was submerged in water, which cannot exceed its boiling point of 100 °C, whereas the ceramic itself will be closer to the 270 °C the experiment was conducted at. As the waterline receded through evaporation more of the food would suddenly be exposed to the higher temperatures, potentially prompting the formation of foodcrust. This applies to both pieces of the solid foodstuffs, as well as any soluble components.
Conclusion
This experiment had three core aims: firstly, to determine the relationship between the relative ratios of input foods and the proportions between the species of the recovered proteins. Secondly, to measure the impact of repeated cooking and burial on protein preservation and lastly how the different ingredient combinations impacted the amount of foodcrust that formed.
The data shows clearly that the relative abundance of recovered proteins does not mirror the actual input of foodstuffs. Most identified proteins belonged to Salmo salar, followed by Bos taurus and lastly Beta vulgaris. This pattern applied to samples taken after cooking events 1, 5 and 9. However, in the samples taken after the burial no Bos taurus proteins could be recovered, although a small number of Beta vulgaris proteins were identifiable. These results highlight that not only do food input and recovered proteins not necessarily align, the relative abundance of taxa may change due to increased degradation.
Secondly, the number of recovered proteins decreased significantly from the first to the fifth cooking event, plateaued from the fifth to the ninth and decreased again from the ninth event until after burial.
Lastly, no statistically significant predictor of the foodcrust volume could be identified, but the mixtures that formed the most foodcrust in terms of volume were beetroot-salmon, followed by a mixture of all three ingredients, suggesting that lipid-rich foods are not as important for the volume of foodcrust.
Supporting information
S1 Data. Nutritional composition of the used foods.
https://doi.org/10.1371/journal.pone.0330195.s001
(CSV)
S2 Data. Overview of animal and plant species identified at Danish Ertebølle sites and the availability of public protein reference data for these species.
https://doi.org/10.1371/journal.pone.0330195.s002
(XLSX)
S3 Data. Metadata of the MetaMorpheus database search.
https://doi.org/10.1371/journal.pone.0330195.s003
(ZIP)
S5 Data. Overview of the number of the number of foodcrusts per cooking event a protein was identified in.
https://doi.org/10.1371/journal.pone.0330195.s005
(XLSX)
S6 Data. Overview of the identified PTMs, both per protein and summarised per cooking event.
https://doi.org/10.1371/journal.pone.0330195.s006
(XLSX)
Acknowledgments
We thank the Bioscience Technology Facility and Chemistry Department at the University of York for mass spectrometry access and support, in particular we would like to thank Dr. Adam Dowle for his support in running the samples and feedback on the manuscript. The instrumentation is part of the York Centre of Excellence in Mass Spectrometry. The centre was created thanks to a major capital investment through Science City York, supported by Yorkshire Forward with funds from the Northern Way Initiative and subsequent support from EPSRC (EP/K039660/1; EP/M028127/1).
We would like to thank the YEAR Centre for their permission to use the Centre’s grounds for the burial experiment and in particular Dr. Gareth Perry for his assistance in the burial and excavation of the experimental vessels.
For the morphological taxonomic verification of the Salmo salar specimens, we would like to thank Dr. Katrien Dierickx and for his assistance in figuring out Sankey diagrams we thank Dr. Jakob Hansen. Additionally, we would like to thank Dr. Claire Koenig for her advice on protein quantification.
Additionally, our gratitude goes to Patrick Gibbs and James Osborn from Heritage360, University of York, for access to their 3D scanning equipment and assistance in the acquisition and analysis of the 3D data.
Lastly, we would like to thank the two anonymous reviewers for their constructive comments, which helped improve the manuscript.
References
- 1. Oras E, Lucquin A, Lõugas L, Tõrv M, Kriiska A, Craig OE. The adoption of pottery by north-east European hunter-gatherers: Evidence from lipid residue analysis. J Archaeol Sc. 2017;78:112–9.
- 2. Lundy J, Drieu L, Orecchioni P, Meo A, Aniceti V, Fiorentino G, et al. Cuisine in transition? Organic residue analysis of domestic containers from 9th-14th century Sicily. R Soc Open Sci. 2023;10(3):221305. pmid:36908986
- 3. Hendy J, Colonese AC, Franz I, Fernandes R, Fischer R, Orton D, et al. Ancient proteins from ceramic vessels at Çatalhöyük West reveal the hidden cuisine of early farmers. Nat Commun. 2018;9(1):4064. pmid:30283003
- 4. Shevchenko A, Schuhmann A, Thomas H, Wetzel G. Fine Endmesolithic fish caviar meal discovered by proteomics in foodcrusts from archaeological site Friesack 4 (Brandenburg, Germany). PLoS One. 2018;13(11):e0206483. pmid:30485287
- 5. Chowdhury MP, Campbell S, Buckley M. Proteomic analysis of archaeological ceramics from Tell Khaiber, southern Iraq. J Archaeol Sci. 2021;132:105414.
- 6. Cronin T, Downey L, Synnott C, McSweeney P, Kelly EP, Cahill M, et al. Composition of ancient Irish bog butter. International Dairy Journal. 2007;17(9):1011–20.
- 7. Greco E, El-Aguizy O, Ali MF, Foti S, Cunsolo V, Saletti R, et al. Proteomic Analyses on an Ancient Egyptian Cheese and Biomolecular Evidence of Brucellosis. Anal Chem. 2018;90(16):9673–6. pmid:30044608
- 8. Yang Y, Shevchenko A, Knaust A, Abuduresule I, Li W, Hu X, et al. Proteomics evidence for kefir dairy in Early Bronze Age China. J Archaeol Sci. 2014;45:178–86.
- 9. Craig OE, Saul H, Lucquin A, Nishida Y, Taché K, Clarke L, et al. Earliest evidence for the use of pottery. Nature. 2013;496(7445):351–4. pmid:23575637
- 10. Admiraal M, Lucquin A, Drieu L, Casale S, Jordan PD, Craig OE. Leftovers: The presence of manufacture‐derived aquatic lipids in Alaskan pottery. Archaeometry. 2019;62(2):346–61.
- 11. Admiraal M, Colonese AC, Milheira RG, da Rocha Bandeira D, Demathe A, Pereira Dos Santos AM, et al. Chemical analysis of pottery reveals the transition from a maritime to a plant-based economy in pre-colonial coastal Brazil. Sci Rep. 2023;13(1):16771. pmid:37798303
- 12. Copley MS, Berstan R, Dudd SN, Docherty G, Mukherjee AJ, Straker V, et al. Direct chemical evidence for widespread dairying in prehistoric Britain. Proc Natl Acad Sci U S A. 2003;100(4):1524–9. pmid:12574520
- 13. Copley MS, Berstan R, Dudd SN, Aillaud S, Mukherjee AJ, Straker V, et al. Processing of milk products in pottery vessels through British prehistory. Antiquity. 2005;79(306):895–908.
- 14. Craig OE, Chapman J, Heron C, Willis LH, Bartosiewicz L, Taylor G, et al. Did the first farmers of central and eastern Europe produce dairy foods? Antiquity. 2005;79(306):882–94.
- 15. Craig OE, Steele VJ, Fischer A, Hartz S, Andersen SH, Donohoe P, et al. Ancient lipids reveal continuity in culinary practices across the transition to agriculture in Northern Europe. Proc Natl Acad Sci U S A. 2011;108(44):17910–5. pmid:22025697
- 16. Bondetti M, Lucquin A, Savel’ev NA, Weber AW, Craig OE, Jordan PD. Resource processing, early pottery and the emergence of Kitoi culture in Cis-Baikal: Insights from lipid residue analysis of an Early Neolithic ceramic assemblage from the Gorelyi Les habitation site, Eastern Siberia. Archaeol Res Asia. 2020;24:100225.
- 17. Courel B, Robson HK, Lucquin A, Dolbunova E, Oras E, Adamczak K, et al. Organic residue analysis shows sub-regional patterns in the use of pottery by Northern European hunter–gatherers. R Soc open sci. 2020;7(4):192016.
- 18. Lucquin A, Robson HK, Oras E, Lundy J, Moretti G, González Carretero L, et al. The impact of farming on prehistoric culinary practices throughout Northern Europe. Proc Natl Acad Sci U S A. 2023;120(43):e2310138120. pmid:37844237
- 19. Colonese AC, Hendy J, Lucquin A, Speller CF, Collins MJ, Carrer F, et al. New criteria for the molecular identification of cereal grains associated with archaeological artefacts. Sci Rep. 2017;7(1):6633. pmid:28747692
- 20. Courel B, Meadows J, Carretero LG, Lucquin A, McLaughlin R, Bondetti M, et al. The use of early pottery by hunter-gatherers of the Eastern European forest-steppe. Quaternary Science Reviews. 2021;269:107143.
- 21. Kubiak-Martens L, Oudemans TFM, Brozio JP, Filipović D, Müller J, Kirleis W. Transformation of cereal grains: Botanical and chemical analysis of food residues encrusted on pottery from the Funnel Beaker settlement of Oldenburg LA 77, northern Germany. PLoS One. 2024;19(1):e0296986. pmid:38241419
- 22. Kubiak-Martens L, Brinkkemper O, Oudemans TFM. What’s for dinner? Processed food in the coastal area of the northern Netherlands in the Late Neolithic. Veget Hist Archaeobot. 2014;24(1):47–62.
- 23. Warinner C, Hendy J, Speller C, Cappellini E, Fischer R, Trachsel C, et al. Direct evidence of milk consumption from ancient human dental calculus. Sci Rep. 2014;4:7104. pmid:25429530
- 24. Charlton S, Ramsøe A, Collins M, Craig OE, Fischer R, Alexander M, et al. New insights into Neolithic milk consumption through proteomic analysis of dental calculus. Archaeol Anthropol Sci. 2019;11(11):6183–96.
- 25. Bleasdale M, Richter KK, Janzen A, Brown S, Scott A, Zech J, et al. Ancient proteins provide evidence of dairy consumption in eastern Africa. Nat Commun. 2021;12(1):632. pmid:33504791
- 26. Evans M, Lundy J, Lucquin A, Hagan R, Kowalski Ł, Wilczyńki J, et al. Detection of dairy products from multiple taxa in Late Neolithic pottery from Poland: an integrated biomolecular approach. R Soc Open Sci. 2023;10(3):230124. pmid:36938542
- 27. Solazzo C, Fitzhugh WW, Rolando C, Tokarski C. Identification of protein remains in archaeological potsherds by proteomics. Anal Chem. 2008;80(12):4590–7. pmid:18494502
- 28. Tanasi D, Cucina A, Cunsolo V, Saletti R, Di Francesco A, Greco E, et al. Paleoproteomic profiling of organic residues on prehistoric pottery from Malta. Amino Acids. 2021;53(2):295–312. pmid:33582869
- 29. Lyu N, Yan L, Wang T, Lin L, Rao H, Yang Y. Characterization of pottery foodcrusts through lipid and proteomic analyses: A case study from the Xiawan site in Yixing city, East China. J Archaeol Sci. 2024;161:105902.
- 30. Evans M, Hagan R, Boyd OJ, Bondetti M, Craig OE, Collins MJ, et al. The impact of cooking and burial on proteins: a characterisation of experimental foodcrusts and ceramics. R Soc Open Sci. 2024;11(9):240610. pmid:39416716
- 31. Schroeter ER, DeHart CJ, Schweitzer MH, Thomas PM, Kelleher NL. Bone protein “extractomics”: comparing the efficiency of bone protein extractions of Gallus gallus in tandem mass spectrometry, with an eye towards paleoproteomics. PeerJ. 2016;4:e2603. pmid:27812413
- 32. Kubiak-Martens L. The plant food component of the diet at the late Mesolithic (Ertebolle) settlement at Tybrind Vig, Denmark. Veget Hist Archaebot. 1999;8(1–2):117–27.
- 33.
Mason S. Archaeobotanical analysis-Møllegabet II. In: Skaarup J, Grøn, editors. Møllegabet II, A submerged Mesolithic settlement in southern Denmark. Archaeopress. 2004:122–43.
- 34. UniProt Consortium. UniProt: the Universal Protein Knowledgebase in 2023. Nucleic Acids Res. 2023;51(D1):D523–31. pmid:36408920
- 35. Kubiak-Martens L. New evidence for the use of root foods in pre-agrarian subsistence recovered from the late Mesolithic site at Halsskov, Denmark. Veget Hist Archaeobot. 2002;11(1–2):23–32.
- 36. Price DT, Gebauer AB, Hede US, Larsen CS, Noe-Nygaard N, Mason SLR, et al. Smakkerup Huse: A Mesolithic Settlement in NW Zealand, Denmark. J Field Archaeol. 2001;28(1–2):45–67.
- 37. Holst D. Hazelnut economy of early Holocene hunter–gatherers: a case study from Mesolithic Duvensee, northern Germany. J Archaeol Sci. 2010;37(11):2871–80.
- 38.
Robinson DE, Harild JA. Archaeobotany of an early Ertebölle (late Mesolithic) site at Halsskov, Zealand, Denmark. In: Mason SLR, Hather J, editors. Hunter-Gatherer Archaeobotany: Perspectives from the Northern Temperate Zone. London, England: Routledge. 2009.
- 39. Richter J, Noe-Nygaard N. A Late Mesolithic Hunting Station at Agernæs, Fyn, Denmark. ACAR. 2003;74(1):1–32.
- 40. Astrup PM, Benjamin J, Stankiewicz F, Woo K, McCarthy J, Wiseman C, et al. A drowned Mesolithic shell midden complex at Hjarnø Vesterhoved, Denmark and its wider significance. Quaternary Science Reviews. 2021;258:106854.
- 41.
Enghoff I. Regionality and biotope exploitation in Danish Ertebølle and adjoining periods. Det Kongelige Danske Videnskabernes Selskab. 2011.
- 42. Price TD, Ritchie K, Gron KJ, Gebauer AB, Nielsen J. Asnæs Havnemark: a late Mesolithic Ertebølle coastal site in western Sjælland, Denmark. Danish J Archaeology. 2018;7(2):255–76.
- 43. Bondetti M, Scott E, Courel B, Lucquin A, Shoda S, Lundy J, et al. Investigating the formation and diagnostic value of ω-(o-alkylphenyl)alkanoic acids in ancient pottery. Archaeometry. 2021;63(3):594–608. pmid:34219747
- 44.
Page LM, B M B. Peterson Field Guide to Freshwater Fishes of North America North of Mexico. Houghton Mifflin Harcourt. 2011.
- 45.
Met Office National Climate Information Centre. Areal values from HadUK-Grid 1km gridded climate data from land surface network. 2025. https://www.metoffice.gov.uk/pub/data/weather/uk/climate/datasets/Tmean/date/England_N.txt
- 46. Cleland TP. Human Bone Paleoproteomics Utilizing the Single-Pot, Solid-Phase-Enhanced Sample Preparation Method to Maximize Detected Proteins and Reduce Humics. J Proteome Res. 2018;17(11):3976–83. pmid:30336043
- 47.
Harvey VL, Hendy J. SP3 protocol optimised for foodcrust protein extraction from archaeological cooking vessels. https://www.protocols.io/view/sp3-protocol-optimised-for-foodcrust-protein-extra-dyge7tte.pdf 2025.
- 48. Dekker J, Larson T, Tzvetkov J, Harvey VL, Dowle A, Hagan R, et al. Spatial analysis of the ancient proteome of archeological teeth using mass spectrometry imaging. Rapid Commun Mass Spectrom. 2023;37(8):e9486. pmid:36735645
- 49. Solntsev SK, Shortreed MR, Frey BL, Smith LM. Enhanced Global Post-translational Modification Discovery with MetaMorpheus. J Proteome Res. 2018;17(5):1844–51. pmid:29578715
- 50. Prus G, Hoegl A, Weinert BT, Choudhary C. Analysis and Interpretation of Protein Post-Translational Modification Site Stoichiometry. Trends Biochem Sci. 2019;44(11):943–60. pmid:31296352
- 51.
The Global Proteome Machine Organization. cRAP protein sequences. The Global Proteome Machine. https://www.thegpm.org/crap/. 2012.
- 52. Hendy J, Welker F, Demarchi B, Speller C, Warinner C, Collins MJ. A guide to ancient protein studies. Nat Ecol Evol. 2018;2(5):791–9. pmid:29581591
- 53.
R Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing. 2021.
- 54.
Posit team. RStudio: Integrated Development Environment for R. Boston, MA: Posit Software, PBC. 2024.
- 55.
Public Health England. McCance and Widdowson’s The Composition of Foods Integrated Dataset 2021. 2021. Available from: https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/971021/McCance_and_Widdowsons_Composition_of_Foods_integrated_dataset_2021.pdf
- 56. Kruskal WH, Wallis WA. Use of Ranks in One-Criterion Variance Analysis. J Am Stat Assoc. 1952;47(260):583–621.
- 57. Dunn OJ. Multiple Comparisons among Means. J Am Stat Assoc. 1961;56(293):52–64.
- 58.
Dinno A. dunn.test: Dunn’s Test of Multiple Comparisons Using Rank Sums. Available from: https://rdrr.io/cran/dunn.test/man/dunn.test.html. 2017.
- 59. Sievers F, Higgins DG. Clustal Omega for making accurate alignments of many protein sequences. Protein Sci. 2018;27: 135–145.
- 60. Wimley WC, White SH. Experimentally determined hydrophobicity scale for proteins at membrane interfaces. Nat Struct Biol. 1996;3(10):842–8. pmid:8836100
- 61. Morton-Hayward AL, Anderson RP, Saupe EE, Larson G, Cosmidis JG. Human brains preserve in diverse environments for at least 12 000 years. Proc Biol Sci. 2024;291(2019):20232606. pmid:38503334
- 62. Yu T-Y, Morton JD, Clerens S, Dyer JM. Proteomic investigation of protein profile changes and amino acid residue-level modification in cooked lamb longissimus thoracis et lumborum: The effect of roasting. Meat Sci. 2016;119:80–8. pmid:27150797
- 63. Yu T-Y, Morton JD, Clerens S, Dyer JM. Cooking-Induced Protein Modifications in Meat. Compr Rev Food Sci Food Saf. 2017;16(1):141–59. pmid:33371543
- 64. Snel B, Lehmann G, Bork P, Huynen MA. STRING: a web-server to retrieve and display the repeatedly occurring neighbourhood of a gene. Nucleic Acids Res. 2000;28(18):3442–4. pmid:10982861
- 65. Szklarczyk D, Kirsch R, Koutrouli M, Nastou K, Mehryary F, Hachilif R, et al. The STRING database in 2023: protein-protein association networks and functional enrichment analyses for any sequenced genome of interest. Nucleic Acids Res. 2023;51(D1):D638–46. pmid:36370105
- 66. Palasca O, Santos A, Stolte C, Gorodkin J, Jensen LJ. TISSUES 2.0: an integrative web resource on mammalian tissue expression. Database (Oxford). 2018;2018:bay003. pmid:29617745
- 67. Ortner DJ, vonEndt DW, Robinson MS. The Effect of Temperature on Protein Decay in Bone: Its Significance in Nitrogen Dating of Archaeological Specimens. Am antiq. 1972;37(4):514–20.
- 68. Collins MJ, Nielsen–Marsh CM, Hiller J, Smith CI, Roberts JP, Prigodich RV, et al. The survival of organic matter in bone: a review. Archaeometry. 2002;44(3):383–94.
- 69. Bax M-L, Aubry L, Ferreira C, Daudin J-D, Gatellier P, Rémond D, et al. Cooking temperature is a key determinant of in vitro meat protein digestion rate: investigation of underlying mechanisms. J Agric Food Chem. 2012;60(10):2569–76. pmid:22335241
- 70. Roberts SJ, Smith CI, Millard A, Collins MJ. The taphonomy of cooked bone: characterizing boiling and its physico–chemical effects. Archaeometry. 2002;44(3):485–94.
- 71. Zielbauer BI, Franz J, Viezens B, Vilgis TA. Physical Aspects of Meat Cooking: Time Dependent Thermal Protein Denaturation and Water Loss. Food Biophysics. 2015;11(1):34–42.
- 72. Ramsøe A, van Heekeren V, Ponce P, Fischer R, Barnes I, Speller C, et al. DeamiDATE 1.0: Site-specific deamidation as a tool to assess authenticity of members of ancient proteomes. J Archaeol Sci. 2020;115:105080.
- 73. Pal Chowdhury M, Buckley M. Trends in deamidation across archaeological bones, ceramics and dental calculus. Methods. 2022;200:67–79. pmid:34450289
- 74. Pal Chowdhury M, Wogelius R, Manning PL, Metz L, Slimak L, Buckley M. Collagen deamidation in archaeological bone as an assessment for relative decay rates. Archaeometry. 2019;61(6):1382–98.
- 75. Maduro Dias CSAM, Nunes HP, Melo TMMV, Rosa HJD, Silva CCG, Borba AES. Application of Near Infrared Reflectance (NIR) spectroscopy to predict the moisture, protein, and fat content of beef for gourmet hamburger preparation. Livestock Science. 2021;254:104772.
- 76. Chiang Y, Welker F, Collins MJ. Spectra without stories: reporting 94% dark and unidentified ancient proteomes. Open Res Eur. 2024;4:71. pmid:38903702
- 77. Rodriguez Palomo I, Nair B, Chiang Y, Dekker J, Dartigues B, Mackie M, et al. Benchmarking the identification of a single degraded protein to explore optimal search strategies for ancient proteins. Peer Community Journal. 2024;4.
- 78.
Cubas M, Becher J, Chiang Y, Antonius Dekker JA, Di Muro A, Doliente JE, et al. Ceramics: Organic Residue Analysis. Encyclopedia of Archaeology (Second Edition). Elsevier. 2024:398–405. doi: https://doi.org/10.1016/b978-0-323-90799-6.00047-1
- 79. Whelton HL, Hammann S, Cramp LJE, Dunne J, Roffet-Salque M, Evershed RP. A call for caution in the analysis of lipids and other small biomolecules from archaeological contexts. J Archaeol Sci. 2021;132:105397.
- 80. Drieu L, Rageot M, Wales N, Stern B, Lundy J, Zerrer M, et al. Is it possible to identify ancient wine production using biomolecular approaches?. STAR: Science & Technology of Archaeological Research. 2020;6(1):16–29.
- 81. Cheung C, Szpak P. Interpreting Past Human Diets Using Stable Isotope Mixing Models. J Archaeol Method Theory. 2020;28(4):1106–42.
- 82. Rüther PL, Husic IM, Bangsgaard P, Gregersen KM, Pantmann P, Carvalho M, et al. SPIN enables high throughput species identification of archaeological bone by proteomics. Nat Commun. 2022;13(1):2458. pmid:35513387
- 83. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215: 403–410. doi:https://doi.org/doi:10.1016/S0022-2836(05)80360-284.
- 84. Barker A, Venables B, Stevens SM Jr, Seeley KW, Wang P, Wolverton S. An Optimized Approach for Protein Residue Extraction and Identification from Ceramics After Cooking. J Archaeol Method Theory. 2011;19(3):407–39.
- 85. Pal Chowdhury M, Makarewicz C, Piezonka H, Buckley M. Novel Deep Eutectic Solvent-Based Protein Extraction Method for Pottery Residues and Archeological Implications. J Proteome Res. 2022;21(11):2619–34. pmid:36268809
- 86. Hendy J, Warinner C, Bouwman A, Collins MJ, Fiddyment S, Fischer R, et al. Proteomic evidence of dietary sources in ancient dental calculus. Proc Biol Sci. 2018;285(1883):20180977. pmid:30051838
- 87. Magens OM, Liu Y, Hofmans JFA, Nelissen JA, Ian Wilson D. Adhesion and cleaning of foods with complex structure: Effect of oil content and fluoropolymer coating characteristics on the detachment of cake from baking surfaces. Journal of Food Engineering. 2017;197:48–59.
- 88.
Ashokkumar S, Thomsen BR, Hinke J, Møller P, Adler-Nissen J. Cleanability evaluation of different surfaces by fouling from contact frying of foods. 2010. Available from: https://www.ceb.cam.ac.uk/system/files/documents/45-ashokkumar-fcfp2010.pdf
- 89. Ma GJ, Yoon BK, Sut TN, Yoo KY, Lee SH, Jeon W, et al. Lipid coating technology: A potential solution to address the problem of sticky containers and vanishing drugs. VIEW. 2021;3(3).
- 90. Goode KR, Asteriadou K, Robbins PT, Fryer PJ. Fouling and Cleaning Studies in the Food and Beverage Industry Classified by Cleaning Type. Comp Rev Food Sci Food Safe. 2013;12(2):121–43.
- 91. Ashokkumar S, Adler-Nissen J. Evaluating non-stick properties of different surface materials for contact frying. Journal of Food Engineering. 2011;105(3):537–44.
- 92. Saget M, de Almeida CF, Fierro V, Celzard A, Delaplace G, Thomy V, et al. A critical review on surface modifications mitigating dairy fouling. Compr Rev Food Sci Food Saf. 2021;20(5):4324–66. pmid:34250733