Article Text

Evaluation of different stool extraction methods for metabolomics measurements in human faecal samples
  1. Vanessa Erben1,2,
  2. Gernot Poschet3,
  3. Petra Schrotz-King1 and
  4. Hermann Brenner1,4,5
  1. 1Division of Preventive Oncology, National Center of Tumor Diseases, Heidelberg, Germany
  2. 2Medical Faculty Heidelberg, University Heidelberg, Heidelberg, Germany
  3. 3Centre for Organismal Studies (COS), Heidelberg University, Heidelberg, Germany
  4. 4Division of Clinical Epidemiology and Aging Research, German Cancer Research Centre, Heidelberg, Germany
  5. 5German Cancer Consortium (DKTK), Heidelberg, Germany
  1. Correspondence to Petra Schrotz-King, Division of Preventive Oncology, National Center of Tumor Diseases, Heidelberg 69120, Germany; petra.schrotz-king{at}


Background Metabolomics analysis of human stool samples is of great interest for a broad range of applications in biomedical research including early detection of colorectal neoplasms. However, due to the complexity of metabolites there is no consensus on how to process samples for stool metabolomics measurements to obtain a broad coverage of hydrophilic and hydrophobic substances.

Methods We used frozen stool samples (50 mg) from healthy study participants. Stool samples were processed after thawing using eight different processing protocols and different solvents (solvents such as phosphate-buffered saline, isopropanol, methanol, ethanol, acetonitrile and solvent mixtures with or without following evaporation and concentration steps). Metabolites were measured afterwards using the MxP Quant 500 kit (Biocrates). The best performing protocol was subsequently applied to compare stool samples of participants with different dietary habits.

Results In this study, we were able to determine up to 340 metabolites of various chemical classes extracted from stool samples of healthy study participants with eight different protocols. Polar metabolites such as amino acids could be measured with each method while other metabolite classes, particular lipid species (better with isopropanol and ethanol or methanol following a drying step), are more dependent on the solvent or combination of solvents used. Only a small number of triglycerides or acylcarnitines were detected in human faeces. Extraction efficiency was higher for protocols using isopropanol (131 metabolites>limit of detection (LOD)) or those using ethanol or methanol and methyl tert-butyl ether (MTBE) including an evaporation and concentration step (303 and 342 metabolites>LOD, respectively) than for other protocols. We detected significant faecal metabolite differences between vegetarians, semivegetarians and non-vegetarians.

Conclusion For the evaluation of metabolites in faecal samples, we found protocols using solvents like isopropanol and those using ethanol or methanol, and MTBE including an evaporation and concentration step to be superior regarding the number of detected metabolites of different chemical classes over others tested in this study.

  • dietary patterns

Data availability statement

All data relevant to the study are included in the article or uploaded as supplementary information.

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See:

Statistics from

What this paper adds

  • A broad range of metabolites can be extracted in human stool samples with 8 different methodologies.

  • Polar metabolites were measured equally well with all protocols applied.

  • Other metabolites are highly dependent on the extraction method.

  • Extraction methods using (1) isopropanol or (2) ethanol or methanol and methyl tert-butyl ether including an evaporation and concentration step seem to be preferable (yielding the highest number and a broad range of metabolites).

  • These methods should be considered in order to get comparable results across different studies.


Metabolomics analysis of human stool samples is of great interest for a broad range of applications in biomedical research including early detection of colorectal cancer (CRC) as a non-invasive alternative to colonoscopy. Metabolomics might be promising for this purpose as metabolites are closely related to the phenotype and depict current metabolic processes happening in an organism. Stool is directly associated with the gut and may reflect changes in metabolism very early through its transit in the gut.1

Faecal mass consists to a great proportion of water and bacteria but also food components or metabolites.2 Faecal samples show great variability in their material content and characteristics, which makes it difficult to standardise the process from collection to processing and analysis including the analytical platform.3 The latter also defines the sensitivity of the analysis and the type of analytes available for analyses. Apart from the aforementioned differences of stool samples, faecal metabolomics might be affected by different confounding factors such as diet, the host and microbial metabolism. Metabolic changes in stool might be derived directly from the development of cancer or precancerous cells (host metabolism) or from a change in the gut microbiota (microbial metabolism) which both result in a distinct metabolic phenotype that might be characteristic for the disease.1 The microbiota itself was found to be differently present in patients with colorectal neoplasms or CRC compared with healthy individuals.4 5 Moreover, it was shown that diet has a clear effect on the microbial composition and metabolism.6 The metabolic profile of CRC or its precursors may help in the understanding of disease development, progression and early detection.7

Some studies have already found faecal metabolomics biomarkers for early detection of CRC but metabolite selection strongly varied1 8–10 and as different studies were using different processing methods, no direct comparison is possible. There is no consensus how to process stool samples for metabolomics measurements to get reliable and reproducible results.11–14 A review by Deda et al focused on the existing various stool preparation protocols and found the metabolites to be dependent on the extraction method.11 In this study, we used the MxP Quant 500 Kit (Biocrates Life Sciences AG, Innsbruck, Austria) to determine and quantify a very broad range of metabolite classes in human faeces. In total, we used eight different stool preparation protocols to assess the best coverage for stool metabolite profiles. The protocol yielding the highest multitude of extracted metabolites was used to analyse and compare additional stool samples of healthy study participants with different dietary habits.

Subjects and methods

Study design

The GEKKO study (Gebt dem Krebs keine Chance—Onkocheck) is an ongoing study in southwest Germany including people participating in screening colonoscopy (Arm A) or with diagnosed primary cancer (Arm B). The study was approved by the ethics committees of the Medical Faculty Heidelberg and of the physicians’ boards of Baden-Württemberg and Rhineland Palatinate. The GEKKO study Arm A was designed to evaluate novel early detection markers of CRC. People undergoing a screening colonoscopy at a gastroenterology practice in or around Heidelberg, Germany, who are over the age of 30 years, with no history of CRC, no inflammatory bowel disease, no colonoscopy within the last 5 years and speaking and understanding the German language are invited to participate.

After written informed consent was received, participants are asked to fill in a questionnaire regarding lifestyle and demographic data and to provide blood, stool, saliva and urine samples for research purposes prior to colonoscopy. Biosamples are processed and then stored at −80°C until needed. Colonoscopy reports are provided from the physicians to the study centre. Participants are grouped according to their most advanced finding at colonoscopy. From the individuals with no polyps or any findings at colonoscopy, those with incomplete colonoscopy (coecum not reached) or poor bowel preparation were excluded. For this analysis, three participants of the GEKKO study Arm A were selected between the age of 50 and 79 years with no polyps or any findings at colonoscopy to test eight different stool preparation protocols and to define the best analytical outcome.

In a further step, stool samples from additional healthy GEKKO participants (n=18) of approximately the same age (50–65 years) with different dietary habits (vegetarians, semivegetarians and non-vegetarians) were processed with the protocol that performed best with respect to numbers of detected metabolites and sample handling, and results were compared between groups. Information on dietary habits of the study participants was extracted from the questionnaire. Vegetarians were defined as never eating meat, processed meat and poultry. Semivegetarians were defined as eating meat, processed meat and poultry less than once a week. Other participants reporting to consume either meat or processed meat or poultry more often were categorised as non-vegetarians.15

Sample collection and handling

Native stool samples were collected by the participants at home from a normal bowel movement prior to bowel preparation for colonoscopy with standard stool collection tubes provided with a small spoon for collecting the stool. The stool samples were then directly frozen by the participants at −20°C at home. The participants were asked to document date and time of sampling and the storage temperature. The stool samples were taken by the participant in a freeze-cool transport container and in an isolated envelope to the gastroenterologists’ practices, directly frozen again at −20°C and within the week of receipt delivered by a transport service on dry ice to the GEKKO study laboratory at the National Center for Tumor Diseases in Heidelberg, where they were immediately frozen at −80°C.

Processing of the samples

For this analysis, we used native frozen stool samples from three individuals of the GEKKO study, testing eight different stool processing protocols, each in triplicate, which results in a total of 72 measurements. A frozen stool sample was collected by the participant into a standard collection tube with a plastic spoon, which takes on average 1 g of stool or even more. The amount can vary though, in spite of detailed visual and written instructions, depending on how much the participant loads onto and how often the participant fills the spoon. One frozen stool sample per participant was cut in a plastic petri dish on dry ice with a scalpel into 24 pieces of 50 mg frozen stool, each transferred into a separate vial and weighed on a precision scale. For this procedure one complete stool sample was used up in most cases per participant. Stool samples continued to be frozen until processing. The eight processing protocols were then tested in triplicates using the 24 stool samples prepared for each participant. It was not possible to thaw and homogenise the stool samples for aliquoting due to the viscous consistency of stool, which makes it difficult to handle, and the fast degradation process of its components upon- thawing. The processing protocols were adapted in part from previously published work (protocol 1,16 protocol 317 and protocol 718 and from the current recommended standard operating procedure provided from Biocrates protocol 519). Details of the protocols are shown in table 1. After processing, the samples were frozen again at −80°C until further processing via MxP Quant 500 Kit (Biocrates Life Sciences AG, Innsbruck, Austria).

Table 1

Overview on the tested stool protocols

Liquid sample extracts were processed according to the vendor’s instructions for human plasma samples.20 Samples dried after extraction had to be resolved before measurement (for protocols 6 and 7). Therefore, 50 µL of 100% isopropanol were added into the vial and the mixture was vortexed for 3 min at room temperature. Additionally, 50 µL of 30% isopropanol were added and again vortexed for 3 min at room temperature. Short centrifugation (5 s) separated the solid substances from the liquid phase which was further used for analysis.

In brief, 10 µL of the sample were pipetted on a 96-well plate containing internal standards, dried under a nitrogen stream using a positive pressure manifold (Waters, Germany) and 50 µL 5% phenyl isothiocyanate solution were added to each well to derivatise amino acids and biogenic amines. After 1 hour incubation time at room temperature, the plate was dried again. To extract the metabolites 300 µL 5 mM ammonium acetate in methanol were pipetted to each filter and incubated for 30 min. The extract was eluted into a new 96-well plate using positive pressure, diluted according to the vendor’s guidelines and measured via liquid chromatography mass spectrometry/mass spectrometry (LC-MS/MS) or flow injection analysis (FIA-MS/MS) analyses.

For metabolite measurements, a QTRAP6500+ (Sciex, Germany) MS/MS system with electrospray ionisation source connected to an UPLC I-class Plus (Waters, Germany) chromatography system was used.21 22

The LC column, the conditions for LC separation and FIA analyses as well as individual multiple reaction monitoring (MRM) parameters for each metabolite and respective internal standards were provided by the vendor of the kit (Biocrates). Quality controls, standards, blanks and zero samples were run throughout the analytical measurement of the samples. For quantification either a seven point calibration curve or one point calibration was used depending on the metabolite class according to the manual. Data were generated using the Analyst (Sciex) software suite and transferred to the software MetIDQ (version Oxygen; Biocrates Life Sciences AG, Innsbruck, Austria) for further processing of the data.

In total, 630 metabolites can be measured via this kit. The limit of detection (LOD) for each compound is defined as three times the background noise. The lower limit of quantitation (LLOQ) is at least 10 times the background noise. At LLOQ measured metabolite concentrations can be considered as reliable. Data are normalised with a tissue factor for quantification under the following assumption: 1 mg frozen tissue equals 1 µL tissue or stool.23 Tissue and faeces are comparable as wet tissue has a water content of approximately 75% which is in the range of wet stool.24 A specific dilution factor (given by the tissue/stool to solvent ratio) was applied to the calculated concentration values (µM) to achieve comparable absolute concentrations and correct for slight differences in sample weight and the individual extraction volume.25 Concentrations are given in pmol of the metabolite/mg stool mass.

All metabolites were identified and quantified using isotopically labelled internal standards (3 hours, 13C, 15N) and MRM using optimised MS conditions as provided by Biocrates. Metabolite annotation was done via retention time and compound-specific MRMs. The following metabolites are measurable using the MxP Quant 500 Kit: 1 alkaloid (trigonelline), 1 amine oxide (trimethylamine N-oxide), 20 amino acids, 30 other amino acid-related metabolites, 14 bile acids, 9 biogenic amines, hexoses (including glucose), 7 carboxylic acids, 1 cresol (p-cresol sulfate), 12 fatty acids, 4 hormone and related metabolites, 4 indoles and derivatives, 2 nucleobases and related molecules, 1 metabolite from the group of vitamins and cofactors, 40 acylcarnitines, 14 lysophosphatidylcholines, 76 phosphatidylcholines, 15 sphingomyelins, 28 ceramides, 8 dihydroceramides, 19 hexosylceramides, 9 dihexosylceramides, 6 trihexosylceramides, 22 cholesteryl esters, 44 diglycerides and 242 triglycerides. All the related isobaric and isomeric lipid species can be measured but cannot be distinguished by this method.

Statistical analyses

We measured 630 metabolites and calculated a range of sums and ratios of metabolites indicating metabolic pathways and syntheses using the MetIDQ software. The so-called ‘metabolism indicators’ are sums and ratios calculated from the measured metabolites and are indicative for a specific pathway or synthesis happening in the body. Metabolism indicators have been described previously in blood samples but not in stool samples.

Metabolites with the mean below the LOD were excluded and described as not measured and metabolism indicators with more than half of the values below the LOD were marked as below LOD in the following. We assessed the number of metabolites and their respective chemical classes for each processing method. Additionally, we calculated means and intersample variabilities between the three stool samples of each participant. We used the estimated values below the LOD to avoid imputation in the data for the calculations.20 For each protocol, variability of concentrations measured in the three stool aliquots from same participant was quantified by the mean coefficient of variation (‘mean intraindividual CV’) among the three participants. The distribution of mean intraindividual CVs (in %) are displayed in a bar chart.

We described the study population that was used to apply the best protocol and used analysis of variance (ANOVA) to detect differences in metabolite concentrations between people with different dietary habits (vegetarians, semivegetarians, non-vegetarians). We assessed the false discovery rate (FDR) to account for over-optimistic results and remove false-positives.

A p value <0.05 (two-sided testing) was considered to indicate statistical significance. Statistical analyses were conducted using SAS Enterprise Guide V.7.1 (SAS Institute Inc, Cary, North C, USA).


Extraction of metabolites using different stool processing protocols

We measured metabolites with the MxP Quant 500 kit using eight different protocols for stool processing from three healthy participants of the GEKKO study (free of neoplasms). We were able to extract metabolites with each protocol but the number of detectable metabolites varied (table 2, figure 1). The highest number of metabolites using (1) a simple single-step extraction protocol was detected using isopropanol (protocol 2) and even more compounds could be detected using (2) a multistep protocol when we dried the liquid extracts after extraction and reconstituted the samples in a small volume to increase metabolite concentrations (protocols 6 and 7).

Figure 1

Number of metabolites that can be assessed by each protocol and in total by chemical class for the LC-MS (A) and FIA analysis (B) separately (dashed lines indicate those below the limit of detection).

Table 2

Number of metabolites that were measured with each protocol according to chemical class

The evaluated solvents differ in their polarity and therefore in their extraction efficacy to solve metabolites of the different chemical classes studied. With the MxP Quant 500 kit, 630 metabolites can be determined of 14 classes of small molecules and 12 lipid classes. In addition, a range of sum and ratios that describe certain pathways and syntheses in the organism is calculated via the MetIDQ software from the obtained data. Despite differences in absolute concentration polar metabolites were well detectable with each extraction method. Amino acids and amino acid-related products were detected with concentrations above LOD in the analysed stool samples by all evaluated extraction protocols. Concentrations for amine oxides or carbohydrates and related products were always below the LOD. Large differences in extraction efficiency were seen for polar-lipids such as glycerophospholipids and glycosylceramides which were better detectable in samples extracted with isopropanol or method 6 or 7. Major differences were observed for non-polar lipids such as for triglycerides as none of the compounds were above the LOD using phosphate-buffered saline (PBS) for preparation whereas more than 50 could be measured using isopropanol or ethanol or methanol in combination with concentrating the liquid extract.

Variability between different stool aliquots from the same participant

We calculated variability between different stool aliquots of the same individual for those metabolites that were detectable above the LOD for each investigated stool processing protocol separately. Online supplemental tables 1 and 2 show means and mean intraindividual CVs for each metabolite assessed at least once above the LOD and for each of the sums and ratios that describe certain pathways and syntheses in the organism, respectively. Online supplemental table 3 and figure 2 provide overviews on the distribution of the mean intraindividual CVs for individual metabolites. Intraindividual variability was in general very high in our analyses. With all protocols, <12% of the metabolites showed mean intraindividual CVs below 15%.

Supplemental material

Figure 2

Distribution of mean intra-individual coefficients of variation reflecting variability of measurements in multiple stool aliquots from the same individuals.

Metabolic differences in stool samples comparing vegetarians and non-vegetarians

Protocol 6 was favourable in terms of sample handling and measured numbers of metabolites. Therefore, we further measured 18 stool samples according to this protocol from participants with different dietary habits (table 3). We analysed stool samples from 18 vegetarians, semivegetarians and non-vegetarians and found a range of metabolites that were significantly different (table 4). After FDR adjustments, 46 metabolites (except PC ae C44:4) remain statistically significant different but less strongly pronounced. Most of the metabolites that distinguished the dietary habits were from lipid classes such as ceramides and phosphatidylcholines. Some metabolism indicators were also found to be different between vegetarians, semivegetarians and non-vegetarians. Most of the statistical significant ceramides were higher abundant in non-vegetarians and the sum of ceramides was increasing from vegetarians to non-vegetarians.

Table 3

Population characteristics for the GEKKO participants analysed with protocol 6 by dietary habits

Table 4

Metabolite concentrations (and SD) and differences between vegetarians, semivegetarians and non-vegetarians


There is no consensus so far how stool samples should be prepared for comparable, standardised metabolomics measurements.11 14 We were able to extract a broad range of metabolites with each of the eight defined protocols tested within this study. However, some methods should be preferred over others in regards to the solvents used dependent on the aim of the study and if a broad or a specific metabolic coverage is aimed for in a particular stool sample analysis. In this study, the largest numbers of metabolites could be measured after extraction with isopropanol and ethanol or methanol following a drying step. We have seen differences in the measured metabolite concentration dependent on the extraction method.

We observed that the stool processing methods differ, are not interchangeable and that metabolite extraction efficiency varies regarding the number and class of metabolites. The metabolomics panels found by various research groups looking into metabolomics stool sample analysis differ in metabolite composition which might be caused, among other reasons, by the different stool processing methods. Studies that have focused on metabolomics in stool samples used either PBS/D2O buffer,9 10 acetonitrile,8 methanol26 or methanol/water mixture1 for metabolite extraction and each buffer/solvent results in specific biomarker panel for the specific reagent applied. In this analysis, we used PBS, acetonitrile, methanol and additionally other more complex biphasic (polar/apolar) solvent combinations for metabolite extraction. We achieved different metabolite coverage for different extraction methods which shows why different studies are even more diverse and hard to compare.

The principal procedure is similar for all protocols. A solvent or mixture of solvents is added to the thawed stool samples and this mixture is homogenised and centrifuged so that either non-dried supernatant or reconstituted concentrated extracts can be used for metabolite analysis.14 All samples were stored under the same condition at −80°C prior to MS analysis. To our knowledge, no study so far has investigated stability of metabolites in stool extracts compared with dried extracts. However, we were able to extract a broad range of metabolites with each procedure but the concentrations and the type of metabolites extracted, differ.

A total of up to 630 metabolites can be measured and various sums and ratios can be calculated. The typical number of metabolites that was measured with this kit in human stool by the vendor is 117 using an ethanol phosphate buffer-based protocol.27 When we used a protocol based on PBS, we were able to measure only 88 compounds. Evaluating different methods, we were able to extract and quantify up to 340 metabolites from human stool samples. Amino acids, amino acid-related metabolites, bile acids, fatty acids, nucleobases and related metabolites among others can be reliably measured with almost all solvents. In contrast, the numbers of acylcarnitines, glycerophospholipids or triglycerides are low in human stool which was also found by Wolf et al.27 Moreover, polar and non-polar lipids are more dependent on the extraction method than the metabolites measured with LC-MS/MS. These metabolites were better extracted using isopropanol or methanol or ethanol with a concentration step.

Protocol 1 is the only method that did not use an organic solvent. Therefore, this extraction does not inactivate enzymes nor removes proteins. The latter might block the LC columns and lead to variation in the measurements. Still PBS is the most common used solvent for metabolite extraction of stool solvent.11 However, solid phase extraction was used during downstream sample preparation to remove proteins and other matrix components.

In agreement with previously reported differences between faecal metabolomics of omnivores, vegetarians and vegans,28 we observed metabolic differences between vegetarians and non-vegetarians. In particular, we found major differences between stool samples from vegetarians and non-vegetarians in the lipid classes. Non-vegetarians were shown to have higher intake of total fat compared with vegetarians.29 The significant differences in amount of metabolites of lipid classes in non-vegetarians compared with vegetarians found in our study might reflect the difference in fat intake. In meateaters, higher blood levels of glycerophospholipids or sphingolipids were found compared with vegetarians or vegans as the most important sources for those metabolites are animal products.30 Furthermore, it was found by different studies that meat intake is associated with the Trimethylamine N-oxide (TMAO) metabolism as meat and meat products are rich in substances needed for the synthesis of TMAO.31 In contrast, we did not find any differences in faecal TMAO. Other studies found higher amounts of amino acid metabolites and bile acids excretion in urine from meateaters as they have higher intake of proteins compared with people with high vegetable intake.32 In stool samples, we did not find different amounts of amino acids or bile acids between vegetarians and non-vegetarians.

There are some limitations of this study. First, various solvents have different abilities to dissolve and extract the metabolites of different classes and therefore there will be different results for each sample processing methodology which are not comparable. There is a very broad range of chemical classes in metabolomics and not all metabolites can be extracted equally well with the same methods. Second, stool composition varies greatly depending on antibiotic use,33 diet or the water content that can be in a range from 63% to 85%.14 This might have introduced great variability. Drying the original stool samples taken out of the freezer before processing offers the advantage of referring to similar actual weights, avoiding bias due to difference in the water content among different samples.13 It is difficult though to imagine study participants to dry their stool samples and then freeze them in small amounts with equal weights for standardisation and avoiding freeze–thaw cycles. One would also need to collect more wet weight stool to get enough dry weight stool for a standardised metabolomics analyses. In addition, it is still an obstacle for many people to collect their stool samples for study purposes, freeze them at home and finally transport them in a freezing device to their physician. There is a need for a standardised protocol for stool sampling to account for the interperson variability of water content (for normalisation of metabolites) in the stool samples. Stable internal standards need to be defined that can be detected in stool together with parameters of food consumption.

In addition, large heterogeneity of stool composition even within a single bowel movement of the same person requires careful attention. In our study, we aimed to quantify such heterogeneity by assessing variability of measurements in different stool aliquots taken from the same bowel movement from each participant. The large mean intraindividual coefficients of variation most likely primarily reflect heterogeneity of stool composition and point to the need of homogenising stool or combining measurements from multiple stool aliquots taken from different parts of a bowel movement to reduce the impact of such intra-individual variation.

Other points are contaminations and variations introduced into the stool samples that can result from toilet water or from urine, which we tried to minimise using a stool collection aid.34 Even though a broad range of up to 630 metabolites were measured, other important metabolites such as short-chain fatty acid, important products from the microbiome,35 cannot be assessed and are not included in this kit. If these specific chemical classes are of interest, one has to use other kits or suitable analytical methods. Further limitations are the small samples size and inclusion of healthy individuals only.

A major strength of this study is that we have tested various metabolite extraction protocols on the same analytical platform. Stool samples are very promising in metabolomics research for CRC as they directly represent the microbial activities and the cellular environment in the gut.7 The stool samples were only frozen once which should ensure good metabolite stability: composition might locally differ since the 50 mg were cut-off as frozen biomass from the total stool sample without thawing and mixing the complete stool sample.

In conclusion, we found a broad range of metabolites measurable in human stool samples. Some chemical classes (polar metabolites) can be measured equally well with all protocols, whereas others are highly dependent on the extraction method. The extraction methods using (1) isopropanol or (2) ethanol or methanol and methyl tert-butyl ether including drying of the supernatant seem to be preferable over others for further metabolomics analyses regarding the number of detectable metabolites and should be recommended if a great number of metabolites shall be measured in large cohorts with good sample handling efficiency. To our knowledge, this is the first study in stool metabolomics comparing eight different protocols for metabolite extraction with a novel highly standardised and quality controlled, quantitative and reproducible assay. Furthermore, we were evaluating one methodology, the for our purpose most favourable protocol, to compare metabolites composition of stool samples of participants with different dietary habits.

There is urgent need for a consensus on standard procedures for and optimised processing of the samples for stool metabolomics and for quantitative and reproducible assays to get comparable results across different studies and laboratories.

Data availability statement

All data relevant to the study are included in the article or uploaded as supplementary information.

Ethics statements


We gratefully acknowledge all the participants of the GEKKO study, the excellent cooperation and support of the participating partner practices and clinics, as mentioned on our website (, the commitment of their physicians and staff who make this study possible with their constant efforts in prevention and early detection of cancer. We thank the GEKKO Team at the National Center for Tumor Diseases (NCT) for outstanding engagement: Dr med Ulrike Bussas, Susanne Jakob, Maria Kuschel, Birgit Brandstetter, Dr med Marie-Luise Groß, Dr Anton Gies for recruiting study participants, acquiring biosamples, data collection and documentation; Ursula Klos, Sabine Serick, Anna Beierle, Rosa Orihuela Vicente and many part-time students for processing and biobanking the samples in the laboratory. We thank Astrid Zimmermann, Elena Voge, Dr Utz Benscheid, Folke Thormann, Anja Wolf, Rahel Bauer, Markus Eckstein, Friedemann Ringwald and Claudia El-Idrissi for their contribution in data collection, monitoring and documentation. Dr Alexia Arnold, Ronja Sienknecht, Madeleine Brandt and Miriam Eckers for administrative and organisational support; and the Bundesfreiwillige from the Internationaler Bund (Bildungszentrum Heidelberg) for being interested and supportive young people.


Supplementary materials

  • Supplementary Data

    This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.


  • Contributors HB planned and designed the GEKKO study. VE, PS-K and HB designed this specific analysis. GP designed the protocols. VE and GP processed the samples. HB and PS-K conducted the study. VE analysed the data. VE and HB drafted the manuscript. All authors critically reviewed the manuscript and approved the final draft.

  • Funding This trial is conducted in the context of the NCT Early Detection and Prevention Programme (POC, NCT 3.0), funded by the German Federal Ministry of Education and Research, the German Cancer Aid, The German Cancer Research Center (GCRC/DKFZ) and the University Clinic Heidelberg (UKHD). There is no external sponsor. This funding source has no role in the study design, and has no role in data collection, data analysis and interpretation, or decision to submit results for presentation or publication.

  • Competing interests The authors disclose no potential conflict of interests. We received the MxP® Quant 500 Kit (Biocrates Life Sciences AG, Innsbruck, Austria) used in this study free of charge and thank Biocrates for their kind support.

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.