Background

More than 800 tropical fruits have been described, but only a small number are widely consumed [1]. Geographically, tropical fruits have their origin in the tropics of Cancer and Capricorn in the north and south of the equator covering most of the tropical and subtropical areas of Asia, Africa, Central America, South America, the Caribbean, and Oceania. In comparison to the temperate fruits such as apple, pears, and berries, tropical fruits are gaining terrain in global production and trade due to a growing demand from consumers. As an example, the production of avocados in the top producing country, Mexico, rose 1900% between 1961 and 2017 [2]. Moreover, mango availability was reported to increase from 1 kg per capita in 2007 to 1.8 kg per capita in 2017 in the USA, and from 0.4 kg to 0.7 kg in the European Union [3].

Several tropical fruits have been studied for their effect in human health due to their content in specific bioactive compounds. Of these, pomegranate has received much attention in recent years. A recent meta-analysis of 8 randomized controlled trials (RCT) (n = 574) showed that the intake of pomegranate juice reduces systolic (− 4.96, 95% CI − 7.67 to − 2.25 mmHg, P < 0.001) and diastolic blood pressure (− 2.01, 95% CI − 3.71 to− 0.31, P = 0.021) [4]. The effects of pomegranate on human health, including LDL-cholesterol, triglycerides, and glucose reduction, have been ascribed to its content in ellagitannins and to urolithins, their gut microbiota metabolites [5, 6].

Unlike pomegranate, information regarding the health effects of other tropical fruits, including the widely consumed banana, is remarkably scarce. To tackle this, well-conducted human studies with randomized controlled designs and prospective cohort studies with accurate dietary assessment will be needed.

Food Frequency questionnaires (FFQs) are widely used to assess fruit intake, but they have several disadvantages. It has been shown that the intake of fruits and vegetables (F&V) is often over-reported when assessed by food frequency questionnaires, due to social desirability bias [7]. Moreover, dietary questionnaires rarely evaluate the fruit consumption down to specific foods and varieties, and even in detailed FFQs, not much attention is given to tropical fruits, because of their diversity and low average intake in most studied populations compared to other fruits. In Western countries, except for banana and avocado, tropical fruits are often not considered in the FFQs or they are integrated in a more or less heterogeneous food group. For example, watermelon and melon are constantly listed together [8, 9] hindering the assessment of their respective intakes. In some producing countries such as Singapore, Mexico, and Brazil, FFQs consider tropical fruits including guava, kiwi, papaya, dragonfruit, watermelon, dates, and persimmon [10,11,12], although in an inconsistent manner. One could argue that the well-designed dietary instruments may suffice to obtain accurate intake information of tropical fruits. However, self-reporting methods will always be subject to imprecision associated to recall errors, difficulty to assess portion size, and social desirability bias. Furthermore, listing all tropical fruit individually would significantly expand the questionnaires and increase participant burden. The use of dietary biomarkers, whose associated errors are independent from that of questionnaire instruments, has the potential to assist in accurately determining dietary intake, and hence better unravel the associations between diet and human health.

BFIs originate from compounds that are specific for a given food or food group, and become bioavailable after their consumption [13]. Plasma vitamin C and carotenoids increase after tropical fruit intake and are biomarkers commonly used to assess the total intake of F&V [14]. However, their use is not without limitations. The bioavailability of carotenoids fluctuates according to the alongside diet or food preparation, e.g., simultaneous intake of fat enhances their absorption [15, 16]. Furthermore, plasma vitamin C concentration exhibits saturation kinetics, meaning that its collinearity with dietary intake stops at a certain point, especially in well-nourished subjects where baseline concentrations are observed at ~ 60 μmol/L [17]. Plasma vitamin C is further challenged by several external factors including pre-analytical conditions such as sample handling, temperature, and conditions of storage [18].

The Food Biomarkers Alliance (FoodBAll JPI-project; www.foodmetabolome.org), in an effort to collate the most comprehensive panel of BFIs as possible for over 140 commonly consumed foods, has conducted a series of reviews of the literature according to the guidelines established by the consortium [19]. The obtained BFIs are evaluated according to eight defined criteria to qualify their applicability as BFIs in further nutritional and health-related research [20]. The vision is to develop analytical methods that could quantify in a cost-effective way a battery of dietary biomarkers to validate or complement dietary questionnaires. Within the FoodBall initiative, the objectives of the present work are as follows: (1) retrieve from an extensive literature search the compounds currently used as BFIs for tropical fruits and those which are specific enough to represent putative BFIs and (2) outline the available knowledge and provide the current level of validation of the collated candidate BFIs for tropical fruits.

Methods

Selection of food groups

For the present review, 16 tropical fruits including the most consumed in the world according to FAO production and trading data were selected [3]. This review assesses the following fruits: dessert banana (Musa acuminata), mango (Mangifera indica), pineapple (Ananas comosus), papaya (Carica papaya), avocado (Persea Americana), pomegranate (Punica granatum), kiwifruit (Actinidia), lychee (Litchi chinensis), persimmon (Diasporos kaki), guava (Psidium guava), passion fruit (Passiflora edulis), acerola (Malpighia emarginata), dragon fruit (Hylocereus undatus), coconut (Cocos nucifera), watermelon (Citrullus vulgaris), muskmelon (Cucumis melo), and date (Phoenix dactylifera).

Search for relevant BFIs research papers

An extensive literature search was carried out to collect all available information on the already used or new putative BFIs for the selected fruits. The BFIRev protocol (Food Intake Biomarker Reviews) elaborated with the guidance of the PRISMA statement (Preferred Reporting Items for Systematic reviews and Meta-Analyses) and described in Pratico et. al. [19] was followed. Briefly, a primary search was performed in the three literature search databases, Scopus, PubMed central, and Web of Science, with the name of the specific fruit and its botanical genus, i.e., (Banana OR Musa*) OR (Mango OR Mangifera* OR Pineapple OR Ananas* OR Papaya OR Carica* OR Avocado OR Persea* OR Pomegranate OR Punica* OR Kiwi* OR Actinidia* OR Lychee OR Lichee OR Litchi* OR Persimmon OR Diospyros* OR Guava OR Psidium* OR Passion fruit OR Passiflora* OR Acerola OR Malpighia* OR Pitaya OR Pitahaya OR Dragon fruit OR Hylocereus* OR Coconut OR Cocus* OR Watermelon OR Citrullus* OR Muskmelon OR Melon OR Cantaloupe OR Cucumis* OR Phœnix dactylifera) along with the common keywords: AND (urine OR plasma OR serum OR excretion OR blood) AND (Human* OR men OR women OR patient* OR volunteer* OR participant*) AND (Biomarker* OR marker* OR metabolite* OR Biokinetics OR Biotransformation OR Pharmacokinetics OR bioavailability OR ADME) AND (Intake OR meal OR diet OR ingestion OR administration OR consumption OR eating OR drink*). Keywords were used in the fields [Topic], [All fields], and [Article Title/Abstract/Keywords] for Web of Science, PubMed, and Scopus, respectively. All searches were carried out in March 2016 and updated in January 2019. A last search in April 2020 resulted in no new putative biomarkers. Only papers in the English language were considered eligible, and no restriction on the date of publication was applied. Articles showing results of human intervention studies (randomized controlled trials, single-dose, short-term or long-term studies) or observational studies (cohort, case-control, cross-sectional studies) were considered eligible. After removal of duplicates, a first selection of papers was performed according to the relevance of the abstract and title. Full texts were obtained for the selected articles and further assessed for eligibility according to their relevance for the identification of BFIs for all tropical fruits. Some of the publications found in the reference list of the selected articles were also included at this stage. Furthermore, for those tropical fruits for which no information in human studies was available, we carried out an additional search in food composition databases including, Dictionary of Food Compounds, FooDB and KNApSAck [21,22,23], and used the text-mining tools, Polysearch2 [24] and Coremine [25] medical online database, to identify specific compounds for each fruit that may be further investigated as putative BFIs [26].

Characterization of candidate BFIs

For each putative biomarker identified, a secondary search allowed to retrieve relevant information to assess its specificity, pharmacokinetics, dose-response relationship, robustness, and reliability, as well as the quality of its method(s) of analysis, in order to qualify its use as BFI according to the validating scheme established by Dragsted et al. [20].

The name of the putative biomarkers and their synonyms were queried in the literature search databases along with AND (biomarker* OR marker* OR metabolite* OR biokinetics OR biotransformation OR pharmacokinetics OR bioavailability OR ADME). Additionally, the compounds were searched manually in the online databases HMDB [27], FooDB [22], Phenol-Explorer [28], Dictionary of Food Compounds [21], Duke’s phytochemical and ethnobotanical databases [29], eBASIS [30], KNApSack [23], and PhytoHub [31] to determine all the possible dietary or metabolic origins of the candidate BFIs.

The list of candidate BFIs was reviewed and agreed upon all authors.

Application of validation criteria

According to the method of Dragsted et al. [20], a validation assessment procedure was applied on the candidate BFIs to assess their current status of validation and identify the missing information for full validation. The validation scheme is based on eight questions that encompass biological and chemical aspects: plausibility, dose-response, time-response, robustness, reliability, stability, analytical performance, and reproducibility (Table S1).

Results and discussion

A flowchart indicating the literature search and the review process is shown in Fig. 1. After removal of duplicates, the literature search yielded 1235 publications for the tropical fruits. Of these, 40 articles reported relevant information on putative biomarkers of intake. Five additional articles were selected from the reference list of the assessed publications or from the secondary search. A summary of the retained literature and the list of the specific candidate BFIs for tropical fruits are presented in Table 1 and Fig. 2. The level of validation of the candidate BFIs is shown in Table 2. For an exhaustive presentation of the results, a full list of all retained and non-retained compounds is given in Table S2, with the main reasons for inclusion or exclusion and the corresponding references.

Fig. 1
figure 1

Flow diagram of study selection according to the BFIRev procedure

Table 1 List of publications reporting candidate biomarkers for banana, watermelon, and avocado consumption
Fig. 2
figure 2

Overview of the literature evaluation process of the candidate biomarkers for banana and avocado, and the candidate biomarker for watermelon

Table 2 Overview of the validation criteria for candidate intake biomarkers for banana and tropical fruitsa

Banana biomarkers

Banana, which is the second most-produced fruit in the world with 113 million tons and is among the three most consumed fruits in Europe [56, 57], certainly deserves specific attention for identifying an intake biomarker. The metabolite 5-hydroxyindole acetic acid (HIAA) was among the most studied compounds in association with banana intake. Of the 23 publications reviewed for this section, 10 reported the impact of banana intake on HIAA in urine (Table 1). HIAA originates from the catabolism of serotonin by the monoamine-oxidase (MAO, EC 1.4.3.4) and aldehyde dehydrogenase (ALDH, EC 1.2.1.3) enzymes. Besides its endogenous origin, serotonin can be provided in substantial amounts by several foods. Banana has a high serotonin content (15 ± 2.4 μg/g FW) along with nuts (87–398 μg/g), pineapple (17 ± 5.1 μg/g FW), plantain (30 ± 7.5 μg/g FW), kiwi fruit (5.8 ± 0.9 μg/g FW), plums (4.7 ± 0.8 μg/g FW), and tomato (3.2 ± 0.6 μg/g FW) [32]. The urinary excretion of HIAA increased by ~ 2–26-fold after banana intakes ranging from 50 to 1000 g [32,33,34,35,36,37,38,39,40, 49], and was comparable to that induced by an oral dose of 10 mg of serotonin [33]. Along with its sulfated conjugate, HIAA was identified as a highly discriminant compound (VIP > 2; R2 = 0.89, Q2 = 0.732) in a randomized, controlled crossover meal study using untargeted metabolomics to discover BFIs for banana [49]. The important contribution of banana intake to HIAA excretion level was also fortuitously observed in a large study aiming at diagnosing neuroblastoma in Japanese 18-month children. Neuroblastoma is a catecholamine-producing tumor, and an elevated excretion of the dopamine metabolite homovanillic acid was used as a diagnosis biomarker. Among the 103 cases identified with elevated homovanillic acid, 50 were false-positive cases caused by regular intake of banana as weaning food, which was associated with elevated excretion levels of both homovanillic acid and HIAA [37].

While serotonin is not specific to banana, its consumption produces a higher elevation of HIAA in urine compared to other serotonin-rich foods, including tomato, pineapple, and kiwi [38, 39]. However, nuts intake constitutes a probable confounder for the use of HIAA as BFI for banana. Feldman et al. reported a higher excretion of HIAA after the ingestion of 100 g of walnuts (7–59 mg/24 h) compared to 100 g of banana (4.8–15 mg/24 h) [32]. Recent studies using untargeted metabolomics proposed HIAA in combination with other metabolites as a candidate biomarker for nuts [58, 59]. Another potential confounder could be fresh tomato, due to its high level of consumption in many populations, but not its processed forms (juices, canned tomato or ketchup) as their serotonin content is much lower (< 0.2 μg/g FW) than in fresh tomato (6.4 μg/g FW) [60]. The robustness of urinary HIAA is thus not fully established. It was not among the metabolites discriminating low and high consumers of banana from non-consumers in a cross-sectional study with 78 subjects [49]. More studies with non-controlled dietary background and examination of the confounding effect of nuts and tomato are needed. While no data are available regarding dose-response, the time-response is well documented for HIAA, with a urinary excretion that peaks 2–4 h after banana intake and returns to baseline levels 8–12 h after consumption [33, 35, 36, 38, 49].

In addition to urine, plasma and serum concentrations of HIAA also increase after banana intake, as reported in four studies [41, 42, 47, 48]. Tohmola et al. measured serum HIAA in response to 35 different foods and showed that walnuts (102 g) produced the highest elevation of HIAA (1.8 μmol/L), followed by banana (~ 0.6 μmol/L for 300 g ingested), pineapple (0.46 μmol/L for 330 g) and tomato (0.39 μmol/L for 362 g) [41]. Plasma concentrations of HIAA peaked 2 h after the intake of all foods and returned to baseline levels 6 h after the intake of tomato and pineapple and 24 h after the ingestion of banana and walnuts. The calculated half-life of HIAA in the circulation was of 1.3 h [41].

Wang et al. conducted a cross-sectional analysis that correlated consumption, using FFQs, of 91 food items or food groups to the non-fasting serum metabolome of 1369 post-menopausal women (Table 1). Herein, HIAA was significantly correlated to banana consumption (|r| = 0.21, P < 0.0001), although dopamine metabolites showed a better ability to predict banana intake in the same study [48].

Regarding the analytical aspects of the validation scheme for BFIs, quantification of HIAA by HPLC-MS in urine and serum samples is well established [61, 62]. Acidification of urine to a pH between 2 and 5 is recommended for HIAA conservation [63, 64]. At pH 4, HIAA was shown to be stable in urine for up to 30 months at − 20 °C, for 3 months at 4 °C, and for 3 weeks at room temperature [63]. The validation criteria fulfilled so far for HIAA as a biomarker of banana intake are presented in Table 2.

Besides HIAA, the response of dopamine metabolites to banana intake has been widely studied, mostly in plasma and serum [42,43,44, 46,47,48], with one publication on urine [49] (Table 1). Banana has a high dopamine content (73 ± 24 μg/g) [38] compared to other dietary sources such as avocado (4 μg/g), orange, cocoa powder, and tomato (< 1 μg/g) [65]. In humans, dopamine is degraded into different metabolites by several pathways [66]. Dopamine-sulfate is the main circulating form originating from both dietary and endogenous dopamine [67, 68]. 3-Methoxytyramine results from the metabolism of dopamine through its methylation by the enzyme catechol-O-methyl transferase (EC 2.1.1.6) [66]; 3-methoxytyramine is further conjugated to its sulfate by the enterocytes or the liver [69].

Dopamine derivatives have been reported as highly distinctive metabolites for banana intake on various occasions. In three intervention studies, banana intake was studied for its effect on the physical endurance of male cyclists, and an untargeted metabolomic analysis of serum samples was included. Herein, dopamine, dopamine-3-O-sulfate, dopamine-4-sulfate, and 3-methoxytyramine-sulfate were detected in plasma as highly discriminant metabolites for the banana intervention [42, 44, 47]. Other authors examined plasma dopamine responses using targeted methods. In a small pilot study, the addition of one banana to a complex meal significantly increased dopamine levels in plasma 1 h after intake from 5.8 ng/mL at baseline to 92.5 ng/mL [43]. A case report about an anorexic adolescent who consumed > 20 bananas/day for 26 months mentioned a 20-fold increase over the normal range of plasma dopamine, which corrected towards normality when the patient resumed the restrictive food ingestion [46]. The latter shows that the intake of banana can increase plasma dopamine to extremely high levels over an extended period of time.

Furthermore, the study by Wang et al. revealed dopamine-3-O-sulfate (r = 0.33, P < 0.0001; AUC = 0.76), dopamine-4-sulfate (r = 0.34, P < 0.0001; AUC = 0.74), and 3-methoxytyramine sulfate (r = 0.22, P < 0.0001; AUC = 0.70) as serum metabolites significantly correlated to banana intake with a good ability to distinguish high and low consumers of banana [48]. These observations support the interest of these three metabolites as candidate BFIs for banana intake in plasma as well as in urine. Their level of validation is discussed below and summarized in Table 2.

In urine samples from a randomized controlled, cross-over meal study, dopamine sulfate was observed as a highly discriminant metabolite (VIP > 2; R2 = 0.89, Q2 = 0.732) for banana consumption, and seven-fold higher intensity (P < 0.001) after the intake of the fruit than control [49]. Additionally, it was observed in higher intensity in the urine of high consumers of banana compared to non-consumers (3-fold higher; P = 6.7 × 10−3) in a cross-sectional study on a German population, thus supporting its robustness [49]. Dopamine sulfate was a key component of the combinations of metabolites which performed best to predict banana intake, while its performance as a unique biomarker was lower [49].

Regarding the validation criteria, no clear information is available on the dose-response after banana intake for dopamine metabolites in plasma or urine. The time-response in plasma showed an elevation 1–2 h after the intake of banana, persisting up to 8 h after consumption [43, 70] (Table 2). Urinary excretion of dopamine metabolites peaked 2–4 h after the intake and continued up to 12 h [70]. As for the chemical aspects of the validation, mild acidification of urine samples may be used to avoid catecholamine-oxidation and deconjugation [71]. Sample storage at − 80 °C should be preferred for catecholamine preservation [72]. Liquid chromatographic methods with different detectors, including mass spectrometry, fluorometric, and electrochemical detection, exist for the determination of dopamine, 3-methoxytyramine and its conjugates in plasma and urine [67, 73, 74]. However, conjugated standards of dopamine or 3-methoxytyramine and other compounds are not commercially available for quantification, which may complicate their use as BFIs.

Another metabolite associated with banana intake is salsolinol. This compound is synthesized in banana by the condensation of dopamine and acetaldehyde (Pictet-Spengler reaction) [75]. It is also an endogenous metabolite [76]. Salsolinol concentration in banana pulp is of 6.3 × 10−2 μg/g and it is present at lower concentration in other sources including beer (0.5 × 10−2 to 1.3 × 10−2 μg/g) and French wine (3.3 × 10−3 to 4.9 × 10−3 μg/g ) [77,78,79]. Salsolinol is also highly present in cocoa powder (25 μg/g) and chocolate (19 μg/g) [80]. There are two existing isomers of salsolinol, the (R) isomer that is proposed to be mainly of endogenous origin and the (S) enantiomer that may be more susceptible to diet. Strolin-Benedetti et al. reported that in 24 h urine samples of 6 subjects before consumption of banana, the concentration of (S)-salsolinol was close to zero and increased after the intake of dried banana. In contrast, the (R) isomer increased after the intake of the fruit but was already present prior consumption [45]. Lee et al. studied the influence of banana intake on blood concentration of salsolinol in healthy subjects. They showed that after the intake of one banana fruit, the concentration of salsolinol increased from 0.16 ± 0.12 to 5.8 ± 7.6 ng/mL and from 0.23 ± 0.16 to 6.6 ± 8.7 ng/mL, for the (S)- and (R)-forms, respectively, 1.5 h post-consumption [43]. This shows that the consumption of banana causes a detectable increase of salsolinol in plasma. The kinetics of this compound was studied only in rats, after gavage with 10 μg of salsolinol and 3 g of banana. In both interventions, salsolinol increased in plasma 1 h after intake and cleared 14 h post consumption [43].

Salsolinol sulfate was observed as the most distinctive urinary metabolite with a 46-fold higher intensity after a single dose of banana compared to a control meal in 12 healthy subjects (VIP > 2; R2 = 0.89, Q2 = 0.732; P < 0.001) [49]. In the KarMeN cross-sectional study, this metabolite was also detected in higher intensity in the urine samples of high-consumers compared to non-consumers (10-fold higher in high consumers, P = 2 × 10−2). However, further analysis showed that salsolinol sulfate alone was not specific enough to predict consumers and non-consumers of banana in this population (misclassification rate > 30%). The robustness of plasma and urine salsolinol as a biomarker of banana intake is possibly challenged by the intake of chocolate, and/or alcohol as their consumption was shown to markedly elevate the levels of salsolinol in biofluids [81]. Intervention studies comparing the response of salsolinol or its conjugates to the intake of different salsolinol-containing foods are needed to further clarify the potential use of salsolinol as a candidate BFI for banana, possibly as a component of a multi-marker.

Among the assessed studies, the one by Vázquez-Manjarrez et al. was specially designed for the discovery and validation of banana biomarkers. Herein, urine samples from two different study designs were analyzed, a controlled, cross-over meal study with 240 g of banana (n = 12) and a cross-sectional study (KarMeN) (n = 78) with high (126–378 g of banana), low (47.3–94.5 g of banana), and non-consumers of banana using an untargeted multi-platform metabolomics approach [49]. Thirty-three banana-derived metabolites were identified in urine in the meal study, including HIAA, dopamine, and salsolinol metabolites, which have been discussed above, and other highly distinctive metabolites such as methoxyeugenol glucuronide (VIP > 2, R2 = 0.89, Q2 = 0.732; 26-fold higher in banana than control, P = 1.3 × 10−10), 2-isopropylmalic acid (VIP > 2, R2 = 0.89, Q2 = 0.732; 9-fold higher in banana than control P = 1.4 × 10−7), N-acetyldopamine sulfate (VIP > 2, R2 = 0.89, Q2 = 0.732; 4-fold higher in banana than control P = 1.1 × 10−6), 6-hydroxy-1-methyl-1,2,3,4-tetrahydro-β-carboline sulfate (6-OH-MTβC sulfate) (VIP > 2, R2 = 0.89, Q2 = 0.732; 4-fold higher in banana than control, P = 2.3 × 10−2). The cross-sectional KarMeN study showed that methoxyeugenol glucuronide (P = 4.70 × 10−5), dopamine sulfate (P = 6.7 × 10−3), salsolinol sulfate (P = 2 × 10−2), 6-OH-MTβC sulfate (P = 3.6 × 10−2), and xanthurenic acid (P = 6.7 × 10−3) had significant higher intensities, ranging from 6- to 25-fold change, in the urine of high-consumers of banana compared to non-consumers, while only methoxyeugenol glucuronide was significantly higher (15-fold, P = 5.20 × 10−5) in low-consumers compared to non-consumers [49]. None of the putative biomarkers was robust enough when used alone, but the authors demonstrated that dopamine sulfate and methoxyeugenol glucuronide was the best parsimonious combination to detect the intake of banana in high consumers (AUC = 0.92, ER = 0.11) and low consumers (AUC = 0.87, ER = 0.19) [49]. The origin of methoxyeugenol glucuronide was attributed to the metabolism of the characteristic banana aroma methoxyeugenol and elemicin [82]. Methoxyeugenol and elemicin were also reported in nutmeg and star anise [82, 83], respectively, which represent marginal confounders. Regarding dose-response, the intensity of the combined BFIs showed a linear trend with the amount of banana consumed in the intervention study and that reported in 24 h recalls by the high and low consumers in the cross-sectional study. However, quantitative studies are required to fully validate the proposed combined BFI in this critical aspect (Table 2). The metabolite 6-OH-MTβC sulfate was also frequently present in the best combinations of biomarkers reflecting high banana intake in the KarMeN study. The 6-OH-MTβC derives from the condensation of serotonin with acetaldehyde and is present in banana in higher concentrations (1.87 mg/kg) than in other fruits, including tomato (0.71 mg/kg), kiwi (0.31 mg/kg), and pineapple (0.62 mg/kg) [84]. Dark chocolate contains higher levels of 6-OH-MTβC (2.64 mg/kg) [85] than banana and thus would constitute a potential confounder.

Melatonin metabolites have been associated with banana consumption in other selected publications. While melatonin is produced endogenously in the pineal gland, the intake of melatonin-rich foods such as banana, pineapple, cherries, and orange substantially elevate the concentration of melatonin and 6-sulfatoxymelatonin, its main metabolite, in urine and serum [86, 87]. Other dietary sources of melatonin are walnuts, mango, papaya, grapes, wine, and olive oil. The use of melatonin and its main metabolite as a candidate BFI for banana or any other fruit is also hindered by its fluctuation during the daytime [88] and by the fact that melatonin is sold over-the-counter in different countries as a supplement to adjust sleeping disorders.

Many other metabolites have been identified in biofluids after the intake of banana but have less relevance as candidate BFIs (Table S2). 3,4-Dihydroxyphenyl-acetic acid, homovanillic acid, 3-hydroxytyrosol, and 5,6-dihydroxyindole were observed as part of the “dopamine signature” of banana [36, 38, 49]. However, 3,4-dihydroxyphenyl acetic acid and homovanillic acid originate from the microbial metabolism of polyphenols, which are widely distributed in plant-based foods [89, 90]. 3-Hydroxytyrosol in urine may better reflect olive-oil consumption [59]. Plasma 2,3-dihydroxy-isovalerate, 4-guanidinobutanoate, ferulic acid 4-sulfate, and 4-acetylphenol sulfate were associated with banana intake in controlled intervention studies with banana [42, 47]. However, 2,3-dihydroxy-isovalerate is an intermediate in the biosynthesis of valine, leucine, and isoleucine while 4-guanidinobutanoate participates in the metabolic pathway of arginine. Ferulic-acid-4 sulfate is a metabolite of ferulic acid and caffeic acid, which are widely distributed in fruits, cereals, and coffee [28, 30]. 4-Acetylphenol has been detected in different medicinal herbs and is a flavoring and aroma ingredient in roasted coffee, beer, mango, and cranberry [21]. Interestingly 2-isopropylmalic acid, eugenol sulfate, and fructose were observed by different groups after banana intake [42, 47, 49]. Fructose is a common carbohydrate present in a wide range of foods, which excludes its use as a BFI of banana. 2-Isopropylmalic acid has been identified as a key compound for organoleptic characteristics of melon [91] and is present in other vegetables and fruits including lettuce and tomato juice [92, 93]. The fact that this metabolite was recovered in biofluids following consumption of different vegetal sources including pear, banana, and peas [42, 47, 49, 94] suggests its possible utility as putative BFI for the general intake of this food group. Eugenol sulfate results from the phase II metabolism of the aroma compound eugenol which is present in banana and other highly consumed fruits such as apple, apricot, and cherry as well as spices including black pepper and clove [21]. The microbiota metabolites of tryptophan, indolepropionate, and indoleacetate have been associated with the intake of banana [47, 95] but also with the consumption of other fruits including apple and pear [95]. Xanthurenic acid, which was found to be increased in urine after banana intake [49], originates from the kynurenine pathway of tryptophan degradation [96]. Whereas banana has a higher content (10 mg/100 g) of tryptophan than other fruits (e.g., apple 3 mg/100 g) [97], other dietary sources very rich in tryptophan, e.g., cheese and fermented products would act as strong confounders [98].

Late urinary excretion of al N-methyl-2-pyridione-5-carboxylic acid, likely a microbial metabolite of trigonelline, was observed by Vázquez-Manjarrez et al. [49], but the presence of trigonelline in coffee excludes its use as BFI for banana. Metabolites from carboxylic acids, saccharides, and amino acids were also reported in the urine. Among them, sinapic acid-sulfate, hydrocinnamic acid sulfate, vanillic acid, mevalonic acid, azelaic acid, 1,5 anhydrosorbitol, fructose, glyceraldehyde, and 2-ethyl-3-hydroxyl propionic acid were found to be increased in urine samples in studies within a controlled setting but not under free-living conditions [49].

In conclusion, we have identified several candidate BFIs for banana intake including metabolites of catecholamines, indolamines, and specific aroma compounds (Fig. 2). While these metabolites are candidate BFIs for banana, they are challenged when considered individually by their endogenous presence in the human body or in other foods. However, the combined use of several candidate BFIs, namely dopamine sulfate, salsolinol sulfate, 6-OH-MTβC sulfate, and methoxyeugenol glucuronide in urine and HIAA along with 3-methoxytyramine sulfate in plasma, offers a more robust alternative to determine exposure to banana in free-living conditions. These candidate biomarkers can reflect banana intake qualitatively (absence/presence of consumption) but dose-response studies must be conducted to push them forward as quantitative BFIs.

Papaya and watermelon biomarkers

Much less information has been found on candidate BFIs for tropical fruits other than banana. Some carotenoids have been proposed as biomarkers of tropical fruit intake. In particular, lycopene, β-carotene and β-cryptoxanthin have been associated with papaya and watermelon intake, while lutein and β-carotene have been associated with intake of papaya and mango [99,100,101]. Tropical fruits have been studied as sources of pro-vitamin A carotenoids for many developing countries with a high occurrence of vitamin A deficiency, and the literature reflects this interest. Carotenes are present in too many foods to be envisaged as biomarkers for specific tropical fruits. β-Cryptoxanthin was shown to increase in plasma after acute intake of papaya [102] and has also been correlated with habitual intake of tropical fruits in a study with 159 Costa-Rican adolescents [101]. Papaya was the best food predictor of plasma β-cryptoxanthin in this study. Yet, although papaya is indeed one of the richest sources of β-cryptoxanthin, this compound can also originate from other orange- or red-colored foods such as orange, mango, apricot, sweet peppers, and pumpkin, as well as from spices and herbs including paprika, dill, and basil [30, 103]. In the Nurse’s Health study conducted in a US population with low consumption of papaya, orange juice was actually reported to be the major determinant of plasma β-cryptoxanthin [104].

Watermelon, and to a lesser extent papaya, contains a substantial amount of lycopene in a highly bioavailable form [102]. Watermelon has a lycopene content ranging from 4.8 to 13.5 mg/100 g fresh weight representing a content equivalent to that measured in most tomato cultivars [103]. However, processed forms of tomato such as tomato sauce are even more concentrated. Considering the higher consumption of tomato products compared to watermelon in most populations, blood concentration of lycopene is more likely to reflect the intake of tomato and derived products than that of watermelon.

Although the common dietary carotenoids lack specificity for tropical fruit intake, it cannot be ruled out that less studied carotenoids among the dozens identified in tropical fruits, for example, cryptoxanthin-5,8-epoxide or sapotexanthin may arise as more specific candidate biomarkers in future metabolomic studies [105].

Aside from carotenoids, the literature search revealed citrulline as a possible biomarker of watermelon intake. Interestingly, the name of this amino acid was derived from the latin name of watermelon, Citrullus vulgaris, from which it was first isolated. A case study reported a ten times elevation of citrulline plasma concentration in a young girl who was consuming large quantities of watermelon, and this was confirmed in a dietary intervention on six healthy adult volunteers [50]. Plasma citrulline peaked 1 h after watermelon intake (593 μM), was still elevated at 8 h and returned to the normal range (20–40 μM) at 24 h. Other authors have observed L-citrulline responses in plasma following watermelon intake. A 16-day supplementation with 300 mL/day of a watermelon concentrated juice providing 3.4 g of citrulline/day resulted in 3.5–4.6 fold higher plasma concentrations of citrulline compared to control or apple juice interventions [51]. A transitory increase in plasma L-citrulline occurred for 20 subjects who consumed 0.2 g/kg of watermelon puree every 15 min during a physical challenge, with a return to baseline levels 3–5 h after the end of the supplementation [52]. Collins et al. did not observe any significant change in L-citrulline plasma concentration following the ingestion of watermelon juice, up to 6 cups/day for 3 weeks [53]. However, measurements were made on 12 h fasting plasma samples, which concurs with the short-term responses observed by other authors. Citrulline also has an endogenous origin, synthetized from glutamine or arginine in the enterocytes [106]. It is a key intermediate of the urea cycle in the liver and is readily converted to arginine and then nitric oxide in the kidney and vascular endothelium. Citrulline plasma concentration reflects the equilibrium between its synthesis in the intestine and its conversion to arginine in the kidney. Low or high citrulline status has been proposed as a biomarker of intestinal dysfunction in various intestinal pathologies as well as markers of acute kidney failure and of some inborn errors of the urea cycle [106]. It is worth noting that after watermelon intake plasma arginine was also modestly elevated, whereas venous ammonia, plasma glutamine, and other amino acids remained in the normal range, which does not correspond to the metabolic patterns classically observed with urea cycle, renal or intestinal disorders.

In conclusion, the concentration of citrulline in plasma and possibly in urine could be further explored as candidate biomarker for assessing watermelon intake in healthy individuals (Fig. 2). Regarding papaya, there are no putative BFIs reported in the literature yet. To further discover candidate BFIs for both fruits, controlled intervention trials with different doses of the fruits and applying an untargeted metabolomics approach may be performed.

Avocado biomarkers

Avocado (Persea Americana) is unique in its low content in starch and sugars and high content in monounsaturated fatty acids 71% [107]. Using a semi-targeted GCxGC-MS sugar profiling method, Mack et al. analyzed 24 h urine samples of 301 volunteers of the KarMeN cross-sectional study to search associations between excreted sugar metabolites and the dietary information obtained with 24-dietary recalls [54]. From the Spearman rank correlation analyses with the entire population, two seven-carbon carbohydrates, perseitol (ρ = 0.33, p values < 0.0001) and mannoheptulose (ρ = 0.2702, p values < 0.0001) were significantly associated with avocado intake. Further correlation analyses including only avocado consumers (n = 9) and non-consumers (n = 18) confirmed the strong association of perseitol ρ = 0.871 and mannoheptulose ρ = 0.778 with avocado consumption [54].

The authors analyzed perseitol and mannoheptulose in 75 foodstuffs to determine their plausibility and specificity as candidate BFIs for avocado. Perseitol was only observed in avocado while mannoheptulose was more concentrated in avocado than in the other fruits and vegetables where it was detected (carrot, blueberry, fig, and tomato) [54].

Wamelink et al. have developed a HPLC-QqQ-MS/MS method of analysis for the seven-carbon carbohydrates and validated it with a small human intervention on 3 subjects consuming avocado (75–200 g) [55]. Urine excretion of perseitol and mannoheptulose but not of other seven-carbon carbohydrates were elevated between 3 and 12 h after avocado intake. A high interindividual variation was observed that may be due to a different content of C7 carbohydrates in avocados. Of note, in subjects with transaldolase deficiency, a recently discovered pentose phosphate pathway alteration, a subtle elevation of mannoheptulose and perseitol in urine can be observed along with other C7 carbohydrates and polyols [55]. This will deserve further attention to determine whether intrinsic factors can affect the excretion of the candidate biomarkers and jeopardize their use.

In conclusion, the limited data available suggest that perseitol and mannoheptulose may be considered as short-term candidate BFIs for avocado. They have been validated for plausibility, and partially for robustness and time-response (Table 2, Fig. 2). However, human studies with larger sample sizes are needed to further validate these candidate BFIs and assess dose-response, reliability, and performance of their quantification method(s).

Date biomarkers

Date fruit is a highly nutritious fruit that contains about 70% of carbohydrates, as well as dietary fibers and a wide range of micronutrients and secondary metabolites [108, 109]. The first and only study that aimed to identify metabolic changes occurring in blood after consumption of date fruit was conducted by Mathew et al. [110]. They applied an untargeted metabolomics approach based on LC-MS on blood samples from 21 subjects who underwent three dietary interventions: (1) a control glucose drink, (2) ten date fruits of the Deglet Nour variety, and (3) ten date fruits of the Khlas variety. A total of 28 molecules were found to significantly increase (Bonferroni corrected p value < 0.05) over 2 h after the date fruit challenges, among which 20 were identified. Several phenolic acid metabolites were found to be discriminant, including ferulic acid 4-sulfate, caffeic acid-sulfate, 4-vinylguaiacol sulfate, and vanillic alcohol sulfate. This was consistent with the presence of 3,4,5-trimethoxycinnamate, 3,4-dimethoxycinammic acid, 4-hydroxycinnamate, dihydroferulic acid, ferulic acid, and isoferulic acid in the date fruit. However, those phenolic metabolites can derive from many fruits and vegetables and are therefore of little interest as candidate biomarkers for date fruit. In the same study, 2-isopropylmalic acid was observed as a distinctive metabolite for recent date intake. As already mentioned, this compound is also recovered in biofluids after intake of peas or different fruits including banana, tomato, and pear [42, 49, 93, 94]. Some carboxyethylated amino acids were detected in plasma after the intake of both varieties of dates, including 1-carboxyethylisoleucine, 1-carboxyethylleucine, 1-carboxyethyltyrosine, and 1-carboxyethylvaline. These compounds, considered to be advanced glycation products, are formed by non-enzymatic reactions between reducing sugars and proteins [111, 112]. They are most likely formed in the dates during drying, but they might also be formed endogenously after intake of dates as a consequence of their high content of fructose [112].

Based the scarce information available on human metabolites observed after intake of date fruit, it is not yet possible to identify a candidate BFI. Some more specific caffeoylshikimic acids, e.g., dactylifric acid (3-O-caffeoylshikimic acid), neodactylifric acid (5-O-caffeoylshikimic acid), and isodactylifric acid (4- O-caffeoylshikimic acid) resulting from enzymatic browning of date fruit [29, 113] and xanthoxylin [114] may be of interest as putative biomarkers but their metabolic fate in humans remain to be elucidated.

Pomegranate biomarkers

Pomegranate is mainly consumed fresh or in the form of juice. It is characterized by its content of rare ellagitannins, including punicalin and punicalagin A and B. Thus, it comes as no surprise that most of the retrieved literature focused on urolithins. Urolithins are dibenzo-[b,d] pyran-6-one derivatives produced from ellagic acid and ellagitannins by the gut microbiota [115, 116] and to which various beneficial effects on human health have been attributed [117, 118].

After consumption of pomegranate, urolithins appear in plasma 6–8 h after exposure and persist in plasma and urine up to 48–72 h [116, 119,120,121,122,123,124,125] (Table S2). It is likely that the primary precursor of urolithins from pomegranate is punicalagin, which constitutes the major ellagitannin present in this fruit.

Two main factors limit the usefulness of urolithins as BFIs for pomegranate. Firstly, their precursors, namely ellagitannins and ellagic acid, are distributed in other highly consumed foods, including berries and walnuts. The estimated dietary contribution of total ellagitannins as assessed by 3-day dietary records and chromatographic analysis revealed five primary dietary sources of ellagitannins: strawberries, blackberries, walnuts, pomegranate juice, and preserved foods [123]. Of these foods, berries accounted for almost half (42%) of the estimated intake of ellagitannins, followed by pomegranate (27.6%), walnuts (26.7%), and strawberry jam (4.1%). Tea infusions (Camelia Sinensis) also contain ellagitannins and ellagic acid [126]. The high frequency of consumption of tea, berries, and walnuts makes them potential confounders for the use of urolithins as BFIs for pomegranate. The second limiting factor is the high inter-individual variation in the production of urolithins caused by variable gut microbiota functionality [127]. From the analysis of urine samples of subjects involved in different dietary interventions with walnuts, pomegranate, or berries, the group of Tomás-Barberan and others reported the occurrence of three ellagitannin-metabolizing phenotypes: metabotype A, metabotype B, and metabotype 0 [115,116,117, 124, 127,128,129,130,131]. Metabotype A subjects, representing 25–80% of the tested populations, produce solely urolithin A conjugates. Metabotype B found in 10–50% of the volunteers produce isourolithin A and urolithin B besides urolithin A. Lastly, subjects with metabotype 0, less than 10% of the population, do not produce any of these urolithins.

In conclusion, urolithins are not promising candidate BFIs for pomegranate and additional work has to be done to identify specific BFIs that would apply for all subjects regardless of their metabotype. As no other specific pomegranate component has been highlighted so far, the best approach to identify putative BFIs for pomegranate is to conduct an explorative intervention study with untargeted metabolomic profiling of the collected biofluids and later validate the robustness of the discovered candidates in a population in free-living conditions.

Mango biomarkers

Mango is a rich source of phytochemicals including diverse polyphenols such as catechins, quercetin, kaempferol, rhamnetin, anthocyanins, gallic acid, ellagic acid, protocatechuic acid, and benzoic acid, which are all widely distributed in other fruits [132, 133]. Gallic acid has been found as the most abundant in mango pulp [134] along with different polymers of the same metabolite in the form of gallotannins. Barnes et al. studied the bioavailability of galloyl derivatives after consumption of mango (Keitt variety). Eleven healthy volunteers underwent a dietary intervention with 400 g of mango daily for 10 days. Plasma and urine samples were collected on the first and last days of intervention. While no response was observed in plasma, seven metabolites of gallic acid and pyrogallol derivatives were detected in urine [134]. The gallic acid metabolites 4-methylgallic acid and 4-O-methylgallic acid-3-O-sulfate accounted for 43–54% of the total administered gallic acid and showed a rapid urinary excretion peaking at 6 h and returning to baseline 8–12 h after the intake of mango [134]. The other five metabolites were pyrogallol conjugates that originate from the microbial decarboxylation of gallic acid, namely isoforms of O-methylpyrogallol-O-sulfate, pyrogallol-O-sulfate, and deoxypyrogallol-O-sulfate [135]. These five metabolites had a urinary excretion peaking at 12 h after the consumption of mango. After 10 days with 400 g/day of mango, pyrogallol-O-sulfate and deoxypyrogallol-O-sulfate were the most prevalent metabolites with a significantly increased urinary excretion (> 60%, P < 0.05).

Quirós-Sauceda et al. reported consistent results on pyrogallol metabolites in hydrolyzed plasma and urine samples of 12 male volunteers after the intake of a different variety of mango (Ataulfo) [133]. Five compounds, i.e., gallic acid, chlorogenic acid, protocatechuic acid, ferulic acid, and genistic acid, all unspecific for mango intake, were detected in hydrolyzed plasma 2–4 h after the intake of either 500 g of mango pulp or 721 g of mango juice. In urine, gallic acid, pyrogallol, chlorogenic acid, vanillic acid, p-coumaric acid, ferulic acid, and sinapic acid were detected [133]. Most metabolites showed a rapid urinary excretion while pyrogallol was observed over 8–24 h after the intake of mango in both forms.

None of the aforementioned phenolic metabolites may be considered as candidate BFIs for mango due their low specificity for this fruit. Pyrogallol and its conjugates have been detected in human biofluids following the intake of other fruit products, such as grape juice and berries [136, 137]. Mangiferin, a xanthonoid, has been isolated from the pulp of different mango varieties including the Keitt and Ataulfo used in the studies discussed in this section [132]. While its presence and specificity for mango makes it plausible candidate BFI for mango, no information is available yet on the bioavailability of this compound. Its combination with metabolites of pyrogallol may be further tested as a candidate multi-marker for mango.

Potential biomarkers for other tropical fruits

In spite of the frequent consumption of kiwi, muskmelon, pineapple, guava, and persimmon, there is a remarkable paucity of information on their metabolites in human studies, and even fewer data are available for the less common tropical fruits acerola, litchi, pitaya (dragon fruit), and coconut.

Abdul et al. examined the metabolic profiling of three varieties of kiwi including the most consumed Hayward cultivar (Actinidia deliciosa), the mini kiwi (Actindia arguta), and the less known Bidan (Actinidia eriantha) [138]. Only non-specific compounds were detected, including phenylalanine, tyrosine, arginine, citric acid, glutamine-hydroxy-L-proline, 4-aminobutyrate (GABA), glutamate, glutamine, quinic acid, actinic acid, shikimate, mannose, syringic acid, and afzelechin [22, 138].

Muskmelon, or cantaloupe, is a widely consumed fruit in different countries. Besides their content in β-cryptoxanthins, other interesting phytochemicals reported in this fruit are cucurbitin and cucurbitacin E and B [21, 22, 139] (Figure S1). Little is known about the bioavailability of these compounds after consumption of muskmelon, and they have also been reported in other fruits of the cucurbitacae family, such as cucumber or pumpkin [22, 24]

Regarding guava, while no publications were retrieved that report specific metabolites following its intake, some specific terpenes have been described in the fruit. Qin et al. reported the presence of guadial, guajadial, 4,5-diepipsidial, psiguadial, and two novel meroterpenoids, psiguajavadial A and B [140]. Although studies on the bioavailability and pharmacokinetics of terpenes in humans are scarce, evidence indicating their bioavailability after the intake of other fruits exists [141, 142]. Thus, these metabolites might constitute putative biomarkers for the intake of guava (Figure S1). Moreover, the presence of mongolicain-A, an ellagitannin, has been reported solely in guava [22, 29] advocating for a potential role as a putative BFI for this fruit. However, as discussed previously its bioavailability in humans in not documented and it may be metabolized into non-specific compounds like urolithins with a high interindividual variation.

Pineapple belongs to the Ananas cosmosus species. Like in other tropical fruits, melatonin is present in pineapple and its consumption has been shown to produce an elevation of 6-sulfatoxymelatonin in urine of healthy adults [143]. However, as discussed in the “Banana biomarkers” section this metabolite is not relevant as a BFI for either pineapple or any other fruit due to its natural abundance in many fruits and in supplements. Ananasic acid, a specific triterpenoid reported in pineapple [21, 22, 144], may be further studied for its plausibility as a putative BFI for pineapple.

Leucodelphinidin-3-glucoside [145], kakidiol, and methyl-phaeophorbide A and B are listed in different food databases for persimmon [21,22,23] (Figure S1). Information on the bioavailability of these compounds after human consumption of persimmon is not available. Finally, no specific phytochemicals were identified for litchi, pitaya (dragon fruit), coconut, or acerola in our review process.

Conclusion

In spite of the important presence of tropical fruits in the human diet, there has been very little interest so far for biomarkers of their intake, and except for banana, no studies have been specifically designed for identifying such biomarkers. For banana consumption, dopamine sulfate, salsolinol sulfate, 6-OH-MTβC sulfate, methoxyeugenol-glucuronide, HIAA, and 3-methoxytyramine sulfate are candidate biomarkers. The combination of some of these individual compounds may strengthen their robustness to overcome the exposure to potential confounders, as demonstrated for the combination of methoxyeugenol glucuronide and dopamine sulfate in the urine. However, dose-response studies are needed to validate the latter combination as an accurate quantitative BFI for banana. Citrulline is a candidate BFI for watermelon but its robustness, especially given its endogenous formation in the gut epithelium, has to be assessed in population studies. Perseitol and mannoheptulose are promising candidate BFIs for avocado but again, larger population studies are required to document their robustness and reliability. A few human intervention studies with mango, papaya, pomegranate, and dates have been performed, but no candidate BFIs were identified so far for these fruits. Specific phytochemicals have been described in some tropical fruits including mango, pineapple, guava, and persimmon, but there is virtually no information regarding their bioavailability and interest as putative BFIs.

The way forward for discovering BFIs for tropical fruits is certainly the application of untargeted metabolomics, where no a priori hypothesis of candidate biomarkers is required. Complementary studies with different study design, such as controlled cross-over trials with different doses of the fruits, and cross-sectional studies with comprehensive dietary information will have to be used to document the criteria of the validation scheme for BFIs.

In parallel, data collection and sharing in online resources on the phytochemical composition of tropical fruits and the bioavailability of these phytochemicals should facilitate the identification of specific phytochemical metabolites in metabolomic profiles that may constitute new candidate BFIs.