A yeast metabolome-based model for an ecotoxicological approach in the management of lignocellulosic ethanol stillage

Lignocellulosic bioethanol production results in huge amounts of stillage, a potentially polluting by-product. Stillage, rich in heavy metals and, mainly, inhibitors, requires specific toxicity studies to be adequately managed. To this purpose, we applied an FTIR ecotoxicological bioassay to evaluate the toxicity of lignocellulosic stillage. Two weak acids and furans, most frequently found in lignocellulosic stillage, have been tested in different mixtures against three Saccharomyces cerevisiae strains. The metabolomic reaction of the test microbes and the mortality induced at various levels of inhibitor concentration showed that the strains are representative of three different types of response. Furthermore, the relationship between concentrations and FTIR synthetic stress indexes has been studied, with the aim of defining a model able to predict the concentrations of inhibitors in stillage, resulting in an optimized predictive model for all the strains. This approach represents a promising tool to support the ecotoxicological management of lignocellulosic stillage.


Introduction
which displays a great pollution potential. For each litre of ethanol manufactured, about 20 l of stillage can be produced, with a chemical oxygen demand (COD) of about 100 g l 21 [1][2][3]. A medium-sized ethanol plant, making a million litres of fuel per year, generates stillage with a pollution level equivalent to the sewage of a small city [4]. Stillage obtained from the distillation of fermented mash is quite hot (70 -808C), deep brown and has high levels of organic materials and solids. Nevertheless, the pollution potential of the effluent depends on feedstocks, facility and methods applied to recover the alcohols [1,3]. Additional environmental concerns on stillage would be provided by high N or sulfate content, colour, heavy metal concentrations and the presence of organic pollutants [1,3].
Whereas the industrial-scale conversion of sugars and starchy crops into ethanol is a mature technology [5 -8], large-scale production from lignocellulosic biomass has limited applications. Nevertheless, efforts are ongoing to improve economics and to move cellulose-to-biofuels conversion into production [9 -13]. In contrast to sugar-and starch-rich materials, the great availability of lignocellulosic biomass means that ethanol processing from lignocellulosic biomass could replace a major portion of fossil fuels [11,12]. However, due considerations for treatment and utilization of the stillage are essential to qualify lignocellulosic ethanol as a sustainable 'green energy' route [14].
Commonly, the traits of stillage from cellulosic materials are comparable to those of conventional substrates (i.e. sugarcane or corn) and, therefore, the ways so far developed to treat and use stillage from conventional feedstocks could also be valid for lignocellulosic materials.
However, the higher levels of heavy metals together with the occurrence of unusual inhibitory compounds associated with the feedstock pre-treatment process, such as weak acids, furans and phenolic molecules [12,[15][16][17], call for ecotoxicological studies to carefully assess their toxic effects before being treated and/or disposed. Figure 1 exemplifies the main stillage treatment and valorization technologies. In order to recover and remove suspended solids containing yeast and other materials, physical/mechanical separations can be implemented. In the case of a corn-to-ethanol process, the separated solids can be dried and sold as a high-value animal feed called dried distillers grains (DDGs) [6,18].
After mechanical treatment, a cluster of technologies is available including single cell protein, membrane separation, evaporation, enzyme(s) production and anaerobic digestion (figure 1) [1,3,4,15,19 -21]. In the case of lignocellulosic stillage, occurrence and levels of inhibitory compounds will greatly influence stillage valorization technologies [1,22]. In specific circumstances, proper dilution rates and/or flow of different valorization strategies will be useful to reduce the inhibitor content to safety levels, in order to convert stillage into valuable products. Thus, towards the widespread implementation of lignocellulosic ethanol plants, safe stillage management procedures will be of great impact and new ecotoxicological insights are needed.
Only a few ecotoxicological studies have taken into consideration the problem related to the toxicity of the disposed stillages, most of them focused on the genotoxicity assessment on higher eukaryotes [23][24][25].
In this perspective, this study proposes a yeast metabolome-based assay to evaluate the levels of lignocellulosic inhibitors commonly found in stillage. Lower eukaryotes present a number of advantages such as easy manipulation and possibility of testing simultaneously several different experimental conditions on millions of cells in different cell cycle phases. For this purpose, three Saccharomyces cerevisiae strains, with different tolerance to inhibitors, have been chosen as biological sensors, with the possibility to extend the results of this bioassay also to higher eukaryotes [26]. Fourier transform infrared (FTIR) spectroscopy was employed to detect the reaction of test organisms to different mixtures of inhibitors as a paradigm of different lignocellulosic stillage compositions [9,11,12,27,28]. This procedure has already been successfully applied for the assessment of stressinduced cell status in response to various stressors [29,30]. Finally, an appropriate modelling has been developed in order to predict the concentration of these compounds in stillages. This ecotoxicological approach, coupling the above-mentioned advantages related to the use of microorganisms as biological sensors to the high efficiency and low cost of FTIR, is proposed as a tool for the correct management of this environmentally unfriendly waste.

Cultures and growth conditions
FTIR-based bioassay was performed with three S. cerevisiae strains, Fp84, Fm17 and DSM70449, selected among 160 yeast strains already described for their tolerance to lignocellulosic inhibitory compounds [17,27]. Distribution analysis of the inhibitors tolerance was obtained by using a boxand-whiskers plot.

Stressing agents
Weak acids and furans obtained from Sigma (www.sigmaaldrich.com) were used at increasing levels in distilled sterile water. Each dose of inhibitory compound was reported as three relative concentrations (RCs, low, medium and high) according to the low, medium and high inhibitor levels usually found in lignocellulosic stillage [12,17,27,28]. The low, medium and high RCs consisted of (mM): 30, 60 and 120 for acetic acid; 13, 27 and 53 for formic acid; 7, 14 and 29 for furfural; and 7, 15 and 30 for 5-(hydroxymethyl)furfural (HMF), respectively.
Inhibitors were also formulated into binary, ternary and quaternary mixtures obtained with increasing concentrations of weak acids or furans. pH values of each solution of single acid ranged from 2.3 to 2.7. In the case of furan preparations, pH values were about 6.5. pH values of the binary and ternary mixtures ranged between 2.3 and 2.7; meanwhile quaternary inhibitors mixtures had the pH values of 2.6, 2.5, 2.4, 2.2 for RC L , RC M and RC H .

Ecotoxicological analysis
An FTIR bioassay was used to assess the toxicity of inhibitory compounds both as single and as binary, ternary and quaternary mixtures. Each cell suspension was centrifuged (3 min at 5300g), washed twice with distilled sterile water and re-suspended with proper volumes of distilled water to obtain a final concentration of 2.5 Â 10 8 cell ml 21 . Inhibitors were added in order to obtain the RCs reported in the Stressing agents section. The control (RC 0 , no inhibitors) was obtained by re-suspending cells in distilled sterile water. All tests were performed in triplicate. Tubes were incubated for 1 h at 258C in a shaking incubator set at 50 r.p.m. After the incubation, cells were prepared for FTIR experiments as described by Corte and colleagues [31]. The biocidal activity tests were carried out in parallel to compare the metabolomic changes with the loss of viability induced by inhibitors. A 100 ml volume of each cell suspension prepared for the FTIR analysis was serially diluted to determine the viable cell counting, in triplicate, on YPDA with chloramphenicol (0.5 g l 21 ). The biocidal effect of the tested compounds was highlighted as cell mortality induced at different concentrations. The cell mortality (M) was calculated as M ¼ (1 2 Cv/Ct) Â 100, where Cv is the number of viable cells in the tested sample and Ct the number of viable cells in the control suspension.

FTIR analysis
Each culture sample was analysed in triplicate. For each strain, 105 ml volume was sampled for three independent FTIR readings (35 ml each, according to the technique suggested by Manfait and colleagues [30]). FTIR measurements were performed in transmission mode. All spectra were recorded in the range between 4000 and 400 cm 21 with a TENSOR 27 FTIR spectrometer, equipped with HTS-XT accessory for rapid automation of the analysis (BRUKER Optics GmbH, Ettlingen, Germany). Spectral resolution was set at 4 cm 21 , sampling 256 scans per sample to obtain high quality spectra (signal to noise ratio values greater than 4000 within the 2100-1900 cm 21 interval). The software OPUS version 6.5 (BRUKER Optics GmbH, Ettlingen, Germany) was used to assess the quality test, subtract the interference of atmospheric CO 2 and water vapour, correct baseline (rubberband method with 64 points) and to apply vector normalization to the whole spectra. Spectral data matrices were exported as ASCII text from Opus 6.5 and analysed using the 'R' script MSA (metabolomic spectral analysis) according to Cardinali and colleagues [32]. Six synthetic stress indexes (SI) were calculated as normalizations of the Euclidean distances between the response spectra of cells under stress and those of cells maintained in water, according to the procedure proposed by Corte and co-workers [29]. Briefly, Euclidean distances among different conditions were calculated using the vectors of the corresponding intensities. Since the five selected spectral regions have different lengths, the normalization was carried out by dividing the Euclidean distance value by the ratio between the number of points of the whole spectrum and the number of points of each specific spectral region. One SI is related to the entire spectrum (global stress index, GSI) while the other five are related to the five different spectral regions defined as follows: fatty acids (W1) from 3000 to 2800 cm 21 , amides (W2) from 1800 to 1500 cm 21 , mixed region (W3) from 1500 to 1200 cm 21 , carbohydrates (W4) from 1200 to 900 cm 21 and typing region (W5) from 900 to 700 cm 21 [30]. The typing region was not considered in this analysis because its response did not appear correlated with the specific stressing conditions tested.

Modelling and statistical analyses
Regression analysis, modelling and statistics were performed using mathematical and statistical functions under MS Excel and 'R' (www.cran.org), respectively. R base package and 'R' script MSA [32] has been used to calculate SI. According to the distribution of results, parametric and/or non-parametric statistics were applied in MS Excel to test hypotheses on the means and/or the medians, respectively. The a value was set to 0.05 and the level of significant difference to p , 0.05. Pearson linear regression analysis was applied to test the best fitting between RC and FTIR data (MS Excel). Statistical significance of the general models was assessed by Student t-Test (MS Excel).

Results and discussion
3.1. Inhibitor tolerance assessed in robust yeast strains Three S. cerevisiae yeast, namely Fm17, Fp84 and DSM70449, were chosen as test organism among 160 strains previously assessed for inhibitor tolerance when grown in complex or minimal broths with increasing concentrations of inhibitory compounds both as single and as quaternary mixtures. Inhibitor tolerance was calculated as relative growth (OD value, %) by assessing the growth in the medium with and without the inhibitory compounds (acetic acid, formic acid, furfural and HMF). Three increasing RC (low, medium and high) have been tested (figure 2) and the position of the three strains evaluated in this study (Fm17, Fp84 and DSM70449) is reported for the tested inhibitor concentrations.
The distribution analysis of single inhibitor tolerance showed a great variability with ranges becoming larger at higher inhibitor concentrations, mainly for the acetic and formic acid ( figure 2a). Quaternary mixtures caused a growth decrease proportional to their concentrations without variability range increase (figure 2b) otherwise observed after single inhibitor exposure (figure 2a). Moreover, the highest relative concentration of the quaternary mixtures was lethal for all the strains (data not shown).
Overall, S. cerevisiae Fm17 is always positioned at the top of single and combined distributions, whereas DSM70449 was placed at the bottom and Fp84 close to the average. The three strains were previously characterized on the basis of their stress response to these four inhibitors exhibiting specific metabolomic reactions in agreement to their tolerance levels [33] and could be useful as biological sensors of the stress induced by lignocellulosic inhibitors. In order to define whether the conclusions reached by Favaro and colleagues [33] are representative of all the combinations of inhibitors potentially present in lignocellulosic stillage, we tested the stressing effect exerted by single compounds, binary, ternary and quaternary mixtures on these test organisms. To this purpose, each strain has been exposed to three different concentrations (RC L , RC M and RC H ) and evaluated in terms of mortality and metabolomic response synthesized as GSI. As RC H was lethal in all the combinations, only data related to RC L and RC M are reported in figure 3.
Each strain exhibited a specific pattern of mortality. Specifically, acetic acid and furans induced the lowest biocidal effect whereas high mortality was observed in most of the combinations containing formic acid.
In general, the higher mixtures concentration induced the higher mortality levels in all tested microorganisms. S. cerevisiae DSM70449 was confirmed to be the most sensitive strain (figure 3e,f), while the Fp84 and the Fm17 showed intermediate and high inhibitor tolerance, respectively (figure 3a-d).
The picture depicted by coupling the metabolomic reaction (GSI) with the mortality induced by the inhibitory compounds at concentration non-saturating mortality (RC L ) confirmed that S. cerevisiae Fp84, Fm17 and DSM70449 are indeed representative of three different types of response to the inhibitory compounds most commonly present in bioethanol plants.
Specifically, S. cerevisiae Fm17 presents the traits characteristic of a resistant strain, in which a low mortality is accompanied by a low metabolomic reaction. Vice versa, the strain DSM70449 displays the highest mortality and the lowest GSI values, typical of a sensitive organism which is not able to react to the strong and immediate action exerted by a toxic compound [34]. Finally, S. cerevisiae Fp84 shows low mortality values together with a relatively high metabolomic response, indicating an intermediate tolerance to these stressing agents.

Primary models
The outcome of the mortality and GSI, determined with the FTIR bioassay, confirmed that the three strains, chosen on the basis of relative growth rates, are also representative of three different styles of response to all the combinations of the stressing agents tested. This piece of evidence led us to concentrate exclusively on the response to the quaternary cocktail, as the most complex stressing condition and one of the more frequently encountered in bioethanol plants and related stillage byproducts. In order to define a model to predict the concentrations of inhibitors on the basis of the FTIR spectrum of cells under stress, we studied the relationship between RC and the SIs in different spectral regions or the spectrum as a whole (GSI). This operation was carried out for the three strains separately. The regression analyses showed that a second degree correlation curve described this relationship quite well (R 2 0.99, 0.96 and 0.83 for Fp84, DSM70449 and Fm17, respectively). According to these analyses, we formulated first independent primary models for W1, W2, W3, W4 and the whole spectrum, as reported in   The degree of variability between replicas throughout the FTIR spectra ranged around 2.5 Â 10 22 . Black dots represent mortality values as the mean of three replicates with the relative standard error always being less than 5% (not reported).

General model
After defining the primary models, a general model was determined for each strain by assigning a specific weight to each primary model equation, and therefore to the contribution of each spectral region, as reported in where RC s indicates the relative concentration estimated by each biosensor, while w1, w2, w3, w4 and wS are the specific weights assigned to each primary model. Two different models were evaluated, the first based on the response of the entire spectrum (wS) while the second on that of W1, W2, W3 and W4 regions, separately. This model proved to be effective in the prediction of RCs only using the Fp84 and DSM70449 as sensors. In fact, t-test analysis showed no significant differences between the observed and the Table 1. Parameters obtained for the construction of the primary model equations. W1, W2, W3 and W4 represent, respectively, the fatty acids region (3000-2800 cm 21 ), the amides region (1800 -1500 cm 21 ), the mixed region from 1500 to 1200 cm 21 and the carbohydrates one from 1200 to 900 cm 21 Table 2. Relative concentration predicted for each biosensor of low, medium and high RCs of quaternary inhibitory mixtures. Upper and lower sections report data obtained by assigning an entire weight to the primary model of the whole spectrum or a specific weight to each primary model equation for W1, W2, W3 and W4 regions, respectively. w1, w2, w3, w4 and wS represent the specific weights assigned to the primary models equations W1, W2, W3, W4 and whole spectrum, respectively. biosensor  The best RCs prediction was obtained for the Fp84 strain on the basis of the sole response of the mixed region (w3 ¼ 1) as well as for the resistant strain Fm17 on that of the fatty acids (w1 ¼ 1). Conversely, the more predictive primary model for DSM70449 strain resulted that based on carbohydrates (w4 ¼ 0.8), with a small contribution due to fatty acids and mixed regions models (w1 and w3 ¼ 0.1).

General model based on the response of the whole FTIR spectrum of cells under stress
The increased predictivity of the second model depends on the option to attribute a specific weight to each primary model equation considering only those areas which best respond to these specific stressing conditions. Using this procedure, regardless of the strain tolerance level, all test microorganisms become excellent sensors for the stressing conditions tested and, therefore, for the prediction of the RCs of inhibitory compounds normally found into lignocellulosic stillage.
Overall, the present study proposes a novel tool to plan proper dilution rates and/or flows for the ecotoxicological management of lignocellulosic stillages. The desirable diffusion of large-scale ethanol plants will require new simple and cheap approaches to reduce the inhibitor content to safe levels in view of different possible uses of stillage [1,2,12].
Furthermore, the final disposition of treated stillages will need rapid and reliable analysis of their inhibitor content to avoid any environmental hazard [14]. Moreover, most of the literature data refer to bench-scale systems and many post-disposal monitoring programmes are conducted in the short term, highlighting a consistent lack of data on the proper management of these waste and their reuse in the long term [35].
Furthermore, most of the bioassays so far reported are not tailored to quantify and classify the changes detected, without analysing the type of action and the strength of their activity on living cells. The approach applied in this work has already been designed to serve as an ecotoxicological assay and successfully applied in different fields including the qualitative and quantitative evaluation of the antagonistic effects of inhibitors mixtures on S. cerevisiae metabolism [33].

Conclusion
The present study provides a novel approach to support the ecotoxicological management of lignocellulosic stillage based on the modelling of FTIR spectra of S. cerevisiae cells challenged with quaternary cocktails of lignocellulosic inhibitors.
Compared to the traditional chemical analyses, FTIR spectroscopy offers the advantage of a rapid, non-invasive and cheap method also exploitable with a handheld instrument. This would open the possibility of testing the actual effect of the stillage, or any other type of compound or complex mixtures of toxic compounds, on living cells, and would be very useful in environmental sciences.
All test microorganisms resulted in excellent sensors for the stressing conditions tested and, therefore, for the prediction of the RCs of inhibitory compounds normally found in lignocellulosic stillage. The choice of S. cerevisiae cells as biosensors represents a valid model because their eukaryotic nature could allow the extension of the results of this bioassay also to other eukaryotes, including plants and man, while their microbial nature is ideal for the laboratory usage. Furthermore, depending on the case, the procedure could easily be applied to other test organisms through a simple a priori calibration of the model. Finally, this bioassay can be further upgraded by producing large FTIR libraries with the actual stillages coming from productive plants, improving the level of predictability without limiting the ease of application. The application of artificial neural networks (ANN) is a further perspective to potentiate this bioassay.
Ethics. For this work, no ethical assessment was required to be made prior to conducting the research, as no research on humans or animals was included.
Data accessibility. All the performance data are provided in the main text. FTIR spectra are available as electronic supplementary material.