Adding levels of complexity enhances robustness and evolvability in a multilevel genotype–phenotype map
Abstract
Robustness and evolvability are the main properties that account for the stability and accessibility of phenotypes. They have been studied in a number of computational genotype–phenotype maps. In this paper, we study a metabolic genotype–phenotype map defined in
1. Introduction
Classical evolutionary models do not account for the robustness and evolvability of phenotypes [1]. They thus fail to explain some evolutionary phenomena, such as punctuated equilibria [2,3], constrained evolution [4] or the origins of novelty [5,6]. In recent years, several research groups have tried to understand this important question by studying computational mappings of molecular genotypes to phenotypes. Some of these maps try to remain faithful to biological phenomena, such as RNA secondary structure [7–14], protein secondary structure [15–20], gene regulatory networks [21–24] and metabolic networks [25–28]. More abstract models have also been developed, such as the polyomino [29–31] and
Even though all these models focus on different aspects of molecular biology, all of them share some common properties. First, the mapping from genotype to phenotype is highly degenerate: many genotypes encode the same phenotype. Additionally, phenotype abundance (the number of genotypes encoding it) is not evenly distributed: most phenotypes are rare, while a few of them are extremely abundant. The probability density function associated with phenotype abundance is a log-normal distribution for a wide variety of models [14,34], although in some cases it has been found to be a power law [33,34]. This implies that rare phenotypes will not play a central role in evolution: they are hard to find in a genotype space that is filled with abundant phenotypes [12,14,37]. This degeneracy is usually accompanied by the formation of neutral networks, networks of genotypes encoding the same phenotype, in which two genotypes are connected if they differ in one point mutation—i.e. if the strings representing each genotype differ in one letter [7,16,21,25,38]. The degree of a node in such a neutral network—the number of neutral neighbours it has—is called genotypic robustness. It is usually normalized by the total number of neighbours in genotype space, and thus represents the fraction of neighbours with the same phenotype [5]. Inside a particular neutral network, the degree distribution is highly heterogeneous, but usually unimodal [13,39]. Additionally, in RNA, the average degree of a neutral network grows linearly with the logarithm of its abundance [13]—this positive correlation is also observed in the polyomino model [30], the HP model [31] and simple genotype–phenotype maps [34], and is suggested by empirical data [40]. Neutral networks of abundant phenotypes percolate genotype space: they contain genotypes that share almost no letters [5,7,21]. Conversely, most abundant phenotypes are easily accessed from each other: traversing a neutral network, many phenotypes can be found at its boundary [5,21,25]. Moreover, abundant phenotypes are typically found just a few mutations away from a random genotype [5,9,30]—i.e. they are highly evolvable. This means that these phenotypes are easily accessible from any other phenotype, so that the search for new phenotypes among abundant ones is a fast evolutionary process.
In [32], we presented Figure 1. Brief overview of
In this paper, we investigate the characteristics of the metabolic genotype–phenotype map of
2. Degeneracy of the genotype–phenotype map
The size of genotype space in (electronic supplementary material, §S1). For g = 2, this number is 5.5 × 1011, for g = 3, it is 1.9 × 1017, and for larger values of g it keeps growing almost exponentially. A complete exploration of these genotype spaces is well beyond our computational possibilities in general. However, using computational tricks, we have exhaustively analysed the g = 2 and g = 3 cases—i.e. we have limited our study to two-gene and three-gene genotypes.
We have restricted ourselves to studying those genotypes that are able to catabolize at least one metabolite—these will be called viable genotypes. The remaining (non-viable) genotypes are unable to catabolize any metabolite. For g = 2, there are 1.1 × 109 viable genotypes, representing little more than 0.2% of all genotypes. For g = 3, this number is 1.0 × 1015—approximately 0.5% of all genotypes. In both cases, the great majority of genotypes are unable to catabolize any metabolite. But note that the space of viable genotypes is still enormous.
Among these viable genotypes, many of them catabolize exactly the same metabolites—they encode the same metabolic phenotype. For g = 2, there are only 775 different phenotypes, corresponding to an average of 1.4 × 106 genotypes per phenotype. For g = 3, there are 26 492 phenotypes, corresponding to an average of 3.8 × 1010 genotypes per phenotype. In other words, for both g = 2 and g = 3, the degeneracy of the genotype–phenotype map is huge. From now on, we will refer to the set of phenotypes found for g = 2 and g = 3 as and
, respectively.
The distribution of phenotype abundances is highly skewed (figure 2a), similarly to what has been observed for other genotype–phenotype maps. In both cases, the distribution can be empirically fitted to a log-normal distribution. We obtained the parameters empirically from the log-transformed data, using maximum-likelihood. For both g = 2 and g = 3, the rank distributions (electronic supplementary material, figure S9) show a long tail, confirming that, indeed, while few phenotypes are very abundant, most of them are rare. For g = 3, this is especially striking, since 300 phenotypes in Figure 2. Neutral networks in represent nearly 99% of all genotypes—which means that the remaining 1% is distributed among approximately 26 000 phenotypes.
, where μ is the mean and σ is the standard deviation of the normally distributed logarithm of the variable. Here μ = 4.742 and σ = 1.224, obtained using maximum-likelihood. For g = 3, the distribution of phenotype abundances (S3, a(ii)) is again very close to a log-normal distribution with μ = 5.604 and σ = 1.838. The log-normal fit is worse than in (a) because there is a small bump in the right part of the distribution, where more abundant phenotypes are—due to the over representation of two-gene phenotypes (see text). (b) Average degree of nodes (circles) in neutral networks (see electronic supplementary material, figure S12 for the degree distribution) versus gene number g. The average degree 〈k〉 of a node grows linearly with gene number g, as 〈k〉 = − 27.6 + 17.8 g (line). (c) Average robustness (circles) versus gene number g. Robustness grows with gene number, and we can find a nonlinear relationship between both variables: 〈R〉 = 0.895 − 1.392/g (line). (d) There is a nonlinear relationship between g and 〈d∞〉, the final distance that is reached in a random walk in which genotypes are forced to get away from the starting genotype every step: 〈d∞〉 = 0.965 − 1.354/g (line). The circles represent 〈d∞〉. (e) There is a linear relationship between 〈d∞〉 and the average robustness of the genotypes as obtained in c, given by: 〈d∞〉 = 0.094 + 0.972〈R〉 (line), very close to the 〈d∞〉 = 〈R〉 fit. In all cases, the grey area encompasses two standard deviations, and the fits in (b–e) were obtained using the least-squares method. (Online version in colour.)
All phenotypes in are also found in
: we can always add a gene whose product does not fold into any protein to a viable two-gene genotype. A pertinent question, therefore, is how abundant the phenotypes belonging to
are in three-gene genotype space. We find that phenotypes in
take up 99.6% of genotypes in g = 3 (electronic supplementary material, §S4). This means that the 775 phenotypes in
dominate the space of phenotypes for g = 3. Only special combinations of three proteins and three promoters will yield most of the phenotypic diversity observed for g = 3. The majority of genotypes will be extensions of two-gene genotypes with a third gene that does not interfere with their function.
We can re-compute the histogram in figure 2a(ii) taking the 775 phenotypes from as a separate set from the remaining 25 717 phenotypes in
that are not in
(electronic supplementary material, §S4). This reveals that the small bump observed in the right part of the distribution of phenotypes in
(figure 2a) is due to the phenotypes in
. When we eliminate these phenotypes, the resulting distribution is much closer to a log-normal. In a sense, it is as if both sets were somehow independent: one is formed by two-gene genotypes with a third, non-interfering gene, and the other is formed by all combinations of three genes that encode new phenotypes. This influence of
phenotypes decays linearly when genotype size increases (electronic supplementary material, §S5), although they keep representing more than 80% of genotype space for g ≤ 13.
3. Neutral networks in toyLIFE
Robustness can be defined as , where k is the degree of a node in a neutral network, and
is the maximum number of point-mutation neighbours. In other words, R is the normalized degree of a node. We can sample genotypes for different genotype sizes, represented by g (gene number), and plot the histogram of values of R (electronic supplementary material, figure S12). All the resulting distributions are unimodal, as has been observed in other genotype–phenotype maps [5,13].
Also, taking into account that Figure 3. Phenotypic robustness is linearly related to the logarithm of phenotype abundance. For g = 2 and g = 3, we sampled 107 genotypes and computed their robustness. Then we assigned each of them to their corresponding phenotypes, and estimated phenotypic robustness, the average robustness for all genotypes encoding a given phenotype (see text). (a) Phenotypic robustness in g = 2 versus the logarithm of phenotype abundance. The line represents the power-law relationship R phenotypes dominate in
for g ≤ 13 (electronic supplementary material, §S5), we can estimate that Sg∼17.8gS2, so logSg∼q + glog17.8, where q is a constant. Combining this result with the linear relationship between g and 〈k〉, we obtain for
(green circles, above) from the rest (blue circles, below). Both sets show a power law relationship between phenotypic robustness and phenotypic abundance: Rp = 1.790 S0.0133 for phenotypes in
(green line) and Rp = 0.805 S0.0233 for the remaining 25 717 phenotypes (straight lines). Grey area encompasses two standard deviations, and the fits were obtained using the least-squares method. (Online version in colour.)
In other genotype–phenotype maps, neutral networks tend to have one giant component [18], although this is not always the case: too short RNA sequences form neutral networks that are highly disconnected [13]. Although network analysis is almost impossible for g≥3, as networks are enormous, for g = 2 we can perform network analyses on all 775 phenotypes exhaustively, and compute their connected components (electronic supplementary material, §S7). We observe that most phenotypes are distributed in highly fragmented neutral networks: the genotypes encoding a given phenotype form many disjoint connected components, which are typically small. Abundant phenotypes tend to have a larger number of connected components, and we can find a relatively good power-law fit between the abundance of the phenotype S2 and the number of components C: C = 0.25S0.72 (electronic supplementary material, §S7). We also observe a huge variation in the size of connected components in g = 2: although more than 98% of connected components are smaller than 1000 nodes, some of them reach up to approximately 107 nodes. The high fragmentation of neutral networks is due to the HP model that underlies protein folding in
For g≥2, we can estimate the distribution of neutral networks in genotype space using neutral random walks: starting at a randomly chosen genotype, we perform a mutation on it. If the resulting mutant genotype belongs to the same neutral network—if it encodes the same phenotype—the mutation is accepted. The random walk continues when we mutate the new genotype again. If the mutant genotype does not belong to the neutral network, the mutation is rejected, and we try to find a new neutral neighbour for the original genotype (this process will not work if the starting genotype does not have neutral neighbours, a rare case). We performed 1000 neutral random walks of length 10 000 for genotype sizes g = 2 to g = 5 (electronic supplementary material, figure S15). At each time step t, we computed dH(g0, gt), the Hamming distance (normalized number of different positions) between the original genotype g0 and the genotype visited at time t, gt. dH(g0, gt) is a random variable for each t, and so we can compute its average and standard deviation (electronic supplementary material, figure S15). If there were no restrictions to the nodes that can be visited in a random walk, we would expect when
. In other words, if there are no restrictions, the correlation between g0 and gt is lost when t grows, and the distance between them tends to the value it would have, on average, if we randomly picked two genotypes from the network. Thus, the evolution of dH(g0, gt) is a good measure of the size and extension of neutral networks in genotype space. For g = 2, 〈dH(g0, gt)〉 ∼ 0.25 when
, implying that networks do not extend very far. Considering that the total genotype space has diameter 40, this means that the average distance between the initial genotype and the final one is close to 10. This is not a very high value, and it is consistent with our previous analysis showing that neutral networks in g = 2 tend to be fragmented and small. For g > 2, 〈dH(g0, gt)〉 ∼ 0.4 when
, which implies that the fragmented networks of g = 2 space are becoming more connected as g grows, facilitating the navigability of genotype space. This suggests that neutral networks for g > 2 span large fractions of genotype space, a result consistent with other genotype–phenotype maps.
A different way to estimate the diameter of a neutral network is to perform neutral random walks in which we force dH(gt, gt+1) > dH(gt−1, gt). That is, in addition to imposing that a mutation is neutral in order to accept it, we also require it to increase the distance to the original genotype. More specifically, the process is computed as follows. We randomly choose a genotype, and perform mutations on it, increasing the distance every time step, until this distance can increase no longer—if, after a large number of trials, we cannot find a neutral mutant that is farther apart from the original genotype, we stop the process. We will denote the final distance obtained in such random walks by d∞. For g = 2 and g = 3 we randomly sampled 10 000 genotypes, whereas for g = 4 and g = 5 we sampled 1000 genotypes (figure 2d; electronic supplementary material, figure S16). Consistent with previous results, random walks did not get very far for g = 2, reaching an average final distance 〈d∞〉 ∼ 0.28. For g > 2, the final distance d∞ increases. This result confirms the previous observation that navigability in these genotype spaces is enhanced. For g = 3, 〈d∞〉 is a little over 0.5, while for g = 4 and g = 5 it reaches 0.6 and 0.7, respectively. In fact, the growth of 〈d∞〉 with g is very similar to the growth of 〈R〉 obtained in figure 2c. In that case, we had 〈R〉 = 0.895 − 1.392/g. Here, it is 〈d∞〉 = 0.965 − 1.354/g (figure 2d). Unsurprisingly, the similarity of the fits implies a linear relationship between 〈d∞〉 and 〈R〉: 〈d∞〉 = 0.094 + 0.972〈R〉 (figure 2e), very close to the identity function. This result has several implications. First, as g grows, neutral networks are more and more connected, and they span larger fractions of genotype space. It is easier to get from one extreme of the genotype network to the other without changing the phenotype. Secondly, this increased connectivity is due to the increase in robustness: the robustness of a genotype is a good predictor for the size of the connected component it belongs to. This can be easily explained in light of our previous discussion on robustness. Adding a new gene to a genotype will endow the latter with an average of 18 new neutral mutations with which to explore genotype space (figure 2b). Because the new gene will not interfere with the phenotype with a high probability, it follows that we can mutate most of its nucleotides, one by one, getting farther away from the original genotype. In other words, new genes in
The fact that robustness is a good predictor for the size of a genotype's connected component can be combined with the positive correlation between the logarithmic abundance of a phenotype and the size of its largest connected component (electronic supplementary material, §S7) to deduce the linear relationship between the logarithm of phenotype abundance and phenotypic robustness, giving yet another heuristic argument for this relationship. We now turn to compute it explicitly.
Phenotypic robustness is defined as the average of genotypic robustness for all genotypes encoding a phenotype , that is





4. Robustness and position in genotype
Instead of considering the degree of a node in a neutral network, we can focus on the neutrality of a given position of the genotype. A genotype formed by g genes can be thought of as a binary string of length 20g—remember that genes in


Figure 4. Different positions in the genome have different neutralities. We sampled 107 genotypes for g = 2 (a) and g = 3 (b) and 103 genotypes for g = 4 (c) and g = 5 (d), and measured ri for i = 1, …, 20. For each i we then computed 〈ri〉 and plotted them versus genomic position. Note the high robustness of the first position in the promoter region, and the low robustness in the coding regions. (Online version in colour.)
Inside the promoter region, which only affects gene regulation, the first position is particularly robust. This means that the regulatory changes it induces have no phenotypic effect at the metabolic level. This may be due to two reasons: either changes in the first position of the promoter region do not affect the regulatory function—the temporal pattern of gene expression determined by the interactions among proteins—or changes in the regulatory function rarely alter the metabolic phenotype. We performed the following simple test of these hypotheses. For each position in the promoter region, we sampled 104 genotypes of size g = 3. We then mutated that position and computed the new regulatory function and the new metabolic phenotype. From all 104 mutations in the first position, 40% were neutral in both the regulatory and the metabolic sense, 54% affected the regulatory function but did not affect the metabolic phenotype, and the remaining 6% changed both—this means that the robustness for the first position in this sample was 94%. For the rest of the positions, 27% of the mutations did not alter either the regulatory function or the metabolic phenotype, 32% changed regulation but not metabolism and 41% changed both. This corresponds to a robustness of 59%, consistent with what we observed in figure 4. In other words, for the first position only 9% of the mutations that affected regulation had any effect on the metabolic phenotype. In addition, 40% of mutations did not affect the regulatory function at all. For the rest of the positions, however, the number of mutations that altered regulation, 73%, was higher. Among these, roughly 55% had an effect on phenotype as well. So both reasons posited above are at work: not only is the number of mutations affecting regulatory function lower in the first position of the promoter region, but also when these mutations do alter the regulatory function, they rarely change the phenotype.
The lower robustness of coding regions, compared to promoter regions, is correlated with a higher evolvability, as will be discussed in the next Section.
5. Accessibility and evolvability
So far, we have limited our discussion of the properties of the genotype–phenotype map in
The neutral networks of different phenotypes tend to be highly interwoven in most computational genotype–phenotype maps, so that connections between them are very common. The Vienna RNA group described a property of RNA neutral networks called shape space covering [7,9]. It implies that one can find most common phenotypes a few mutations away from any given genotype. We checked for the existence of this property in Figure 5. Evolvability in
Shape space covering means that phenotypes are easily accessible from each other through a few number of mutations. A relevant detail in the metabolic genotype–phenotype map in
Another way to study evolvability is to compute the connections between different phenotypes directly. We say that two phenotypes are connected if at least two genotypes from each phenotype are one point mutation away from each other. We can then create a network of phenotypes, whose nodes will be the phenotypes themselves, and the edges the connections between them. This network of phenotypes is undirected and weighted—the weight of an edge between two phenotypes is the sum of all point mutations connecting two genotypes encoding each phenotype. This network admits self-loops, whose weight is twice the number of edges connecting genotypes encoding a single phenotype—in other words, it is the sum of the degrees of all the nodes encoding that phenotype, which is proportional to the phenotype's robustness. For g = 2, where we can compute the whole network of genotypes with their corresponding phenotypes, we can build this phenotype network exhaustively. The network is formed by a giant component that includes 767 phenotypes. We also find six additional tiny components, five of them with just one phenotype and the remaining one with three phenotypes (electronic supplementary material, §S9). Thus, for g = 2, some phenotypes will be unreachable by point mutations from other phenotypes. For g = 3, we cannot build the phenotype network exhaustively, but resorting to a numerical approximation using random walks (electronic supplementary material, §S9), we can estimate the network of connections between the 775 phenotypes in —in order to study how the addition of one gene alters the connections between these phenotypes. The results show that all phenotypes in
now belong to one giant component (electronic supplementary material, §S9). The number of connections between phenotypes has greatly increased as well. This is again due to the additional junk genes. They do not only increase robustness, but also allow for increased connections between phenotypes.
This increased connectivity can also be measured in an alternative way. In previous work [5], evolvability has been estimated as the number of new phenotypes discovered in a neutral random walk along a neutral network. In figure 5c,d, we have performed such an analysis for 10 000 genotypes for g = 2 and g = 3. The results show that evolvability is much higher for g = 3. While the number of discovered phenotypes almost stops growing for g = 2, it grows quickly in g = 3, and to a much higher value than for g = 2. Again, this is due to the higher average number of connections between phenotypes for g = 3 (electronic supplementary material, §S9).
6. Discussion and conclusion
Throughout this article, we have explored the properties of the metabolic genotype–phenotype map in
All of these properties have been described in other genotype–phenotype maps [7–31,33,34]. This is somewhat striking, given that the genotype–phenotype map in
Two particular results stand out. The first is the log-normal distribution of phenotype abundances, which has also been observed in RNA [14] and predicted for simple combinatorial genotype–phenotype maps [34]. The second is the positive correlation between phenotypic robustness and the logarithm of phenotype abundance, which has also been described before [13,30,31,34]. The fact that these two relationships (as well as other phenomena, such as shape space covering) are so widespread points to a general property of these maps, which must be related to combinatorics and network theory. Previous work [34] has shown that, when the abundance of a phenotype can be inferred from the genotype sequence in simple genotype–phenotype maps, we can use combinatorial arguments to explain the appearance of a log-normal distribution and the linear relationship between phenotypic robustness and the logarithm of phenotype abundance. These arguments would explain the presence of these properties in the case of RNA, but do not seem to be easily translatable to
The high robustness and evolvability of the metabolic phenotype in
Alternatively, we could change the folding process in
On a related note, our results show that adding genes to
Finally, the appearance of junk genes is possibly a result of the fact that interactions in
As a final comment, note that we have not defined fitness for
Data accessibility
There are no data associated with this paper.
Authors' contributions
P.C., S.M. and J.A.C. conceived the study. P.C. performed research and data analysis. P.C., A.W., S.M. and J.A.C. discussed the results, wrote the paper and approved the final version of the manuscript.
Competing interests
We declare we have no competing interests.
Funding
P.C., S.M. and J.A.C. acknowledge support by the Spanish Ministerio de Economía y Competitividad and FEDER funds of the EU through grants VARIANCE (FIS2015-64349-P) (P.C. and J.A.C.) and ViralESS (FIS2014-57686-P) (S.M.). A.W. acknowledges support by ERC Advanced Grant 739874, by Swiss National Science Foundation grant 31003A_146137, by an EpiphysX RTD grant from SystemsX.ch, as well as by the University Priority Research Program in Evolutionary Biology at the University of Zurich. P.C. acknowledges additional support from EMBO grant ASTF 652-2014.
Acknowledgments
We acknowledge the revision of six anonymous reviewers, which has helped in improving this paper.