Stable leaders pave the way for cooperation under time-dependent exploration rates
Abstract
The exploration of different behaviours is part of the adaptation repertoire of individuals to new environments. Here, we explore how the evolution of cooperative behaviour is affected by the interplay between exploration dynamics and social learning, in particular when individuals engage on prisoner’s dilemma along the edges of a social network. We show that when the population undergoes a transition from strong to weak exploration rates a decline in the overall levels of cooperation is observed. However, if the rate of decay is lower in highly connected individuals (Leaders) than for the less connected individuals (Followers) then the population is able to achieve higher levels of cooperation. Finally, we show that minor differences in selection intensities (the degree of determinism in social learning) and individual exploration rates, can translate into major differences in the observed collective dynamics.
1. Introduction
Humans often replicate the behaviours of others that exhibit higher social fitness, and do so by means of social learning and peer influence [1,2]. However, individuals can also unilaterally adopt new behaviours, e.g. through learning errors or innovations [3]. This way, populations are capable to explore behaviours otherwise inaccessible to them [4,5]. Hence, when studying population dynamics, it is usually assumed that the evolution of individual behaviours in socio-economic systems proceeds, in many respects, in a way that mimics the fundamental forces driving its biological counterpart [6–9], that is, through a combination of both selection and mutation (exploration). Here, we study the impact of exploration dynamics in the evolution of cooperative behaviour, framed within the context of evolutionary game theory. In particular, we explore the implications, on the population-wide dynamics, of introducing a finite exploration rate, as well as how the collective dynamics is affected when a transition from strong to weak exploration rates takes place.
Past literature in evolutionary game theory typically assumes scenarios in which both selection pressure and exploration rates are constant and homogeneous across the population [10–17] some exceptions include Szolnoki et al. [18,19] and Alam et al. [20]. Arguably, a more realistic scenario would consider time varying exploration rates, which can be triggered by external shocks or internal dynamics. For instance, it is known that human learners often adopt a high exploration rate at earlier stages when facing a new environment, progressively reducing such rate in time [21], a feature which allows for a better exploration of the available strategy space [22–24]. It has been also shown that some humans are very selective about when to use information from third parties, which leads to a heterogeneous distribution of learning strategies [25]. Moreover, several findings in biology [26–28] seemingly support the idea that populations can undergo variations in mutation rates as a response to environmental pressures. Such findings naturally raise questions on the impact of variable exploration rates in the evolution of cooperative behaviour, in particular on structured populations [10,12,16,29,30] and when individuals engage in prisoner’s dilemma (PD) [6,31].
The emergent population-wide evolutionary dynamics in structured populations can be strikingly different from what one would expect from the dilemma played in well-mixed populations [11,32]. For instance, when individuals are connected through a heterogeneous social network, the dynamics associated with a PD will be transformed into a coordination dilemma game [11,32], in sharp contrast with the defection dominance dynamics related to the pay-off structure of the game and how the game is locally perceived. In that sense, and because of the above properties of interaction structures in population dynamics, these structures have been suggested as one of the fundamental evolutionary mechanisms supporting the emergence of cooperative behaviour in populations of rationally bounded individuals [33]. But what happens when we take into account exploration dynamics? Recently, it was shown that a similar, but more complex, break in symmetry between individual-level and the emergent population-wide dynamics happens [34,35]. But what does it imply to populations that might undergo time-dependent variations in exploration rates? In that context, past research provides some insights by showing that the emergence of cooperation can be favour when individual’s social learning ability lowers with ageing [36].
Several empirical works have confirmed the positive impact of population structure in the evolution of cooperation [37–39]. However, in other instances [4,40,41] it was reported that some individuals would spontaneously change their strategy to defection, leading to the collapse of cooperation. An explanation proposed for such phenomena [38,40] involves the effect of ‘mutation/exploration dynamics’ (the individuals' willingness to explore the strategic space) as it affects the creation of the strategy assortments (i.e. stable clusters of cooperators) necessary for cooperation to thrive on networks. In other words, existing and unaccounted exploration rates—which can be due to the experimental set-up, stochastic effects, inconsistent individual’s strategic preferences and the population of subjects, among others—may lead to the reported mismatch between theory and empirical evidence. Naturally, this is an effect that affects all experiments, thus its relevance. Interestingly, recent results have shown that such ‘noisy’ players can also be vital for the promotion of cooperative behaviour. By strategically placing artificial agents with noisy behaviours (i.e. high exploration rates) in different locations of a social network, it was shown that such agents would facilitate the solving of complex tasks by a population of human participants [42]. Moreover, the selection pressure—i.e. how important the game pay-off is for the individual fitness—may also have a strong influence in decision-making in networked populations [15,32,40]. Thus, our work also aims at providing insights on the interplay between exploration rates, population structure and other sources of stochasticity, such as selection pressure, that could, e.g. lead to a better understanding of experimental results.
2. Results
We start by showing that the exploration rate has a profound impact on the effective dilemma individuals face. Secondly, we show that when facing a variation in the exploration rates, from strong to weak, populations can reach undesirable outcomes in the level of cooperation—for instance, when comparing with situations where exploration dynamics is very low or absent. Such dynamical outcomes are not possible to reproduce in well-mixed and homogeneously structured populations (see Methods). We further investigate how by decoupling the rate at which exploration levels vary between Leaders (highly connected individuals) and Followers (lower connected individuals) impacts the overall levels of cooperation. We show that allowing Leaders to converge faster to low exploration rates can benefit the evolution of cooperative behaviour. For simplicity, and since the network structures are static, individuals are assigned to either a role of Leader or Follower at the start of the evolution, depending on their number of contacts (degree), and remain in that role for the entirety of the simulation.
Let us consider that interactions among individuals occur in terms of the two-person PD, where players can either cooperate or defect during an interaction. Mutual cooperation leads to a reward, R, whereas mutual defection leads to a punishment P (<R). When one player cooperates and the other defects, the associated game pay-offs are, respectively, S (sucker’s pay-off) and T (temptation). The PD game is obtained when T > R > P > S, such that mutual cooperation (R) is preferred over unilateral cooperation (S) and mutual defection (P). Nonetheless, defection tends to be chosen due to the combination of two social tensions [43]: the temptation to cheat against a cooperator (T is the best possible outcome), and due to the fear of being cheated (S is the worst possible return). The ensuing pay-off structure and further details can be found in the Methods.
Although popular, the evolutionary dynamics of a finite population in the well-mixed approximation usually constitutes a strong assumption that is seldom realized. In some simple cases, one may have populations spatially and quasi-regularly distributed [12]. More often, population structures exhibit sizable irregularities, which translate into a structure well represented by heterogeneous graphs, often exhibiting fat tails whenever populations are sufficiently large [30] and heterogeneity in what concerns the distribution of the number of interactions [44,45]. Hence, in the remainder of the manuscript we probe the impact of variable exploration levels on degree heterogeneous structured populations [11] with a scale-free degree distribution, as representative example of such heterogeneous graphs. We have generated these networks using the Barabási–Albert algorithm of growth and preferential attachment [30] (see Methods), in this way capturing some of the features of real-world networks detailed above. Given this setting, we shall now address how the emergent macroscopic dynamics characterized in structured populations is altered for different exploration rates and selection pressures.
Behavioural dynamics is modelled via social learning, through the so-called pairwise comparison rule [46]. Here, an individual i with fitness fi imitates the strategy of a randomly chosen neighbour j (with fitness fj) with a probability that increases with the fitness difference (Δf = fj − fi). Such probability can be given by the Fermi distribution , where β provides a convenient measure of the strength of natural selection and the errors associated with the social learning process. The overall dynamics in well-mixed populations (i.e. a complete graph) can be described analytically by means of the gradient of selection G(k) = T+(k) − T−(k), representing the difference between the probabilities of increasing (T+(k)) and decreasing (T−(k)) the number of cooperators by one when there are k cooperators in the population. The internal roots of G(k) provide finite-population analogues of internal fixed points in infinite populations [31,47–51]. The addition of exploration prompts the emergence of an additional internal fixed point xL that emerges and move towards x = 0.5 with increasing μ (see orange lines in figure 1). In other words, exploration dynamics trivially transforms a purely defection-dominance dilemma into a coexistence game. In such cases, the final balance of cooperators and defectors is dictated by the location of the attractor xL (a finite population analogue of a stable fixed point in infinite populations).
Figure 1. Internal roots of the average gradient of selection at g = 100 generations (see Methods) on heterogeneous structured populations as a function of the exploration probability μ. Each panel shows results for different selection pressures: (a) β = 0.01, (b) β = 0.1 and (c) β = 10.0. The yellow background denotes the region μ > μC, the value at which two internal roots (the finite-population analogues of fixed points in infinite populations) of coalesce. When μ < μC the evolutionary outcome is dictated by the coordination point (x*) (i.e. a repeller, a finite population analogue of an unstable fixed point), meaning that depending on the initial fraction of cooperators the population could be driven towards any of the two possible basins of attraction. When μ > μC, evolution drives the population towards a coexistence, identified by the location of the attractor (a finite population analogue of a stable fixed point), that is dictated by the strength of the exploration dynamics. Results for μ = 0.0 (i.e. in absence of exploration dynamics), are represented in the left border of each panel. In such case the dynamics is characterized by a single repeller (x*). Orange lines depict the location of the internal coexistence point that characterizes the evolution of well-mixed populations under exploration dynamics (see Methods for a detailed discussion of the results for well-mixed populations). Other parameters are T = 1.25, S = −0.25, Z = 103 and network average connectivity 〈k〉 = 4.
This description is, however, too simple to account for the complex dynamics emerging from structured populations [3,11,29,35,52,53]. Alternatively, we resort to the average gradient of selection, , which is a numerically computed counterpart of G(k) [11,52,53]. Unlike G(k), (where μ is the exploration rate and t is the time) is a time-dependent quantity that allows to track the self-organization of cooperators and the effective game being played at different times. Moreover, the average gradient of selection (AGoS) can be computed for arbitrary intensity of selection, population structure and game parametrization. Past works have used the AGoS to link individual to the emergent population-level dynamics, showing that different types of social networks lead to different population-wide dynamics [53].
Figure 1 shows the location of the internal roots of the average gradient of selection strongly depends on the exploration rates. We picture the value of for a broad range of (fixed) exploration rates (10−4 ≤ μ ≤ 1.0) and for the case with T = 1.25 and S = −0.25 (PD domain with R = 1 and P = 0). Each panel of figure 1 shows the results obtained for different selection pressures β (see Methods): β = 0.01 (a), β = 0.1 (b) and β = 10.0 (c).
For μ = 0.0 and β = 0.01 (figure 1a) or β = 0.1 (figure 1b) the population-wide dynamics is dominated by a single internal repeller at x* (a finite population analogue of an unstable fixed point in infinite populations), which implies the population will evolve under a coordination dynamics and be driven to either x = 1.0 or x = 0.0 depending on its initial condition. For stronger selection (β = 10, figure 1c), additional roots appear in the close vicinity of x = 1.0 and x = 0.0 which, while playing no relevant role in the present analysis, result from specific assortments of strategies which lead, in the absence of exploration, to long fixation times [32]. The coordination nature of the population-wide dynamics, observed for μ = 0.0, implies that an increasing μ leads to the emergence of two internal probability attractors near x = 1.0 (xU) and x = 0.0 (xL), both approaching x = 0.5 with increasing μ.
From figure 1, one may easily infer that, as in many other complex adaptive systems [7,54–57], evolution in structured populations portrays critical thresholds (sometimes called tipping points) at which the population-wide dynamics abruptly shifts from one regime to another. In this case, two distinct dynamical pictures result with increasing exploration probabilities (μ) which also depend on the selection pressure. For β = 0.1 (figure 1b), x* coalesces with xL, whereas for β = 0.01 and 10.0 (figure 1a and 1c) x* coalesces with xU. In the present scenario, and since for μ = 0.0 the population-wide dynamics is characterized by single repeller, with increasing μ we will always observe the emergence of two additional internal roots: xU and xL. One of these additional roots will coalesce with the repeller, the level of μ at which the coalescence happens depends on the specific conditions of the simulation. Interestingly, whenever selection is strong (figure 1c), above a critical exploration rate, evolution proceeds as if the network is absent, as we abruptly recover the well-mixed dynamical profile with a single coexistence internal point (see orange lines). For lower selection pressures (figure 1a,b), such critical exploration rate defines the point above which population structure and degree heterogeneity become detrimental to cooperation when compared with populations without any interaction structure.
Figure 1 also suggests that, unlike the scenario associated with well-mixed populations, the evolutionary outcome under variable exploration rates on structured populations will also depend on the overall selection pressure: A parametric increase of the exploration rate will drive populations towards a coexistence of strategies, regardless of both (i) initial configuration of the population and (ii) selection pressure. However, a time-dependent decrease in the exploration rate implies the demise of cooperation for a wide range of selection pressures (β < 0.1 and β > 0.5, see figure 2a). Between these domains (more precisely, 0.1 < β < 0.5) the overall dynamics is dominated by an interior attractor, already when μ ≪ 1. These results highlight the existence of an optimal selection pressure at which cooperation is maximized [32].
Figure 2. Level of cooperation under variable, time-dependent, explorations for a wide range of selection pressures (a) and decaying time scales (b) on scale-free networks. Panel (a) shows the average final fraction of cooperation under variable exploration (black discs) when compared with the level of cooperation attained in a scenario without exploration (μ = 0, orange squares). Vertical dashed lines indicate the intensity of selection (β) values used in figure 1. Panel (b) shows the average final fraction of cooperation for different decaying time scales of the exploration rate for different selection pressures. Other parameters are S = −0.25, T = 1.25, Z = 103, λ = 0.95 and 〈k〉 = 4.
Let us now induce the population to undergo a transition from strong (μ = 1.0) to weak (μ = 10−5) exploration regimes. Let us start from a random composition of strategies at μ0 = 1.0. After each period τ (with a duration of Δ generations (see Methods) the population undergoes a decrease in μ by a fraction λ < 1.0: , this process being repeated while . Figure 2a shows the level of cooperation (see Methods) obtained this way with black dots. For comparison, orange squares depict the corresponding values in the limit μ = 0. We span three orders of magnitude of values for the selection pressure β at constant λ = 0.95. Figure 2b, in turn, shows the dependence of the level of cooperation on the decaying time-scale value Δ adopted, for different selection pressures.
Clearly, variable exploration rates (as described above) will always lead to lower levels of cooperation (compared with the μ = 0 case) which, for β > 0.5, result in the complete demise of cooperation. For lower selection pressures, the stochastic effects compensate this harsher dynamical picture (also evidenced in figure 1a). Moreover, as shown in figure 2b, the decrease in cooperation becomes more pronounced for larger Δ, that is, in regimes where the exploration rates decay slowly. Finally, in line with the static results of figure 1, there is a window of hope for cooperators within a range of selection pressures (around β = 0.1) which optimizes the resilience of populations to changes of exploration rates. In what concerns the impact of different decay time scales (figure 2b), when β = 0.01 the observed level of cooperation remains stable already for Δ > 5, while for increasing selection pressures they tend to stabilize at higher values of Δ.
The previous results present somewhat grim prospects for cooperation. Regardless of whether exploration is static or time dependent, the population will always achieve a lower level of cooperation when compared with situations where no exploration takes place. However, it is important to keep in mind that, in all cases, we considered ‘uniform populations’ in which all individuals explored the fitness landscape at the same rate. This may not always be the case. Indeed, individuals’ willingness to explore can be conditional on their level of social influence, or on how many social ties they hold. In the following, we investigate the consequences of this possibility by letting individuals with different number of neighbours adopt distinct exploration rates. To this end, we split the population in two classes: Leaders who have a degree greater than 2/3 of the maximum degree of the population (highly connected individuals), and Followers. Figure 3a shows how a decoupling of the exploration rates between such two classes—Leaders and Followers—impacts the level of cooperation. In order to highlight how these results deviate from a baseline condition we show the relative increase in the level of cooperation compared with the baseline scenario where no exploration takes place (μ = 0). Results worse than the baseline are shown in black; results better than the baseline are indicated with brighter colours, the more bright the larger the positive deviation. We fix the selection pressure to an intermediate value (β ≈ 0.1). Clearly, Followers with exploration rates higher than Leaders may favour the increase of cooperation levels by as much as . This however, only occurs for intermediate selection pressure, as for both strong and weak selection, the outcomes are always worse when compared with their baseline counterpart.
Figure 3. Level of cooperation under asymmetric exploration rates. In panel (a) the Y-axis represents the exploration rates of Leaders, here defined as individuals with a degree greater or equal than 2/3 of the maximum degree in the population, while the X-axis identifies the exploration rates of Followers, individuals with a degree lower than 2/3 the maximum degree in the population. Results depict the gains in cooperation compared with a baseline scenario where no exploration takes place. Black regions indicate situations in which the level of cooperation achieved is equal or worse than the baseline, while brighter tones indicate conditions in which the level of cooperation was greater than the baseline, the brighter the greater. Panel (b) shows how the levels of cooperation depend on the balance between the exploration rates of the Leaders (0.1 ≤ λL ≤ 0.95) and the Followers (λF). In panels (a) and (b), exploration rates are constant, not time-dependent, and set at the start of each simulation. Panel (c) shows how the levels of cooperation depend on the interplay between λL, λF and selection pressure (β) when exploration rates are time-dependent (Δ = 25). Other parameters, S = −0.25, T = 1.25, Z = 103 and 〈k〉 = 4.
In the light of the previous results, it becomes relevant to investigate to which extent the levels of cooperation are sensitive to a decay of the exploration rates that also depends on the connectivity of individuals. To this end, let us assume, as before, that the population starts from uniform and strong exploration rates (μ = 1.0), but that Leaders tend to converge to low exploration rates quicker than the Followers (λL < λF). Figure 3b shows how different values of λL impact the observed level of cooperation when λF = 0.95. A nonlinear dependence is obtained, where results show special sensitivity to the overall selection pressure. Notwithstanding, there seems to be an optimal balance in the decay of the exploration rates that allow the population to reach higher levels of cooperation more consistently.
The aforementioned sensitivity to β is shown in greater detail in figure 3c, where the level of cooperation is depicted as a function of the selection pressure for three values of λL (with λF = 0.95). When this additional feature is included in the evolutionary dynamics, it becomes possible to attain levels of cooperation of 100%, provided selection pressure is optimally selected.
3. Conclusion
The present work investigates in which way both static and time-dependent exploration rates can impact the evolutionary dynamics of cooperation in heterogeneous structured populations. For static and low exploration rates, we show that the population-wide dynamics no longer resembles the one emerging from the PD game that individuals locally face. Instead, multiple internal roots of the average gradient of selection emerge: two analogues of stable fixed points (probability attractors) separated by one analogue of an unstable fixed point (probability repeller), the latter one inducing a coordination-type dynamics. We also witness the occurrence of a regime shift as we keep increasing the exploration rate, the critical transition taking place at the point where two of the internal roots coalesce and disappear, there remaining a single attractor governing the overall evolutionary dynamics. We find that the critical μ at which this transition occurs depends on the overall selection pressure.
When the population undergoes a progressive decrease of exploration μ (from strong to weak), we always observe lower cooperation levels when compared with the situation without any exploration. Similarly to the case of constant exploration rates, we show a non-trivial interplay between exploration rates and selection pressure, and identify an interval of selection pressures that optimizes the resilience of populations against defectors as μ decreases in time (figure 2).
Such a grim scenario was obtained in situations where all individuals in the population experience the same variations in exploration rates. We find, however, that the overall levels of cooperation may significantly increase provided individuals adopt exploration rates that reflect their role in the underlying social network. At the simplest level, we show that a population with stable Leaders, that is, highly connected individuals that reduce their exploration rate faster than the remainder of their network peers (the Followers), confer to populations a better chance of reaching higher levels of cooperation. For static exploration rates, we show that cooperation can now benefit from exploration (compared with no exploration) if such exploration mostly occurs at the leaves (low connected nodes) of the network. Again, this feature is also sensitive to selection pressure. These results are in accord with those recently proposed by Shirado & Christakis [42], in that the strategic placement of noisy individuals in the right location of social networks can greatly benefit the social goals of a population.
Methods
The prisoner’s dilemma
Let us assume that individuals may adopt one of two possible behaviours/strategies: to cooperate (C) or to defect (D). Each individual collects a pay-off from each interaction he/she participates in with his/her neighbours. The total pay-off gathered by an individual is computed by
Evolutionary dynamics in well-mixed populations
Let us consider a finite population of size Z and model evolution in discrete time by means of a stochastic birth–death process, in which selection is implemented by the pairwise-comparison rule [13] and strategy exploration is carried out by individuals chosen randomly from the population with a uniform probability μ. The co-evolution of both processes—selection and exploration—can be summarized as follows: at each time step a randomly selected individual, A, explores the strategy space by adopting a different strategy (selected at random) with probability μ or, with probability 1 − μ, imitates the strategy of a random neighbour, B, with probability
Barabási–Albert algorithm
We generate scale-free networks using an algorithm of growth and preferential attachment proposed by Barabási and Albert [30]. In that sense, starting with m + 1 fully connected nodes, we iteratively add a new node that connects to m = 2 pre-existing nodes proportional to their degree. The algorithm stops when the number of nodes in the network reaches the desired target (Z). Networks generated following this procedure will have an average degree of four (m × 2) and a degree distribution that follows a power-law distribution.
Social network simulations
Each simulation starts with an equal number of Cs and Ds randomly assigned to the nodes of the network. At each time step an individual, i, is selected at random from the population. With probability μ, i adopts a different strategy (exploration). With probability 1 − μ, i copies the strategy of a random neighbour j with probability given by equation (4.2) (social learning). These steps are repeated for 2.5 million generations (one generation corresponds to Z iterations), or until the population reaches an absorbing state (k = 0, or k = Z), at which moment the number of cooperators in the population is measured.
Level of cooperation, designated by η, is estimated by averaging the number of cooperators at the end of 105 independent simulations. Each simulation lasts for 5 × 103 generations and starts from a fraction x0 of Cs that are randomly distributed across the network. Given the coordination nature of the population-wide dynamics in scale-free networks, η provides a good approximation for the likelihood that a population reaches a monomorphic state dominated by Cs. Whereas for dynamics dominated by an attractor, it provides a good estimation of its location. Along the manuscript we compute η for different scenarios changing exploration levels (μ), selection pressure (β) and game parameters T and S.
Connectivity classes are used to split the population between highly connected individuals (Leaders) that have a degree greater than 2/3 of the maximum degree of the population, and lower connected individuals (Followers) that have a degree lower than 2/3 of the maximum degree of the population.
Variable exploration rates are implemented by assuming that populations start with the highest possible exploration rates, subsequently decreasing in time geometrically after every predefined time period (Δ): every Δ generations (one generation equals Z update events) the exploration rates decay by a constant factor λ. Thus, the effective exploration rate after τ time windows is of . We consider two scenarios in the manuscript, when λ is the same for all individuals and when it is different depending on the degree/connectivity class to which they belong: λL for Leaders and λF for Followers. In the former scenario we consider the case of λ = 0.95, in the latter scenario we consider scenarios where the Leaders converge to weak exploration levels faster (λL < λF).
Average gradient of selection (), captures the population-wide dynamics on structured populations and represents the numerical counterpart of . It can be conveniently estimated by computing the difference between the population averaged probability to increase () and to decrease () the number of cooperators by one when the population is in a state with k cooperators (and Z − k defectors). Formally, this quantity is thus defined [11] as
Data accessibility
Data used to create the main text Figures and support our conclusions is available for download in the Dryad Digital Repository. A comprehensive description of the standard algorithms implemented to compute the evolutionary dynamics of strategies is provided in the Dryad Digital Repository https://doi.org/10.5061/dryad.m0cfxpp15 [58].
Authors' contributions
F.L.P., J.M.P. and F.C.S. designed and implemented the research, prepared all the figures, wrote and reviewed the manuscript.; F.L.P. conducted the experiment(s).
Competing interests
We declare we have no competing interests.
Funding
This research was supported by Fundação para a Ciência e Tecnologia (FCT) through grant nos. PTDC/MAT/STA/3358/2014, PTDC/MAT-APL/6804/2020, UIDB/04050/2020, UIDB/04152/2020, UIDB/50021/2020 and PTDC/CCI-INF/7366/2020.
Acknowledgements
The authors are thankful to Vítor V. Vasconcelos and Fernando P. Santos for the useful discussions and insights.
Footnotes
References
- 1.
Fowler JH, Christakis NA . 2010Cooperative behavior cascades in human social networks. Proc. Natl Acad. Sci. USA 107, 5334-5338. (doi:10.1073/pnas.0913149107) Crossref, PubMed, ISI, Google Scholar - 2.
Christakis NA, Fowler JH . 2008The collective dynamics of smoking in a large social network. N. Engl. J. Med. 358, 2249-2258. (doi:10.1056/NEJMsa0706154) Crossref, PubMed, ISI, Google Scholar - 3.
Pinheiro FL, Santos MD, Santos FC, Pacheco JM . 2014Origin of peer influence in social networks. Phys. Rev. Lett. 112, 098702. (doi:10.1103/PhysRevLett.112.098702) Crossref, PubMed, ISI, Google Scholar - 4.
Traulsen A, Hauert C, De Silva H, Nowak MA, Sigmund K . 2009Exploration dynamics in evolutionary games. Proc. Natl Acad. Sci. USA 106, 709-712. (doi:10.1073/pnas.0808450106) Crossref, PubMed, ISI, Google Scholar - 5.
Santos FP, Pacheco JM, Santos FC . 2016Evolution of cooperation under indirect reciprocity and arbitrary exploration rates. Sci. Rep. 6, 37517. (doi:10.1038/srep37517) Crossref, PubMed, ISI, Google Scholar - 6.
Nowak MA . 2006Evolutionary dynamics. Cambridge, MA: Harvard University Press. Crossref, Google Scholar - 7.
May RM, Levin SA, Sugihara G . 2008Complex systems: ecology for bankers. Nature 451, 893-895. (doi:10.1038/451893a) Crossref, PubMed, ISI, Google Scholar - 8.
Levin S et al.2013Social-ecological systems as complex adaptive systems: modeling and policy implications. Environ. Dev. Econ. 18, 111-132. (doi:10.1017/S1355770X12000460) Crossref, ISI, Google Scholar - 9.
Hamilton WD . 1964The genetical evolution of social behaviour. II. J. Theor. Biol. 7, 17-52. (doi:10.1016/0022-5193(64)90039-6) Crossref, PubMed, ISI, Google Scholar - 10.
Santos FC, Pinheiro FL, Lenaerts T, Pacheco JM . 2012The role of diversity in the evolution of cooperation. J. Theor. Biol. 299, 88-96. (doi:10.1016/j.jtbi.2011.09.003) Crossref, PubMed, ISI, Google Scholar - 11.
Pinheiro FL, Pacheco JM, Santos FC . 2012From local to global dilemmas in social networks. PLoS ONE 7, e32114. (doi:10.1371/journal.pone.0032114) Crossref, PubMed, ISI, Google Scholar - 12.
Szabó G, Fath G . 2007Evolutionary games on graphs. Phys. Rep. 446, 97-216. (doi:10.1016/j.physrep.2007.04.004) Crossref, ISI, Google Scholar - 13.
Traulsen A, Pacheco JM, Nowak MA . 2007Pairwise comparison and selection temperature in evolutionary game dynamics. J. Theor. Biol. 246, 522-529. (doi:10.1016/j.jtbi.2007.01.002) Crossref, PubMed, ISI, Google Scholar - 14.
Gokhale CS, Iwasa Y, Nowak MA, Traulsen A . 2009The pace of evolution across fitness valleys. J. Theor. Biol. 259, 613-620. (doi:10.1016/j.jtbi.2009.04.011) Crossref, PubMed, ISI, Google Scholar - 15.
Van Segbroeck S, Santos FC, Lenaerts T, Pacheco JM . 2011Selection pressure transforms the nature of social dilemmas in adaptive networks. New J. Phys. 13, 013007. (doi:10.1088/1367-2630/13/1/013007) Crossref, ISI, Google Scholar - 16.
Perc M, Szolnoki A . 2010Coevolutionary games—a mini review. BioSystems 99, 109-125. (doi:10.1016/j.biosystems.2009.10.003) Crossref, PubMed, ISI, Google Scholar - 17.
Allen B, Traulsen A, Tarnita CE, Nowak MA . 2012How mutation affects evolutionary games on graphs. J. Theor. Biol. 299, 97-105. (doi:10.1016/j.jtbi.2011.03.034) Crossref, PubMed, ISI, Google Scholar - 18.
Szolnoki A, Vukov J, Szabó G . 2009Selection of noise level in strategy adoption for spatial social dilemmas. Phys. Rev. E 80, 056112. (doi:10.1103/PhysRevE.80.056112) Crossref, ISI, Google Scholar - 19.
Szolnoki A, Perc M . 2016Leaders should not be conformists in evolutionary social dilemmas. Sci. Rep. 6, 23633. (doi:10.1038/srep23633) Crossref, PubMed, ISI, Google Scholar - 20.
Alam M, Nagashima K, Tanimoto J . 2018Various error settings bring different noise-driven effects on network reciprocity in spatial prisoner’s dilemma. Chaos, Solitons Fractals 114, 338-346. (doi:10.1016/j.chaos.2018.07.014) Crossref, ISI, Google Scholar - 21.
Wilson RC, Geana A, White JM, Ludvig EA, Cohen JD . 2014Humans use directed and random exploration to solve the explore–exploit dilemma. J. Exp. Psychol.: General 143, 2074. (doi:10.1037/a0038199) Crossref, PubMed, ISI, Google Scholar - 22.
Kirkpatrick S, Gelatt CD, Vecchi MP . 1983Optimization by simulated annealing. Science 220, 671-680. (doi:10.1126/science.220.4598.671) Crossref, PubMed, ISI, Google Scholar - 23.
Corne D, Dorigo M, Glover F, Dasgupta D, Moscato P, Poli R, Price KV . 1999New ideas in optimization. Maidenhead, UK: McGraw-Hill Ltd. Google Scholar - 24.
Sutton RS, Barto AG . 1998Introduction to reinforcement learning, vol. 135. Cambridge, UK: MIT Press. Crossref, Google Scholar - 25.
Toyokawa W, Saito Y, Kameda T . 2017Individual differences in learning behaviours in humans: asocial exploration tendency does not predict reliance on social learning. Evol. Hum. Behav. 38, 325-333. (doi:10.1016/j.evolhumbehav.2016.11.001) Crossref, ISI, Google Scholar - 26.
Torkelson J, Harris RS, Lombardo MJ, Nagendran J, Thulin C, Rosenberg SM . 1997Genome-wide hypermutation in a subpopulation of stationary-phase cells underlies recombination-dependent adaptive mutation. EMBO J. 16, 3303-3311. (doi:10.1093/emboj/16.11.3303) Crossref, PubMed, ISI, Google Scholar - 27.
Denamur E, Matic I . 2006Evolution of mutation rates in bacteria. Mol. Microbiol. 60, 820-827. (doi:10.1111/j.1365-2958.2006.05150.x) Crossref, PubMed, ISI, Google Scholar - 28.
Swings T, Van den Bergh B, Wuyts S, Oeyen E, Voordeckers K, Verstrepen KJ, Fauvart M, Verstraeten N, Michiels J . 2017Adaptive tuning of mutation rates allows fast response to lethal stress in Escherichia coli. eLife 6, e22939. (doi:10.7554/eLife.22939) Crossref, PubMed, ISI, Google Scholar - 29.
Santos FC, Pacheco JM, Lenaerts T . 2006Evolutionary dynamics of social dilemmas in structured heterogeneous populations. Proc. Natl Acad. Sci. USA 103, 3490-3494. (doi:10.1073/pnas.0508201103) Crossref, PubMed, ISI, Google Scholar - 30.
Barabási A-L, Albert R . 1999Emergence of scaling in random networks. Science 286, 509-512. (doi:10.1126/science.286.5439.509) Crossref, PubMed, ISI, Google Scholar - 31.
Sigmund K . 2010The calculus of selfishness. Princeton, NJ: Princeton University Press. Crossref, Google Scholar - 32.
Pinheiro FL, Santos FC, Pacheco JM . 2012How selection pressure changes the nature of social dilemmas in structured populations. New J. Phys. 14, 073035. (doi:10.1088/1367-2630/14/7/073035) Crossref, ISI, Google Scholar - 33.
Nowak MA . 2006Five rules for the evolution of cooperation. Science 314, 1560-1563. (doi:10.1126/science.1133755) Crossref, PubMed, ISI, Google Scholar - 34.
Pinheiro FLP . 2016Evolution of cooperation under variable mutation rates. In Characterization of self-organization processes in complex networks, ch. 5, pp. 109–118. Braga, Portugal: Universidade do Minho. Google Scholar - 35.
Ichinose G, Satotani Y, Sayama H . 2018How mutation alters the evolutionary dynamics of cooperation on networks. New J. Phys. 20, 053049. (doi:10.1088/1367-2630/aac2a7) Crossref, ISI, Google Scholar - 36.
Szolnoki A, Perc M, Szabó G, Stark HU . 2009Impact of aging on the evolution of cooperation in the spatial prisoner’s dilemma game. Phys. Rev. E 80, 021901. (doi:10.1103/PhysRevE.80.021901) Crossref, ISI, Google Scholar - 37.
Apicella CL, Marlowe FW, Fowler JH, Christakis NA . 2012Social networks and cooperation in hunter-gatherers. Nature 481, 497. (doi:10.1038/nature10736) Crossref, PubMed, ISI, Google Scholar - 38.
Rand DG, Nowak MA, Fowler JH, Christakis NA . 2014Static network structure can stabilize human cooperation. Proc. Natl Acad. Sci. USA 111, 17 093-17 098. (doi:10.1073/pnas.1400406111) Crossref, ISI, Google Scholar - 39.
Li X, Jusup M, Wang Z, Li H, Shi L, Podobnik B, Stanley HE, Havlin S, Boccaletti S . 2018Punishment diminishes the benefits of network reciprocity in social dilemma experiments. Proc. Natl Acad. Sci. USA 115, 30-35. (doi:10.1073/pnas.1707505115) Crossref, PubMed, ISI, Google Scholar - 40.
Traulsen A, Semmann D, Sommerfeld RD, Krambeck H-J, Milinski M . 2010Human strategy updating in evolutionary games. Proc. Natl Acad. Sci. USA 107, 2962-2966. (doi:10.1073/pnas.0912515107) Crossref, PubMed, ISI, Google Scholar - 41.
Grujic J, Gracia-Lázaro C, Milinski M, Semmann D, Traulsen A, Cuesta JA, Moreno Y, Sánchez A . 2014A comparative analysis of spatial prisoner’s dilemma experiments: conditional cooperation and pay-off irrelevance. Sci. Rep. 4, 4615. (doi:10.1038/srep04615) Crossref, PubMed, ISI, Google Scholar - 42.
Shirado H, Christakis NA . 2017Locally noisy autonomous agents improve global human coordination in network experiments. Nature 545, 370. (doi:10.1038/nature22332) Crossref, PubMed, ISI, Google Scholar - 43.
Macy MW, Flache A . 2002Learning dynamics in social dilemmas. Proc. Natl Acad. Sci. USA 99, 7229-7236. (doi:10.1073/pnas.092080099) Crossref, PubMed, ISI, Google Scholar - 44.
Caldarelli G . 2007Scale-free networks: complex webs in nature and technology. New York, NY: Oxford University Press. Crossref, Google Scholar - 45.
Amaral LA, Ottino JM . 2004Complex networks. Eur. Phys. J. B: Condens. Matter Complex Syst. 38, 147-162. (doi:10.1140/epjb/e2004-00110-5) Crossref, Google Scholar - 46.
Traulsen A, Nowak MA, Pacheco JM . 2006Stochastic dynamics of invasion and fixation. Phys. Rev. E 74, 011909. (doi:10.1103/PhysRevE.74.011909) Crossref, ISI, Google Scholar - 47.
Pacheco JM, Santos FC, Souza MO, Skyrms B . 2009Evolutionary dynamics of collective action in N-person stag hunt dilemmas. Proc. R. Soc. B 276, 315-321. (doi:10.1098/rspb.2008.1126) Link, ISI, Google Scholar - 48.
Santos FC, Pacheco JM . 2011Risk of collective failure provides an escape from the tragedy of the commons. Proc. Natl Acad. Sci. USA 108, 10 421-10 425. (doi:10.1073/pnas.1015648108) Crossref, ISI, Google Scholar - 49.
Chalub FA, Souza MO . 2016Fixation in large populations: a continuous view of a discrete problem. J. Math. Biol. 72, 283-330. (doi:10.1007/s00285-015-0889-9) Crossref, PubMed, ISI, Google Scholar - 50.
Vasconcelos VV, Santos FP, Santos FC, Pacheco JM . 2017Stochastic dynamics through hierarchically embedded Markov chains. Phys. Rev. Lett. 118, 058301. (doi:10.1103/PhysRevLett.118.058301) Crossref, PubMed, ISI, Google Scholar - 51.
Hindersin L, Wu B, Traulsen A, García J . 2019Computation and simulation of evolutionary game dynamics in finite populations. Sci. Rep. 9, 1-21. (doi:10.1038/s41598-019-43102-z) Crossref, PubMed, ISI, Google Scholar - 52.
Pacheco JM, Pinheiro FL, Santos FC . 2009Population structure induces a symmetry breaking favoring the emergence of cooperation. PLoS Comput. Biol. 5, e1000596. (doi:10.1371/journal.pcbi.1000596) Crossref, PubMed, ISI, Google Scholar - 53.
Pinheiro FL, Santos FC, Pacheco JM . 2016Linking individual and collective behavior in adaptive social networks. Phys. Rev. Lett. 116, 128702. (doi:10.1103/PhysRevLett.116.128702) Crossref, PubMed, ISI, Google Scholar - 54.
Scheffer M . 2009Critical transitions in nature and society. Princeton, NJ: Princeton University Press. Crossref, Google Scholar - 55.
Scheffer M, Carpenter S, Foley JA, Folke C, Walker B . 2001Catastrophic shifts in ecosystems. Nature 413, 591-596. (doi:10.1038/35098000) Crossref, PubMed, ISI, Google Scholar - 56.
Scheffer M et al.2012Anticipating critical transitions. Science 338, 344-348. (doi:10.1126/science.1225244) Crossref, PubMed, ISI, Google Scholar - 57.
Lade SJ, Tavoni A, Levin SA, Schlüter M . 2013Regime shifts in a social-ecological system. Theor. Ecol. 6, 359-372. (doi:10.1007/s12080-013-0187-3) Crossref, ISI, Google Scholar - 58.
Pinheiro FL, Pacheco JM, Santos FC . 2021Data from: Stable leaders pave the way for cooperation under time-dependent exploration rates. Dryad Digital Repository. (doi:10.5061/dryad.m0cfxpp15) Google Scholar


