Journal of The Royal Society Interface
You have accessResearch articles

Evolution of individual versus social learning on social networks

    Abstract

    A number of studies have investigated the roles played by individual and social learning in cultural phenomena and the relative advantages of the two learning strategies in variable environments. Because social learning involves the acquisition of behaviours from others, its utility depends on the availability of ‘cultural models’ exhibiting adaptive behaviours. This indicates that social networks play an essential role in the evolution of learning. However, possible effects of social structure on the evolution of learning have not been fully explored. Here, we develop a mathematical model to explore the evolutionary dynamics of learning strategies on social networks. We first derive the condition under which social learners (SLs) are selectively favoured over individual learners in a broad range of social network. We then obtain an analytical approximation of the long-term average frequency of SLs in homogeneous networks, from which we specify the condition, in terms of three relatedness measures, for social structure to facilitate the long-term evolution of social learning. Finally, we evaluate our approximation by Monte Carlo simulations in complete graphs, regular random graphs and scale-free networks. We formally show that whether social structure favours the evolution of social learning is determined by the relative magnitudes of two effects of social structure: localization in competition, by which competition between learning strategies is evaded, and localization in cultural transmission, which slows down the spread of adaptive traits. In addition, our estimates of the relatedness measures suggest that social structure disfavours the evolution of social learning when selection is weak.

    1. Introduction

    Learning is observed in a broad range of animal taxa and is of a pinnacle importance in shaping human behaviour. The unparalleled cultural diversity in modern humans (i.e. Homo sapiens), along with their global expansion into diverse environments, could not have been possible without their prominent learning capacity [1,2]. Learning is divided into two categories: individual learning and social learning (also known as exploration and exploitation, respectively) [2,3]. Individual (or asocial) learning, such as trial and error and insight, refers to learning that occurs independently of any social influences. Social learning, including imitation, involves the transmission of traits or behaviours through interaction with other individuals, in which a characteristic exhibited by a ‘model’ is acquired by an ‘observer’. There is a wide theoretical agreement that individual and social learning are adaptations to temporally and/or spatially variable environments, as innate behaviour is sufficient to cope with a constant environment [47]. The relative advantages of individual and social learning depend on the environmental constancy: in short, unless the environment is extremely stable so that innate behaviour is most advantageous, social learners (SLs) are advantageous in a moderately variable environment because they can acquire adaptive behaviour without engaging in costly individual learning, but run a risk of carrying an inept behaviour in a highly variable environment.

    Animal societies are structured in the sense that a given individual in a society is more likely to interact with certain individuals than with others in the same society. Social structure of this kind introduces a level of complexity to animal societies, which has been hypothesized to underlie the evolution of communicative complexity across species [8] as well as larger brains in primates [9]. Societies may be structured on the basis of genetic relatedness as well as other criteria. In humans, in particular, it has been emphasized that cultural variations within and between societies play an important role in creating social structure [10]. This adds a further complexity to human societies, which might be causally related to the evolutionary origins of the human cognitive capacity [11].

    Social structure also governs the flow of information via social learning. Recent empirical studies have shown that cultural transmission in humans and other animals is affected by social network architecture [1215]. Furthermore, the relative advantage of SLs depends on from whom they acquire information [1620], which indicates that social structure should play an essential role in the evolution of learning.

    However, possible effects of social structure on the evolution of learning have not been fully explored so far with some notable exceptions [2124]. As summarized by a recent study, social or spatial structure may affect the evolution of learning in two ways [23]. First, information flow is localized in a structured population in the way that social learning occurs only between spatially or socially neighbouring individuals (i.e. localization in cultural transmission). Thus, it takes more time for any cultural trait to spread in a structured population than in a well-mixed population, which disfavours social learning by increasing the time they spend with outdated behaviours. Second, competition is also localized in a structured population, so that individuals tend to compete with those who are genetically similar to themselves (i.e. localization in competition). This benefits SLs with maladaptive behaviour by letting them circumvent competing with individual learners (ILs).

    Then, a question arises: does social structure favour or disfavour the evolution of social learning after all? Or, to put it in another way, which of the two effects of localization in a structured population plays the dominant role? The question remains open, as recent studies have produced mixed results [21,23]. In a landmark study, Rendell et al. [21] conducted a series of simulations to explore evolution of social and asocial learning in a two-dimensional, lattice-structured population [21]. They found that the long-term average frequency of SLs is typically higher when both dispersal and learning are limited within neighbouring sites than when both occur globally. This supports the notion that population structure favours the evolution of social learning. More recently, Kobayashi & Wakano [23] investigated the evolution of social and individual learning using an infinite island model, in which a population is divided into infinitely many, finite-sized islands connected by migration [23]. According to their analytical results, social learning is favoured when each island is large or the migration rate is high, suggesting that spatial subdivision disfavours the evolution of social learning, which is in stark contrast to Rendell et al. [21].

    The discrepancy between the two studies implies that whether or not social structure favours the evolution of social learning depends on the precise nature of the structure. To understand the effect of structure, therefore, it is essential to formulate the evolution of learning in a social structure that is as general as possible. In this study, we model evolutionary dynamics of learning strategies in a general social network. In our analysis, we use the concept of relatedness, a statistical measure of association between interacting individuals applicable to both genetically and culturally inherited traits [2528]. Relatedness has been applied to a variety of biological phenomena [25,2931], particularly in spatially structured populations [3234]. We first describe our evolutionary model and derive the condition for natural selection to favour the evolution of social learning. We then obtain an approximated expression of the long-term average frequency of SLs for homogeneous networks, where each individual has the same number of neighbours, and specify the condition under which social structure favours the evolution of social learning as a function of relatedness measures. We also examine the validity of our approximation by Monte Carlo simulations in three types of social structures: complete graphs, regular random graphs and scale-free networks.

    2. Model

    2.1. Description of the model

    We investigate possible effects of social structure on the evolution of social and individual learning, following the general framework developed by earlier studies [46,23]. Consider a population consisting of N individuals in a fluctuating environment, where there is only one behaviour that is adaptive to the current state of the environment and all other behaviours are equally maladaptive. The environment changes every T generations, where T measures the environmental stability. We adopt the infinite environmental state model, which assumes that the environment never reverts when it changes [46,23].

    Two learning strategies are considered: each individual is either an IL or an SL. An IL engages in costly trial-and-error learning and always discovers the correct (adaptive) behaviour. An SL imitates others' behaviour without paying the cost of individual learning, but may acquire a wrong (maladaptive) behaviour as a result. Hence, there are three phenogenotypes, IL, SL with the correct behaviour (SLC) and SL with a wrong behaviour (SLW). The relative advantages of the three phenogenotypes are measured in terms of pay-offs (table 1). The pay-off to IL is reduced to 1 − c relative to SLC, where c represents the cost owing to having a more complex nervous system or engaging in trial-and-error learning (0 < c < 1). The pay-off to SLW reduces to 1 − s relative to SLC, where s represents the cost owing to behaving maladaptively (0 < s < 1).

    Table 1.Pay-offs to three phenogenotypes.

    phenogenotypepay-off
    IL1−c
    SLC1
    SLW1−s

    We regard each individual in a population as occupying a vertex of a network whose edges represent social links between individuals. We assume that network structure is constant for the duration of the evolutionary changes considered. In what follows, individuals occupying linked vertices are referred to as neighbours.

    We consider changes in the frequencies of IL, SLC and SLW over discrete time periods. Each time period is subdivided into two phases: the reproductive phase and the learning phase. Environmental changes may occur after the reproductive phase. In the reproductive phase, all individuals are synchronously replaced by newborns, where the individual currently occupying a given vertex will be replaced either by that individual's offspring or offspring of a randomly chosen neighbour. The offspring will inherit the learning strategy of the parent. Note that although we use the term ‘reproductive’, our model is also consistent with a process without biological reproduction. In the latter interpretation, we effectively assume pay-off-biased cultural transmission, in which each individual may copy a neighbour's learning strategy depending on the relative pay-offs.

    Consider a pair of individuals competing for a focal vertex, that is, the individual currently occupying that vertex (self) and a randomly chosen neighbour (opponent). We assume that the probability, f, that the self will be replaced by the opponent's offspring or adopt the opponent's learning strategy follows the logit choice rule (also known as the Fermi rule) [3537]:

    Display Formula
    2.1
    where πself and πopp are the pay-offs to the self and the opponent, respectively, and α is the intensity of selection.

    In the learning phase, all individuals acquire correct or wrong behaviour. IL always acquires the correct behaviour, whereas SL copies the behaviour of cultural models. The cultural model of SL occupying a given vertex is chosen at random from those who occupied that vertex or the neighbouring vertices before the reproductive phase, which approximates the process of vertical and oblique transmission [1] (see the electronic supplementary material for a model assuming vertical and horizontal transmission). When an environmental change occurred, none of the pre-existing behaviours is adaptive and, as a result, SL cannot acquire the correct behaviour after an environmental change.

    2.2. Overview of the analysis

    From equation (2.1), we can derive the probability that an individual is SL after the reproductive phase, depending on phenogenotypes of its neighbours and costs of IL and SLW. Under the assumption of large population size, averaging the probability over the population gives the frequency of SL after the reproductive phase or a recursion equation for the frequency of SL in the population. From the recursion equation, we obtain the condition under which SL are selectively favoured over IL. We also identify a periodically stable state of changes in the frequency of SL to derive the long-term average frequency of SL in terms of coefficients of relatedness, which measure the association between two neighbouring individuals. See the electronic supplementary material for more details.

    2.3. Relative advantage of social learning

    Besides, we consider only homogeneous networks, where each individual has the same number of neighbours. In a homogeneous network, everybody is equally likely to be somebody's opponent. In a heterogeneous network, on the other hand, there may be hub individuals having much more neighbours than others and thus highly likely to be somebody's opponent. Suppose, for example, a network consisting of one (hub) individual and five neighbours, with each neighbour being connected only to the hub (i.e. a star graph). In this network, if the hub is SL and the others are IL, the expected frequency of SL among the opponents of a randomly chosen individual is 5/6, which is larger than the frequency of SL in the population, 1/6. Hence, in this specific case, the frequency of SL may increase simply because SL is more likely to be somebody's opponent than IL is, even if no pay-off-dependent selective process such as modelled by equation (2.1) is at work. Because our main focus here is on investigating the behaviour of the system dominated by the pay-off-dependent selective process, we limit our attention to homogeneous networks. See the electronic supplementary material for more general analysis allowing for such cases.

    We derive the condition under which SL is selectively favoured over IL under weak selection. Let λX,Y denote the probability that the self is X and the opponent is Y, where X and Y denote either IL, SL or SLW. As shown in the electronic supplementary material, SLs increase in frequency if

    Display Formula
    2.2

    Thus, social learning is more likely to become prevalent in such social structure that renders the ratio (λIL,SLW + λSLW,IL)/(λIL,SL + λSL,IL) smaller. This ratio measures the extent to which IL and SLW are socially connected in comparison with the extent of connection between IL and SL, and is likely to be small when, for example, SLW forms a cluster that is surrounded by SLC, thereby separated from clusters of IL (e.g. the edge effect in two-dimensional lattice; see [21]).

    2.4. Long-term average frequency of social learners

    Here, we derive the long-term average frequency of SL assuming weak selection and stable coexistence of the three phenogenotypes. Let p and p′ denote the frequency of SL before and after the reproductive phase, respectively. Because we assume periodic change of environment, the population will not reach an equilibrium state where IL, SLC and SLW coexist in a constant ratio. However, the long-term average of p will be stable if change in p is, on average, zero over T time periods between two consecutive environmental changes. When this condition is met, we regard the population as having reached a periodically stable state. We also denote by w the proportion of SLW among SL.

    For our purpose, we need to define three statistics, R, r and ρ, as follows. Note first that, from the symmetry between the self and opponent in homogeneous networks, we can write λIL,SL = λSL,IL = (1 − P(SL|SL))p, where P(SL|SL) is the probability that one of two neighbouring individuals is an SL under the condition that the other is an SL. This can be rewritten as λIL,SL = λSL,IL = (1 − R)(1 − p)p, where

    Display Formula
    2.3

    The coefficient R gives the standard genetic relatedness between two neighbouring individuals, which can be rewritten in terms of variances and covariances [38].

    Second, we can write λIL,SLW = λSLW,IL = (1 − r)(1 − p)pw, where

    Display Formula
    2.4

    P(SL|SLW) is the probability that one of two neighbouring individuals is an SL under the condition that the other is an SLW. Coefficient r, which measures the association between an individual's genotype and the neighbour's phenotype, has been formalized as the ‘gene-to-culture relatedness' [39].

    Finally, we introduce the following statistic

    Display Formula
    2.5
    where P(SL′|SLW) is the probability that an individual sampled after the reproduction phase is an SL under the condition that its cultural model is an SLW. Coefficient ρ measures the extent to which social learning occurs non-randomly, where positive ρ indicates that SL is more likely to imitate SLW than they would in well-mixed populations.

    To proceed further, let us assume that once a periodically stable state is reached, changes in p as well as R, r and ρ are much smaller than those in w, so that they can be regarded as constant. This assumption may be justified when selection is weak. Under this assumption, we obtain (see the electronic supplementary material)

    Display Formula
    2.6
    where the hat symbol represents long-term averages. The long-term average proportion of SLW among SL is also given by
    Display Formula
    2.7
    Note that Inline Formula in a well-mixed population.

    2.5. The effects of social structure

    The frequency of SL at a periodically stable state in a homogeneous network can be greater than that would be in a well-mixed population in that case we can conclude that social structure promotes the evolution of social learning. From equation (2.6), this is the case if

    Display Formula
    2.8

    Quantities Inline Formula and Inline Formula jointly specify characteristics of a homogeneous network, and both are one in large well-mixed populations. In structured populations, SL tends to form clusters, which render Inline Formula and Inline Formula larger. Because only IL can behave correctly after an environmental change, SL in the inside of a cluster of SL tends to acquire a maladaptive behaviour. On the contrary, SL on the edge of the cluster can acquire the correct behaviour from IL. As a result, SLW's opponent may be more likely to be SL than SLC's opponent is. This is true when Inline Formula in that case competition between SLW and IL tends to be evaded. As this effect is more prominent when Inline Formula is larger, this quantity can be regarded as a measure of localization in competition. Further, as mentioned above, Inline Formula represents the extent to which social learning occurs non-randomly, and thus Inline Formula can be considered as a measure of localization in cultural transmission. In sum, what inequality (2.8) proves is that social structure favours the evolution of social learning if the effect of localization in competition is stronger than that of localization in cultural transmission [23,24].

    2.6. Monte Carlo simulations

    To examine the validity of equations (2.6) and (2.7), we conduct a series of Monte Carlo simulations. We demonstrate that our approximations are valid for a wide range of parameter values. In fact, this turns out to be true even for heterogeneous networks and under relatively strong selection. The simulation is designed to model the same process as in our analytical model except that we now allow for mutation on the learning strategy in order to avoid the population becoming monomorphic by stochastic effect. Mutation, which switches an individual's learning strategy, is assumed to occur after the reproductive phase with probability μ = 10−5 per individual per time period. We obtain long-term average frequencies of phenogenotypes for T∈{5, 10, … 100} and α∈{1, 10}, using s = 0.1 and c = 0.02. Results with other values for s and c are described in the electronic supplementary material. Throughout our simulations, we set N = 10 000.

    We track changes in the frequencies of the phenogenotypes in three types of networks: complete graphs, regular random graphs and scale-free networks. Complete graphs, in which each of N individuals is connected to N − 1 others, are considered as a model of well-mixed populations. Regular random graphs, in which each individual has the same number, k, of neighbours chosen at random from the N − 1 others (1 < k < N − 1), represent homogeneous networks. Scale-free networks, in which the number of neighbours follows the power law with mean k, are included as an example of heterogeneous networks to evaluate whether or not our approximation derived under the assumption of homogeneous networks may be also applicable to heterogeneous networks. We generated scale-free networks based on the Barabási–Albert model [40]. In our simulations, we use k∈{8, 16}. Results presented are for k = 8 and they are not altered qualitatively by using k = 16 (see the electronic supplementary material).

    In the beginning of each simulation run, either IL or SL is randomly assigned to each individual with equal probability. Each simulation run consists of 50 000 time periods. Long-term average frequencies of the three phenogenotypes are obtained by averaging over the last 10 000 time periods and then further averaged over 100 simulation runs. For complete graphs, the 100 runs are of the same network structure. For regular random graphs and scale-free networks, the 100 runs consist of five runs for each of 20 different network structures of the same type.

    We assume that for a given network structure, R, r and ρ converge to certain values as long as selection is sufficiently weak. Accordingly, we conduct additional simulations for regular random graph and scale-free network in which selection is absent (α = 0) to obtain estimates of the three coefficients. In these simulations, environmental change occurs only once at time period 40 000, and the three coefficients for each of the 100 time periods following the environmental change are calculated. We run 1000 simulations and then obtain estimates of Inline Formula, Inline Formula and Inline Formula as the median of the 100 × 1000 values of the three coefficients. Table 2 provides the estimates thus obtained.

    Table 2.Estimates of the three relatedness measures for regular random graphs and scale-free networks.

    Inline FormulaInline FormulaInline Formula
    regular random graph0.1006860.1869130.220344
    scale-free network0.0998560.1804530.220302

    The upper panels of figure 1 compare the long-term average frequency of SL for complete graphs obtained numerically by simulations and analytically by equation (2.6). For the parameter values examined, the equation predicts the numerical results very accurately. The middle panels of figure 1 show the long-term average frequency of SL for regular random graphs obtained numerically and analytically. For a broad range of T, equation (2.6) predicts the numerical results very accurately except when T is very small. The lower panels of figure 1 compare numerical results for scale-free networks with the analytical results. Numerical results for scale-free networks are almost identical with those for regular random graphs, suggesting that equation (2.6) is also applicable to scale-free networks for the parameter values examined.

    Figure 1.

    Figure 1. The long-term average frequency of SL, plotted against the environmental stability, T. The circles represent the results by Monte Carlo simulations and the curves represent the analytical approximations. The results for well-mixed populations (a), regular random graphs (b) and scale-free networks (c), respectively. For weak (α = 1) (i) and strong (α = 10) (ii) selection, respectively. Parameter values used are s = 0.1, c = 0.02.

    The upper panels of figure 2 compare the long-term average proportions of SLW among SLs for well-mixed populations obtained analytically by equation (2.7) and numerically by Monte Carlo simulations. Under weak selection (α = 1), equation (2.7) predicts the numerical results very accurately except when T is small, whereas under strong selection (α = 10), the prediction is slightly less accurate. The discrepancies between the analytical and numerical results are smaller for intermediate values of T, for which our assumption of stable coexistence of the three phenogenotypes is more likely to hold. The middle and lower panels of figure 2 show the results for regular random graphs and scale-free networks, respectively, obtained by equation (2.7) and Monte Carlo simulations. The general trends for the two types of networks are qualitatively the same as those for well-mixed populations, although equation (2.7) is less accurate for scale-free networks when T is large even when selection is weak.

    Figure 2.

    Figure 2. The long-term average proportion of SLW among SL, plotted against the environmental stability, T. The circles represent the results by Monte Carlo simulations and the curves represent the analytical approximations. The results for well-mixed populations (a), regular random graphs (b) and scale-free networks (c), shown respectively. For weak (α = 1) (i) and strong (α = 10) (ii) selection, shown respectively. Parameter values used are: s = 0.1, c = 0.02.

    Our estimates of the relatedness measures shown in table 2 do not satisfy inequality (2.8) for regular random graphs or scale-free networks. This suggests that social structure disfavours the evolution of social learning when selection is weak.

    3. Discussion

    We have explored the evolution of learning strategies on social networks. First, we have derived the condition under which natural selection favours the evolution of social learning, as opposed to individual learning, for a broad range of social networks (equation (2.2)). Second, we have obtained an analytical approximation of the long-term average frequency of SLs in homogeneous networks (equation (2.6)), from which we have specified the condition, as a function of three relatedness measures, for social structure to promote the long-term evolution of social learning (inequality (2.8)). Third, by Monte Carlo simulations, we have confirmed that our approximation is valid for wide range of parameter values in complete graphs, regular random graphs, and scale-free networks. Finally, we have found that our estimates of the relatedness measures (table 2) do not satisfy inequality (2.8) for regular random graphs or scale-free networks, suggesting that social structure generally disfavours the evolution of social learning under weak selection.

    Our mathematical analysis has shown that the long-term average frequency of SL depends on three coefficients, R, r and ρ, that represent non-random association between phenogenotypes. It has also demonstrated that the relative magnitudes of two quantities, (1 − R)/(1 − r) and 1/(1 − ρ), which measure localization in competition and cultural transmission, respectively, determine whether the long-term average frequency of SL is larger or smaller in a homogeneously structured population than in an unstructured population. Therefore, the evolution of social learning is facilitated by the presence of such a social structure that localizes competition to a greater extent than it localizes cultural transmission.

    In a structured population, SL acquires the correct behaviour less efficiently than in a well-mixed population because the structure localizes cultural transmission. When the degree of localization in cultural transmission is strong so that the inequality in (2.8) is reversed, SL tends to compete with neighbouring IL without acquiring the correct behaviour. This results in lower long-term average frequency of SL in structured populations than in well-mixed populations. On the other hand, population structure also localizes competition in a way that competition between SL and IL tends to be evaded. More to the point, certain structures hinder competition between SLW and IL to a greater extent than they hinder competition between SLC and IL, in which case the left-hand side of inequality (2.8) exceeds 1. When the degree of localization in competition is relatively strong so that inequality (2.8) holds, SLW forms a cluster surrounded by SLC, whereby competition between SLW and IL is evaded. Some previous studies on the evolution of social learning on two-dimensional lattices have reported that the long-term average frequency of SL can be higher than those in unstructured populations, suggesting that lattice-structured populations can be more likely to favour the evolution of social learning, because the separation by SLC of SLW and IL can be readily achieved in this type of structure [21,24]. In such a structure, SL's disadvantage owing to the delay in information transmission can be more than compensated for by the advantage of readiness in surrounding clusters of ILs. The edge effect proposed by Rendell et al. is an example of this case [21].

    While most of previous studies on the evolution of individual versus social learning heavily depended on numerical simulations, Kobayashi and co-workers have provided analytical formula for the approximated long-term average frequency of SLs [23,39]. In particular, Kobayashi & Ohtsuki's [39] inclusive-fitness analysis has pointed out the importance of a balance of two types of localization caused by population structure [39]. In one sense, our analysis is less restrictive than Kobayashi & Ohtsuki's [39]: a major difference between the two studies is that while Kobayashi and Ohtsuki used Wright's island model, which assumes a population consisting of infinitely many islands, our approximate formulas have been shown to be applicable to a fairly wide range of population structure. In another sense, our analysis is more restrictive: of the three mechanisms for stable coexistence of ILs and SLs investigated by Kobayashi & Ohstuki [39], which are temporal fluctuation of environments, local population sparsity and spatial environmental heterogeneity, we focused on the first one. Our equation (2.6) is comparable with equation (2.1) in Kobayashi & Ohtuski [39]. These equations are consistent in showing the importance of R and r, whereas ρ appears only in our equation. A possible reason why an additional measure is required in our model is as follows. In Wright's island model, each individual within an island is connected with all other individuals within the island, so that cultural transmission can be considered to occur within a small-sized well-mixed population, or possible cultural models for all individuals within an island is identical. In contrast, in a general homogeneous network, possible cultural models of two individuals can be different even if they are neighbours.

    When reproduction and social learning are mediated by the same social network, as have been assumed in this study, the two types of localization tend to go hand in hand. In this case, therefore, social structure can favour or disfavour the evolution of social learning depending on details of the structure and parameter values, which may, in part, explain the discrepancy between the previous studies [21,23]. In some combinations of population structures and parameter values, localization in competition can overwhelm localization in cultural transmission [21]. It may be interesting to consider two networks simultaneously, one through which individuals update learning strategies and the other through which SLs acquire cultural traits [41,42].

    Social structure is also relevant to Rogers' paradox, which states that the mean fitness in a given population where ILs and SLs coexist at equilibrium is equal to that in the absence of SLs [43]. Rendell et al.'s [21] simulations suggested that the mean fitness at the stable coexistence of ILs and SLs tends to be lower in a lattice-structured population than in a well-mixed population, where the former harbours more SLs that the latter [21]. Kobayashi & Wakano [23] named this phenomenon strong Rogers' paradox and confirmed that the paradox also holds in their infinite island model [23]. Using equation (2.7), we derive the expectation of the mean pay-off, Inline Formula, in a periodically stable state

    Display Formula
    3.1

    Hence, the mean pay-off in a periodically stable state is on average smaller than the mean pay-off in the absence of social learning, 1 − c, if Inline Formula is met. In a well-mixed population, for R = r always holds, the mean pay-off in a periodically stable state equals the mean pay-off in the absence of social learning. In other words, equation (3.1) demonstrates that strong Rogers' paradox holds whenever social structure creates a positive effect of localization in competition.

    We have not found any significant difference between regular random graphs and scale-free networks in terms of the long-term average frequency of SL. This may seem at odds with previous studies, in particular, those that addressed the evolution of cooperation. Those studies suggested that heterogeneous networks tend to facilitate the evolution of cooperation when compared with homogeneous networks ([4446] but see [47,48]). This effect is likely to depend on the fact that in heterogeneous networks, there exist hub players who can enjoy large total pay-offs derived from a large number of interactions. Indeed, it has been pointed out that the advantage of scale-free networks in facilitating the evolution of cooperation vanishes if an individual's fitness is determined not by the total pay-off but by the averaged pay-off over interactions [49]. Because pay-offs do not accumulate over interactions in our model, our finding that homogeneous and heterogeneous networks produce almost the identical evolutionary outcomes does not necessarily contradict with the previous studies.

    Our results have some anthropological and sociological implications. Kobayashi & Ohtsuki [39], as well as this study, showed that the evolution of social learning tends to be facilitated in unstructured populations, where there are more potential social partners than in structured populations. Based on their mathematical analysis and previous findings in anthropology [9,50,51], they proposed a possible evolutionary association between the heavy reliance on social learning and large group size in modern humans. In the electronic supplementary material, we confirmed that a larger number of neighbours, k, can provide the higher frequency of SLs even in the same type of network structure (electronic supplementary material, figure S1), suggesting that Kobayashi & Ohtsuki's [39] idea can be supported by models with explicit network structure. Because these results indicate the possible existence of a feedback loop between the heavy reliance of humans on social learning and increasing group size, developing a mathematical model to consider the coevolution of sociality and learning strategies under various costs of having many social partners is a possible future direction of this study.

    Homophily, or the degree to which individuals associate with similar others, may affect the spread of innovations. Although homophilic contacts are better at achieving mutual understanding, the importance of heterophilic contacts [52] and the ‘strength of weak ties' [53] in the spread of innovations have been emphasized. In this study, equation (2.7) suggests that whenever social structure promotes localization of competition (i.e. Inline Formula), the proportion of SLW among SL is heightened than expected in a well-mixed population. This may be in accord with the idea that homophilic contacts slow down the spread of innovations.

    It has been argued that heterogeneity in a social network can affect cultural evolution in multiple ways. On the one hand, heterogeneity may promote the spread of adaptive cultural traits. Hub individuals in a heterogeneous network may have a greater influence on others than non-hub individuals, leading to one-to-many transmission, which has been considered as a key to rapid cultural evolution ([54,55] but see [56,57]). On the other hand, heterogeneous networks may also promote maladaptive cultural evolution. While adaptive aspects of culture have been emphasized, some maladaptive behaviours may be also culturally transmitted. For instance, the fertility transitions, prestige-seeking behaviours and contagious suicide have been of particular interest to anthropologists as well as social scientists [1,2,51,5866]. Learning biases and modes of cultural transmission have been deemed as factors contributing to maladaptive cultural evolution [1,2,61,63]. An interesting question is whether spread of some maladaptive behaviours can be also attributable to heterogeneous network architectures [66]. Using estimated values of Inline Formula, Inline Formula and Inline Formula for regular random graphs and scale-free networks (table 2), we compared the long-term average proportion of SLW among SL between the two types of network architectures. It turns out that they are not significantly different, suggesting that heterogeneity plays only a minor role in the spread of cultural traits. Nevertheless, it is possible that heterogeneity makes a difference when it works in coordination with certain learning biases, including the pay-off-dependent bias. The pay-off-dependent bias may play an important role in shaping structure of cultural transmission networks: real cultural transmission networks are heterogeneous and the heterogeneity may have emerged from adaptive learning biases [67,68]. Theoretical investigation on a dynamic formation of cultural transmission networks through a process in which individuals preferentially choose models to acquire their behaviours may provide better understanding about human learning bias and the nature of cultural transmission.

    Acknowledgements

    We thank three anonymous reviewers for their helpful comments.

    Funding statement

    K.T. is supported by Grant-in-Aid for JSPS fellows and CREST, JST. Y.K. is supported by Monbukagakusho grant no. 22101004.

    Footnotes

    References