Discrete and continuous mathematical models of sharp-fronted collective cell migration and invasion
Abstract
Mathematical models describing the spatial spreading and invasion of populations of biological cells are often developed in a continuum modelling framework using reaction–diffusion equations. While continuum models based on linear diffusion are routinely employed and known to capture key experimental observations, linear diffusion fails to predict well-defined sharp fronts that are often observed experimentally. This observation has motivated the use of nonlinear degenerate diffusion; however, these nonlinear models and the associated parameters lack a clear biological motivation and interpretation. Here, we take a different approach by developing a stochastic discrete lattice-based model incorporating biologically inspired mechanisms and then deriving the reaction–diffusion continuum limit. Inspired by experimental observations, agents in the simulation deposit extracellular material, which we call a substrate, locally onto the lattice, and the motility of agents is taken to be proportional to the substrate density. Discrete simulations that mimic a two-dimensional circular barrier assay illustrate how the discrete model supports both smooth and sharp-fronted density profiles depending on the rate of substrate deposition. Coarse-graining the discrete model leads to a novel partial differential equation (PDE) model whose solution accurately approximates averaged data from the discrete model. The new discrete model and PDE approximation provide a simple, biologically motivated framework for modelling the spreading, growth and invasion of cell populations with well-defined sharp fronts. Open-source Julia code to replicate all results in this work is available on GitHub.
1. Introduction
Continuum partial differential equation (PDE) models have been used for over 40 years to model and interpret the spatial spreading, growth and invasion of populations of cells [1–3]. PDE models have been used to improve our understanding of various biological processes including wound healing [4–9], embryonic development [10–13], tissue growth [14–16] as well as disease progression, such as cancer [17–24]. For a homogeneous population of cells with density , a typical PDE model can be written as
where is the flux of cells and is a source term that can be used to model proliferation and/or cell death. Different PDE models are specified by choosing different forms of and . Within the context of modelling homogenous cell populations, the most common choice for the flux term is based on the assumption that cells move randomly [25], giving rise to linear diffusion with a flux term given by Fick’s law, , where is the cell diffusivity [3,7,8]. A standard choice for the source term is to specify a logistic term to represent carrying capacity-limited proliferation, , where is the proliferation rate and is the carrying capacity density [3,7,8]. These choices of and mean that equation (1.1) is a multi-dimensional generalization of the well-known Fisher–Kolmogorov model [26–29], which has been successfully used to interpret a number of applications including in vivo tumour progression [18], in vivo embryonic development [10], in vitro wound healing [7,8] and tissue growth [15,16].
Figure 1a shows experimental images of a simple two-dimensional in vitro cell migration experiment, called a barrier assay [30,32]. These experiments are initiated by uniformly placing approximately 30 000 fibroblast cells as a monolayer inside a circular barrier of radius 3 mm. In these experiments, cells are pre-treated with an anti-mitotic drug that prevents proliferation [33], and there is no observed cell death [30]. Accordingly, we model this experiment by setting in equation (1.1). The experiment proceeds by lifting the barrier at and observing how the population of cells spreads over time, with the rightmost image in figure 1a showing the extent to which the population has spread after days. Two key features of this experiment are immediately clear from these images: (i) the population of cells spreads symmetrically with time and (ii) the experimental image at days shows a clear well-defined sharp front at the leading edge of the population as it spreads. Images in figure 1b show a numerical solution of equation (1.1) with and the standard choice of linear diffusion, , for a typical choice of [30]. Consistent with the experiments in figure 1a , we see that the simulated population spreads symmetrically, but plotting the density along the line in the rightmost panel of figure 1b shows that we have for all which is inconsistent with the well-defined sharp fronts at the leading edge in the experimental images. This property of having for all persists for all , which is a well-known deficiency of linear diffusion [34]. Figure 1c shows a numerical solution of equation (1.1) with and a nonlinear degenerate diffusive flux, , for a typical choice of in this model [7,8,16,35–37]. Consistent with the experiments we see that the simulated population spreads symmetrically, and plotting the solution along the line in the rightmost panel of figure 1c shows that we have a well-defined sharp front: for and for , where is the front location at time . Full details of our numerical method for solving equation (1.1) are given in appendix A.

Figure 1. ( a ) Experimental images showing a population of non-proliferative fibroblast cells spreading in a two-dimensional barrier assay. The image at shows the population just as the barrier is lifted, and the image at days shows the population of migrating cells spreading symmetrically with a sharp front. Images reproduced from Simpson et al. [30] with permission. Numerical solutions of equation (1.1) with ( b ) and ( c ) . Both numerical solutions have and inside a disc of radius 3 mm, and elsewhere to match the initial distribution of cells in the experiments shown in ( a ). The numerical domain is a square of side length 10, and equation (1.1) is discretized on a uniform mesh. The numerical solution of equation (1.1) at days is given in the middle panel of ( b , c ), and the details of the density profile are shown in the rightmost panels where is plotted at (red) and days (blue). Details of the leading edge of the profiles are highlighted in the green rectangle near illustrating that the density profile in ( b ) has at all locations, whereas the density profile in ( c ) has compact support, which is consistent with the experimental images in ( a ). All values of and in ( b , c ) measure location in terms of mm to be consistent with the experimental images in ( a ). The numerical solution in ( b ) corresponds to a typical value of for linear diffusion [30], and in ( c ) we set to satisfy , which ensures that both the linear and nonlinear diffusion models lead to a similar amount of spreading over the experimental time scale [31].
The qualitative comparison between the solution of the linear diffusion equation, the nonlinear degenerate diffusion equation and the experimental images in figure 1 has been made with so that the continuum PDE model is consistent with the experiments where proliferation is suppressed. However, the difference between spreading cell fronts having sharp or smooth fronts is also relevant for models with [3,7,8]. Throughout the first part of this work, we set , noting that the difference between smooth and sharp-fronted solutions of equation (1.1) is, in general, determined by the choice of rather than . We will come back to this point in §2.5 and provide evidence to support this claim.
As mentioned above, many continuum models of homogeneous cell populations adopt a simple linear diffusive flux, , and this approximation is often made with the implicit or explicit acknowledgement that solutions of this PDE model fail to predict a well-defined front as observed experimentally. In contrast, working with the degenerate nonlinear diffusion model by setting can lead to a better match with experimental data with well-defined sharp fronts [7,8,16,36,38,39]. With this choice of flux and , equation (1.1) is also known as the porous medium equation [40–44]. Working with the degenerate nonlinear diffusion model is complicated by the fact that this model is one member of a family of models obtained by setting , where is some constant. Solutions of equation (1.1) with this more general choice of nonlinear flux also lead to symmetric spreading with a well-defined sharp front like we saw in figure 1c for all values of . These sharp-fronted solutions with compact support are similar to moving boundary problems in the sense that there is a well-defined front location with zero density, and the position of this front evolves with time which we can interpret as a model of the position of the cell front in an experiment [40–44]. The question of how to choose the value of the exponent remains unclear. For example, Sherratt & Murray [2] studied an in vivo wound healing experiment with and 4, and showed that all three choices of exponent could be used to make their reaction–diffusion PDE model match their experimental data. Later, Jin et al. [36] studied a series of in vitro scratch assays by setting and and concluded that led to the best match to their experimental data without attempting to provide a biological motivation or interpretation of this choice of . Similarly, McCue et al. [45] studied a series of two-dimensional in vitro wound closure experiments and also found that provided the best match to their experimental data. Other continuum modelling studies have simply worked with without explicitly considering other choices of the exponent [14,16,46]. In summary, a key challenge in using continuum PDE models with this generalized nonlinear degenerate diffusivity is that the exponent often acts as a fitting parameter [38], and lacks a clear biological interpretation. In addition to using these kinds of degenerate diffusion models to interpret biological observations, there is also a great deal of inherent mathematical interest in these models and their solutions [35,47].
An alternative to working with a continuum model to understand the collective spatial spreading, growth and invasion of cell populations is to work with a discrete modelling framework that considers the stochastic motion of individual cells [19,25]. Many kinds of discrete models of cell populations have been implemented to interpret experimental observations ranging from simple lattice-based models [48,49] to more complicated lattice-free [50,51] and vertex-based models [52–55]. An attractive feature of working with discrete models is that experimental images and time-lapse movies showing individual cellular-level behaviours can be translated into a set of individual rules that can be implemented with a stochastic framework to provide a high-fidelity simulation-based model capturing the key biological processes of interest [56,57]. Discrete models can be implemented to visualize snapshots of the spreading population in a way that is directly analogous to performing and imaging an experiment to reveal the positions of individual cells within the population. Another advantage of working with discrete stochastic models is that the discrete mechanism can be coarse-grained into an approximate continuum model, which means that we can encode different individual-level rules into a simulation-based model, and then convert these rules into approximate continuum PDE models, and the solution of these coarse-grained models can be compared with averaged discrete data obtained by repeated simulation [48,58–66]. As described previously, there has been a great deal of effort devoted to understanding how different forms of continuum PDE models predict smooth or sharp-fronted solution profiles; however, far less attention has been devoted to understanding what individual-level mechanisms lead to smooth or sharp fronts in discrete models of cell migration.
All mathematical models discussed so far are simple in the sense that they involve a single PDE or a single population of agents in a discrete framework that can be used to describe the spreading of a homogeneous population of cells. Of course, there are many other more complicated models of collective cell spreading that can lead to sharp-fronted solution profiles. These models include coupled reaction–diffusion models of multiple interacting cell populations [67] as well as discrete models describing multiple populations [68]. Other families of mathematical models include models that describe cell migration that involves biased movement along chemical gradients, such as chemotaxis or haptotaxis [69,70]. Here, we will focus on more fundamental mathematical models of simple homogeneous populations composed of one cell type only, and we do not explicitly consider any biased migration mechanism, such as chemotaxis or haptotaxis.
In this work, we propose a simple, biologically motivated, lattice-based discrete model of collective cell migration and proliferation. The discrete model explicitly models how individual cells in a two-dimensional in vitro experiment produce a biological substrate (e.g. biological macromolecules and extracellular material) that is deposited onto the surface of the tissue culture plate [14,71,72]. Substrate is produced at a particular rate, and deposited locally by individuals within the simulated population. Individual agents within the stochastic model undergo an unbiased random walk at a rate that is proportional to local substrate concentration, and crowding effects are incorporated by ensuring that each lattice site can be occupied by no more than a single agent. As we will demonstrate, this simple biologically inspired mechanism allows us to simulate cell spreading experiments similar to those in figure 1a . Through simulation, we first show that altering the rate of substrate deposition visually impacts the sharpness of the agent density front. A deeper mathematical understanding of these observations is obtained by coarse-graining the discrete mechanism to give a novel PDE model whose solution describes the average behaviour of the stochastic model. One way to interpret this new PDE model is that it naturally describes a linear diffusion mechanism at spatial locations well behind the leading edge of the population, as well as more complicated transport mechanisms at the leading edge of the spreading population that give rise to sharp-fronted solution profiles consistent with experimental observations. We show that averaged data from the discrete model can be very well approximated by numerical solutions of the new continuum limit PDE. In particular, both the continuum and discrete models predict the formation of sharp-fronted density profiles. A careful examination of the new continuum limit PDE model allows us to interpret how the different terms in the model lead to the formation of sharp, sometimes non-monotonic fronts. We conclude this study by incorporating a minimal model of cell proliferation into the discrete model, coarse-graining the proliferative discrete mechanism, and comparing averaged data from the discrete model with proliferation to numerical solutions of the new PDE model.
2. Results and discussion
2.1. Stochastic model and simulations
To account for crowding effects, we implement a lattice-based exclusion process where each lattice site can be either vacant or occupied by, at most, a single agent [48,58]. From this point forward, we will use the word agent to refer to individuals within the simulated population and the word cell to refer to individuals within an experimental population of biological cells. For simplicity, we implement the model on a two-dimensional square lattice with lattice spacing . Each site is indexed by , where , and each site has position . The lattice spacing is taken to be the size of a typical cell diameter [30,48]. In any single realization of the stochastic model, the occupancy of each site is a binary variable , with if the site is occupied, and if the site is vacant. Each site is also associated with a substrate concentration, which is a continuous function of time, , where is the maximum amount of substrate that can be accommodated at each lattice site. For simplicity, we write , where is the non-dimensional substrate density.
A random sequential update method is used to advance the stochastic simulations through time. If there are agents on the lattice, during the next time step of duration , agents are selected independently, at random, one at a time with replacement, and given the opportunity to move. If the chosen agent is at site , the agent will attempt to move with probability , where is the probability that an isolated agent will attempt to move during a time interval of duration . The target site for all potential motility events is selected at random from one of the four nearest neighbour lattice sites, and the potential motility event will be successful if the target site is vacant [30,48]. Once potential movement events have been attempted, the density of substrate is updated by assuming that agents deposit substrate at a rate of per time step, so that the amount of substrate at each occupied lattice site increased by an amount , taking care to ensure that the maximum non-dimensional substrate density at each site is 1. In addition to specifying initial conditions for the distribution of agents and the initial density of substrate, we must specify values of two parameters to implement the stochastic simulation algorithm: determines the motility of agents, and determines the rate of substrate deposition. With this framework, a typical cell diffusivity is given by [48].
To illustrate how the discrete model can be used to model the barrier assay in figure 1a , we perform a suite of simulations summarized in figure 2. The radius of the barrier assay is 3 mm, and a typical cell diameter is approximately 20 µm [30,48]. This means we can simulate the initial placement of cells within the barrier by taking a circular region of radius lattice sites to represent the disc enclosed by the barrier. The experiments in figure 1a are initiated by placing approximately 30 000 cells uniformly, as a monolayer, within the circular barrier. In the simulations, we have lattice sites within the simulated barrier, and we initialize the simulations by randomly populating each lattice site within the barrier with probability . With the discrete model, we can simulate a population of cells with a typical cell diffusivity of m2 h−1 [30] by choosing and h. This means that simulating 500 time steps of duration h is equivalent to 1 day in the experiment.

Figure 2. Discrete simulations illustrating the role of the substrate deposition rate . All simulations are performed on a square lattice where the lattice spacing corresponds to 20 μm making the diameter of the simulated population distribution in the left column equal to the diameter of the barrier assay at in figure 1a . Simulations are initiated by randomly occupying sites within a circular region of radius 150 lattice sites so that the expected number of agents at the beginning of the simulation is 30 000. Simulations are performed by setting and h, with values of as indicated. Results in ( a ) correspond to initializing at all lattice sites, whereas results in ( b – d ) correspond to initializing at all lattice sites. Each day of simulation corresponds to 500 time steps in the discrete model, and snapshots are reported in terms of the index of the lattice, which can be re-scaled to give the dimensional coordinates noting that . Each panel shows a dashed line indicating the initial placement of the circular barrier. Comparing the extent of the spreading populations with this dashed line gives a visual indication of the extent of spreading.
Results in figure 2a show a preliminary simulation with this initial condition where we set at all lattice sites at the beginning of the simulation. This first simulation corresponds to the simplest possible case where all lattice sites have the maximum amount of substrate present at the beginning of the experiment which means that the simulation does not depend upon the rate of deposition, . In figure 2a , we see that the population of agents spreads symmetrically, and after 3 days, we have a symmetric distribution of individuals without any clear front at the leading edge of the population. In fact, by days, we see that some agents within the simulated population become completely isolated, having spread far away from the bulk of the population as a result of chance alone. This situation is inconsistent with the experimental images in figure 1a , where we see a clear front at the leading edge of the population and a complete absence of individuals that become separated from the bulk population. Open-source Julia code to replicate these stochastic simulations is available on GitHub.
Additional simulation results in figure 2b–d involve setting up the same initial distribution of agents as in figure 2a except that we set at all lattice sites at the beginning of the simulation. These simulations in figure 2b–d are more biologically realistic than the simulations in figure 2a , because in the real experiment, cells are placed into the barriers at the beginning of the experiment without having any chance to deposit significant amounts of substrate onto the surface of the tissue culture plate before the barrier is lifted. Simulations in figure 2b–d are shown for different substrate deposition rates, . If the substrate is deposited sufficiently fast, as in figure 2b , the distribution of individual agents at days is visually indistinguishable from the case in figure 2a , where we do not observe a clear front in the spreading population. As is reduced, results in figure 2c,d show that the populations spread symmetrically with time, and now we see an increasingly well-defined sharp front as the population of agents spreads. The snapshot of individuals in figure 2d shows that after days, there are very few individual agents that are isolated away from the bulk of the population, and this distribution is consistent with the experimental observations in figure 1a .
2.2. Continuum limit partial differential equation model
We now provide a greater mathematical understanding and interpretation of the discrete simulation results in figure 2 by coarse-graining the discrete mechanism to give an approximate continuum limit description in terms of a PDE model in the form of equation (1.1). We begin by considering the average occupancy of site , where the average is constructed by considering a suite of identically prepared simulations to give
where is the binary occupancy of lattice site at time in the th identically prepared realization. With this definition, we treat as a smooth function of time, and for notational convenience, we will simply refer to this quantity as .
Under these conditions, we can write down an approximate conservation statement describing the change in average occupancy of site during the time interval from time to time [48,58]:
where, for notational convenience, we write
The first term on the right of equation (2.2) approximately describes the increase in expected occupancy of site owing to motility events that would place agents on that site. Similarly, the second term on the right of equation (2.2) approximately describes the decrease in expected occupancy of site owing to motility events associated with agents leaving site . We describe these terms as approximate as we have invoked the mean-field assumption that the average occupancies of lattice sites are independent [58]. While this assumption is clearly questionable for any particular realization of a discrete model, when we consider the expected behaviour of an ensemble of identically prepared simulations this approximation turns out to be quite accurate [48,58]. Note that setting at all lattice sites means that this conservation statement simplifies to previous discrete conservation statements that have neglected the role of the substrate [48].
To proceed to the continuum limit, we identify and with smooth functions and , respectively. Throughout this work, we associate uppercase variables with the stochastic model and lowercase variables with the continuum limit model. We expand all terms in equation (2.2) in a Taylor series about , neglecting terms of and smaller. Dividing the resulting expressions by , we take limits as and , with the ratio held constant [25] to give
where
which relates parameters in the discrete model: and , to parameters in the continuum model: and . In practice, for given values of ∆, τ, P and Γ, we will use the approximation and
The evolution equation for , equation (2.7), arises directly from our discrete model where we assume that each lattice site can occupy a maximum amount of substrate. This leads to a mechanism that is very similar to an approach that has been recently adopted to study a generalization of the well-known Fisher–KPP model where the nonlinear logistic source term is replaced with a linear saturation mechanism [73–75]. Solutions of these saturation-type models of invasion involve moving boundaries that form as a result of the saturation mechanism since this provides a natural moving boundary between regions where and . Later in §§2.3 and 2.4, we will show that equations (2.6) and (2.7) can also be interpreted as moving boundary problems in exactly the same way as [73–75].
The form of equations (2.6) and (2.7) provides insight into the population-level mechanisms encoded with the discrete model. To see this, we write the flux encoded within equation (2.6) as
Written in this way, we can now interpret how these two components of the cell flux impact the population-level outcomes. One way to interpret these terms is to note that the first term on the right of equation (2.9) is proportional to which is similar to a diffusive flux, and the second term on the right of equation (2.9) is proportional to which acts like a nonlinear advective flux. In particular, this non-linear advective flux is similar to fluxes often encountered in mathematical models of traffic flow [76].
We can also interpret how the two components of in equation (2.9) give rise to different features in the solution of the model depending on the location within a spreading population of individuals, such as the discrete populations shown in figure 2. For example, in regions that have been occupied by agents for a sufficiently long period of time, such as regions near the centre of the spreading populations in figure 2 where , locally we will eventually have and . This means that equations (2.6) and (2.7) simplify the linear diffusion equation since the nonlinear advective flux vanishes and the diffusive-like flux simplifies to Fick’s law of diffusion. In contrast, for regions that have been recently occupied by agents, such as near the leading edge of a population, we have and . Under these conditions, the diffusive flux is similar to a nonlinear diffusion term where the diffusive flux of is proportional to , which reflects the fact that agent motility in the discrete model is directly proportional to the local density of substrate. The advective-like component of the flux acts like a nonlinear advection term since the flux is proportional to [76], meaning that the advective flux vanishes when and , and is a maximum when . The direction of the nonlinear advective flux is opposite to . The nonlinear advective flux explicitly includes crowding effects encoded into the discrete model by enforcing that each lattice site can be occupied by, at most, a single agent.
2.3. Continuum–discrete comparison
We now examine how well the numerical solution of equations (2.6) and (2.7) matches averaged data from the discrete model. The experimental images and stochastic simulations in figures 1 and 2 correspond to a radially symmetric polar coordinate system, which can be described by writing equations (2.6) and (2.7) in terms of a radial coordinate system. Instead, we consider a second set of discrete simulations, shown in figure 3, where we consider a rectangular domain with a width of 300 lattice sites, and height of 20 lattice sites. Simulations are initialized by setting at all lattice sites, and uniformly occupying all sites within with agents. Reflecting boundary conditions are imposed along all boundaries to ensure that the distribution of agents remains, on average, independent of vertical position [48,59], and simulations are performed for and per time step, as shown in figure 3. Simulation results are consistent with previous results in figure 2, where we see that simulations performed with sufficiently large substrate deposition rates lead population spreading with a smooth front, without any obvious well-defined front position. Simulations with larger lead to population spreading with a visually noticeable defined front. Our main motivation for performing simulations on a rectangular-shaped lattice is that we can work with equations (2.6) and (2.7) in a one-dimensional Cartesian coordinate system [48].

Figure 3. Discrete simulations with , h, and (a–f) various values of , as indicated. All simulations are performed on a rectangular lattice of width and height . Simulations are initialized by setting at all lattice sites, and all sites with are occupied by agents. Snapshots are shown at and days. Each day of simulation corresponds to 500 time steps in the discrete model, and snapshots are reported in terms of the index of the lattice, which can be re-scaled to give the dimensional coordinates noting that .
Averaged agent density data are extracted from the simulations illustrated in figure 3 by considering identically prepared realizations of the discrete model, averaging the occupancy of each lattice site across these realizations and then further averaging the occupancy along each column of the lattice to give [48]
where is the height of the lattice. Numerical solutions of equations (2.6) and (2.7) in a one-dimensional Cartesian coordinate system are obtained for parameter values and initial data consistent with the discrete simulations. Details of the numerical method used to solve the continuum PDE model are given in appendix A. Results in figure 4 compare numerical solutions of equations (2.6) and (2.7) with averaged data from the discrete simulations, given by equation (2.10) for various values of , as indicated.

Figure 4. Averaged discrete data (dots) superimposed on numerical solutions of equations (2.6) and (2.7) (solid). Each panel compares averaged discrete data, constructed using equation (2.10) with and , with a numerical solution of equations (2.6) and (2.7). Four sets of solutions are shown for (a–d) and per time step, as indicated. Discrete simulations are initialized by occupying all lattice sites with , and with and h. Within each panel, a comparison is made at and days shown in blue, green orange and yellow, respectively, as indicated. Each day of simulation corresponds to 500 time steps in the discrete model, and snapshots are reported in terms of the index of the lattice, which can be re-scaled to give the dimensional coordinates noting that . The arrows within each panel show the direction of increasing time.
Results in figure 4 indicate that the quality of the continuum–discrete match is very good for all values of considered. For sufficiently large values of the substrate deposition rate in figure 4a,b we see that the density profiles are smooth, with no clear well-defined front location at the low-density leading edge. These results are consistent with the preliminary numerical results in figure 1a,b for the barrier assay geometry. In contrast, for sufficiently small values of the substrate deposition rate, density profiles in figure 4c,d show that we have a well-defined sharp front at the leading edge of the spreading populations. The density profiles in figure 4d indicate that the solution of equations (2.6) and (2.7) for has compact support, and the density profiles are non-monotonic with a small dip in density just behind the leading edge. Interestingly, we see the small dip in density behind the leading edge in the averaged discrete data. This indicates that the continuum limit PDE model provides an accurate approximation of the average densities from the discrete simulations. As far as we are aware this dip in density just behind the leading edge has not been measured experimentally.
2.4. Front structure
Now that we have confirmed that averaged data from the discrete model can be approximated by numerical solutions of equations (2.6) and (2.7), we will briefly describe and summarize the general features of the front structure in a simple one-dimensional Cartesian geometry analogous to the results in figure 4. This discussion of the front structure is relevant for initial conditions of the form for all , and , where H is the usual Heaviside step function so that initially we have for and for . In all cases considered we impose zero flux boundaries on at both boundaries of the one-dimensional domain. Figure 5 shows a typical solution of equations (2.6) and (2.7). This schematic solution corresponds to the most interesting case with sufficiently small that we see a clear sharp-fronted solution, and both and are discontinuous at some moving location . As discussed in §2.2, the moving boundary at arises because of the saturation mechanism governing the dynamics of in equation (2.7). This kind of moving boundary problem has been previously studied in the case of a generalized Fisher–KPP model [73–75], except that these previous investigations have not involved any discrete stochastic models, or any kind of coarse-graining to arrive at an approximate PDE model.

Figure 5. Schematic front structure. ( a ) Profiles for (green) and (yellow) within Region 1 for and Region 2 for . ( b ) Using the profiles for and in ( a ), we show the corresponding spatial distribution of (red) and (blue). ( c ) Using the profiles for and in ( a ), we show the spatial distribution of the total flux (black). The locations of and are shown with vertical dashed lines with . In ( b ), we include a horizontal dashed line at to emphasize the point that the diffusive flux changes sign at .
The schematic showing and in figure 5a motivates us to consider two regions within the solution:
— | Region 1: , where . | ||||
— | Region 2: , where . |
Ahead of Region 2 where we have , and so we consider to be the front of the solution. In Region 1, we have and , which means that the evolution of in Region 1 is governed by the linear diffusion equation and the flux of simplifies to . This simplification explains why, for this initial condition, is a monotonically decreasing function of within Region 1 because solutions of the linear diffusion equation obey a maximum principle [77].
Region 2 is characterized by having with . The interface between Region 1 and Region 2 has and , for some value . Within Region 2, the flux of is given by . The advective component of the flux, , is directed in the positive direction, which means that the flux of entering Region 2 across the interface at is partly advected in the positive direction due to the advective flux term that acts within Region 2 only. This additional advective flux in the positive direction within Region 2 explains why there can be a local minima in at . The diffusive component of the flux in Region 2, , can act in either the positive direction when or in the negative direction when . The schematic in figure 5a shows and across Regions 1 and 2. Associated schematic in figure 5b shows and , and the schematic in figure 5c shows for the and profiles in figure 5a . These plots of the fluxes show that while the total flux across both Regions 1 and 2, we see that vanishes everywhere except within Region 2, and within Region 1, but changes sign within Region 2 in this case.
Exploring numerical solutions of equations (2.6) and (2.7) indicates that the width of Region 2, , decreases with . This is both intuitively reasonable and consistent with the observations in figure 2 regarding how the structure of the front appeared to vary with the deposition rate in the discrete model. Numerical solutions of equations (2.6) and (2.7) indicate that as , we have and . Since the width of Region 2 vanishes for sufficiently large , the solution of equations (2.6) and (2.7) can be accurately approximated by the solution of the linear diffusion equation, which is independent of . Again, this outcome is consistent with the discrete simulations in figure 2, where we observed that simulations with large deposition rates were visually indistinguishable from simulations where all lattice sites were initialized with the maximum substrate concentration where the continuum limit of the discrete model is the linear diffusion equation [48].
The schematic profiles of and in figure 5 can also be interpreted in terms of the mechanisms acting in the discrete model. When agents within Region 2, close to the front, move in the positive direction to a lattice site that has never been previously occupied, that agent will experience at the new site. This means that the agent will be stationary for a period of time until that agent deposits substrate, which means that increases. While there is empty space behind that agent, for example, at site where it was previously located, the agent cannot easily move back until a sufficient amount of time has passed to build up the amount of substrate. Therefore, Region 2 within the discrete model acts as a low-motility zone where agents become momentarily stationary until sufficient substrate is produced to enable the agents to continue to move.
In addition to presenting a physical interpretation of the front structure, from both the continuum and discrete points of view summarized here, we also attempted to examine the structure of the front more formally using an interior layer analysis by identifying as a small parameter in the system of governing equations. Unfortunately, this leads to a nonlinear PDE as the problem that determines the shape of the interior layer. Since we are unable to solve the problem we did not proceed any further with this approach.
2.5. Proliferation
The experimental image in figure 1a shows a barrier assay describing the spatial spreading of a population of fibroblast cells that are pre-treated to prevent proliferation [30,33]. All subsequent discrete and continuum modelling in this work so far has focused on conservative populations without any death or proliferation mechanisms so that these simulations are consistent with the preliminary experimental observations in figure 1. In the discrete model, this is achieved by simulating a population of agents, where is a constant. In the continuum model, this is achieved by working with PDE models like equation (1.1) with . To conclude this study, we now re-examine all discrete and continuum models by incorporating a minimal proliferation mechanism motivated by the additional experimental results summarized in figure 6. The proliferation mechanism involves agent division only without any death process because there is no evidence of cell death in the experiments that motivate these simulations [30]. The leftmost image in figure 6 shows a barrier assay initialized with approximately 30 000 fibroblast cells just after the barrier is lifted at . The central image in figure 6 shows the outcome of a barrier assay where the fibroblast cells are pre-treated to suppress proliferation [30,33], and the rightmost image shows the outcome of a barrier assay that is initialized in the same way except that the fibroblast cells are not pre-treated to suppress proliferation. This means that the rightmost image in figure 6 shows the outcome of a barrier assay in which fibroblast cells are free to move and proliferate [30]. The motile and proliferative population expands symmetrically, and the leading edge of the population remains sharp. The main difference between the outcome of the barrier assays for the motile and proliferative population compared with the population where proliferation is suppressed is that cell proliferation leads to more rapid spatial expansion of the population.

Figure 6. Circular barrier assay images comparing the spatial spreading of motile and non-proliferative populations with the spatial spreading of a motile and proliferative population of fibroblast cells. The leftmost image shows a barrier assay at days just after the circular barrier is lifted. This experiment is initiated by placing approximately 30 000 fibroblast cells uniformly inside a barrier of radius 3 mm. The central image shows the spatial extent of the population at days where the cells are pre-treated to suppress proliferation. The rightmost image shows the spatial extent of the population at days where the cells are motile and proliferative. All images reproduced from Simpson et al. [30] with permission.
A minimal model of proliferation is now incorporated into the discrete model described previously in §2.1. The key difference is that previously the number of agents remained fixed during the stochastic simulations, whereas now is a non-decreasing function of time. Within each time step of the discrete model, after giving randomly selected agents an opportunity to move, we then select another agents at random, one at a time with replacement, and given the selected agents an opportunity to proliferate with probability . We take a simple approach and assume that the proliferation is independent of the local substrate density. If a selected agent is going to attempt to proliferate, the target site for the placement of the daughter agent is randomly selected from one of the four nearest neighbour lattice sites [78]. If the target site is occupied then the proliferation event is aborted owing to crowding effects, whereas if the target site is vacant a new daughter agent is placed on the target site. At the end of every time step, we update to reflect the change in total population owing to proliferation during that time step [48,78]. A set of preliminary simulations comparing results with and are given in figure 7. In these simulations, we compare the spatial spreading of 30 000 agents uniformly distributed within a circular region of diameter 3 mm. Results in figure 7a are made in the case where we set at all lattice sites at the beginning of the experiment, and we repeat the comparison for simulations with with and in figure 7b–d , respectively.

Figure 7. Discrete simulations illustrating the role of the substrate deposition rate in the spatial spreading of a population of motile agents without proliferation with the spatial spreading of a population of motile and proliferative agents. All simulations are performed on a square lattice where the lattice spacing corresponds to 20 μm making the diameter of the simulated populations in the left column equal to the diameter of the populations at in figures 1 and 6. Simulations are initiated by randomly occupying sites within a circular region of radius 150 lattice sites so that the expected number of agents at the beginning of the simulation is 30 000. Simulations of motile and non-proliferative populations correspond to , and days, with values of as indicated. Simulations of motile and proliferative populations correspond to , and days, with values of as indicated. Results in ( a ) correspond to initializing at all lattice sites, whereas results in ( b – d ) correspond to initializing at all lattice sites. Each day of simulation corresponds to 500 time steps in the discrete model, and snapshots are reported in terms of the index of the lattice, which can be re-scaled to give the dimensional coordinates noting that . Each panel shows a dashed line indicating the initial placement of the circular barrier. Comparing the extent of the spreading populations with this dashed line gives a visual indication of the extent of spreading.
Similar to the experiments in figure 6, our simulations in figure 7 show that incorporating proliferation increases the rate at which the growing populations spread and invade the surrounding area. As for the non-proliferative simulations in figure 2, we see that the front of the spreading populations is poorly defined when is sufficiently large, with a relatively diffuse distribution of agents that includes many isolated individuals that have migrated well ahead of the bulk population. In contrast, reducing leads to visually well-defined sharp fronts with a clearer boundary at the leading edge of the proliferative population. These sharper fronts contain very few isolated individual agents. Visual comparison of the proliferative and non-proliferative snapshots in figure 7c,d indicates that incorporating proliferation leads to an increasingly sharp and well-defined sharp front. These simulations indicate that having substrate-dependent motility and substrate-independent proliferation are sufficient to produce sharp and well-defined fronts in the discrete simulations.
To interpret the differences between the motile populations and the motile and proliferative populations in figure 7, we coarse-grain the discrete model by following a similar approach taken in §2.2. To proceed we write down an approximate conservation statement describing the change in average occupancy of site during the time interval from time to time :
The new term on the right of equation (2.11) approximately describes the increase in expected density of site owing to proliferation events that would place an agent on that site provided that the target site is vacant [48]. To proceed to the continuum limit, we again identify and with smooth functions and , respectively, and expand all terms in (2.2) in a Taylor series about , neglecting terms of and smaller. Dividing the resulting expression by , we take limits as and , with the ratio held constant [25] to give
where
which provides relationships between parameters in the discrete model: and , to parameters in the continuum model: , and . The additional term in equation (2.13) is simply a logistic source term with carrying capacity of unity, which reflects the fact that the occupancy of any lattice site is limited to a single agent. The numerical method we use to solve equations (2.13) and (2.14) is given in appendix A.
It is straightforward to choose parameters to mimic known biological observations. An important parameter for applying these models to biological experiments is the ratio , which compares the relative frequency of motility to proliferation events for isolated agents in regions where . Key parameters in an experiment are the cell diameter , the cell diffusivity and the proliferation rate , which is related to the cell doubling time, , by . Using equation (2.15), we have , noting that this ratio is independent of . With typical values of , and h, we have , which means setting and correspond to biologically relevant parameter values of the discrete model. One way of interpreting this choice of parameters is that the average time between proliferation events for an isolated agent is 500 times longer than the average time between motility events in regions where .
We now repeat the comparison of averaged discrete data with the solution of equations (2.13) and (2.14) in a one-dimensional Cartesian coordinate system for the same domain, initial conditions and parameter values considered previously in figure 4 except now we consider simulations that include proliferation with . The quality of the continuum–discrete match in figure 8 is very good for all values of considered. Comparing the solution profiles in figures 4 and 8 shows that the presence of proliferation over a period of 4 days increases the distance that the population front moves in the positive direction, just as we demonstrated using the stochastic model in figure 7. In addition to noting that the numerical solution of equations (2.13) and (2.14) provides a reasonable match to averaged discrete data, it is important to note that the presence of proliferation in figure 8 does not alter the trends established previously in figure 4 regarding how affects the sharpness of the front, namely that sufficiently large substrate deposition rates lead to smooth-fronted profiles whereas reduced substrate deposition rates lead to sharp-fronted profiles, with the possibility of having a non-monotonic shape.

Figure 8. Averaged discrete data (dots) superimposed on numerical solutions of equations (2.13) and (2.14) (solid). Each panel compares averaged discrete data, constructed using equation (2.10) with , , and with a numerical solution of equations (2.13) and (2.14). Four sets of solutions are shown for (a–d) and , as indicated. Within each panel, a comparison is made at and days shown in blue, green, orange and yellow, respectively, as indicated. Each day of simulation corresponds to 500 time steps in the discrete model, and snapshots are reported in terms of the index of the lattice, which can be re-scaled to give the dimensional coordinates noting that .
Visually comparing results in figures 2 and 7 provides a simple and easy-to-interpret qualitative comparison of the impact of proliferation. In contrast, comparing results in figures 4 and 8 provides a quantitative comparison of the impact of proliferation since these plots involve identical initial conditions, geometry and visualization of agent densities over the same time scales. The only difference is that figure 4 involves motility only whereas figure 8 involves combined motility and proliferation. Results in figure 8 correspond to biologically relevant parameter estimates where where the mean-field approximation is known to be accurate [48,79,80]. Other parameter choices are not sufficiently small lead to situations where the solution of the mean-field model does not match averaged data from the discrete model as explored in [48,79,80]. Should the reader wish to explore the implications of parameter choices where is not sufficiently small they may use the open-source code on GitHub to make additional continuum–discrete comparisons. In this work, we restrict our attention to biologically relevant parameter estimates where the mean-field approximation is reasonably accurate.
3. Conclusion and outlook
In this work, we have revisited the question of using continuum PDE models to study spatial spreading and invasion of populations of cells. While many continuum PDE models involve linear diffusion, solutions of these models do not have compact support, and do not replicate clearly defined fronts that are often observed experimentally. Previously, this issue has been addressed by generalizing the linear diffusion flux, , to a degenerate nonlinear diffusion flux, , where . The motivation for working with degenerate nonlinear diffusion is that the flux vanishes when and the solution of the PDE model has a well-defined sharp front that can match experimental observations [40–43]. While PDE models with this kind of degenerate nonlinear diffusion flux lead to solutions with well-defined sharp fronts, the biological motivation for these models and a biological interpretation of the exponent remain unclear. In this work, we have revisited the question of modelling spatial spreading and cellular invasion from the point of view of developing a simple lattice-based discrete model. In the discrete model, we assume that agents produce an external substrate (e.g. biomacromolecules and extracellular matrix) that is deposited locally on the lattice, and the rate of randomly directed agent migration is taken to be proportional to the density of substrate at each lattice site. We explicitly incorporate crowding effects in the discrete model by allowing each lattice site to be occupied by, at most, one single agent. This simple, biologically motivated mechanism allows us to model collective spreading and invasion with well-defined sharp fronts provided that the rate of substrate deposition is sufficiently small. Stochastic simulations that mimic the spatial spreading of cells in a two-dimensional circular barrier assay illustrate that our discrete model is capable of replicating key features of the experiment, namely symmetric spreading of the population with a well-defined sharp front at the leading edge of the population.
Coarse-graining the discrete mechanisms leads to a PDE model with a novel flux term that simplifies to linear diffusion in the bulk of the population, and has features similar to a degenerate nonlinear diffusion flux at the leading edge of the population. Importantly, these features arise within the context of a simple, biologically motivated discrete mechanism that is capable of replicating sharp-fronted density profiles and our approach does not involve specifying a degenerate nonlinear diffusivity function that is difficult to relate to biological mechanisms.
Numerical solutions of the new PDE model provide us with a computationally efficient, accurate approximation of averaged data from the stochastic model. Careful examination of the solutions of the PDE indicates that the structure of the leading edge depends upon the rate of substrate deposition. For sufficiently fast substrate deposition the substrate profile approaches a step function at the leading edge of the spreading population, and the nonlinear PDE model simplifies the linear diffusion equation. In contrast, for sufficiently slow substrate deposition the leading edge of the population behaves like a moving boundary problems where the density profile has compact support, and the shape of the density profile at the leading edge can be non-monotonic. The first set of stochastic simulations and coarse-grained PDE models presented in this work focus on conservative populations where cell proliferation and cell death are absent. To understand how the shape of the front could change when considering a proliferative population we present a second set of simulations and coarse-grained PDE models that incorporate a minimal proliferation mechanism. In this case, the coarse-grained PDE model takes the form of a reaction–diffusion model. We solve the new PDE numerically, using biologically motivated parameter values which show that numerical solutions of the PDE model match averaged data from the discrete simulations very well, and confirm that sharp-fronted density profiles occur in the presence of proliferation. In fact, for the biologically motivated parameter values considered in this work, we find that incorporating proliferation tends to sharpen the density fronts at the leading edge relative to non-proliferative stochastic simulations.
There are many options for extending the work presented in this study. One obvious avenue for exploration is to introduce additional details into the discrete model since our approach in this work is to introduce very simple mechanisms only. An interesting option for further examination would be to generalize the transition probabilities in the following way. In the current model, the transition probability for an agent undergoing a motility event from site to site is proportional to , which indicates that the transition probability is a linearly increasing function of local substrate density . An interesting extension would be to generalize the transition probability to be proportional to , where is a smooth function describing how the motility probability for individual agents depends upon the substrate density. In the context of modelling cell migration, it is natural to assume that is an increasing function. Taking the continuum limit of the discrete mechanism under these circumstances leads to
which is a generalization of setting . Returning to our initial discussions in §1, choosing , for means that the diffusive flux term in equations (3.1) and (3.2) is analogous to the flux term in the generalized porous medium equation [40,41,44]. All results presented in this study involve working with the simple choice of ; however, generating and comparing averaged discrete data with numerical solutions of equations (3.1) and (3.2) would be very interesting to explore how different choices of might impact the quality of the discrete–continuum match and the shape of the front. Another extension would be to couple the probability of proliferation in the discrete model to the substrate density. This would, in effect, introduce a substrate-dependent proliferation rate into equation (3.1). Again, the question of generating and comparing averaged discrete density data for this generalization would be interesting and a relatively straightforward extension of the current discrete and continuum modelling frameworks established in this work.
Another extension would be to examine long-time travelling wave solutions of equations (2.13) and (2.14) [3]. In the current work, we have limited our examination of this model to relatively short-time simulations of the discrete model and relatively short-time numerical solutions of the continuum limit PDE which is relevant when using these models to mimic standard experimental protocols. Standard experimental protocols examining collective cell migration and proliferation are typically limited to durations of 24 or 48 h [32,36]. This means that for a typical cell line with a doubling time of 12–24 h, these standard experimental protocols last for approximately 1–4 times the cell doubling time. This means that standard experimental protocols are perfectly suited to examine the effects of proliferation that will be evident over these typical time scales. Our theoretical comparison of averaged discrete data and the solution of the continuum limit PDE in figure 8 is relevant for such typical experimental durations since we compare the evolution of the front position over 4 days for a population with a doubling time of 18 h, which is just over five times the doubling time. Despite the fact that we have considered numerical solutions of equations (2.13) and (2.14) over time scales that are five times the doubling time. It is clear that the numerical solutions in figure 8 have not had sufficient time to approach a constant speed, constant shape travelling wave solution [3]. Therefore, taking a more theoretical point of view, it would be mathematically interesting to examine time-dependent numerical solutions of equations (2.13) and (2.14) over much longer time scales and study the resulting travelling wave behaviour as . This could be achieved by transforming the time-dependent PDE model into the travelling wave coordinate, , where is the long-time asymptotic speed of the travelling wave solutions. Properties of the solution of the resulting dynamical system could then be studied in phase space to provide information about the relationship between parameters in the continuum PDE model and the travelling wave speed and the shape of the travelling wave profile [3,72]. We leave both of these potential extensions for future consideration.
Ethics
This work did not require ethical approval from a human subject or animal welfare committee.
Data accessibility
All code and data are freely available on GitHub [81].
Declaration of AI use
We have not used AI-assisted technologies in creating this article.
Authors’ contributions
M.J.S.: conceptualization, data curation, formal analysis, funding acquisition, investigation, methodology, project administration, resources, software, supervision, visualization, writing—original draft; K.M.M.: conceptualization, formal analysis, investigation, methodology, writing—review and editing; S.W.M.: conceptualization, supervision, writing—review and editing; P.R.B.: conceptualization, supervision, writing—review and editing.
All authors gave final approval for publication and agreed to be held accountable for the work performed therein.
Conflict of interest declaration
We declare we have no competing interests.
Funding
M.J.S. and P.R.B. are supported by the Australian Research Council (DP230100025).
Acknowledgements
We thank the Faculty of Science at QUT for providing K.M.M. with a mid-year research fellowship to support this project.
Appendix A. Numerical methods
Results in figure 1 involve generating numerical solutions of
on a square domain centred at the origin with side length . To solve equation (A 1) we discretize all spatial derivative terms on a uniform square mesh with mesh spacing so that the mesh point with index is associated with location . Applying a standard central difference approximation to the spatial derivative terms in equation (1.1) at the central nodes leads to
This central difference formula is adjusted along the domain boundaries to enforce no-flux boundaries. When we apply this discretization to simulate linear diffusion we set , and when simulating nonlinear degenerate diffusion we set . This system of coupled nonlinear ordinary differential equations is solved using the DifferentialEquation.jl package in Julia, which uses automatic time-stepping routines to minimize truncation error. Results in figure 1 are obtained with , which is sufficiently small to ensure that these numerical results are grid independent.
Results in the main text include numerical solutions of equations (2.6) and (2.7) in a one-dimensional Cartesian geometry,
on . To solve equations (A 3) and (A 4), we discretize all spatial derivative terms on a uniform mesh with mesh spacing so that the th mesh point is associated with position . Applying a standard central difference approximation to the spatial derivative terms in equation (A 3) gives the following system of coupled nonlinear ordinary differential equations at the th node:
The discrete equation for , equation (A 6), holds for all mesh points because there are no spatial derivative terms in equation (A 4) and no boundary conditions need to be imposed. In contrast, the discrete equation for , equation (A 5), holds only on the interior mesh points . Applying no-flux boundary conditions at and means that we impose the constraints and , respectively. This system of coupled nonlinear ordinary differential equations is solved using the DifferentialEquation.jl package in Julia, which implements automatic time-stepping routines to control temporal truncation error. All numerical results in this work correspond to , which is sufficiently small to ensure that our numerical results are grid independent for the problems that we consider. Open source Julia code to solve equations (A 5) and (A 6) is available on GitHub. Results in this study are obtained using a Runge–Kutta method within the Tsit5 algorithm within the DifferentialEquations.jl package.