Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences
You have accessResearch articles

Optimal control of epidemic spreading in the presence of social heterogeneity

G. Dimarco

G. Dimarco

Department of Mathematics and Computer Science, University of Ferrara, Ferrara, Italy

Google Scholar

Find this author on PubMed

,
G. Toscani

G. Toscani

Department of Mathematics ‘F. Casorati’, University of Pavia, Pavia, Italy

Institute for Applied Mathematics and Information Technologies (IMATI), Via Ferrata, 5/A, Pavia, Italy

Google Scholar

Find this author on PubMed

and

    Abstract

    The spread of COVID-19 has been thwarted in most countries through non-pharmaceutical interventions. In particular, the most effective measures in this direction have been the stay-at-home and closure strategies of businesses and schools. However, population-wide lockdowns are far from being optimal, carrying heavy economic consequences. Therefore, there is nowadays a strong interest in designing more efficient restrictions. In this work, starting from a recent kinetic-type model which takes into account the heterogeneity described by the social contact of individuals, we analyse the effects of introducing an optimal control strategy into the system, to limit selectively the mean number of contacts and reduce consequently the number of infected cases. Thanks to a data-driven approach, we show that this new mathematical model permits us to assess the effects of the social limitations. Finally, using the model introduced here and starting from the available data, we show the effectiveness of the proposed selective measures to dampen the epidemic trends.

    This article is part of the theme issue ‘Kinetic exchange models of societies and economies’.

    1. Introduction

    Non-pharmaceutical interventions (NPIs) have had a strong impact in limiting the spread of SARS-CoV-2 [1]. The introduced NPIs span from basic personal hygiene precautions to more severe actions such as the mandatory stay-at-home and business and schools closures.

    These measures were intended to diminish transmission rates and, to this end, aimed at reducing person-to-person contacts via social distancing. In fact, even if the precise role played by those interventions has still to be carefully understood in view of a deeper comprehension of the transmission mechanisms of the disease [2,3], it has been recognized how the social attitudes of a population are a crucial feature for the spreading of airborne diseases [4,–8]. Thus, it appears that understanding the interplay between social heterogeneity and clinical characteristics of a disease is an essential feature to design efficient and robust countermeasures in the near future.

    The classical epidemiological models [9] are usually based on a compartmentalization of the population, denoted in the following by C, in which the agents composing the system are split into epidemiologically relevant states, like susceptible, who can contract the disease, and several other classes of agents who may transmit it. Therefore, to model the transmission of a pathogen through a population, if Ni(t) represents the number of individuals in the compartment iC with |C|=n>1, one generally considers a system of differential equations that can be written in compact form as

    dN(t)dt=P(N(t)),1.1
    where N(t) is the n-dimensional vector with components Ni(t), and P is the n-dimensional vector whose components Pi(N), iC, are the transition rates between compartments, namely the variation of the number of individuals in the ith compartment due to interactions with individuals in other compartments.

    Several variations of system (1.1) have been introduced to forecast the penetration of a virus and to prevent its spread, see e.g. [9,–12]. Generalizations of the mentioned compartmental models include additional factors characterizing the transmission dynamics, such as the introduction of age structure, contacts heterogeneity and mobility, see e.g. [–4,13,15].

    In this work, we propose a kinetic approach to derive consistent transition rates in a population characterized by a heterogeneous number of contacts together with selective NPIs. Indeed, the number of social contacts has been recently identified by the scientific community as one of the main relevant causes of the potential pathogen transmission [6,–8,16]. In this respect, the NPIs imposed by the different governments were mainly based on restrictions whose scope was to reduce as much as possible the number of contacts among individuals.

    In more detail, we aim at giving a deeper understanding of the mitigation effects due to the reduction of the social interactions among individuals in the dynamics of SARS-CoV-2. This is done through a combination of a multiscale mathematical model which takes into account both the statistical information on the distribution of social contacts in a society as well as a control strategy whose target is to point the population towards a given benchmark number of contacts. The underlying theoretical framework that we consider is the one coming from the kinetic theory of collective social phenomena [17]. The diffusion of epidemics is here described as a result of the interactions between a large number of individuals, each one having a different social propensity. In this respect, as shown in Dimarco et al. [18] and Zanella et al. [19], the fundamental tools of statistical physics are capable of providing a useful insight into epidemiological dynamics linking classical compartmentalization approaches with a statistical understanding of the social aspects. In fact, the essential multiscale nature of kinetic theory permits us to determine the resulting macroscopic (or aggregate) and measurable characteristics of the evolution of the disease [19]. One of the key directions we pursue in this research is given by the combination of consolidated theoretical and modelling tools [17] with data-driven techniques. More precisely, starting from the data at our disposal, we calibrate our mathematical model passing through the determination of the relevant epidemiological parameters of the epidemic from the data. Subsequently, we concentrate on modelling the lockdown policies through an optimal control approach. In particular, we show how different kinds of control may result in very different mitigation effects, that deeply depend on the heterogeneity in the contact distribution of the population. In the last part, starting from the calibrated model at our disposal, we dedicate ourselves to the study of the effects induced by alternative control strategies.

    We stress that, even if in the rest of the paper we mainly refer our results to a compartmental model with exposed and asymptomatic, i.e. the so-called SEIAR compartmentalization, the present approach is quite general, and applies both to simple [9,–12] as well as to more complex epidemiological models [20,–22]. For this reason, we will keep the presentation as general as possible.

    2. Modelling epidemic dynamics in the presence of contact heterogeneity

    We discuss in this part a possible improvement of the classical compartmental description of epidemic spreading which takes into account the statistical description of the social behaviour of individuals. This permits us to determine the distribution of the average number of connections in a system of N interacting individuals, see [18,19], and to relate this quantity with the infection dynamics. In this new approach, agents which are supposed to belong to a given compartment in term of response to a given pathogen are additionally characterized by their daily number of contacts x0, whose statistical description is obtained through the distribution functions fi(x,t), t0, where iC denotes the ith compartment (susceptible, infected, asymptomatic, etc.), such that

    iCfi(x,t)=f(x,t),R+f(x,t)dx=N(t).
    In this setting, the time evolution of the functions fi(x,t), iC is obtained by integrating the compartmentalization epidemiological description with the simultaneous characterization of the formation of social contacts in a society [18]. This merging is expressed by the system
    f(x,t)t=P(x,f(x,t))+1τQ(f(x,t)).2.1
    As in (1.1), P is the n-dimensional vector whose components Pi(x,f), iC, are the transition rates between compartments (now depending both on the contact variable x and on the vector f(x,t)), while Q is the n-dimensional vector whose components Qi(fi), iC are suitable relaxation operators which describe the formation of equilibrium distribution of contacts of the underlying compartments. Details about this emerging equilibrium distribution, together with its main properties, will be provided in the following. Lastly, τ>0 measures the time scale at which these equilibrium distributions of social contacts are reached by relaxation.

    In general, the description of the time evolution of epidemic spreading in terms of the distributions fi(x,t), as given by (2.1), is richer than the classic one obeying (1.1). This is due to the fact that, in (2.1), the system depends on an additional variable allowing a more realistic characterization of the transition rates between compartments. In particular, by assuming that the pathogen transmission does not depend upon the contact among individuals, the proposed description (2.1) collapses to the standard approach (1.1) as shown in [18].

    In our model, the transition rates can now be assumed to be dependent on the intensity of social contacts in the compartments. As detailed later, in fact, they are obtained by taking into account, in a very natural way, the heterogeneity of the population in the form of the variance of the distributions fi(x,t) [18,23]. This point of view provides a better understanding of the role of the different individual behaviours in the spreading of a disease [5].

    In general, the knowledge of the densities fi(x,t) allows us to evaluate various macroscopic quantities, independent from the number of contacts, the so-called moments of the distributions, obtained by integration with respect to the variable xR+. In this respect, the zero-order moment gives the number of individuals Ni(t) in each compartment at time t0

    Ni(t)=R+fi(x,t)dx,iC.2.2
    In the same way, one can quantify the mean numbers x¯i(t) of contacts of individuals in each compartment at time t0 evaluating
    x¯i(t)=1Ni(t)R+xfi(x,t)dx,iC.2.3

    Let now fS, SC, indicate the contact distribution of susceptible people (people at risk of becoming infected), and let the subset IC, |I|<|C| include the compartments of people that may transmit the infection. Since the transition rate PS(x,f(x,t)) contains the details about the spreading of the infection among susceptible individuals, the new formulation (2.1) allows us in particular to characterize this transmission in terms of the intensity of social contacts between the compartments of susceptible and infectious individuals. This intensity is measured at time t0 by relating the densities fS(x,t) and fj(x,t), jI. Following [18], the transition rate PS is then expressed by introducing the function K(fS,{fj}jI), the so-called local incidence rate, given by

    K(fS,{fj}jI)(x,t)=fS(x,t)R+jIκj(x,y)fj(y,t)dy.2.4
    In (2.4), the contact functions κj(x,y) are non-negative functions growing with respect to the number of contacts x and y of the populations of susceptible and infectious agents in the compartment jI, and such that κj(x,0)=0. In particular, the choice
    κj(x,y)=βjxαyα,jI
    with constants α,βj>0 corresponds to consider an incidence rate dependent on the product of the number of contacts of susceptible and infectious people. When α=1, for example, the incidence rate takes the simpler form
    K(fS,{fj}jI)(x,t)=xfS(x,t)jIβjx¯j(t)Nj(t).2.5

    We discuss now the details of the dynamics of formation of the social contacts which are modelled by the operators Qi(fi) in (2.1). These operators are of Fokker–Planck-type with variable coefficient of diffusion and linear drift

    Qi(fi(x,t))=12x[σ2(xfi(x,t))x+μ(xx¯i(t)1)fi(x,t)],iC,2.6
    coupled with no-flux boundary conditions such that the Fokker–Planck-type equation
    tfi(x,t)=Qi(fi)(x,t)
    is mass and momentum preserving. In (2.6), σ and μ are positive constants which characterize the intensities of the diffusion and drift terms. The above description through Fokker–Planck operators is obtained thanks to an upscaling of an underlying microscopic dynamics which takes into account the typical aspects of human behaviour for what concerns the social attitudes [18]. In particular, the microscopic description takes into account the interactions due to the basic daily needs such as work duties, schools and the search, in the absence of epidemics, for opportunities for socialization. These aspects are embedded in the microscopic model by relying on the so-called prospect theory of Kahneman & Tversky [24] as detailed later.

    The kernels of the operators Qi furnish the class of equilibrium densities, that are obtained by solving the first-order differential equations

    σ(xfi(x))x+μ(xx¯i1)fi(x)=0,iC,iCNi(t)=N(t).
    It is immediate to verify that, in this setting, the equilibrium distributions fi(x) of given mass Ni and local mean x¯i are the Gamma distributions
    fi(x)=fi(x,x¯i,λ)=Ni(λx¯i)λ1Γ(λ)xλ1exp{λx¯ix},2.7
    characterized by the shape parameter λ=μ/σ. It is important to stress the role of the positive constant λ which is a measure of the heterogeneity of the population, statistically distributed according to (2.7). It is also remarkable that the above equilibrium dynamics, satisfying the condition λ>1, is in agreement with the experimental results in [6]. This study represents one of the first attempts in which a large scale experimental measure of the relation between contact and spread of infectious diseases has been done. To better characterize the equilibrium distribution, we also note that the condition on λ identifies the Gamma densities for which fi(x=0)=0. Moreover, higher-order moments of this class of distributions depend only on Ni, x¯i and λ. In particular, the local variance is expressed by
    Vi=1NiR+(xx¯i)2fi(x,x¯i,λ)dx=x¯i2λ.2.8
    Hence, expression (2.8) permits us to relate the large values of λ with small heterogeneity of the population as claimed before.

    Before introducing in the model the aspects related to the control policies, we briefly discuss how the Fokker–Planck-type operators (2.6) can be derived from collisional kinetic equations describing the evolution of social connections [25,26]. These kinetic equations are obtained by looking to a possible characterization of the personal behaviour of a single agent in terms of daily contacts starting from some well-established assumptions. Indeed, the total amount of contacts can be simply viewed as the result of a repeated upgrading: the daily life of each person is based on a certain number of activities each carrying a certain number of contacts. Moreover, independently of the personal choices or needs, this number contains a certain amount of randomness, that has to be taken into account. These assumptions give, letting x be the microscopic number of social contacts resulting from the interaction,

    x=xΦϵ(xx¯i)x+ηϵx,2.9
    with ηϵ a centred random variable such that E[ηϵ2]=ϵσ2 with bounded moments up to order three, and Φϵ() a transition function of the form
    Φϵ(s)=μeϵ(s1)1eϵ(s1)+1,s0.2.10
    The transition function has been constructed by taking into account that while is very easy to reach a high number of social contacts for need or will, going below a given threshold is very difficult, since contacts are forced by basic activities which cannot be avoided in general. This asymmetry between growth and decrease, as exhaustively discussed in [18,25,28], can be suitably modelled by (2.10) which plays the role of the so-called value function in the prospect theory of Kahneman & Tversky [24] and implements the advocated asymmetry in the contact distribution formation.

    Hence, with the above discussed choices, at the aggregate scale the evolution of a contact distribution fi,ϵ is given by the Boltzmann-type equation in weak form

    tR+φ(x)fi,ϵ(x)dx=1ϵE[R+B(x)(φ(x)φ(x))fi,ϵ(x,t)dx],2.11
    where B(x)=1/x is the interaction kernel assigning a low probability to interactions for individuals exhibiting a large number of contacts and assigning a high probability to interactions when x is small. Letting finally ϵ0+ (the limit of grazing interactions) one shows that (2.11) converges, up to extraction of a subsequence, to the Fokker–Planck equation (2.6), see e.g. [18,29]. This result is particularly useful since it permits us to characterize the equilibrium state of the system, i.e. it permits us to find the equilibrium Gamma distribution (2.7) as discussed in [18] and experimentally found in [6]. Moreover, it allows us to analytically consider and study control problems at the kinetic level, i.e. to define effective actions at the microscopic level the scope of which is to modify the behaviour of the single individuals. Subsequently, thanks to well established kinetic theory tools [17], it is possible to upscale the microscopic actions in order to compute its reflection at the aggregate, measurable, scale of description and of interest.

    (a) Observable effects of selective social restrictions

    In this part, we model the lockdown measures intended to reduce the social interactions by introducing an additive term u at the level of the microscopic formation of the contacts (2.9) to limit selectively the social activities. This gives the controlled elementary interaction

    x=xΦϵ(xx¯i)x+ϵτW(x)u+ηϵx,W0.2.12
    The function W(x) is such that if W1 the control variable u is independent from the number x0 of the social contacts, while if W(x)x the control is considered heavier for people with a higher number of contacts. Note that in (2.12), the size of the controlled variable is tuned by the small parameters ϵ and τ, which ensures that the control acts at the right scale with respect to both the grazing variable ϵ characterizing the passage from a Boltzmann to a Fokker–Planck dynamics and the relaxation variable τ characterizing the speed at which an equilibrium state for the distribution of contact is reached, compared to the speed at which the epidemic dynamics travels.

    Then, the so-called optimal control u is determined as the minimizer of a given cost functional

    u=argminuUJ(x,u),2.13
    subject to the constraint (2.12). In (2.13), the minimum is taken on the set U of all admissible controls. A classical choice for J is a quadratic cost functional of the form
    Ji(x,u)=12E[(xxT,i)2+νui2]iC,2.14
    ν>0 being a penalization coefficient measuring the economic and social cost of the control and xT,i>0 the desired target number of social contacts for the compartment iC. According to (2.14), the cost of the control increases quadratically with the distance to the desired target of average number of contacts. The presence in (2.14) of the mean operator is related to the fact that, by hypothesis, a control cannot act on the random fluctuations introduced to mimic the unanticipated daily interactions among agents. Let us observe that other convex cost functionals may be considered as well. However, they often lead to problems whose analytical solution cannot be obtained explicitly. Proceeding now as in [30], the control introduced in (2.12) can be solved explicitly giving
    u=ϵτW(x)ν+ϵτW2(x)(xxT,iΦϵ(xx¯i)x),2.15
    which generates the optimal constrained law of growth of the social contacts
    x=xνν+ϵτW2(x)Φϵ(xx¯i)xϵτν+ϵτW2(x)(xxT,i)W2(x)+ηϵx.2.16

    Now, in the presence of the above described microscopic controlled interactions, one can consider as before the construction of a kinetic collisional equation for what concerns the description of the formation of the social structure

    tR+φ(x)fi,ϵ(x)dx=1ϵE[R+(φ(x)φ(x))xfi,ϵ(x,t)dx],2.17
    where we recall that the frequency of interaction has been chosen such that B(x)=1/x. Then, thanks to the passage to the grazing limit one can show that the controlled kinetic model converges to a Fokker–Planck type equation which contains an additional drift term, that quantifies the role of the control mimicking lockdown measures. In fact, since by hypothesis ϵ1, we can write
    φ(x)φ(x)=(xx)xφ(x)+12(xx)2x2φ(x)+16(xx)3x3φ(x^),
    with x^(min(x,x),max(x,x)). Then using (2.16) one gets
    tR+φ(x)fi,ϵ(x)dx=1ϵR+νν+ϵτW2(x)Φϵ(xx¯i)xφ(x)fi,ϵ(x,t)dxR+τν+ϵτW2(x)(xxT,i)xW2(x)xφ(x)fi,ϵ(x,t)dx+σ22R+x2φ(x)xfi,ϵ(x,t)dx+RΦϵ(fi,ϵ)(x,t),
    where the remainder RΦϵ(fi,ϵ)(x,t) can be shown to go to zero in the limit ϵ0+ since by assumption φ and its derivatives are bounded in R+ and the stochastic variable η is assumed to have bounded moments at least of order 3 as previously stated. We point the reader to [17,18] for a detailed discussion on this aspect. In the limit ϵ0+, one gets
    tR+φ(x)fi(x)dx=R+Φ(xx¯i)xφ(x)fi(x,t)dxR+τν(xxT,i)xW2(x)xφ(x)fi(x,t)dx+σ22R+x2φ(x)xfi(x,t)dx2.18
    since for ϵ1 we have
    Φϵ(xx¯i)ϵμ2(xx¯i1)=ϵΦ(xx¯i).
    Integrating back by part (2.18), and considering the no flux boundary condition at x=0+ to preserve mass and momentum, the relaxation equations take the form
    fi(x,t)t=1τQi(fi(x,t))+Ci(fi(x,t)),iC,2.19
    where the operator Ci(fi) in (2.19) is given by
    Ci(fi(x,t))=1νx[xxT,ixW2(x)fi(x,t)],2.20
    and Qi(fi(x,t)) remains the same as in (2.6). Note in particular that in (2.20) the control term does not depend on the time scale τ of the Fokker–Planck operator.

    Therefore, in the presence of social restrictions modelled by the control term introduced above, system (2.1) is modified as

    f(x,t)t=P(x,f(x,t))+C(f(x,t))+1τQ(f(x,t)),2.21
    where the n-dimensional vector C quantifies the effect of the control on the epidemic spreading. According to the previous analysis, the components of the vector control C are given by the operators of type (2.20), where the values of the penalization coefficients and of the target values may depend on the specific compartment iC.

    In the next part, we focus on a specific compartmental model for which we give the details of the controlled strategy for the reduction of the social contacts.

    Remark 2.1.

    We observe that the cost functional (2.14) introduced at the level of microscopic interactions can be seen as an instantaneous approximation of a macroscopic functional obtained by considering (2.15) in the limit ϵ,τ0+ whose form is

    Ji=12R+(1+W2(x)ν)(xxT,i)2fi(x)dx,iC.
    In the above functional, we consider the contributions at the final time T and the steady distribution fi of contacts defined in (2.7) instead of the instantaneous single interaction among agents.

    (b) A SEIAR-type compartmentalization

    We consider the case C={S,E,I,A,R}, i.e. we concentrate on a model with exposed and asymptomatic compartments and we discuss the observable effects of the introduced control at the level of the mean number of connections in two leading examples.

    The choice C={S,E,I,A,R} gives, according to (2.21), the following system of kinetic equations relating the epidemiological dynamics with the number of daily contacts expressed in terms of a probability distribution fi(x,t)

    tfS(x,t)=xfS(x,t)j{I,A}βjx¯jNj(t)+CS(fS)(x,t)+1τQS(fS)(x,t),tfE(x,t)=xfS(x,t)j{I,A}βjx¯jNj(t)γEfE(x,t)+CE(fE)(x,t)+1τQE(fE)(x,t),tfI(x,t)=ξγEfE(x,t)γIfI(x,t)+CI(fI)(x,t)+1τQI(fI)(x,t),tfA(x,t)=(1ξ)γEfE(x,t)γAfA(x,t)+CA(fA)(x,t)+1τQA(fA)(x,t),andtfR(x,t)=γIfI(x,t)+γAfA(x,t)+CR(fR)(x,t)+1τQR(fR)(x,t),}2.22
    where, to have a completely definite system, it remains to clarify the role of the parameter ξ which gives the fraction of exposed individuals becoming successively infected and its counterpart (1ξ) which gives the fraction of asymptomatic individuals after the exposure to the virus. In addition, the functions γI and γA represent the recovery rates of infected and asymptomatic individuals respectively, which give the frequency at which individuals move to the compartment of recovered individuals. In the present setting, the choice of the constants βI,βA>0 takes into account that the population of the infected compartment is considered recognized by the health system through a swab test and therefore subject to isolation. We collect in the asymptomatic compartment unrecognized infections that are not subject to isolation. We highlight that we preferred to neglect the compartment of deceased people since they do not influence the spreading of the disease. Anyway, the inclusion of this new compartment to assess the risks related to the infection can easily be considered.

    Since the Qi and Ci components in (2.22), coupled with no-flux boundary conditions, are mass preserving, we can integrate the equations of the system with respect to the contact variable x to obtain a macroscopic system of equations describing the evolution of the fraction of individuals in each compartment independently of the number of daily social contacts. This gives the system

    dNSdt=x¯SNSj{I,A}βjx¯jNj,dNEdt=x¯SNSj{I,A}βjx¯jNjγENE,dNIdt=ξγENEγINI,dNAdt=(1ξ)γENEγANA,anddNRdt=γINI+γANA,}2.23
    which is not closed since the evolution in time of the vector solution N(t) depends on the mean number of contacts x¯i. Since the Qi components, under the same hypothesis on the boundary fluxes, are also momentum preserving, by multiplying the distributions fi(x,t) by x and integrating with respect to the contact variable, it is possible to recover a further macroscopic set of equations describing the constrained dynamics of the mean number of social contacts in each compartment. However, one immediately realizes that this second set of macroscopic observables depends upon higher order moments of the distributions fi(x,t). In other words, in general, the system of macroscopic equations which can be obtained by integration of (2.22) over the number of social contacts is not closed. In fact, system (2.23) depends on the time evolution of x¯i(t),i=S,E,I,A,R while the system giving the time evolution of x¯i(t),i=S,E,I,A,R depend upon second-order moments of the fi(x,t),i=S,E,I,A,R. This represents nothing else than the classical so-called closure problem often encountered in kinetic theory [17].

    However, letting τ0 forces the distributions fi(x,t) to converge towards the Gamma vector distribution of components (2.7) with mass fractions Ni(t), and local mean values x¯i(t), iC={S,E,I,A,R}. This amounts to assuming that social dynamics are much faster than epidemic dynamics. In this situation, it is possible to obtain a closed set of macroscopic equations. In particular, the second order moments of the distributions fi(x,t) can be explicitly computed in terms of the mass fractions and mean values resorting to (2.8). This gives the system

    dx¯Sdt=1λx¯S2j{I,A}βjx¯jNj+C(fS),dx¯Edt=x¯SNSNE(λ+1λx¯Sx¯E)j{I,A}βjx¯jNj+C(fE),dx¯Idt=ξγE(x¯Ex¯I)NENI+C(fI),dx¯Adt=(1ξ)γE(x¯Ex¯A)NENA+C(fA),anddx¯Adt=γI(x¯Ix¯R)NINR+γA(x¯Ax¯R)NANR+C(fR),}2.24
    where
    C(fi)(t)=1νNi(t)R+(xT,ix1)W2(x)fi(x,t)dx.2.25
    Equations (2.24) coupled with the system (2.23) describe in closed form the time evolution of an epidemic where the transition functions from one compartment to another depend upon the mean level of social contacts in the population. In addition, these latter depend on the heterogeneity of the population through the parameter λ=μ/σ characterizing the distributions fi(x,t), while the term (2.25) models the effects of the lockdown measures.

    In the rest of the paper, we focus on two specific benchmark cases for which it is possible to compute explicitly the effect of the social restrictions on the system. In particular, we first concentrate on the case of uniform restrictions, corresponding to W(x)1, and second on the case of selective restrictions. This last situation describes an agent system in which restrictions are stronger for individuals exhibiting a larger number of contacts, and it is based on the choice W(x)=x/x¯i. By direct computation, we have

    C(fi)={1ν(λλ1xT,ix¯i1)W(x)=1,1ν(1+λλx¯ixT,i)W(x)=xx¯i.2.26

    Hence, for small penalizations ν>0 of the control, the mean number of connections stabilizes towards the values

    x¯i={xT,iλλ1>xT,iW(x)=1,λ1+λxT,i<xT,iW(x)=xx¯i.
    It is worth noting that, in both situations, the role of the penalization parameter ν>0 is limited to a scaling of time, without changing the final goal of the control. Thus, a smaller penalization parameter is reflected in a quicker convergence of the solution towards the target value as expected. We also note that the selective control, i.e. the one especially operating on individuals with larger social nets, allows us to reach for the global population a value below the desired target xT. In particular, the mean contact rate is lower than xT of an amount directly proportional to the degree of heterogeneity of the population. This can be explained by the fact that individuals with larger social networks are mainly responsible for the creation of the contact dynamics. This situation suggests that it is important to act on this class of people in order to limit the spread of a disease instead of taking measures with uniform impact over the whole population.

    From an operative point of view, it is clear that the application of a uniform control, which is based on restrictive measures uniformly valid for the global population, are simpler to apply with respect to measures which provide for heavier restrictions for the categories that show a greater number of social contacts. In this direction, the previous analysis seems to suggest that a mixture of the two types of controls will present the best compromise between containment results and effective applicability of containment measures.

    3. Numerical tests

    In this section, using (2.23)–(2.24), we compare possible interventions for the containment of an epidemic based on the limitation of the mean number of contacts. These non-pharmaceutical measures are designed to drive the population towards a prescribed target dependent on the relevant epidemiological state of agents, namely xT,i, iC in the considered framework.

    To this end, we first estimate the relevant epidemiological parameters of the second order macroscopic SEIAR model taking into account the available dataset provided by John Hopkins University.1 After the calibration of the model with the data at our disposal, we assess the impact of different control strategies leading to the observed epidemic dynamics in different countries where lockdown policies have been implemented.

    We will mainly focus on the so-called first wave of infection lasting the first half of 2020 since in this period similar contact restrictions have been proposed in all Western countries. In the following, we concentrate on the cases of Italy and the UK.

    (a) Calibration of the second order model

    We consider two countries that adopted the stay-at-home policies during early stages of the infection. In table 1, we report the days of implementation of such measures together with a sketch of the timeline of the epidemic. In the second column, we highlight the begin of restrictions at the global level for each different country. Anyway, it is worth mentioning that often local restriction measures, i.e. isolation of a small portion of a territory, have been implemented as well.

    Table 1. Timeline of COVID-19 pandemic.

    country first detected case begin restrictions βI λ
    Italy 30 January 9 March 0.0042 4.9593
    UK 31 January 23 March 0.0043 5.8502

    Before proceeding with the estimation of the model parameters some remarks are deserved. First, we stress that the calibration of epidemiological models is generally difficult and requires special care at the numerical level. Furthermore, available data typically represent a lower bound with respect to the true numbers of an infection, especially in its initial phases, due to a limited testing capacity. Here, we do not consider this aspect and we refer to recent attempts to overcome this issue based on tools of uncertainty quantification [4,19] that can be successfully employed in this context. For related approaches, we mention also [31,–34].

    For the calibration of the model (2.23)–(2.24), we have adopted a bilevel approach. In particular, in the phase preceding the lockdown, we estimated the epidemic parameters in the unconstrained regime assuming that no restriction of social contacts was activated, i.e. C0. We also fixed the known clinical parameters according to values reported in the literature: see [21,35]. We then set γE=1/3.32, γI=1/10 and γA=2γI, βA=2βI and we also assumed that ξ=0.2, such that 80% of infections come from asymptomatic or paucisymptomatic individuals, in agreement with the recent serological campaigns promoted in Italy.2 The proposed choice for βA,βI>0 is coherent with existing literature, see e.g. [36].

    We then solved a least-square problem based on the minimization of the relative L2 norm of the difference between reported number of infected N^I and recovered N^R, and the theoretical evolution of the model NI(t) and NR(t) for t[t0,t] with NE(t0)=NI(t0)=NA(t0)=NR(t0)=1, i.e. we assume that in t0 the pandemic was nearly started. We also considered x¯S(t0)=x¯E(t0)=x¯A(t0)=x¯R(t0)=10, in agreement with the experimentally observed mean number of contacts in a Western country before the pandemic [6], and x¯I(t)3 for recognized infected, corresponding to the average number of family contacts. Thus once the known parameters have been fixed, we considered the following minimization problem

    minβI,λ[(1θ)||NI(t)N^I(t)||L2([t0,t])+θ||NR(t)N^R(t)||L2([t0,t])],
    where θ[0,1], ||||L2([t0,t]) the relative norm over the time horizon [t0,t] and under the constraints βI[0,1/100], λ[0,50]. In the third and fourth columns of table 1, we report the estimated epidemiological parameters obtained for θ=103 to get a better resolution of the infected dynamics. It is worth noticing that, according to these values, Italy manifests higher heterogeneity linked to the contact distribution compared to the UK.

    (b) Assessing the impact of non-pharmaceutical interventions

    Once the relevant epidemiological parameters are estimated, we focus on the subsequent lockdown phase where we look for the optimal value xT in the control term which permits us to fit the data. We assume this value xT to be equal for each compartment and we study the two cases of uniform and selective restrictions, respectively obtained by fixing W(x)=1 and W(x)=x/x¯i and C(f) of the form (2.26) in the dynamics. Hence, we solve an optimization problem in the lockdown time interval [t+1,tf], for a sequence of time steps tn over a time window of one week

    minxT(tn)R+[(1θ)||NI(t)N^I(t)||L2([tnκ,tn+κr])+θ||NR(t)N^R(t)||L2([tnκ,tn+κr])],
    with κ=3, κr=4.

    In figure 1, we show the obtained trends for the registered cases for Italy and the UK. We may observe how for both controls the macroscopic trends are coherent with the observed ones. Therefore, the adopted fitting procedure allows us to compare the estimated target xT,i for mean contacts in the considered control strategies. In particular, we observe how the uniform control implies a stronger contact reduction with respect to the selective one, while a similar cost, computed as the sum JS+JE+JA+JR, is produced by the two containment strategies. In other words, the adoption of selective-type containment strategies based on the average number of contacts allows us to mitigate the severity of lockdown policies.

    Figure 1.

    Figure 1. (a,b) obtained fitting of the observed number of infected (labelled with Data) together with estimated number of exposed and asymptomatic cases. (c,d) Estimated target number of contacts xT. (e,f) Evolution of the sum of cost functionals JS+JE+JA+JR. In all the presented tests, we compared the cases of uniform control (W(x)=1) and selective control (W(x)=x/x¯i, iC). (Online version in colour.)

    In the previous part, we focused on getting the target mean number of contacts for both the control strategies. We remark that the NPIs implemented by both countries have been mainly uniform in the first wave of the epidemic, meaning that, in the present model framework, the average number of contacts were approximately the ones defined by the red line with circle markers in the second row of figure 1. Hence, it is interesting to consider a retrospective analysis where the estimated xT,i, iC, is instead implemented in the dynamics with selective control. In figure 2, we present the results of such a study, where the evolution of the disease in the presence of selective control with target given by the uniform control is reported. It is evident how the amplitude of the epidemic peak is strongly reduced and the fraction of asymptomatic cases is under control in both cases using this alternative strategy. Furthermore, it is worth observing that a selective-type control is more effective in reducing the peak of infection in the presence of high heterogeneity like in the case of Italy.

    Figure 2.

    Figure 2. Retrospective analysis of the epidemic dynamics in the presence of selective control strategy with target xT,i, iC, estimated in the uniform case. (Online version in colour.)

    4. Conclusion

    The recent spreading of COVID-19 epidemic has been thwarted in most countries through non-pharmaceutical interventions, intended to diminish transmission rates and, to this end, reduce person-to-person contact via social distancing. Even if the confinement measures appear to be a correct tool to limit the infection spreading, a precise quantification of the effects played by them has still to be carefully understood.

    Aiming to investigate the impact of the number of daily contacts in the spread of infectious diseases, in [18], we introduced a mathematical description of the epidemic spreading by integrating the epidemiological dynamics of the classical SIR-type compartmental model with a kinetic term quantifying the population-based contacts.

    The kinetic description of the formation of social contacts considered in [18], based on elementary interactions, turns out to be very flexible with respect to external inputs, like a control forcing the number of daily social contacts towards a fixed target. From the modelling point of view, this strategy leads to integrating the compartmental models with social contacts with a new term quantifying in a precise way the effect of the control strategy depending on the population heterogeneity.

    The main conclusion of our theoretical analysis, confirmed by numerical experiments, is that a selective control, when compared with a uniform control, and at the same social cost, leads to a marked reduction of the epidemic spreading. This reduction is further linked to the parameter of social heterogeneity, so that the selective control gives a better performance in a population with higher heterogeneity.

    Data accessibility

    All data used in this work are publicly available at https://github.com/CSSEGISandData/COVID-19.

    Authors' contributions

    G.D., G.T. and M.Z. conceived and wrote the paper; G.D. and M.Z. designed and performed the experiments. All authors gave final approval for publication and agreed to be held accountable for the work performed therein.

    Competing interests

    We declare we have no competing interests.

    Funding

    The research was partially supported by the Italian Ministry of Education, University and Research(MIUR: Project ‘Optimal mass transportation, geometrical and functional inequalities with applications’, and Dipartimenti di Eccellenza Program(2018–2022)—Dept. of Mathematics ‘F. Casorati’, University of Pavia. G.D. would like to thank the Italian Ministry of Instruction, University and Research (MIUR) for supporting this research with funds coming from PRIN Project 2017(no. 2017KKJP4X entitled ‘Innovative numerical methods for evolutionary partial differential equations and applications’).

    Acknowledgements

    This work has been written within the activities of GNFM and GNCS groups of INdAM (National Institute of High Mathematics).

    Footnotes

    One contribution of 17 to a theme issue ‘Kinetic exchange models of societies and economies’.

    Published by the Royal Society. All rights reserved.