Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences
You have access

Cluster formation in complex multi-scale systems

J.D Gibbon

J.D Gibbon

Department of Mathematics, Imperial College LondonLondon SW7 2AZ, UK

Google Scholar

Find this author on PubMed

and
E.S Titi

E.S Titi

Department of Computer Science and Applied Mathematics, Weizmann Institute of SciencePO Box 26, Rehovot 76100, Israel

Departments of Mathematics and Mechanical and Aerospace Engineering, University of CaliforniaIrvine, CA 92697-3875, USA

Google Scholar

Find this author on PubMed

    Abstract

    Based on the competition between members of a hierarchy of length scales in complex multi-scale systems, it is shown how clustering of active quantities into concentrated sets, like bubbles in a Swiss cheese, is a generic property that dominates the intermittent structure. The halo-like surfaces of these clusters have scaling exponents lower than that of their kernels, which can be as high as the domain dimension. Possible examples include spots in fluid turbulence and droplets in spin-glasses.

    1. Introduction

    It has long been recognized that active quantities in complex systems of many types are not distributed evenly across a domain but cluster strongly into irregular bubbles, as in a Swiss cheese. The nomenclature, the nature and shape of the bubbles, and the physics in each subject may be substantially different. Some examples are: spottiness in high Reynolds number fluid turbulence (Batchelor & Townsend 1949; Kuo & Corrsin 1971; Meneveau & Sreenivasan 1991; Frisch 1995; Zeff et al. 2003) and boundary layers (Emmons 1951); droplet formation in spin-glasses (Fisher & Huse 1986; Bray & Moore 1987; Palassini & Young 2000); clustering behaviour in networks (West et al. 1999; Albert & Barabási 2002); the preferential concentration of inertial particles (Eaton & Fessler 1994; Sigurgeirsson & Stuart 2002; Bec 2003; Bec et al. 2005; Holm & Putkaradze 2005), with applications to rain initiation by cloud turbulence (Falkovich et al. 2002); the clustering of luminous matter (Bak & Chen 2001, 2002; Paczuski & Hughes 2004; Bak & Paczuski 2005) and magnetic bubbles in astrophysics (Zweibel 2002). Clusters display strong features whose typical length scales are much shorter than their averages, thus raising the question of the nature of the interface between them and the surrounding longer scale regions. For instance, in spin glasses Palassini & Young (2000) have shown that the ‘surface’ of the droplets has a fractal-like structure, whereas the droplets themselves have the full domain dimension. In fluid turbulence the concentrated sets on which vorticity accumulates are tubes and sheets, although the fractal nature of these is unclear. These sets dominate the associated Fourier spectra which display a spikiness that is the hallmark of what is usually referred to as intermittency (Batchelor & Townsend 1949; Kuo & Corrsin 1971; Kerr 1985; Meneveau & Sreenivasan 1991; Vincent & Meneguzzi 1994; Frisch 1995; Zeff et al. 2003).

    While one cannot hope to make a uniform theory for so many disparate examples whose origins, scale and governing equations widely differ, the ubiquity of clustering phenomena suggests the existence of an underlying set of organizing principles. The function of this paper is primarily mathematical; using simple but broadly applicable ideas, it will demonstrate that a dominant principle behind clustering is the existence of a hierarchy of length scales whose members are in competition.

    2. Competition within a hierarchy of length scales

    Consider a d-dimensional system whose smallest characteristic (integral) scale L is such that the system is statistically homogeneous on boxes Ω=[0,L]d. Moreover, it is endowed with the following two properties. Firstly, at each point xΩ, it possesses an ordered set of length scales n=n(x) associated with a hierarchy of features labelled by n≥2

    Display Formula
    (2.1)
    The n could be thought of as an ordered set of correlation or coherence lengths; their inverses Inline Formula clearly obey 1<nn+1. Secondly, the assumption is that the ensemble averages of the n(x) are bounded above by some ordered, positive parameters of the system satisfying 1<RnRn+1,
    Display Formula
    (2.2)
    The ensemble average 〈.〉 is a spatial average with respect to the Lebesgue measure over Ω. Thus, while the ordering of the n(x) must be respected at each point, the n themselves could be quite rough, e.g. they could consist of a series of step functions.

    If they become very small near points x* then they must obey Inline Formula (r=|xx*| and ϵ>0) so as not to violate equation (2.2).

    Following an idea used by Gibbon & Doering (2003, 2005), consider the real arbitrary parameters 0<μ<1 and 0<α<1 such that μ+α=1. A version of Hölder's inequality, with A and B positive, is

    Display Formula
    (2.3)
    With a choice of p=μ−1 and q=α−1, we have
    Display Formula
    (2.4)
    Re-arranging and factoring out a term Inline Formula gives
    Display Formula
    (2.5)
    Lower bounds on the ratio Inline Formula can be found from equations (2.1) and (2.2) thereby turning equation (2.5) into
    Display Formula
    (2.6)
    While it is possible that the integrand in equation (2.6) could be positive everywhere in Ω, this cannot be assumed; the generic case is that the integrand could take either sign. With the definition Inline Formula we have the pair of inequalities
    Display Formula
    (2.7)
    for which ≥ is valid on regions where the integrand is positive, designated as good regions, and negative (<) on bad regions. The term Inline Formula on the right-hand side of equation (2.7) remarkably contains the arbitrary parameter μ which lies in the range 0<μ<1. Its existence is important because the ordering in equation (2.1) applied to equation (2.7) makes it clear that everywhere within the bad regions (< in equation (2.7)) there are large lower bounds on κn with exponents containing 1/μ
    Display Formula
    (2.8)
    Let Inline Formula be the set on which Inline Formula and Inline Formula the set on which Inline Formula. Then all the bad regions (< in equation (2.7)), designated by the clusters of black kernels in figure 1, lie in Inline Formula. The grey halos also lie in Inline Formula, and correspond to those parts of the good regions (≥ in equation (2.7)), neighbouring the bad, for which Inline Formula; see also figure 2. It is in these halos where the lower bound Inline Formula becomes operative. The white areas of figure 1 belong to Inline Formula in which the κn can be randomly distributed subject to their ordering in equation (2.1). It is clear from equation (2.6) that the existence and location of the clusters may differ for each n although no information is obtained regarding their distribution.
    Figure 1

    Figure 1 A cartoon-like illustrative slice through Ω for one value of n: the black kernels, surrounded by grey halos may take many shapes depending on the problem. Very small-scale behaviour concentrates on the black and grey regions which constitute the set Inline Formula. The halos have scaling exponents lower than those of the black kernels.

    Figure 2

    Figure 2 An illustration of a single cluster with its grey halo (left figure) and their corresponding regions (right figure); these lie in the range Inline Formula. For many clusters the corresponding plot of n versus x would have an intermittent structure.

    To show that the volume Inline Formula of Inline Formula comprises a small part of Ω, Chebychev's inequality relates the normalized Lebesgue measure Inline Formula to the integral of n over Inline Formula

    Display Formula
    (2.9)
    Together with the relation Inline Formula we have
    Display Formula
    (2.10)
    Hence Inline Formula is significantly smaller than unity and decreases as Rn increases. Thus Inline Formula can fill, at most, a small fraction of Ω.

    3. Scaling exponents

    To estimate the fractal or Hausdorff dimensions of the set Inline Formula in a precise set-theoretic sense requires more information than is available. However, something very close to this can be found by estimating scaling exponents; see Hentschel & Procaccia (1983) and Frisch (1995). This entails making a third assumption of self-similarity to estimate the smallest number of balls Inline Formula of radius Inline Formula needed to cover Inline Formula. Defining Inline Formula as

    Display Formula
    (3.1)
    for some p>1, it is clear that Inline Formula cannot be large enough when p=1 because of equation (2.2). However, any value1 of p≫1 will do that makes Inline Formula large enough to be a member of Inline Formula. The simplest and worst estimate would be to write
    Display Formula
    (3.2)
    Inequality (2.10), however, shows that Inline Formula occupies only a small fraction of Ω. A multiplicative factor of Inline Formula is introduced thus
    Display Formula
    (3.3)
    Instead of using equation (2.10) to estimate Inline Formula, an assumption of self-similar scaling is introduced that requires that the change in volume of the balls with respect to n should scale as Inline Formula scales to Ld. Thus
    Display Formula
    (3.4)
    We observe that the definition of the set Inline Formula in principle involves the length scales L and Inline Formula, but not overtly Inline Formula. Yet the good and bad sets involve all three scales; L, Inline Formula and Inline Formula. The self-similarity assumption (3.4) is an assumption about the nature of the set Inline Formula that relates successive length scales Inline Formula and Inline Formula in an ad hoc, yet reasonable fashion. Using equation (3.4) in (3.3) we have
    Display Formula
    (3.5)
    From these, two estimates for Inline Formula emerge, one each for the grey halo and black kernel regions of figure 1, whose scaling exponents2 are independent of p
    Display Formula
    (3.6)
    For the halos, the > direction of the inequality in equation (2.7) has been used together with a simple Hölder inequality
    Display Formula
    (3.7)
    whereas for the kernels κnκn+1 has been used. In contrast, without any evidence of contraction of volume, the formula corresponding to equation (3.3) for Inline Formula is
    Display Formula
    (3.8)
    where Inline Formula satisfies Inline Formula. The uniform scaling exponents in equation (3.6) satisfy (table 1)
    Display Formula
    (3.9)
    whereas Inline Formula=d from equation (3.8). The coefficients Inline Formula in equations (3.5)–(3.8) reflect the fact that this effect is taking place only at length scales smaller than Inline Formula where the better lower bound Inline Formula becomes effective in equation (2.7). The grey halo clearly plays the role of an interface of small but finite thickness between the d-dimensional (white) outer region and the (black) inner kernel whose dimension can be as high as d but could be less; figure 2 illustrates the correspondence between a kernel with its halo and the magnitude of n. The full system, with many clusters, would manifest many excursions away from the ensemble average that would dominate the intermittent structure. When Inline Formula saturates its upper bound we have
    Display Formula
    (3.10)
    For any system in question, a numerical experiment would be necessary to estimate the Rn by finding the maximum value of the ensemble average 〈κn〉. In principle μ could then be found from numerical estimates of Inline Formula within the black kernels, although if the κn take very large values there it might not be possible to achieve resolution. μ itself may have upper and lower bounds that are themselves n-dependent, as in Gibbon & Doering (2003, 2005).

    Table 1Summary regarding the sets Inline Formula and the regions in figure 1

    figure 1blackgreywhite
    setInline FormulaInline FormulaInline Formula
    inequality (2.7)<(bad)≥(good)≥(good)
    exponentdd(1−μ)=d

    4. Some final remarks

    The arguments in this paper have revolved around a statistically steady system with an ensemble average based on a Lebesgue measure. It has been shown that when a system of this type has an ordered hierarchy of length scales, and ensemble averages are finite, then intermittency is generally inevitable. Very short length scales crowd into clusters whose boundaries are fractal-like. This crowding effect can be illustrated in an alternative way: a Schwarz inequality, in combination with equation (2.2), yields a lower bound on the ensemble average of n,

    Display Formula
    (4.1)
    whereas in the black clusters (2.8) shows that there is a point-wise upper bound,
    Display Formula
    (4.2)
    Comparing the two shows that the length scales within these clusters are very much smaller than the lower bound on the averages expressed in equation (4.1).

    Historically, the first ideas on clustering came more than half a century ago from Batchelor & Townsend (1949) who observed intermittent behaviour in their high Reynolds number turbulent flow experiments, closely followed by observations in boundary layers by Emmons (1951). Batchelor & Townsend (1949) called this phenomenon ‘spottiness’ and suggested that the energy associated with the small scale components is distributed unevenly in space and roughly confined to regions which concomitantly become smaller with eddy size (Kuo & Corrsin 1971). Mandelbrot (1974) then suggested that these clustered sets on which energy dissipation is the greatest might be fractal in nature. In measurements of the energy dissipation rate in the atmospheric surface layer, Meneveau & Sreenivasan (1991) interpreted the intermittent nature of their signals in terms of multi-fractals. A newer generation of experiments measuring intense dissipation in turbulent flows have been pursued by Zeff et al. (2003).

    Standard cascade theories of turbulence (Frisch 1995) are generally statistically stationary, but there are some significant challenges in applying the formulation of this paper to time-evolving three-dimensional Navier–Stokes turbulence. Thin sets of high vorticity and strain, taking on the nature of short-lived quasi-one-dimensional tubes and quasi-two-dimensional sheets, evolve rapidly in time (Kerr 1985; Vincent & Meneguzzi 1994; Frisch 1995). To include time and space in the average 〈.〉 means a different measure could be necessary because of the semi-infinite nature of the time-axis. With the exception of the paper by Caffarelli et al. (1982) on the Navier–Stokes (potentially) singular set, methods of analysis are unfortunately not advanced enough to deal with the full space–time three-dimensional Navier–Stokes equations. Conventional methods use Sobolev norms to L2-average the velocity field and its derivatives over space while the pressure is removed by projection (Constantin & Foias 1988; Foias et al. 2001; Majda & Bertozzi 2002) leaving only time as an independent variable. Using this approach Gibbon & Doering (2003, 2005) have reached a half-way stage in this process by showing that an ordered hierarchy of κn(t) can be constructed for the three-dimensional Navier–Stokes equations that are comprised of ratios of norms (of derivatives of order n). Because the κn(t) are functions only of time, 〈.〉 means time-average. The clusters of figure 1 appear as gaps in the time-axis, as in figure 2, although conclusions about the halos would not be applicable when time is the only variable. Gibbon & Doering (2005) found it necessary to prove that these gaps are finite in width and decreasing with increasing Reynolds number, which involves finding bounds on μ.

    Thus for time-evolving intermittent systems such as the Navier–Stokes equations, the challenge is to include both space and time while showing that these equations also possess a space–time hierarchy of scales. The appearance of short-lived clusters in numerical computations of the vorticity field suggests that such hierarchies may exist in local sub-domains of the flow for finite times.

    A further example is that of the low-temperature phase of spin glasses (Sherrington & Kirkpatrick 1975; Mézard et al. 1987). In this case 〈.〉 means ensemble average in the sense defined in this paper. The role played by the competition between members of a hierarchy of length scales is consistent with the observation of ultrametricity, a term that is used to denote the presence of a hierarchy of scales (Mézard et al. 1987; Parisi & Ricci-Tersenghi 2000; Berthier et al. 2004). This has been observed in computations on the low-temperature spin glass phases of the Sherrington–Kirkpatrick (Hed et al. 2004) and Edwards–Anderson models (Stariolo 2001), as well as in dynamic phenomena in complexity (Boettcher & Paczuski 1996). The results in this paper, particularly with reference to equation (3.10), are consistent with the droplet theory (Fisher & Huse 1986; Bray & Moore 1987; Palassini & Young 2000) where the kernel of the droplet is of full dimension d but its surface has a scaling exponent <d. Palassini & Young (2000) have shown numerically that Inline Formula when d=3 and Inline Formula when d=4.

    The physics behind many clustering phenomena is often difficult to express in terms of well-posed boundary value problems that can be subjected to rigorous analysis. Instead, the approach taken in this paper, highlighting the competition between length scales as the dominant mechanism, may be a useful paradigm in explaining the behaviour of multi-scale systems.

    We wish to acknowledge discussions with Steve Cowley, Charles Doering, Darryl Holm, Roy Jacobs, Robert Kerr, Michael Moore, Maya Paczuski, Andrew Parry, Greg Pavliotis, Jaroslav Stark and Christos Vassilicos. J.D.G. would like to thank the Isaiah Berlin Foundation for travel support and the hospitality of the Faculty of Mathematics and Computer Science of the Weizmann Institute of Science. E.S.T. was supported in part by the NSF grant number DMS-0204794, an MAOF Fellowship of the Israeli Council of Higher Education, the USA Department of Energy under contract number W-7405-ENG-36 and the ASCR Program in Applied Mathematical Sciences.

    Footnotes

    As p→∞, Inline Formula, which certainly lies within Inline Formula. The p-labelling of Inline Formula is suppressed.

    Since we expect Inline Formula, the estimate (3.5) implies that Inline Formula. This is consistent with κn>L−1 as in equation (2.1) but technically imposes an additional constraint.