Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences

    Abstract

    This paper demonstrates an equivalence between rotating magnetized shear flows and a stressed elastic beam. This results from finding the same form of dynamical equations after an asymptotic reduction of the axis-symmetric magnetorotational instability (MRI) under the assumption of almost-critical driving. The analysis considers the MRI dynamics in a non-dissipative near-equilibrium regime. Both the magnetic and elastic systems reduce to a simple one-dimensional wave equation with a non-local nonlinear feedback. Under transformation, the equation comprises a large number of mean-field interacting Duffing oscillators. This system was the first proven example of a strange attractor in a partial differential equation. Finding the same reduced equation in two natural applications suggests the model might result from other applications and could fall into a universal class based on symmetry.

    1. Introduction

    In a non-dissipative, purely hydrodynamical context, the famous Rayleigh criterion [1,2] of outwardly increasing angular momentum governs the stability of rotating shear flows. That is,

    ddr(r4Ω(r)2)<01.1
    gives a necessary condition for the axis-symmetric instability of a given radially varying cylindrical rotation profile, where r represents the outward directed radial coordinate, and Ω(r) represents the local angular rotation rate. During the 1950s and 1960s, Chandrasekhar [3,4] in The West, and Velikov [5] in the former Soviet Union considered the effect of an axial magnetic field on the stability of a cylindrically swirling conducting fluid, thereby modifying the Rayleigh criterion. In these works, both authors found independently that (in the ideal non-dissipative regime), the presence of a magnetic field can catalyse an instability for a merely inwardly increasing angular frequency profile; rather than angular momentum. That is, a magnetohydrodynamic (MHD) instability only requires
    ddr(Ω(r)2)<0,1.2
    independent of the strength of the axial magnetic field (in a sufficiently large domain). However, in spite of its discussion in Chandrasekhar's famous book on fluid instabilities [6], this result went mostly unnoticed until its application in astrophysics.

    In the early 1990s, Balbus & Hawley [7] auspiciously applied the destabilizing of nature of a weak magnetic field to solve a previous astrophysical paradox. Specifically, in a fluid system orbiting a central mass, M, a balance between gravitational and centrifugal acceleration chiefly determines the rotation profile, (r)2=GM/r2; G represents Newton's gravitational constant. This Keplerian profile implies

    ddr(r4Ω(r)2)=GM>0,1.3
    which ostensibly guarantees stability to infinitesimal perturbations, according to the Rayleigh criterion. However, the stability of Keplerian differential rotation does not reconcile with observations of large brightnesses in proto-planetary discs [8]. Balbus and Hawley invoked the presence of a weak background magnetic field and employed the more favourable instability requirement
    ddr(Ω(r)2)=3Ω(r)2r<0.1.4
    This argument, along with their nonlinear numerical simulations, provided very strong evidence supporting the hypothesis of magnetized accretion disc dynamics [9,10].

    The advent of the magnetorotational instability (MRI) in astrophysics produced many notable results over the last three decades [8]. Little doubt exists regarding the operation and efficiency of the MRI in hot astrophysical disc systems. But questions do remain concerning the instability's cessation and saturation [11,12]. This paper explores the dynamics of the MRI in a controlled setting. This simple approach helps explain some mysterious aspects regarding how the instability transports momentum and magnetic flux, which proves useful in understanding possible saturation mechanisms.

    In contrast to a magnetized fluid, an elastic material is a continuum substance that returns to its original form with the removal of applied loads. In spite of clear differences, elastodynamics and MHD contain many deep physical and mathematical similarities. And just with forced fluids, elastic materials possess an extremely rich range of behaviours, specifically including dynamical instabilities.

    When a solid column buckles, it loses the ability to support a load while retaining its elastic integrity. Because of its obvious importance to the stability of structures, the scientific investigation of buckling dates well back into antiquity, and quantitative investigations started at the very beginning of the modern scientific era. As early as the 1480s, Leonardo da Vinci produced empirical criteria addressing the stability and lateral deflection of columns under compression [13]. In the eighteenth century, Euler and Bernoulli began to consider time-dependent elastic deformations and buckling [14]. In particular, Euler derived a practical formula for calculating the critical load on a slender beam. Engineers still use this formula (and its extensions) in modern construction design [15].

    Elastic substances exhibit many more types of instabilities than simple buckling [16]. After much success with linear dynamics of elastic solids, work in the 1970s began to focus heavily on nonlinear finite deformations. Aeronautical engineering and manufacturing particularly requires understanding the time-dependent effects of solid materials under a wide range of situations. The introduction of methods from mathematical dynamical systems theory allowed rigorous analysis for a wide range of scenarios [17]. In a series of work, Holmes and Marsden considered a class of nonlinear, non-local models for buckling and fluttering [18,19]. They proved the first example in a partial differential equation of chaos and a strange attractor; i.e. an infinity of arbitrarily long periodic orbits.

    This paper finds an almost identical equation arising naturally from the weakly nonlinear theory of the MRI in a simple geometry. This link gives qualitative insight into the astrophysical setting, and a more quantitative understanding of magneto-Taylor–Couette flow in laboratory experiments [20]. In contrast to astrophysical discs, the model may also apply to the inside of some stars with stronger ambient magnetic field and more modest differential rotation. However, even if no astrophysical object exists near-equilibrium, understanding nonlinear behaviour near a phase transition can uncover fundamental interactions. In this case, the current model helps elucidate transport of momentum and magnetic flux in an important MHD instability. To use a zoological analogy: underneath many obvious differences, both the buckling beam and the wild-type MRI contain a morphologically equivalent primitive skeleton.

    The remainder of this paper is as follows: §2 analyses both the linear and nonlinear theory of the MRI. The main result of the paper follows from (2.63) and (2.64) and its analogy to (3.24); §3 derives the buckling instability of a slender elastic beam; §4 provides a detailed discussion on several aspects of the MRI and the buckling beam; §5 shows computational results of the nonlinear solutions and §6 provides conclusion.

    2. Instability of a rotating magnetohydrodynamic shear flow

    The first part of this paper considers a two-dimensional model for the MRI of an incompressible magnetized fluid with linear shear flow. The system remains invariant along the shear direction, but contains all three components of flow and magnetic field. The full primitive equations describe the dynamics in a rotating coordinate system,

    tv+vv+2Ωz^×v+p=BB+ν2v,2.1
    v=0,2.2
    tB+vB=Bv+η2B2.3
    andB=0.2.4
    The above equations govern the dynamics of all three components of flow and magnetic field, but make the implicit assumption of axis-symmetry ∂y=0.

    The stream-function and magnetic-scalar-potential formulations enforce the solenoidal character of flow and magnetic field (2.2) and (2.4),

    v=[v0(x)+v(x,z)]y^y^×ψ(x,z)2.5
    and
    B=b(x,z)y^y^×[a0(x)+a(x,z)],2.6
    where v(x,z), b(x,z) represent the flow and magnetic field in the stream-wise direction, respectively, and ψ(y,z), a(y,z) represent the poloidal stream function and scalar potential in the plane perpendicular to the streaming flow. Both v(x,z) and a(x,z) represent perturbations relative to the linear background profiles
    v0(x)=SxSd2anda0(x)=BxBd2.2.7
    The constant terms in the above equation ensures the mean of both quantities vanishes over the domain 0≤xd. This represents an arbitrary gauge for the magnetic potential, but complies with the definition of the global rotations rate for the shear flow. The linear background magnetic potential implies a uniform background magnetic field in the vertical direction, B z^. The parameter S represents the local background shear rate. Figure 1a depicts the basic geometry and background configuration.
    Figure 1.

    Figure 1. (a) Fluid system diagram. The model considered in §2 assumes a narrow-gap Cartesian limit of an axis-symmetric Taylor–Couette cylinder. Different rotation rates on the inner and outer cylinders manifests as a local shear in the rotating frame. (b) Elastic system diagram. The left-side image depicts an un-stressed beam of length L, and thickness d. The right-side image depicts the solution to the slightly buckled beam with clamped-end boundary conditions.

    Obtaining equations for the stream function and magnetic scalar potential requires taking the two-dimensional curl of the x and z components of the flow and magnetic field equations. The incompressible MHD equations for each scalar variable are

    tb+J(ψ,b)=J(a,v)+BzvSza+η2b,2.8
    ta+J(ψ,a)=Bzψ+η2a,2.9
    tv+J(ψ,v)(f+S)zψ=J(a,b)+Bzb+ν2u2.10
    andt2ψ+J(ψ,2ψ)+fzv=J(a,2a)+Bz2a+ν4ψ,2.11
    where
    J(p,q)xpzqzpxq,2x2+z22.12
    represent the nonlinear transport of two quantities, and two-dimensional Laplacian, respectively. For simplicity, we measure the magnetic field in Alfvén units, i.e. μ0ρ0=1. We restore these parameters at the conclusion of the derivations. For the remaining parameters, f=2Ω represents the background vorticity owing to the frame rotation, Ω. The dissipation parameters, ν and η represents viscosity and magnetic diffusivity, respectively. The following analysis considers the cases of both small and/or completely vanishing diffusion coefficients.

    (a) Linear analysis

    Temporarily neglecting the nonlinear and dissipative terms in (2.8)–(2.11) helps determine the relevant spatio-temporal scales. After linearizing, and with some simplifications,

    (t2B2z2)b=fBz2ψ,2.13
    ta=Bzψ,2.14
    (t2B2z2)tv=((f+S)t2SB2z2)zψ2.15
    and(t2B2z2)22ψ+f((f+S)t2SB2z2)z2ψ=0.2.16
    For a given ψ(t,x,z), (2.13), (2.14) and (2.15) determine b(t,x,z), a(t,x,z) and v(t,x,z), respectively. (2.16) determines the stream function, subject to the impenetrable boundary conditions
    ψ|x=0=ψ|x=d=0.2.17
    Wave-like solutions
    ψ=Ψei(kz+ωt)sinπxd+c.c.2.18
    produce the ‘dispersion relation’ for the (possibly complex-valued) frequency, ω, versus wavenumber k, and other parameters,
    k2+π2d2λ2f(f+S)k2λf2B2k4=0,2.19
    where
    λω2B2k2.2.20
    equation (2.19) gives purely real solutions for λ. Therefore, a transition from purely real frequencies (ω2>0) to purely imaginary frequencies (ω2<0) characterizes any transition of stability. For any real k,
    B2k2+π2d2+fS=02.21
    gives the boundary (ω2=0) between these two regimes. Instability (ω2<0) exists for
    fSB2k2+π2d2>π2B2d2.2.22
    For a finitely thick layer,
    Scrit.π2B2fd22.23
    defines the critical shear needed to drive the MRI.

    The difference between the critical shear and actual shear helps simplify the analysis in the vicinity of the instability,

    σScrit.S.2.24
    Assuming long-wavelength perturbations |kd|≪1, and near-critical shear |σ|≪|Scrit.|, we can expand the dispersion relation to lowest order and find
    ω2B2d2k2(B2k2fσ)π2B2+f2d2.2.25
    The above equation shows that ω=O(σ), and k=O(|σ|) for f|σ|d2/B2≪1. Figure 2a shows the behaviour of (2.25).
    Figure 2.

    Figure 2. (a) A ‘dispersion curve’ showing the complex-valued frequency as a function of vertical wavenumber. The shaded grey region shows purely imaginary values, and the white region to the right of the dashed line corresponds to purely real waves. Assuming σ>0, k0fσ|/|B|, ω0fσd/π2B2+f2d2. The maximum growth rate occurs for k=k0/2 and ω=ω0/2. (b) The solid line shows the critical stability curve for σ(k) with finite dissipation. The dashed line shows the corresponding curve with no dissipation. See (4.6)–(4.8) for details.

    Reintroducing space and time variables via ω→−it and k→−iz implies (heuristically) that (2.25) represents the simplified (unstable) wave equation

    t2ψB2d2π2B2+f2d2(B2z2+fσ)z2ψ+nonlinear terms.2.26
    The above equation implicitly contains missing nonlinearities required to saturate exponential growth. The next section determines these unknown terms.

    (b) Nonlinear asymptotic analysis

    Rendering the dynamical equations in dimensionless form often helps to simplify derivations. In this case, working with scaled quantities partially obscures the physical meaning of different dynamical ingredients. Therefore, the following analysis introduces a small non-dimensional parameter (ε≪1) only to keep track of the relative magnitudes of the different terms. We rescale all relevant dynamical variables in terms of particular powers of ε and conduct an asymptotic analysis accordingly. From (2.25), we deduce the consistent rescaling

    SS0ε2σ,tε2t,xx,zεz2.27
    The linear system (2.13)–(2.15) provides the relative amplitudes of b, a and v in terms of ψ. The fully nonlinear dissipative system, (2.8)–(2.11), constrains the amplitude of ψ, and also gives the magnitude of ν and η that allow dynamically significant dissipation. That is,
    bε2b,aεa,vεv,ψε2ψ2.28
    and
    ηε2η,νε2ν.2.29

    Upon rescaling, (2.8)–(2.11) become

    z(BvS0a)=εJ(a,v)+ε2(tbσzaηx2b)+O(ε3),2.30
    z(Bx2afv)=εJ(a,x2a)+ε2(tx2ψBz3aνx4ψ)+O(ε3),2.31
    taBzψηx2a=εJ(ψ,a)2.32
    andtvBzb(f+S0)zψνx2v=ε(J(a,b)J(ψ,v))+O(ε2).2.33

    Considering this system order-by-order, the analysis halts when the first time derivative from the right-hand side enters the balance. Therefore, we may automatically neglect terms that are formally smaller than the highest order time-derivative. Other than not writing higher order terms, (2.30)–(2.33) are completely equivalent to (2.8)–(2.11).

    We expand all dynamical variables in the following series:

    v=v0+εv1+ε2v2+,a=a0+εa1+ε2a2+2.34
    and
    b=b0+εb1+ε2b2+,ψ=ψ0+εψ1+ε2ψ2+,2.35
    substitute this into (2.30)–(2.33), and collect terms order-by-order in powers of ε. The leading-order balance produces
    z(Bv0S0a0)=0,z(Bx2a0fv0)=02.36
    and
    ta0Bzψ0=ηx2a0,tv0Bzb0(f+S0)zψ0=νx2v0.2.37

    Solving (2.36) and (2.37) in general form requires introducing a potential function φ(t,z)

    v0=S0zφsinπxd,2.38
    a0=Bzφsinπxd,2.39
    ψ0=(tφ+γ1φ)sinπxd2.40
    andb0=1B(ftφ+((f+S0)γ1S0γ2)φ)sinπxd,2.41
    where
    γ1π2ηd2,γ2π2νd2,S0=π2B2fd2.2.42

    At this point, φ(t,z) represents all the dynamical degrees of freedom left undetermined by (2.36) and (2.37). This minimalist approach solves (2.36) and (2.37) without adding any more information about the form of the solution until nonlinearity dictates it. This single unknown function represents a scalar-valued ‘order parameter’ in the language of critical phenomena. The typical value of φ(t,z) grows in amplitude and becomes increasingly disordered as the system becomes more unstable.

    (2.42) matches with the critical value resulting from linear theory, (2.23). We choose boundary conditions for the magnetic field that match the natural non-dissipative profiles. Other choice of boundary conditions would bring boundary layers into the analysis. We filter these effects for simplicity.

    The next-order balance produces

    z(Bv1S0a1)=J(a0,v0)=0,2.43
    z(Bx2a1fv1)=J(a0,x2a0)=0,2.44
    ta1Bzψ1ηx2a1=J(ψ0,a0)=πB2dsin2πxd(t(zφ)2+2γ1(zφ)2z((tφ+γ1φ)zφ))2.45
    andtv1Bzb1(f+S0)zψ1νx2v1=J(a0,b0)J(ψ0,v0)=π(fS0)2dsin2πxd(t(zφ)2+2γ3(zφ)2z((tφ+γ3φ)zφ)),2.46
    where
    γ3fγ1S0γ2fS02.47
    denotes a weighted average of the magnetic and viscous damping coefficients. (2.43) and (2.44) determine the strictly z-dependent components of v1 and a1, which exactly mimic the form of v0 and a0. (2.43) and (2.44) do not determine the z-mean components of these variables.

    Averaging (2.45) and (2.46) in the vertical direction eliminates all pure z derivatives and produces the balances

    (t+4γ1)a1=πB2dsin2πxd(t+2γ1)|zφ|22.48
    and
    (t+4γ2)v1=π(fS0)2dsin2πxd(t+2γ3)|zφ|2,2.49
    where
    qlimL12LLLq(z)dz.2.50
    Defining the amplitudes,
    a1Asin2πxdandv1Vsin2πxd2.51
    simplifies the mean dynamics such that
    (t+4γ1)A=πB2d(t+2γ1)|zφ|22.52
    and
    (t+4γ2)V=π(fS0)2d(t+2γ3)|zφ|2.2.53
    The vertical variance of ∂zφ drives purely x-dependent corrections to the mean background field and shear momentum.

    Closing the system in terms of φ requires generating a second-order equation for a2. After combining second-order versions of (2.36),

    B3zx2+π2d2a2=π((3π2B2+fd2S0)ABfd2V)d3z2φsinπxdsin3πxd(f(fS0)(t2φ+2γ3tφ+γ4γ1φ)+fσB2z2φ+B4z4φ)sinπxd,2.54
    where
    γ42γ3γ1.2.55
    The solvability condition requires that all terms multiplying sin(πx/d) on the right-hand side of (2.54) cancel identically. Otherwise, (2.54) admits no finite solutions for a2 satisfying the boundary conditions [21]

    Grouping all evolution equations together,

    (1+q)(t+γ4)(t+γ1)φ+σ^B2fz2φ+B4f2z4φ=0(t+4γ1)A=πB2d(t+2γ1)|zφ|2and(t+4γ2)V=πf(1+q)2d(t+2γ3)|zφ|2,2.56-2.57
    where
    qS0f=B2π2f2d22.58
    defines the ratio of timescales between rotation and critical shear; i.e. the Rossby number. Note that q=34 corresponds to a Keplerian profile, and q=1 corresponds to the Rayleigh instability threshold. In the case of fixed q, we may think of the background magnetic field taking the role of the critical parameter.

    (2.56) contains the mean-field shear parameter

    σ^σπ(2π2BAfd2V)fd3.2.59
    The feedback from magnetic flux and momentum transport produce the only differences between the linear model (including dissipation) and a nonlinear model that can saturate the MRI.

    In terms of q,

    γ3=γ1+qγ21+qandγ4=(1q)γ1+2qγ21+q.2.60
    For 0<q<1, both γ3>0 and γ4>0 are well-defined averages of γ1 and γ2. The dissipation coefficients take only one of the two possible orderings
    γ1<γ3<γ4<γ2orγ2<γ4<γ3<γ1.2.61

    Assuming η=ν=0 allows reducing the system even further. In this case, (2.56) and (2.57) integrate explicitly so that

    A=πB2d|zφ|2,V=πf(1+q)2d|zφ|22.62
    and
    (1+q)t2φ+σ^B2fz2φ+B4f2z4φ=0,2.63
    where
    σ^=σf3q(1+3q)2B2|zφ|2.2.64
    equation (2.63) represents the most important result of this paper. The next section shows that (2.63) is almost identical to a model of weakly nonlinear loaded elastic beam. Holmes & Marsden [19] found that the forced and damped version of this model possesses an infinite number of chaotic solutions. This work has been an influential paradigm for chaos in PDEs. The remainder of this paper will elaborate on some of the interesting implications for this mathematical analogy between MHD and elastodynamics.

    Equation (2.63) differs significantly from previous attempts at a weakly nonlinear model for the MRI [22,23]. Those cases consider geometry more similar to accretion discs, and as a result required significantly higher (leading-order) dissipation. Past results produce a dissipative Ginzburg–Landau equation, not the high Reynolds number model produced here. However, for large dissipation (2.56) reduces to a first-order equation in ∂t and hence more like Ginzburg–Landau.

    Liverts et al. [24] derived an ordinary differential Duffing equation for the thin-disc MRI. We see in §5 that (2.63) represents a possible infinity of coupled Duffing equations. This correspondence shows strong evidence that the ideal Duffing-type dynamics underlies the MRI at a fundamental level.

    3. Buckling instability of an elastic beam

    This section derives a dynamical nonlinear model for an elastic beam in the vicinity of a buckling instability. Holmes gave a very short equivocal version of this same derivation [18]. The current context needs enough details to see the similarities and differences to the magnetic case. We use an asymptotic expansion in terms of the beam aspect ratio. In this section,

    εdL13.1
    where 0≤zL represents the length of the beam along the direction of applied load, and −d/2≤xd/2 represents cross-sectional thickness on the beam. Figure 1b depicts the basic geometry and background configuration. A fully complete treatment of fully nonlinear elastodynamics would lead to a very complicated asymptotic analysis in powers of ε. Even though we apply some intuitive reasoning, a completely systematic analysis gives the same eventual answer.

    The nonlinear strain tensor characterizes a general Lagrangian deformation of a continuum solid [25]. Therefore,

    |dx+dξ|2|dx|2=2dxEdx=2(Ex,xdx2+2Ex,zdxdz+Ez,zdz2),3.2

    where |dx|2=dx2+dz2 represents the original distance between nearby points, and |dx+dξ|2 represents the distance between nearby points after a Lagrangian displacement, ξ(t,x). The explicit components of Green's strain tensor are

    Ex,x=xξx+|xξx|2+|xξz|22,Ez,z=zξz+|zξx|2+|zξz|223.3
    and
    Ex,z=Ez,x=12[xξz+zξx+zξzxξz+xξxzξx].3.4

    We consider a homogeneous isotropic Hookean solid with linear stress–strain relationship

    S=λTr(E)I+2μE,3.5
    where Tr(E)=Ex,x+Ez,z, and I represents the identity matrix. The parameters λ and μ represent Lamb's constants, but using the alternative definitions proves advantageous,
    μ(1ν)2Yandλν(1ν)12νY,3.6
    where ν denotes Poisson's ratio (note: this is not the same as the viscosity parameter in the MRI analysis), and Y represents Young's modulus. Some texts use Y/(1−ν2) to denote Young's modulus e.g. [25]. Our current definition coincides with [18] and allows for a simpler derivation.

    The thin aspect ratio leads to small displacements and variation in the longitudinal direction, while the perpendicular displacements and variation remain order unity. Kinetic energy also must balance stresses. Therefore, we replace

    zεzandtε2t.3.7
    Considering leading-order balances produces
    ξx=u(z)ε2u(z)22+ν1νxw(z)+xu(z)2x2u(z)2+O(ε4)3.8
    and
    ξz=ε(w(z)xu(z))+O(ε3),3.9
    where u(z) and w(z) represent unknown mean displacements in the x and z directions, respectively, and primes, e.g. u′(z), denote z derivatives. (3.8) and (3.9) imply
    Ez,z=ε2w(z)+u(z)22xu(z)+O(ε3)3.10
    and
    Ex,x=ν1νEz,z+O(ε3),Ex,z=O(ε3).3.11
    These imply the stresses,
    Sz,z=ε2Yw(z)+u(z)22xu(z)+O(ε3)andSx,x,Sx,z=O(ε3).3.12
    The stress–strain relationship implies the elastic potential energy density
    U=12i,jSi,jEi,j=ε42Yw(z)+u(z)22xu(z)2+O(ε5),3.13
    for which only the Ez,z and Sz,z components contribute to leading order. To leading order, only the horizontal displacement contributes to the kinetic energy
    K=ε4ρ0|tu|22+O(ε5).3.14
    The explicit x dependence in the potential energy allows integrating out this dimension, which produces the effective one-dimensional Lagrangian density
    Ld/2d/2(KU)dxρ0|tu|22Y2zw+|zu|222+d212|z2u|2.3.15
    The corresponding action follows such that
    A0T0LLdzdt.3.16
    Varying the action with respect to w(t,z) implies
    zLw=0.3.17
    Or,
    zw+|zu|22=E0,3.18
    where E0 denotes an integration constant (independent of z, but depending on t). Varying the action with respect to u(t,z) implies that
    tLu˙+zLu=0.3.19
    Or,
    ρ0t2u=YE0z2uYd212z4u.3.20
    Determining E0 in (3.18) requires integrating over the entire length of the beam such that
    E0=ΔLL+12L0L|zu|2dz,3.21
    where ΔLw(z=L)−w(z=0) gives the total change in the length of the rod.

    From the definition of Young's modulus applied to the entire solid,

    Γ=YΔLL,3.22
    where Γ denote the total applied compression (Γ>0), or tensile (Γ<0) load. Therefore,
    zw=ΓY+12L0L|zu|2dz|zu|22.3.23
    The final reduced dynamical equation for the horizontal deflection follows such that
    ρ0t2u=Γ+Y2L0L|zu|2dzz2uYd212z4u.3.24
    The above equation is equivalent to the models found in [18,19], but with all parameters explicit.

    4. Discussion

    (a) Dynamical correspondences

    Inspecting each term in (2.63) and (3.24) highlights the analogies between the various physical parameters in the two cases. To simplify the derivations in §2, (2.8)–(2.11) use Alfvén units such that μ0ρ0=1, where μ0=4π in cgs units. This section restores the general parameters in order to make comparisons between MHD and elastic parameters.

    The shear criticality in the MHD system and the applied compression/tension loading correspond such that,

    Γσf+|S0|B2μ0=σf(1+q)B2μ0.4.1
    The magnetic pressure, B2/μ0 lends the correct units to the right-hand side of the relationship.

    The magnetic tension in the background magnetic field corresponds to Young's modulus in the elastic setting

    Y12|S0|π2(f+|S0|)B2μ0=12qπ2(1+q)B2μ0.4.2

    Together, (4.1) and (4.2) relate the shear criticality in the fluid to the total strain (relative compression) in the solid

    ΔLL=ΓYπ2σ12|S0|=π2σ12fq.4.3

    Lastly, the comparing the nonlinear terms in both systems gives interpretation for the scalar potential in MHD in terms of the horizontal beam displacement.

    |zu|2π4(f+3|S0|)12|S0|d2|zφ|2=π4(1+3q)12qd2|zφ|2.4.4
    Or, up to non-dimensional factors, u(t,z)∝φ(t,z)/d.

    The potential φ does not relate naturally to the horizontal Lagrangian displacement in the fluid system. The definition of the potential function is somewhat arbitrary, but (2.38)–(2.41) produce the most straightforward solution to (2.36) and (2.37) in terms of derivatives; b,ψ∝∂tφ; a,v∝∂zφ. The constants of proportionality are more chosen according to style. In terms of (fluid) Lagrangian displacements, ξx∝∂zφ; ξzφ, whereas for the elastic solid ξxu,∂zw; ξzw,∂zu. The correspondence between the parameter in the MRI and the buckling beam would allow the demonstration of some aspects of MHD with a more manageable slender elastic rod.

    (b) Saturation

    How the MRI saturates remains an open question in the context of accretion disc modelling [12,24]. Intuitive understanding in the case of the elastic beam, helps clarify the saturation mechanism of the MRI in this simple model. An elastic beam saturates a buckling instability by setting one side of the beam under tension and the other side under compression. In a sense, the instability transports stress (equivalently strain and density) dynamically from the tension side to the compression side. In the MHD case, (2.51) and (2.62) imply that the instability puts the left-half (0<x<d/2) of the domain under a slight excess of magnetic flux and slight deficit of linear momentum. The right-half (d/2<x<d) receives the opposite feedback such that the total of both quantities remains conserved. In the linear phase of the instability, momentum and magnetic flux compete to both drive and suppress growing perturbations. The nonlinear transport rearranges the background so that both halves of the domain experience more stringent stability criteria. In their early MRI studies, Balbus & Hawley [10] demonstrated positive outward angular momentum flux. From (2.62), 〈V 〉<0 also corresponds to positive outward transport. Even in the local, incompressible, Cartesian geometry, the system can correctly discern inward and outward. Along with angular momentum transport, the exchange of magnetic potential reconciles with past accretion disc models [26].

    (c) Symmetry

    The buckling beam analogy is not the first between MHD and elastic systems [27,28]. There are deep mathematical reasons for this relationship. Chandrasekhar speculated that the reason for not recovering Rayleigh's criteria (1.1) in the limit of zero magnetic field ‘must lie in the circumstances that … the lines of magnetic force are permanently attached to the fluid’ [6], section 81, p. 389. This is another way of saying that specifying the location of the magnetic field lines is the same as specifying the location of the fluid parcels.

    Like an elastic medium, MHD supports shear stresses and shear waves in the form of Alfvén displacements. Symmetry breaking provides the mechanism underlying this simple physical fact. When considering a collection of N particles, phase space generically comprises 3N momenta and 3N coordinates for a total of 6N dimensions. For a fluid, the range scales of fluctuations determines the effective N. Independence of the system on one or more of the coordinates implies conservation of the corresponding momenta. Ordinary fully compressible hydrodynamics is only 5N dimensional, 3N for the velocities, 1N for the density, and 1N for entropy (or equivalently the pressure). The missing third set of coordinates implies the conservation of potential vorticity (effectively a component of momenta). There is no restoring force for one component of the momentum (more accurately one subset of size N). For incompressible hydrodynamics a much smaller set of unique coordinate labels implies Kelvin's Circulation Theorem which implies a much larger set of conserved momenta. Magnetic field changes this picture.

    The frozen-in condition [29] provides the magnetic fluid with enough Lagrangian coordinate labels to break potential vorticity conservation. In fact, MHD seems to provide more coordinate labels than necessary to specify a location in phase space completely. The divergence condition, and other constraints, remedy the over-counting problem; see [30] for more details. With a full accounting for fluid parcels, there remain no point-wise conserved momenta, and almost all degrees of freedom must experience a non-zero force of some kind. Lack of Lagrangian-particle relabelling symmetry is an imperative feature MHD shares with elastodynamics. In the latter case, the dynamics depends explicitly on all components of the displacement field through the strain tensor.

    Along with their primary discovery, Balbus & Hawley [31] produced an analogy between magnetic field lines between Keplerian fluid parcels and a simple elastic spring connecting two orbiting masses. In this example, the connecting spring breaks the conservation of the individual masses angular momentum and Laplace–Runge–Lenz vectors. This example shows that the tension in the magnetic field and/or spring provides both stabilization via tension, and acts to liberate rotational energy. The buckling beam differs in this respect. The product of magnetic field and shear compares to the compression in (4.1). The magnetic field alone (without destabilizing shear) compares to Young's modulus in (4.2).

    (d) Non-locality

    The non-local nature of the nonlinear feedback requires some consideration. The nonlinear feedback in (2.63) appears surprising at first. Especially given that the original MHD model does not seem to contain non-local terms. How does the non-locality arise?

    Answering this requires pointing out a number of small facts. (i) In spite of appearances the original equations (2.8)–(2.11) are actually non-local. This results from evolving ωy=∇2ψ, rather than ψ alone in (2.11). Computing the inverse Laplacian amounts to convolving against the appropriate Green's function. This is a non-local operation. (ii) Unlike the incompressible MHD equations, the full elastodynamic equations are completely local. (iii) Therefore, the reason for non-locality in (2.63) cannot result from the non-locality (2.11). But similar physics underlies both situations. (iv) Non-locality in incompressible MHD results physically from fast acoustic waves equalizing the pressure field on timescales much faster than dynamical timescales. Rather than experiencing a small delay, distant points respond instantaneously to fluid motions, but in a fashion that attenuates with distance away from the disturbance. (v) The buckling beam achieves non-locality through of the separation of timescales between extremely fast elastic waves, and unstable flexural waves. (vi) In addition to the unstable MRI modes, the full linear dispersion relation (2.19) contains two additional fast modes for each wavenumber. (vii) These mixed Alfvén–Coriolis modes propagate with the phase speed

    ωfastk±B2+f2d2π2+O(σ2).4.5
    (viii) Both magnetic tension and the Taylor–Proudman effect [32] provide rigidity in the vertical direction and transmit magnetic and kinetic stresses through the fluid.

    Incompressibility implies that non-locality is more common than often realized. The surprising aspect about (2.63) is that feedback is entirely non-local. (2.64) shows no spatial variation or attenuation. We could express (2.64) in local form by stating zσ^=0; but we require some other condition to determine the actual value of σ^. Conservation of total-integrated momentum and magnetic flux provides the solution. The global conservation of these quantities also underlies the saturation mechanism.

    (e) Dissipative stability

    Also, in contrast to the ideal stability theory in §2a the nonlinear analysis elucidates stability with a small amount of diffusion. In that case there exist a critical shear curve as a function of wavenumber, i.e.

    σc(k)=B2k2f+(1+q)γ1γ4fB2k2,4.6
    where the first term on the right-hand side gives the stability curve in the ideal limit. Minimizing over k, gives
    kc=(1+q)1/4γ11/4γ41/4fBandσc(kc)=2(1+q)γ1γ4.4.7
    Figure 2b shows the difference in stability for dissipative versus non-dissipative dynamics. This adds further insight into breaking the apparent degeneracies associated with the small-dissipation limit of the MRI; also see [33,34]. In the limit as the background magnetic field vanishes, we find a necessary condition in terms of the magnetic Reynolds number,
    ReMSd2η>2π2+O(B2).4.8

    5. Nonlinear solutions

    Rescaling the time, space and amplitude variables simplifies the discussion of nonlinear solutions to (2.63) and (3.24). Assuming η=ν=0, and defining a non-dimensional length, time and amplitude leads to

    t2φ+(μ|zφ|2)z2φ+z4φ=0,5.1
    where μ=±1. It is straight forward in both the MHD and elastic cases to find the relevant re-scalings that yield (5.1).

    The completely non-local character of the nonlinearity in (5.1) implies a particularly interesting class of solutions. The spatial Fourier transform

    φk(t)12πφ(t,z)eikzdz5.2
    implies
    φ¨k(t)=k2μk2k2|φk(t)|2dkφk(t).5.3
    Assuming φ(t,y) is real implies the Hermitian symmetry φk(t)=φk(t). Each mode linearly self interacts, and couples to all other active modes only through the integrated feedback. For periodic solutions, the integral in (5.3) becomes a discrete sum. It is now clear that system is equivalent to a large collection of mean-field interacting Duffing oscillators.

    The original nonlinear system (with η=ν=0) is conservative, and the reduced system remains Hamiltonian with the total conserved energy

    H12(|φ˙k(t)|2k2(μk2)|φk(t)|2)dk+14k2|φk(t)|2dk2.5.4
    (5.3) results canonically from (5.4).

    The completely mean-field character of (5.3) implies interesting symmetries when considering the evolution of the complex-valued amplitude in terms of the real amplitude and phase

    φk(t)=rk(t)eiθk(t).5.5
    In this case, the feedback is independent of the phase such that the imaginary component of (5.3) implies
    ddt(rk(t)2θ˙k(t))=0.5.6
    That is, for each individual value of k, each ‘angular momentum’ remains constant in time
    Lkrk(t)2θ˙k(t).5.7
    The conservation of Lk results from the symmetry for each k, φkφkexp(iχk), for any time-independent χk.

    The system further reduces to the following real equations:

    r¨k(t)Lk2rk(t)3=k2μk2k2rk(t)2dkrk(t),θ˙k(t)=Lkrk(t)25.8
    The angular momentum Lk is an arbitrary function of k and is independent of time. The total energy now becomes
    H=12r˙k(t)2+Lk2rk(t)2k2(μk2)rk(t)2dk+14k2rk(t)2dk2.5.9

    In terms of the complex initial data,

    φ0,k=φk|t=0andφ˙0,k=φ˙k|t=0,5.10
    time-integrating (5.8), requires the angular momenta, phases, radii and radii velocities, respectively as,
    Lk=Im[φ0,kφ˙0,k],θk(0)=Im[logφ0,k],5.11
    and
    rk(0)=|φ0,k|andr˙k(0)=Re[φ0,kφ˙0,k]|φ0,k|.5.12

    Furthermore, the inverse Fourier transform of Lk gives a point-wise conserved quantity in the spatial domain,

    Λ(t,z)[φ˙(t,z+z)φ(t,z)φ(t,z+z)φ˙(t,z)]dz,5.13
    which represents the antisymmetric component of the cross-correlation function. For every z, ∂tΛ(t,z)=0. The conservation of Λ(t,z) is equivalent to the conservation of
    Λs(t)tφ(t,z)zsφ(t,z)dz,5.14
    for all odd values of s. The case s=1,3, respectively, correspond to magnetic-helicity and cross-helicity conservation in the original MHD system. Even though the system contains an infinite number of conserved quantities, it is not apparently integrable.

    (5.8) shows that the dynamics is equivalent to a collections of particles all moving in an averaged potential, and each individually conserving its angular momentum. The non-local character of the nonlinearity implies that every Fourier truncation of the system represent an exact solution. In other words, the system does not cascade energy to smaller scales than are present in the initial conditions. This unusual form of memory implies an extremely diverse class of solutions. The model displays one of the simplest types of mean-field dynamics, but with non-trivial results.

    Figure 3 shows a typical solution for a modest number of initial modes. The solution lies in a periodic domain 0≤z≤2π, with μ=+1. The initial conditions consist of K=20 Gaussian-random unit-amplitude complex-valued Fourier modes for both φ and φ˙. Using a fourth-order adaptive Runge–Kutta scheme, we time integrate the system of ODEs for 30 non-dimensional time units,

    φ¨k(t)=k2μk2k=1Kk2|φk(t)|2φk(t).5.15
    The spatial form of the solution is reconstructed such that
    φ(t,z)=k=1Kφk(t)exp(ikz)+c.c.5.16
    The computations were computed with the Dedalus code. Dedalus is a very general toolkit for computing the pseudo-spectral solution to a large number of PDEs and related problems. For more information and links to the source code, see dedalus-project.org. The script used to compute the numerical solutions can be found at https://bitbucket.org/gvasil/wnlmri-beam/
    Figure 3.

    Figure 3. Solutions to (5.1) for random initial conditions. The vertical squiggle on the left-hand side of each image represents the initial state of φ(t=0,z) and ∂tφ(t=0,z), respectively. The horizontal squiggle at the bottom of each image represents a slice through each space–time diagram at z=12. The z-axis runs from 0 to 2π, and time runs from 0 to 10. For φ, the plots are scaled between ±1.5. For ∂tφ, the plots are scaled between ±4. (Online version in colour.)

    Figure 3 shows a pattern of large-scale travelling waves with faster small-scale dynamics superimposed. The small-scale fast dynamics show up clearly in the velocity pattern, ∂tφ. The qualitative aspects of this pattern seem typical for an array of initial conditions. A more comprehensive numerical study of the space of solutions both with and without dissipation lies beyond the scope of this paper. For an example of the possible range-rich behaviours, Eugeni et al. [35] recently studied the forced nonlinear beam dynamics with multiple interacting degrees of freedom.

    6. Conclusion

    Heuristically, one can use the concept of entropy to interpret two different physical systems producing the same reduced governing equations. Near an instability, only a small number of degrees of freedom grow to significance. There are many fewer ways to organize a small number of modes than the large number active in a strongly driven system. Or, to borrow from Tolstoy, ‘Weakly nonlinear systems are all alike; every strongly nonlinear system is nonlinear in its own way’.1 This is especially true in the presence of symmetry. This fact is apparent in that so many systems reduced to a small set of well-known canonical equations. For example, the complex Ginzburg–Landau equation (and its further simplifications) derives systematically from an extremely wide range of physical starting points [36].

    This paper points out a new class of non-local models deriving from at least two different natural systems. More physical examples likely exist. In a particularly fundamental way, MHD is more similar to elastodynamics than pure hydrodynamics. The magnetic field provides a final material coordinate label, and therefore eliminates fluid particle relabelling symmetry, and conservation of potential vorticity. In elastodynamics, material particles contain explicit labels, and the issue of potential vorticity never really arises. This implies that fluid systems without conserved potential vorticity, and a spring-like restoring mechanism could produce similar dynamics as the MRI or buckling beam. Over-stable double-diffusive convection satisfies these conditions, and (speculatively) may provide an additional example. It is quite possible that this mean-field network of Duffing oscillators forms a new class of universal near-equilibrium dynamics.

    More generally, there are infinitely more ways to produce a non-local interaction than a local one. The multiple scales assumption allows for information to travel asymptotically fast and produce the instantaneous interaction we see in our model. In both the buckling beam, and MRI, the reason for the particular type of nonlinearity arises from conservation laws. In the elastic case, the integral terms result from fixing the total mass of the beam. In the MHD case, the integral terms arise from conservation of total linear momentum, and magnetic flux. In both cases, these quantities remain conserved with the inclusion of dissipative effects. Rotating and/or magnetized free-slip Rayleigh–Bénard convection, displays non-local nonlinear terms arising from conservation of total momentum and/or magnetic flux; just as in the MRI problem [37,38]. These correspondences perhaps imply that global conservation principles and non-locality relate more deeply. Intriguingly, the non-local terms in the MRI model allow for a very large class of symmetries and this hints at the deeper link to conservation principles.

    Elastodynamics and MHD probably part ways with the introduction of more physical ingredients. The systems are likely very different from each other in three dimensions. Dissipative boundary layer dynamics (filtered in this paper with judicious choice of boundary conditions), interactions with other fluid instabilities, and more complex geometric effects all likely pull the correspondence further apart. Nevertheless, the link between elastic buckling and the MRI produces interesting insight regarding the growth and saturation of the MRI, albeit in a regime far removed from traditional astrophysical applications. The type of derivation presented in this paper would probably work in a thin disc-like geometry, which is more suitable to accretion discs. This interesting case will produce several additional complications resulting from boundary conditions alone. Attempting to understand more complex systems requires adding dynamical elements back into the minimalist model and considering the consequences. Hopefully adding richer dynamical ingredients will still allow simple mathematical progress. But even when interactions become too numerous to consider analytically, understanding the relevant spatio-temporal scales and possible couplings will greatly help streamline numerical simulation of more complex models.

    Data accessibility

    This work does not have any experimental data.

    Acknowledgements

    The author would like to thank Edgar Knobloch for originally pointing out Holmes and Marsden's work on elastic buckling. The computations used the Dedalus code: dedalus-project.org. The author thanks the remainder of the Dedalus collaboration: Keaton Burns, Daniel Lecoanet, Jeff Oishi and Ben Brown for their significant contributions to the code project. The author thanks two anonymous referees for suggesting improvements to the manuscript.

    Funding statement

    The Australian Research Council supports the author through a Discovery Early Career Researcher Award, no. DE140101960. Part of this work was conducted with funding from the University of California Berkeley Theoretical Astrophysics Center.

    Conflict of interests

    I have no competing interests.

    Footnotes

    1 Tolstoy's original quote: ‘Happy families are all alike; every unhappy family is unhappy in its own way.’, concisely describes how some macro states can contain vastly different numbers of possible micro states. Anna Karenina was published within a couple years of Boltzmann's original mathematical formulation of entropy.

    References