Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences
Open AccessResearch article

Density of convex intersections and applications

Published:https://doi.org/10.1098/rspa.2016.0919

    Abstract

    In this paper, we address density properties of intersections of convex sets in several function spaces. Using the concept of Γ-convergence, it is shown in a general framework, how these density issues naturally arise from the regularization, discretization or dualization of constrained optimization problems and from perturbed variational inequalities. A variety of density results (and counterexamples) for pointwise constraints in Sobolev spaces are presented and the corresponding regularity requirements on the upper bound are identified. The results are further discussed in the context of finite-element discretizations of sets associated with convex constraints. Finally, two applications are provided, which include elasto-plasticity and image restoration problems.

    1. Introduction

    Convex constraint sets K as subsets of an infinite-dimensional Banach space X are common to many fields in mathematics such as calculus of variations, variational inequalities and control theory. Such constraints are induced by physical limitations of control and/or state variables, but also emerge through Fenchel dualization of convex problems; e.g. [13] for fundamental concepts in variational analysis. In this vein, given a set of functions satisfying an arbitrary convex constraint, density properties of more regular functions satisfying the same restriction are of utmost importance. In abstract terms, given some dense subspace Y of X, the central point of interest is whether the closure property

    K(Y)¯X=K,1.1

    with K(Y)={uY :uK}=KY , is fulfilled, and how this problem is intimately linked to the solution of constrained optimization and variational inequality problems.

    In the literature, problems of dense intersections appear in connection with the discretization of variational inequality problems in Sobolev spaces and the convergence analysis for finite-element methods under minimal regularity (e.g. [46]). Moreover, the limiting behaviour of singular perturbations of elliptic variational inequalities can be traced back to the density issue (see [7] and references therein). This also pertains to the deduction of a vanishing viscosity limit for hyperbolic variational inequalities with an obstacle constraint [8]. In the context of plasticity problems, certain density properties represent an important step towards the determination of appropriate relaxed formulations (cf. [9,10]). However, to the best of our knowledge, the investigation of problem (1.1) is restricted to special cases and the literature still lacks a general and systematic treatment of the density issue.

    To motivate the study of the abstract problem (1.1), §2 provides a novel unifying framework for various perturbation approaches to non-smooth constrained optimization and variational inequality problems. The general setting includes regularization, Galerkin approximation and singular perturbations, and, most remarkably, it allows to reduce the study of the corresponding limit problems for a wide range of practically relevant perturbations to the study of the density property (1.1). In particular, we prove that the dense intersection (1.1) is a necessary and sufficient stability condition for the retrieval of the original problem in the (joint) limit of vanishing regularization and/or discretization parameters.

    Starting from §3 we focus on the setting where X=X(Ω) is a (Rd-valued) vector space of functions over a bounded domain Ω of RN and K=K(X) denotes the subset of elements in X(Ω) bounded pointwise by a prescribed measurable function α:ΩR{+}, i.e.

    K(X(Ω))={wX(Ω):|w(x)|α(x) a.e. (almost everywhere) in Ω},
    with |⋅| denoting an Rd-norm. Particularly in this part, X(Ω) refers to a Lebesgue or Sobolev space and Y =Y (Ω) refers to the space of continuous or infinitely differentiable functions up to the boundary. We also use the notation K(X(Ω);|⋅|) whenever it is necessary to make the dependence on the norm |⋅| explicit. Despite the fact that a small number of specific density results for very regular bounds α are available [4,9,11], a systematic investigation of density properties in terms of the regularity of α seems not available in the literature.

    In order to close this gap, we prove new density results for continuous obstacles (§4), and we also consider different classes of discontinuous obstacles. In fact, in §4a, the density issue is studied in the context of the regularity of the obstacle as a Sobolev function. More precisely, we prove that results of the type (1.1) cannot be expected if the obstacle is just a Sobolev function by providing a counterexample. The density results are then proved to be valid even for certain classes of lower semicontinuous obstacles; see §4b,c. Subsequently, in §4d, a different approach is considered for obstacles that originate from the solution of a partial differential equation (PDE).

    In §5, we focus on the Mosco convergence of finite-element discretized convex sets, which, in general, is a delicate matter, and only a limited number of results for more regular obstacles are known (e.g. [4,5]). In this respect, the construction of a recovery sequence essentially reduces to the verification of density properties of the type (1.1). Making use of the density results provided by the preceding sections, we prove several new Mosco convergence results in the Hilbert spaces L2,H1 and H(div) for different types of finite-element discretizations of K, even for discontinuous obstacles α. The results are extended to a more general constraint setting involving pointwise restrictions on partial derivatives. We conclude the paper by presenting two important applications that further highlight the paramount significance of dense intersections. First, we consider the regularization of an elasto-plastic contact problem, where the closure property turns out to be fundamental for the efficient solution by a semismooth Newton method. Secondly, we discuss an example from total variation-based image restoration with a distributed non-smooth regularization parameter. Here, the density property arises as an essential condition for the equivalent reformulation of the problem in the Hilbert space H(div) by means of Fenchel duality.

    2. Motivation

    (a) Optimization with convex constraints

    In many variational problems, one seeks the solution in a given convex, closed and non-empty subset K of an infinite-dimensional Banach space (X,∥.∥). To illustrate the problem, let us consider the following abstract class of optimization problems:

    infF(u),over uX,s.t.uK.}2.1
    We assume that F:XR is continuous, coercive and sequentially weakly lower semicontinuous, but not necessarily convex. Thus, problem (2.1) admits a solution provided X is reflexive. The problem class (2.1) is ubiquitous, encompassing numerous fields, such as the variational form of PDEs, variational inequality problems of potential type, optimal control of PDEs with constraints on the state and/or control, and many other. The analysis of (2.1) and the design of suitable solution algorithms often involve the general concepts of perturbation or dualization methods comprising regularization, penalization or discretization approaches or possibly a combination of the latter (e.g. [15] and references therein). The central result of this section is that the stability of (2.1) with respect to a large class of perturbations can be characterized by the closure property (1.1), i.e.
    K(Y)¯X=K,
    where Y is some dense subspace of X (in the norm topology of X), and K(Y) is given by
    K(Y)={uY:uK}=KY.
    In what follows, we will identify a very general class of perturbations for which the stability analysis effectively reduces to the study of the density property (1.1).

    (i) A class of quasi-monotone perturbations

    To subsume as many of the above-mentioned methods as possible, we consider the sequence of perturbed problems

    infF(u)+Rn(u),over uX,2.2
    defined by a given sequence of functions
    Rn:XR{+},nN,
    that are perturbations of the indicator function iK:XR{+} in the following sense: there exist functions R_n:XR{+} and R¯n:XR{+} such that
    0R_nRnR¯nnN,
    having the additional properties
    R_nR_n+1nN,limn+R_n(u)=iK(u)uXandR_n is sequentially weakly lower semicontinuousnN}2.3
    and
    R¯nR¯n+1,nN,limn+R¯n(u)=iKY(u)uX.2.4
    We call mappings (Rn) that share the above features quasi-monotone perturbations of the indicator function iK with respect to the (dense) subspace Y . Note that no additional assumptions are made on (Rn) itself.

    The first main result of this section states that the dense intersection property implies the stability of any quasi-monotone perturbation scheme. The proof is deferred to appendix A.

    Theorem 2.1 (Sufficient condition)

    Let the Banach space X be reflexive or assume that the dual space X* is separable. For a closed, convex and non-empty set K⊂X, let (Rn) be a sequence of quasi-monotone perturbations of iK with respect to the dense subspace Y according to (2.2). If the density property (1.1) holds true, then F+iK is the Γ-limit of (F+Rn) in both, the weak and strong topology.

    Under the assumptions of theorem 2.1, one may infer that, provided each problem (2.2) admits a global minimizer un, each weak cluster point of the sequence of minimizers (un) is a global minimizer of (2.1); see [12] for an introduction to Γ-convergence. At the end of this section, it is further clarified that theorem 2.1 is sharp in the sense that the stability result in general fails if (1.1) does not hold. We also remark that in case the (sequential) weak and strong Γ-limits coincide, one usually uses the notion Mosco convergence.

    In the following, we present a selection of approximation methods that fit into the general class of perturbations defined by (2.2), which bear high practical relevance. In favour of generality, we do not leave the abstract setting.

    Example 2.2 (Tikhonov regularization)

    Let (Y,∥…∥Y) be a Banach space which is densely and continuously embedded into X. For a sequence of positive non-decreasing parameters (γn) with γn+ and fixed α>0, consider in (2.2) the Tikhonov regularization

    Rn(u)=iK(u)+12γnuYα,2.5
    where it is understood that Rn(u)=+ if uY . In fact, set R_n:=iK for all nN and R¯n:=Rn. Obviously, (2.3) and (2.4) are satisfied such that (Rn) fits into the context of quasi-monotone perturbations according to (2.2).

    Example 2.3 (Conforming discretization)

    Let X be a separable Banach space. Suppose (2.1) is approximated by a Galerkin approach using nested and conforming finite-dimensional subspaces Xn, i.e. XnX and XnXn+1 for all nN, such that the Galerkin approximation property

    nNXn¯X=X
    is fulfilled. The resulting discrete counterpart of problem (2.1) is given by (2.2) with Rn(u)=iKXn. Setting R_n=iK, (2.3) is clearly fulfilled. Define Y=nNXn, then (2.4) is fulfilled with R¯n=Rn.

    Example 2.4 (Combined Moreau–Yosida/Tikhonov regularization)

    Let X be a Hilbert space and (Y,∥…∥Y) be a Banach space that is densely and continuously embedded into X. For two sequences of positive non-decreasing parameters (γn),(γn′) with γn,γn+ and fixed α>0, consider the simultaneous Moreau–Yosida and Tikhonov regularization

    Rn(u)=γn2infvKuv2+12γnuYα,2.6
    with α>0 fixed, where it is understood that Rn(u)=+ if uY . Setting R_n(u)=(γn/2)infvKuv2, standard properties of the Moreau–Yosida regularization ensure that R_n satisfies (2.3) (e.g. [2], Prop. 17.2.1). Defining R¯n(u)=iK(u)+(1/2γn)uYα, (2.4) is verified as in the previous example.

    Example 2.5 (Conforming discretization and Moreau–Yosida regularization)

    Let X be a separable Hilbert space and (γn) a sequence of positive non-decreasing parameters converging to +. The combination of regularization and discretization leads to the definition

    Rn(u)=γn2infvKuv2+iXn(u),2.7
    where the sequence of spaces (Xn) is defined as in example 2.3. Setting R_n(u)=(γn/2)infvKuv2 and R¯n(u)=iKXn(u), (2.3) and (2.4) are fulfilled with Y=nNXn and the framework of (2.2) applies.

    Consequently, each of these perturbations is stable with respect to (2.1) provided the density result (1.1) is satisfied. It should also be emphasized that these examples only represent an assorted variety of perturbations that fit into the problem class (2.2).

    Moreover, the density property (1.1) is also a necessary condition for the stability of perturbation schemes in the following sense: first, the Γ-limit of the approximation schemes defined in examples 2.2 and 2.3 can be calculated using similar arguments as in the proof of theorem 2.1. In fact, under the same conditions on X, one obtains F+iKY¯ as the weak and strong Γ-limit in both cases. Secondly, in the combined approaches of examples 2.4 and 2.5, theorem 2.1 guarantees that F+iK is obtained as the weak-strong Γ-limit for any coupling of regularization parameter pairs [γn,γn′] and [Xn,γn], respectively. Let us put this statement into a perspective by means of the combined Galerkin/Moreau–Yosida approach (example 2.5). In this case, it is possible to prove the existence of a suitable combination of n and γn to recover F+iK in the Γ-limit without resorting to the density property (1.1), see [13], Prop. 2.4.6. However, the proof is non-constructive and thus not immediately useful for the design of a stable numerical algorithm. On the other hand, if (1.1) is violated, the Γ-convergence to the original problem (2.1) cannot be guaranteed independently from the choice of the regularization/discretization parameter pair. In fact, the following result, which we prove in appendix A, holds true.

    Proposition 2.6 (Necessary condition)

    Consider example 2.5 with the corresponding definitions ofYand (Rn). Further suppose thatKY¯K. Then for allxKKY¯there exists a strictly increasing sequence (γn) withγnsuch that

    F(yn)+Rn(yn)F(x),
    for all (yn)⊂Xwithynx, i.e. there exists no recovery sequence atxin the norm topology.

    The analogous statement is valid in the case of combined Moreau–Yosida/Tikhonov regularizations given a fixed sequence (γn′); cf. example 2.4. In conclusion, theorem 2.1 is sharp with respect to condition (1.1) in the sense of proposition 2.6 and the preceding discussion.

    (b) Elliptic variational inequalities

    The density of convex intersections of the type (1.1) is also of fundamental importance for the analysis of perturbations of variational inequalities. Assuming X to be a Hilbert space and KX non-empty, closed and convex, we consider the general variational inequality problem of the first kind,

    find uX:Au,vu+iK(v)iK(u)l,vu,vX;2.8
    e.g. [7,14] for an introduction. Here, lX* is a linear, bounded operator and A:XX* denotes a, in general, nonlinear operator on X. We further assume A to be Lipschitz continuous and strongly monotone, i.e. there exists κ>0 with
    AvAu,vuκvu2,u,vX.
    In the following, we investigate three main classes of perturbations of (2.8) and their relation to the density properties of convex intersections.

    (i) Quasi-monotone perturbation

    Consider the perturbed variational inequality problem,

    find unX:Anun,vun+Rn(v)Rn(un)ln,vun,vX,2.9
    where An and ln are appropriate perturbations of A and l, respectively, and (Rn) is a quasi-monotone perturbation of iK with respect to a dense subspace Y of X. The stability of the approximation scheme (2.9) hinges on the density property (1.1). In fact, if the latter condition is fulfilled, then the sequence (Rn) Mosco converges to iK provided R_n is weakly lower semicontinuous. Under mild assumptions on (An) and (ln) one may then invoke known stability results, cf. [7, p.99, 15], to conclude the consistency of the perturbation scheme with respect to the limit problem (2.8).

    (ii) Galerkin approximation of variational inequalities

    In general, finite-dimensional approximations of K are neither conforming nor nested as it was the case in examples 2.3 and 2.5, where K was ‘discretized’ by KXn, which is numerically realizable only in special cases. Instead, it is often more favourable to consider non-nested approximations KnXn that may contain infeasible elements, such that KnK does not hold true in general [4,5]. As a result, the finite-dimensional variational inequality problems,

    find unXn:Anun,vun+iKn(v)iKn(un)ln,vun,vXn,2.10
    do not fit into the framework of (2.9). Again, under mild assumptions on (An) and (ln), the Mosco convergence of (Kn) to K ensures that the approximation (2.10) is stable with respect to the limit problem (2.8). However, Mosco convergence requires the existence of a recovery sequence (see definition 4.5) for any element uK. To construct this sequence in the context of finite-element methods, one may use an interpolation procedure which typically is only defined on the (supposedly) dense subset KY of K, where Y=Ck(Ω¯) for some kN0 (cf. [4], II, Theorem 2.3 and §5). This leads again to problem (1.1).

    (iii) Singular perturbations

    The closure property (1.1) also plays a role in the limiting behaviour of singular perturbations. In fact, let A1:YY * be a Lipschitz continuous and strongly monotone operator on a Hilbert space (Y,∥…∥Y) that embeds densely and continuously into X. For a sequence of regularization parameters (γn) with γn+ consider the perturbed problems,

    find unKY:(A+1γnA1)un,vunl,vun,vKY.2.11
    Observe that problem (2.11) admits a unique solution unKY provided that KY is closed in Y . The appropriate limit problem is then given by
    find uKY¯X:Au,vul,vu,vKY¯X.2.12
    Note that (2.12) corresponds to the initial variational inequality problem if the density property (1.1) holds true. In this case, the sequence (un) converges strongly in X to the solution of (2.8). Here, the assumptions on A1 may be alleviated. This type of application also plays a role in the analysis and the design of algorithms for hyperbolic variational inequalities through the vanishing viscosity approach. For details, [7, section 4.9, 8] may be consulted.

    3. Density results for continuous obstacles

    We first fix some notation. In this section, ΩRN denotes a bounded Lipschitz domain. The space of functions that are restrictions to Ω of smooth functions with compact support on RN is denoted by D(Ω¯),

    D(Ω¯)={φ|Ω:φCc(RN)}.
    The standard Lebesgue and Sobolev spaces over Ω are denoted by Lp(Ω),W1,p(Ω) and W01,p(Ω), and we also employ the spaces
    H(div;Ω)={uL2(Ω;RN):divuL2(Ω)}
    and
    H0(div;Ω)=Cc(Ω;RN)¯H(div;Ω)={uH(div;Ω):uν=0 on Ω}.
    In the recent paper [11], it has been shown that for any αC(Ω¯) with
    essinfxΩα(x)>0,3.1
    the following density result for the spaces X(Ω){Lp(Ω)d,W01,p(Ω)d,H0(div;Ω)}, and 1p<+, holds true:
    K(Cc(Ω)d)¯X(Ω)=K(X(Ω)),3.2
    where the constraint set K(X(Ω)) with respect to a given subspace
    X(Ω)L1(Ω)d
    is defined by a pointwise constraint on an arbitrary norm |⋅| on Rd, i.e.
    K(X(Ω)):={wX(Ω):|w(x)|α(x) a.e. in Ω}.3.3
    Here, α:ΩR{+} is a given non-negative Lebesgue measurable function. It is further understood that d=N in (3.2) if X(Ω)=H0(div;Ω).

    When considering the case X=W1,p instead of W01,p in (3.2), the choice of the approximating sequence from [11], Theorem 1, which relies on the trivial extension of Sobolev functions, fails. As a result, a different extension operator has to be employed.

    Theorem 3.1

    Let αC(Ω¯) fulfil (3.1) and 1p<+. Then it holds that

    K(D(Ω¯)d)¯W1,p(Ω)d=K(W1,p(Ω)d),3.4
    i.e. K(D(Ω¯)d) is dense in K(W1,p(Ω)d) with respect to the norm topology in W1,p(Ω)d.

    Proof.

    Let wK(W1,p(Ω)d). Since Ω is a bounded Lipschitz domain we may extend w to a function in W1,p(RN)d using for each component the extension-by-reflection operator. The resulting operator

    E:W1,p(Ω)dW1,p(RN)d3.5
    has the properties Ew|Ω=w for all wW1,p(Ω)d and EL(W1,p(Ω)d,W1,p(RN)d); see, for instance, [16]. Since E is obtained by a partition of unity argument using local reflection with respect to the Lipschitz graphs into which ∂Ω can be decomposed, the property |w(x)|≤α(x) in Ω is preserved by the extension in that
    |(Ew)(x)|EC(Ω¯)α(x), a.e. xRN,3.6
    where EC(Ω¯):C(Ω¯)C(RN) denotes the application of the extension by reflection procedure to bounded uniformly continuous functions, i.e. (EC(Ω¯)α)|Ω=α. Further inspecting the construction of E, it may also be observed that the support of Ew is compactly contained in RN. Analogously, we obtain EC(Ω¯)αCc(RN). For a sequence (ρn) of smooth mollifiers
    ρn(x)=nNρ(nx),3.7
    where
    ρD(RN),ρ0,ρ(x)=0if |x|1,Ωρdx=1,
    we define the approximating sequence Sn(w,Ω) to w by
    Sn(w,Ω)(x):=(ρnEw)(x)=RNEw(y)ρn(xy)dy,xRN.3.8
    It is well known that
    Sn(w,Ω)|Ωw in W1,p(Ω)das n,3.9
    and, since Ew has compact support in RN, it holds that Sn(w,Ω)|ΩD(Ω¯)d. In order to achieve feasibility, we use the scaling sequence
    βn:=(1+supxRN|αn(x)EC(Ω¯)α(x)|minxΩ¯α(x))1,
    where αn(x):=((EC(Ω¯)α)ρn)(x),xRN. Since EC(Ω¯)αCc(RN), αn converges to EC(Ω¯)α uniformly in RN and thus βn→1 as n. In addition, (3.6) together with (3.8) yields |Sn(w,Ω)|≤αn(x) for xRN such that
    βn1α(x)=α(x)+supxRN|αn(x)EC(Ω¯)α(x)|minxΩ¯α(x)α(x)αn(x)|Sn(w,Ω)|,3.10
    for all xΩ. As a result, βnSn(w,Ω)K(D(Ω¯)d) and, taking account of (3.9), the proof is accomplished. ▪

    Remark 3.2 (boundary conditions)

    • (i) In order to incorporate a homogeneous Dirichlet boundary condition in the context of theorem 3.1, one may use an additional reparametrization to construct a suitable approximating sequence; see [11].

    • (ii) If the set K(W1,p(Ω)d) is additionally restricted by an inhomogeneous Dirichlet boundary condition given by a function gW1−1/p,p(∂Ω)d with |g(x)|≤α(x) on ∂Ω, the proof of theorem 3.1 fails. In fact, the sequence (3.8), which is based on the standard mollifier, does not preserve a given trace condition. In any case, the regularity of g (and ∂Ω) determines an a priori regularity limitation for the functions in Y in order to be compatible with a closure property analogous to (3.4), e.g. if Y=C(Ω¯) and gC(∂Ω), then KY =∅. In this case, a different mollification approach needs to be pursued; cf. also §7 for an outlook on this matter.

    4. Density results for discontinuous obstacles

    (a) Obstacles in Sobolev spaces

    Note that theorem 3.1 requires continuous obstacles. In some applications, such as in the regularization and discretization of elasto-plastic contact problems or image restoration problems (see §6), it may be useful to consider obstacles that are not continuous. Under such circumstances, the following example shows that density properties of the type (3.2) or (3.4) cannot be expected if the obstacle is just a Sobolev function: without loss of generality, assume that 0ΩRN with N≥2 and denote by

    Bε(x):={yRN:|xy|2ε},
    the open ball with centre xRN and radius ε>0 with respect to the Euclidean norm |⋅|2 in RN. Let {xk:kN} be a countable dense subset, i.e.
    {xk:kN}¯=Ω¯,
    and r>0 such that Br(0)⊂Ω. Consider the function
    φ(x):=φ~(x)ln(ln(c|x|21)),cer fixed,4.1
    where φ~Cc(Br(0)) is a smooth cut-off function with φ~(x)0 for all xBr(0) and φ~1 on Br/2(0). We note that φ is non-negative with a singularity at the origin, and its zero extension belongs to W1,N(RN); cf. [17, Example 4.43]. Further set
    g(x):=k=1k2φ(xxk),xΩ,4.2
    and note that gW1,N(Ω) with g being unbounded at each xk; see [18], p.247, Example 4. Further take a function ϕC1(R) with 0≤ϕ(t)<1, ϕ(t)→1 for t+ and ϕ′ uniformly bounded in R. By the chain rule for Sobolev functions, the obstacle
    α:=2ϕg4.3
    belongs to W1,N(Ω); e.g. [14], Lemma A.3. Notice also that α is bounded away from zero and that it is basically equal to 1 on the dense set {xk:kN}. Consequently, any continuous function w with wα a.e. in Ω fulfils w≤1 on Ω:

    Assume that the latter implication is false. Then there exist k0N as well as μ>0,δ>0 such that

    w(x)1+μxBδ(xk0).4.4
    Let R>0 be such that ϕ(t)≥1−μ/2 for all tR. By continuity, there also exists δ′>0 such that φ(xxk0)Rk02 a.e. in Bδ(xk0) such that
    g(x)k02φ(xxk0)R, a.e. xBδ(xk0),
    which implies
    w(x)α(x)=2ϕ(g(x))1+μ2, a.e. xBδ(xk0),
    contradicting (4.4). Hence, any sequence of continuous functions approximating α from below is bounded above by 1. However, as α(x)>1 for a.e. xΩ by definition, and convergence in the norm topology of Lp(Ω) implies convergence pointwise a.e. (along a subsequence), we obtain that
    αK(Lp(Ω))K(C(Ω)Lp(Ω))¯Lp(Ω),4.5
    for any 1p+, and
    αK(W1,p(Ω))K(C(Ω)W1,p(Ω))¯W1,p(Ω),4.6
    for all pN, where α is defined by (4.3).

    Remark 4.1 (Complements on the counterexample)

    An interesting point in the preceding counterexample is the structure of the set of singularities S where g(x) is not well defined as a real number by the infinite sum (4.2), if φ from (4.1) is understood as a function in C(Ω∖{0}). Extending φ to Ω by setting φ(0):=+, we obtain g(xk)=+ for all kN and, understanding g:ΩR{+} as an extended real-valued function, we arrive at the following definition:

    S:={xΩ:g(x)=+ with g(x) defined by (4.2) where φ(0)=+}.
    Using the Baire category theorem, one may show that the set S is a non-meagre set with vanishing Lebesgue measure [19].

    The previous construction of the counterexample is the basis for the following result.

    Theorem 4.2

    Let ΩRN be a bounded Lipschitz domain. The following density results hold true:

    • (i) Let N≥2 and 1p+. Then there exists an obstacle αW1,N(Ω)L(Ω) satisfying (3.1) such that

      K(C(Ω)Lp(Ω))¯Lp(Ω)K(Lp(Ω)),
      the inclusion being strict.

    • (ii) Let N≥2 and 1≤p≤N. Then there exists an obstacle αW1,N(Ω)L(Ω) satisfying (3.1) such that

      K(C(Ω)W1,p(Ω))¯W1,p(Ω)K(W1,p(Ω)),
      the inclusion being strict.

    • (iii) Let N<p<+ or p=N=1. For any measurable obstacle function α:ΩR{+} which satisfies (3.1), it holds that

      K(D(Ω¯)d)¯W1,p(Ω)d=K(W1,p(Ω)d).

    Proof.

    We only prove assertion (iii) since (i) and (ii) follow immediately from (4.5) and (4.6). As a consequence of the Sobolev imbedding theorem, any wK(W1,p(Ω)d) is contained in C(Ω¯)d. Let wK(W1,p(Ω)d). Setting

    α^(x)=max(|w(x)|,essinfxΩα(x)),
    it follows that |w(x)|α^(x) a.e. in Ω. Since α^C(Ω¯) and (3.1) holds with α^ instead of α, we may invoke theorem 3.1 to infer that there exists a sequence (wn) with wnD(Ω¯)d, wnw in W1,p(Ω)d and |wn(x)|α^(x)α(x) a.e. in Ω. This entails that wnK(D(Ω¯)d) for all nN, which accomplishes the proof. ▪

    We immediately infer the corresponding statements for Sobolev spaces incorporating homogeneous Dirichlet boundary conditions.

    Corollary 4.3

    LetΩRNbe a bounded Lipschitz domain. The following density results hold true:

    • (i) LetN≥2 andpN. Then there exists an obstacleαW1,N(Ω)L(Ω)satisfying (3.1) such that

      K(C(Ω)W01,p(Ω))¯W01,p(Ω)K(W01,p(Ω)),
      the inclusion being strict.

    • (ii) LetN<p<+orp=N=1. For any measurable obstacle functionα:ΩR{+}which satisfies (3.1) it holds that

      K(Cc(Ω)d)¯W01,p(Ω)d=K(W01,p(Ω)d).

    Proof.

    • (i) Define the upper bound α by (4.3). Let φ^Cc(Ω) be a smooth cut-off function with 0φ^1 a.e. on Ω and φ^1 except on a sufficiently small neighbourhood of ∂Ω. Then it holds that αφ^K(W01,p(Ω)) and the assertion now follows directly from the discussion preceding remark 4.1.

    • (ii) Taking account of (3.2), statement (ii) can be proven as theorem 4.2 (iii). ▪

    (b) Lower semicontinuous obstacles and Lebesgue spaces

    The preceding counterexample provides a regularity limit in terms of the upper bound α for which the density property (3.2) in the space X(Ω)=Lp(Ω)d can be expected to hold. In this regard, however, uniform continuity is far from being a necessary condition. In order to enlarge the space of obstacles compatible with (3.2), we first consider upper bounds that allow for a lower semicontinuous representative, i.e. there exists a lower semicontinuous function in the equivalence class of functions that are Lebesgue-almost everywhere equal to α.

    Theorem 4.4

    Let ΩRN be a bounded Lipschitz domain and 1p<+. If α:ΩR{+} has a lower semicontinuous representative that fulfils (3.1), then it holds that

    K(Cc(Ω)d)¯Lp(Ω)d=K(Lp(Ω)d).

    Proof.

    Let wK(Lp(Ω)d). Consider a lower semicontinuous function α:ΩR{+} that fulfils (3.1). Without loss of generality, we may assume that infxΩα(x)>0. Denote by α~ the extension of α given by α~(x):=α(x),xΩ, α~(x):=infxΩα(x) on RNΩ, and note that α~ is lower semicontinuous (l.s.c.) on RN. The Lipschitz regularization of α~,

    αn(x)=infyRN{α~(y)+nxy},
    yields a sequence (αn) with αnC(Ω¯), infxΩα(x)αn(x)α(x) for all xΩ,nN and αn(x)→α(x) a.e. in Ω; see, e.g. [2], Theorem 9.2.1. Now consider the functions
    wn(x):=min{|w(x)|,αn(x)}w(x)|w(x)|,
    where it is understood that wn(x):=0 if w(x)=0. It follows from Lebesgue's theorem on dominated convergence that wnw in Lp(Ω)d. Further observe that wnKn(Lp(Ω)d) where
    Kn(X(Ω)):={wX(Ω):|w(x)|αn(x) a.e. on Ω}.
    Let ε>0. According to (3.2), for each nN, wn can be approximated by a smooth function w~nKn(Cc(Ω)d)K(Cc(Ω)d) such that
    wnw~nLp(Ω)d<ε2.
    For sufficiently large n, we conclude that
    ww~nLp(Ω)dwwnLp(Ω)d+wnw~nLp(Ω)d<ε2+ε2=ε,4.7
    which concludes the proof. ▪

    We proceed by considering the important special case of a piecewise continuous upper bound; suppose there exists a partition of Ω into open subsets ΩlΩ with Lipschitz boundary such that Ω¯=l=1LΩ¯l, ΩiΩj=∅ for ij and

    α|ΩlC(Ω¯l),infxΩlα|Ωl(x)>0,l=1,,L.4.8
    Theorem 4.4 ensures that for obstacles of this class the density result in the norm topology of the Lp-spaces holds true.

    (c) Lower semicontinuous obstacles and Sobolev spaces

    Conditions on the obstacle α so that the density results for Sobolev spaces hold can be relaxed from assuming that αC(Ω¯) to lower regularity requirements with the aid of Mosco convergence of closed and convex sets. The following definition goes back to [15].

    Definition 4.5 (Mosco convergence)

    Let X be a reflexive Banach space and (Kn) a sequence of closed convex subsets with KnX for all nN. Then KnMK as n+, i.e. (Kn) is said to Mosco converge to the set KX, if and only if

    K{vX:((vk)X:vkKnkkN,vkv)}M 1
    andK{vX:((vn)X,NN:vnKnnN,vnv)}.M 2

    Here, (Knk) denotes an arbitrary subsequence of (Kn) and the subset notation (vk)⊂X has to be understood in the sense that {vk}⊂X. The following class of obstacles encompasses functions in W1,q(Ω) that fulfil a generalized lower semicontinuity condition.

    Definition 4.6

    We denote by Wq(Ω) for q≥1 the set of functions αW1,q(Ω) for which there exists a sequence of functions (αn) with αn satisfying (3.1), αnα a.e. in Ω and αnC(Ω¯)W1,q(Ω) for all nN such that αnα in W1,q(Ω).

    Note that the class Wq(Ω) is strictly contained in W1,q(Ω). Additionally, any obstacle αWq(Ω) has a lower semicontinuous representative, which follows easily from definition 4.6 and by extraction of a pointwise almost everywhere converging subsequence. However, the functions in Wq are not necessarily continuous: it suffices to consider the example from (4.1) for Ω=Br(0), N>1 and

    α(x)=ln(ln(c|x|1)),cer fixed.4.9
    It follows that αW1,q(Ω) for all qN (see [17], Example 4.43), αC(Ω¯), and the sequence (αn) defined as αn(x)=min(α(x),n) for nN satisfies the requirements of the definition of Wq(Ω).

    Theorem 4.7

    Let ΩRN be a bounded Lipschitz domain. Let 1p< and αWp(Ω). Then the following density results hold true:

    K(D(Ω)d;||)¯W01,p(Ω)d=K(W01,p(Ω)d;||)andK(D(Ω¯)d;||)¯W1,p(Ω)d=K(W1,p(Ω)d;||),
    where K(X(Ω);||)={wX(Ω):|w(x)|α(x) a.e. xΩ}.

    Proof.

    Without loss of generality, consider the one-dimensional case d=1. Let wK(W01,p(Ω);||) and (αn)⊂W1,p(Ω) according to definition 4.6. By Mazur's lemma, we may as well assume that (αn) converges strongly to α in W1,p(Ω) since convex combinations preserve order and continuity. Hence, one obtains the Mosco convergence result

    Kn±(W01,p(Ω))MK±(W01,p(Ω))
    for the unilateral constraint sets
    Kn(X(Ω)):={wX(Ω):w(x)αn a.e. in Ω},Kn+(X(Ω)):={wX(Ω):w(x)αn a.e. in Ω},K(X(Ω)):={wX(Ω):w(x)α a.e. in Ω}andK+(X(Ω)):={wX(Ω):w(x)α a.e. in Ω}.
    Consequently, there exist two recovery sequences,
    wn±Kn±(W01,p(Ω)),4.10
    with wn±w in W01,p(Ω). Using the continuity of
    max(,0),min(,0):W01,p(Ω)W01,p(Ω),
    it follows that the sequence
    wn=max(wn+,0)+min(wn,0),
    converges to w in W01,p(Ω). Moreover, it holds that |wn|≤αn for all nN. For each nN, the assumptions on αn allow to use (3.2) to infer the existence of a smooth function w~nCc(Ω) with |w~n|αnα a.e. in Ω that approximates wn arbitrarily well. Using wnw in W01,p(Ω)d, the assertion follows by an ε/2-argument as in (4.7). The proof for the case X(Ω)=W1,p(Ω)d follows analogously by invoking theorem 3.1. ▪

    (d) Supersolutions of elliptic partial differential equations

    By now, density properties for pointwise constraints in Sobolev spaces of the type

    K(Cc(Ω)d)¯W01,p(Ω)d=K(W01,p(Ω)d),orK(D(Ω¯)d)¯W1,p(Ω)d=K(W1,p(Ω)d),
    have been obtained on the basis of mollification and a subsequent procedure to enforce feasibility. An alternative approach is the approximation of a function via the solution of an appropriate sequence of elliptic PDEs. Using standard regularity theory, one may prove higher regularity of the approximating sequence and one is left to prove feasibility. In this section, we focus on obstacles which are solutions of an elliptic PDE. Therefore, consider a general second-order differential operator A in divergence form;
    A=i,j=1Nxiaij(x)xj+i=1nbi(x)xi+c(x),4.11
    where aij,bi,cL(Ω) for 1≤i,jN. Here, the matrix [aij(x)] is symmetric a.e. and uniformly elliptic, i.e. there exists a κa>0 such that
    i,j=1Naij(x)ξiξjκa|ξ|2,ξRN,
    for a.e. xΩ. It is further assumed that aij,bi,c are such that A:H01(Ω)H1(Ω) is strongly monotone, i.e. there exists κ>0 such that
    Au,uκuH01(Ω)2,uH01(Ω),
    where 〈…,…〉 denotes the duality pairing in H−1(Ω). For example, this is the case if bi≡0 for 1≤iN and c(x)≥0 a.e. in Ω. We call a function αH1(Ω) weak supersolution with respect to the elliptic operator A, if ≥0 in the H−1(Ω)-sense, that is
    Aα,v0,vH01(Ω),v0 a.e. in Ω.4.12
    The subsequent theorem covers density properties for obstacles that are weak supersolutions of an elliptic PDE of type (4.11).

    Theorem 4.8

    Let Ω be a bounded domain. Let α∈H1(Ω) be a weak supersolution for some A as in (4.11) in the sense of (4.12), with α≥0 on ∂Ω. For X(Ω){L2(Ω)d,H01(Ω)d}, it holds that

    K(Y(Ω),||)¯X(Ω)=K(X(Ω),||),
    in the following cases:
    • (i) aijC0,1(Ω¯) or aij∈C1(Ω): Y(Ω)=(Hloc2(Ω)H01(Ω))d,

    • (ii) ∂Ω∈C1,1 or Ω convex, aijC0,1(Ω¯): Y(Ω)=(H2(Ω)H01(Ω))d,

    • (iii) aij,bi,c∈Cm+1(Ω), mN0: Y(Ω)=(Hlocm+2(Ω)H01(Ω))d and

    • (iv) ∂Ω∈Cm+2, aij,bi,cCm+1(Ω¯), mN0: Y(Ω)=(Hm+2(Ω)H01(Ω))d.

    Proof.

    Without loss of generality, assume d=1. First observe that the maximum principle implies α(x)≥0 a.e. in Ω. Let wK(X(Ω)) be arbitrary. Consider the sequence (wn), where wn is defined as the unique solution to the problem,

    find yH01(Ω):1nAy+y=w in H1(Ω).4.13
    We denote by Tn the solution mapping to (4.13), i.e. wn=Tn(w).

    Step 1: Tn-invariance of K(H01(Ω)): We now prove that for any nN, we have that −αwnα a.e., i.e.

    Tn:K(L2(Ω))K(H01(Ω)),4.14
    given that ≥0 in the H−1(Ω). Proceeding as in [20], we consider (wnα)+ as a test function on (4.13) and add to both sides −〈(1/n)+α,(wnα)+〉. Then,
    κn(wnα)+H01(Ω)2+(wnα)+L2(Ω)2(1nA+I)(wnα),(wnα)+wα1nAα,(wnα)+1nAα,(wnα)+0,
    where we have used that wα≤0 a.e. in Ω. Therefore, wnα a.e. in Ω. Analogously, we obtain that wn≥−α a.e. by considering (−αwn)+ as a test function and by adding to both sides −〈(1/n)+α,(−αwn)+〉. This proves (4.14), i.e. wnK(H01(Ω)).

    Step 2: Some convergence results for singular perturbations.

    The desired convergence modes of the approximating sequences rely on standard arguments for singular perturbations, cf. [7], Theorems 9.1 and 9.4 for the case of singularly perturbed variational inequalities. First, for yL2(Ω) it holds

    limnyn=y in L2(Ω)y^n:=Tn(yn)y in L2(Ω).4.15
    Second, for yH01(Ω), we prove that
    limnyn=y in H01(Ω)limny^n=y in H01(Ω).4.16
    In fact, since ynH01(Ω) and A is strongly monotone, we observe that
    κny^nynH01(Ω)2+y^nynL2(Ω)2(1nA+I)(y^nyn),y^nyn=1nAyn,yny^n1nAynH1(Ω)yny^nH01(Ω),
    where we have used that y^n solves (4.13) with yn as right-hand side. Hence (y^n) is bounded in H01(Ω). Employing (4.15) one obtains that y^ny in H01(Ω) along a subsequence, and by uniqueness, it holds y^ny for the entire sequence (y^n). Finally, from the inequalities above, we have
    κlim supn|y^nyn|H01(Ω)2lim supnAyn,yny^n=0,
    so that y^n=Tn(yn)y in H01(Ω) and thus (4.16) is proven.

    Thirdly, in addition to wn=Tn(w), we define wnq=Tnq(w) where Tnq(w):=Tn(Tnq1(w)) for qN,q2, Tn1(w):=Tn(w)=wn and wn0:=w. It can be deduced from (4.15) and (4.16) by induction that

    limnwnq=win L2(Ω),qN{0},4.17
    for wL2(Ω), and
    limnwnq=win H01(Ω),qN{0},4.18
    for wH01(Ω), respectively.

    Step 3: Regularity and convergence of the approximating sequences

    The extra regularity of the H01(Ω)-solution Tn(w) to (4.13) is different with respect to the statement cases: if aijC0,1(Ω¯) or aijC1(Ω) for 1≤i,jN, the solution Tn(w) belongs to H01(Ω)Hloc2(Ω) (see [21] for the first case and [18] for the second one). The solution Tn(w) belongs to H01(Ω)H2(Ω) if ∂Ω is C1,1-smooth [21] or when Ω is convex [22].

    In case wK(L2(Ω)), (4.15) with ynw ensures that wnw in L2(Ω). In conjunction with the regularity and the feasibility of wn=Tn(w) described above, we have then established (i) and (ii) for X(Ω)=L2(Ω). Secondly, note that if wK(H01(Ω)), then wnw in H01(Ω) by (4.16) with ynw, and as seen above, wnK(H01(Ω)). This, together with the regularity of wn=Tn(w) established above, proves in turn (i) and (ii) for X(Ω)=H01(Ω).

    It is left to argue for (iii) and (iv) as follows. If aij,bi,cCm+1(Ω) for 1≤i,jN, then for each nN, the operator Tn has the following increasing regularity properties [18],

    wHlock(Ω)Tn(w)Hlock+2(Ω)H01(Ω),0km;
    and if aij,bi,cCm+1(Ω¯) for 1≤i,jN and ∂Ω is of class Cm+2, for each nN,
    wHk(Ω)Tn(w)Hk+2(Ω)H01(Ω),0km.
    Finally, this proves (iii) given that wnqHlocm+2(Ω)H01(Ω) for 2qm+2, wnqK(H01(Ω)), and wnqw as n in L2(Ω) or H01(Ω) depending on the regularity of w, cf. (4.17) and (4.18). The analogous reasoning applies to (iv). ▪

    Let us briefly comment on the relation to the density results from theorem 4.4 and theorem 4.7. First, note that we do not require the obstacle to be bounded away from zero as we did in the preceding paragraphs. Secondly, the maximal regularity of the feasible approximation hinges on the coefficients of the elliptic operator associated with the obstacle and the smoothness of the boundary. Concerning the semicontinuity requirements of the upper bound, a classical result from Trudinger [23], Cor. 5.3 for the case without lower order terms (bi≡0,c≡0) states that any weak supersolution in the sense of (4.12) is upper semicontinuous. By contrast, the consideration of upper bounds that are weak subsolutions of an elliptic PDE is not useful as these functions may easily fail to be non-negative on Ω. For example, this is the case if a weak subsolution satisfies a Dirichlet boundary condition.

    5. Application to finite elements

    (a) Finite-element discretized convex sets

    In the following, we investigate the issue of the Mosco convergence (definition 4.5) of finite-dimensional approximations Kn of a convex constraint set K(X(Ω)) of the type (3.3); see §2b(ii) for a general motivation in the context of variational inequality problems. In this section, it is assumed that the sets (Kn) result from a suitable finite-element discretization such that the parameter n is associated with a sequence of mesh widths (hn) tending to zero. The convergence of (Kn) in the sense of definition 4.5 ensures that the solutions of the discrete problems converge to the solution of the original infinite-dimensional problem irrespectively of the regularity of the data or the obstacle defining K(X(Ω)); see [7], ch. 4, Theorem 4.1. Mosco convergence results of this type are rarely found in the literature and are typically confined to simpler constraint sets and higher regularity assumptions on the obstacle; see, for instance, [4] for the case of an H1(Ω)C(Ω¯)-bound in the context of the obstacle problem. The density results from the preceding sections provide the basis for new Mosco convergence results under minimal regularity (of the solution) and under weaker assumptions on the regularity of the obstacle α. We further provide novel Mosco convergence results for discretized constraints on partial derivatives, including Raviart–Thomas finite-element approaches for problems in H(div). As a general rule, density results of the type (1.1) represent a powerful means to verify the convergence of finite-element methods for convex constrained problems under minimal regularity. Applications involving constraint sets of the type (3.3) with low regularity of α are manifold and comprise, for instance, the discretization of variational problems in mechanics, such as in elasto-plasticity with hardening [24], and in image restoration, with regard to the predual problem of TV-regularization [25]. Moreover, the issue occurs in fixed point-based approaches to the solution of quasi-variational inequalities through the implicit definition of obstacles.

    Remark 5.1

    In some textbooks on finite-dimensional approximations of variational inequalities, cf. e.g. [4,6], condition (M2) is replaced by the following criterion:

    there exists a dense subset K~K and an operator rn:K~Xsuch that for all vK~ it holds rnvv in X and there exists n0Nsuch that rnvKn for all nn0.}M2′
    It is easy to show that (M2′) implies (M2). In fact, let vK and denote by πKnv its (not necessarily uniquely determined) projection onto Kn. By density, for ε>0, there exists vεK~ such that ∥vεv∥≤ε. Thus, it holds
    vπKnv=infvnKnvvnvrnvεε+vεrnvε,
    for sufficiently large n such that limnvπKnvε, where ε was arbitrary.

    The condition (M2′) turns out to be convenient especially in the context of finite-dimensional approximations, where (rn) is given by suitable interpolation operators, which typically are only well defined on a dense subset Y (Ω) of X(Ω) giving rise to sets K~ of the type K(Y (Ω)). This is precisely the point where the density results of §3 are needed.

    Note that Mosco convergence is a powerful tool whenever the discrete spaces are fixed a priori, i.e. regardless of the data of the specific problem. The resulting sequence of finite-dimensional problems can be understood as an approximation of any problem in a given problem class.

    By contrast, adaptive finite-element methods intend to design the sets Kn in order to approximate the solution of a specific problem. However, rigorous convergence proofs with regard to adaptive discretizations of variational inequalities are restricted to special cases and usually rely on rather strong assumptions. For instance, in the case of the obstacle problem with a piecewise affine obstacle, we mention the article [26]. Moreover, density results may still be useful in the convergence analysis of adaptive schemes which require interpolation operators (cf. [27]).

    (b) Finite-element spaces and interpolation operators

    In this section, we assume that ΩRN is polyhedral. Together with Ω, a sequence of geometrically conforming affine simplicial meshes (Th)h>0 of Ω with mesh size

    h:=maxTThdiamT
    is assumed to be given. For details, we refer to [28]. In analogy to the case N=2, we refer to each Th as a triangulation. The (N-dimensional) Lebesgue measure of an element TTh is denoted by λ(T). We also admit the standard assumption that the sequence (Th) is shape-regular, i.e.
    c>0:diam(T)ρTchTTh,5.1
    where diam(T)=maxx,yT|xy| denotes the diameter of T and ρT designates the diameter of the largest ball that is contained in T. We further write xT for the (barycentric) midpoint of an element T, and Mh={xT:TTh}, Nh and Eh for the set of element midpoints, triangulation nodes and edges with respect to Th, respectively. By abuse of notation, we write |Mh| and |Nh| for the cardinality of the respective set. Let χT:ΩR designate the characteristic function of T with respect to Ω, that is
    χT(x)=0,xT,χT(x)=1,xT.
    We further make use of the standard H1(Ω)-conforming finite-element space of globally continuous, piecewise affine functions denoted by
    P1,h(Ω):={uC(Ω¯):u|TP1TTh}.
    Here, P1 denotes the space of polynomials of degree less than or equal to one. Together with the finite-dimensional subspace P1,h(Ω) and its standard nodal basis {φx:xNh}, we consider the global interpolation operator
    Ih:C(Ω¯)P1,h(Ω)andIhu:=xNhu(x)φx.5.2
    Note that Ih is only defined on a dense subspace of H1(Ω). For the discretization of variational problems in H(div;Ω), it is customary to use the conforming space of Raviart–Thomas finite elements of lowest order
    RTh(Ω):={wL2(Ω)N:w|TRTTTh,[wν]|EΩ=0EEh},5.3
    where RT:={wP1d:aRd,bR:w(x)=a+bx} and ν denotes the unit outer normal to T. To incorporate homogeneous Neumann boundary conditions, one uses the H0(div;Ω)-conforming subspace
    RT0,h(Ω):=RTh(Ω)H0(div;Ω).
    The construction of suitable edge-based basis functions {φE:EEh} can be found in the literature, cf., for instance, [29], such that the boundary condition in the definition of RT0,h(Ω) can be easily accounted for. The global Raviart–Thomas interpolation operator is given by
    IhRT:W1,1(Ω)NRTh(Ω),IhRTw:=EEh(EwνdHN1)φE.5.4

    (c) Mosco convergence results under minimal regularity

    We emphasize that the subsequent results may be extended to finite elements of higher order, which are typically useful when the solution to the variational problem, e.g. (2.8), displays a higher regularity. In this regard, higher regularity assumptions on the data and the obstacle are required and the concept of Mosco convergence is not binding to prove the convergence of the finite-element method, and a priori error estimates with a rate can be derived (cf. e.g. [30]). However, we do not want to deviate from minimal regularity assumptions on the data. Further, even for simple variational problems such as the classical elasto-plastic torsion problem, there is a regularity limitation for the solution regardless of the smoothness of the data (cf. [4]).

    Note also that the subsequently covered problems comprise situations where the discrete feasible sets Kh are not necessarily nested and non-conforming in the sense that they are in general not contained in the feasible set K(X). In the following, c denotes a positive constant, which may take different values on different occasions.

    Lemma 5.2

    LetΩRNbe a polyhedral domain andαC(Ω¯)withα(x)≥0 inΩ. Further let (wh) be a sequence that fulfils for allh, whP1,h(Ω)dand |wh(xT)|≤α(xT) for allTTh. Ifwhwforh→0 inL2(Ω)d, then it holds that |w|≤αa.e. inΩ.

    Proof.

    It suffices to show that iK(w)=0, where

    K:={wL2(Ω)d:|w|α a.e.}.
    Moreover, it holds that iK=j*, where j* denotes the Fenchel conjugate
    j(v):=supvL2(Ω)d{(v,v)j(v)},
    of the mapping j:L2(Ω)dR,j(v):=Ωα|v|dx. Here,
    |v|=supvRd{0}vv/|v|
    denotes the dual norm of |⋅|. From the definition of j*, we obtain that iK(w)=0 is equivalent to
    (w,v)Ωα|v|vL2(Ω)d.5.5
    By a density argument, it suffices to prove this result for all vCc(Ω)d. Denote by
    αh:=TThα(xT)χTandvh:=TThv(xT)χT,5.6
    the piecewise constant interpolants of α and v, respectively. By definition of (ah) and (vh) as well as the uniform continuity of α and v it follows that αhα and vhv, both in L(Ω). By the weak convergence of (wh), the strong convergence of (αh) and (vh) as well as the midpoint quadrature rule, we obtain
    ΩwvdxΩwhvhdx=TThTwhvhdx=TThλ(T)wh(xT)vh|TdxTThλ(T)α(xT)|vh|T|dx=Ωαh|vh|dxΩα|v|dx,5.7
    which proves (5.6). ▪

    Lemma 5.3

    LetΩRNbe a polyhedral domain andαC(Ω¯)withα(x)≥0 inΩ. Let (wh) be a sequence that fulfils for allh, whP1,h(Ω)dand |wh(x)|≤α(x) for allxNh. Ifwhwforh→0 inL2(Ω)dthen it holds that |w|≤αa.e. inΩ.

    Proof.

    The assertion follows by a slight modification of the proof of lemma 5.2. Instead of the piecewise constant interpolant we define αh as the piecewise affine interpolant of α, i.e. αh=Ihα, which fulfils α(x)=(Ihα)(x) for all xNh and αhα strongly in L(Ω)d. By (5.8), we obtain

    ΩwvdxΩwhvhdx=TThλ(T)N+1xNhTwh(x)vh|TdxTThλ(T)N+1xNhT|wh(x)||vh|T|TThλ(T)N+1xNhTα(x)|vh|T|=Ωαh|vh|dxΩα|v|dx.

    Theorem 5.4

    Let ΩRN be a polyhedral domain and αC(Ω¯) such that (3.1) holds true. Then the sets

    Kh={wP1,h(Ω)d:|w(xT)|α(xT) for all TTh}5.8
    Mosco converge for h→0 to the set K(H1(Ω)d) in H1(Ω)d.

    Proof.

    Since weak convergence in H1(Ω) implies weak convergence in L2(Ω), the preceding lemma 5.2 shows that (M1) is fulfilled. We now show (M2′). To prove the assertion, we may use a strategy that is similar to the one in [4], ch. II and requires (3.4). Note that theorem 3.1 implies that the set

    K~:={φC(Ω¯)d:|φ(x)|<α(x) for all xΩ¯}5.9
    is also dense in K(H1(Ω)d) with respect to the H1(Ω)d-norm. For the global interpolation operator Ih defined in (5.3), we have the classical estimate,
    uIhuL(Ω)ch2uW2,(Ω)uW2,(Ω).5.10
    Here, c denotes a constant independent of h on account of the shape-regularity of the triangulation (5.2) (cf. [28, p.61]). We further define rh:K~P1,h(Ω)d by
    rhw:=[Ihw1,,Ihwd],
    and it follows that rhww as h→0 in H1(Ω)d for all wK~; see [28, Corollary 1.109]. Applying estimate (5.11) to the components of wK~ and using the equivalence of norms on Rd, one obtains that
    |wrhw|L(Ω)ch2wW2,(Ω)d,5.11
    for a suitable modification of c. This implies
    |rhw(x)||w(x)|+ch2wW2,(Ω)dxΩ.5.12
    Since any wK~ is uniformly bounded away from α, there exists h0=h0(w) such that rhwKhhh0, which implies (M2′). ▪

    Corollary 5.5

    Under the conditions of theorem 5.4, the sequence (Kh) defined in (5.8) Mosco converges forh→0 to the setK(L2(Ω)d) inL2(Ω)d.

    Proof.

    Again, lemma 5.2 implies that (M1) with X=L2(Ω)d holds true. For K~ defined in (5.10) it holds that K~ is also dense in K(L2(Ω)d) with respect to the L2(Ω)d-norm (cf. (3.2)). Thus, (M2′) follows analogously to the proof of theorem 5.4. ▪

    Corollary 5.6

    Under the conditions of theorem 5.4, the node-based discrete sets

    Kh:={wP1,h(Ω)d:|w(x)|α(x)xNh},5.13
    Mosco converge forh→0 toK(H1(Ω)d) inH1(Ω)d.

    Proof.

    The proof is analogous to the proof of theorem 5.4, noting that (5.13) also implies rhwKhhh0 with Kh according to the node-based definition (5.14). ▪

    Remark 5.7

    With the help of the density property (3.2) for uniformly continuous upper bounds, the above results on the Mosco convergence of discretized convex sets carry over to spaces involving homogeneous Dirichlet boundary conditions. In this context, the set P1,h(Ω) in the definitions of the discretized sets Kh in (5.9) and (5.14) has to be replaced by the space

    P1,hΩ:={uC(Ω¯):u|TP1TTh,u(x)=0xNhΩ}.
    The resulting discrete sets Kh incorporate the zero boundary condition and the corresponding results on Mosco convergence for h→0 remain valid replacing H1(Ω)d by H01(Ω)d.

    With the help of the density result (3.2), one obtains the following result for the discrete approximation of pointwise constraint sets in H(div;Ω) by the Raviart–Thomas finite-element space RTh(Ω) (cf. (5.4)).

    Theorem 5.8

    Let ΩRN be a polyhedral domain. Let αC(Ω¯) such that (3.1) is satisfied. Then the sets

    Kh:={wRT0,h(Ω):|w(xT)|α(xT)TTh}.
    Mosco converge to K(H0(div;Ω)) in H(div;Ω) and to K(L2(Ω)N) in L2(Ω)N.

    Proof.

    Let whKh for all h. First observe that if (wh) weakly converges to w in H(div;Ω), then it also weakly converges to w in L2(Ω)N. Analogously to the proof of lemma 5.2 one concludes that |w|≤α a.e. in Ω. The continuity of the normal trace mapping

    H(div;Ω)wwν,vH1/2(Ω),H1/2(Ω)R
    for fixed vH1(Ω) implies =0 in H−1/2(∂Ω). We conclude that wK(H0(div;Ω)) whence it follows that (M1) is satisfied. Secondly, note that
    K(Cc(Ω)N)¯H(div;Ω)=K(H0(div;Ω));
    cf. (3.2). For the global Raviart–Thomas interpolation operator defined in (5.5), the following interpolation error estimate holds true [28, Corollary 1.115]:
    uIhRTuL(Ω)N+divudivIhRTuL(Ω)chuW1,(Ω)N,5.14
    for all uW2,(Ω)N. Setting rhw:=IhRTw for any wK~, where
    K~:={wCc(Ω)N:|w(x)|<α(x),xΩ},
    and taking account of the fact that IhRTww in H(div) for all wK~, we may proceed analogously to the proof of theorem 5.4 to verify (M2′). ▪

    The previous approach can also be applied to derive approximation results for constraint sets involving pointwise bounds on partial derivatives. To begin with, we consider the gradient-constraint sets

    K(X(Ω))={wX(Ω):|w|α a.e. in Ω},
    for X(Ω)⊂H1(Ω)d and an arbitrary norm |⋅| on RN×d.

    Theorem 5.9

    Let ΩRN be a polyhedral domain. Let αC(Ω¯) such that (3.1) is satisfied. Define

    Kh:={wP1,hΩ(Ω)d:|w|T|α(xT)TTh}.5.15
    Then the sets Kh Mosco converge to K(H01(Ω)d) in H01(Ω)d.

    Proof.

    To prove (M1), it suffices to notice that if whw in H1(Ω)d then whw in L2(Ω)N×d. Similar to the proof of lemma 5.2, one obtains for vCc(Ω)N×d that

    Ωw:vdxΩwh:vdxΩ|wh||v|dxΩαh|v|dxΩα|v|dx,
    using αh from (5.7). Therefore, (5.6) holds with ∇w in place of w, and (M1) is verified.

    To prove (M2′), we consider again the global interpolation operator Ih from (5.3). The standard estimate

    uIhuL(Ω)NchuW2,(Ω),uW2,(Ω),
    holds true (e.g. [28]). Note also that K(Cc(Ω)d) is dense in K(H01(Ω)d) for the H1(Ω)d-norm [11, Theorem 4]. Thus, the set
    K~:={wCc(Ω)d:|w(x)|<α(x)xΩ}
    is also dense in K(H1(Ω)d). Therefore, one may argue as in the proof of theorem 5.4 to deduce (M2′). ▪

    Next we consider pointwise constraints on the divergence. For X(Ω)⊂H(div;Ω) let

    Kdiv(X(Ω)):={wX(Ω):|divw|α a.e. in Ω}.5.16
    Using Raviart–Thomas finite elements, a discrete realization of the inequality constraint in (5.17) can be achieved by imposing the inequality on the midpoints of the triangulation. The following statement ensures that the resulting approach is stable as the mesh width goes to zero.

    Theorem 5.10

    Let ΩRN be a polyhedral domain. Let αC(Ω¯) fulfil (3.1). Then the sets

    Kh:={wRT0,h(Ω):|divw|T|α(xT),TTh}
    Mosco converge in H(div;Ω) to the set Kdiv(H0(div;Ω)) as defined in (5.17).

    Proof.

    Taking account of the fact that whw in H(div;Ω), whKh, implies divwhdivw in L2(Ω), (M1) follows analogously to the corresponding part of the proof of corollary 5.9. Since Kdiv(Cc(Ω)N) is dense in Kdiv(H0(div;Ω)) [11, Theorem 4], the set

    K~:={wCc(Ω)d:|divw(x)|<α(x),xΩ}
    is also dense in Kdiv(H0(div;Ω)). Setting rh=IhRT, the estimate (5.15) implies rhww in H(div;Ω) and
    divwdivrhwL(Ω)chwW2,(Ω)N,
    for all w in K~. In particular, one may argue as in the proof of theorem 5.4 to verify (M2′). ▪

    For a general Lp-function as upper bound, a point-based discretization is obviously not possible. As a remedy, the construction of the discrete sets Kh typically involves some kind of averaging process. For this purpose, we define the integral mean

    Tαdx:=Tαdxλ(T)
    over some given subset TΩ (with positive measure).

    Now we have to take into account that the density results of the type (3.2) and (3.4), which represent the main ingredient to prove the consistency of the finite-element approximation, may fail to hold true (e.g. theorem 4.2). On the other hand, the results from §4 indicate that the density property is still guaranteed for a large class of discontinuous obstacles. To maintain the greatest level of generality, we assume that the non-negative measurable function α:ΩR{+} allows for the density property

    K(C(Ω¯))¯L2(Ω)d=K(L2(Ω)d).5.17
    Here, we concentrate on the consistency in the L2-topology but an extension to the other cases is possible by appropriately modifying assumption (5.18). We stress the fact that assumption (5.18) is fulfilled in relevant situations (cf. e.g. theorem 4.4).

    Lemma 5.11

    LetΩRNbe a polyhedral domain andαL2(Ω) withα(x)≥0 a.e. inΩ. Let (wh) be a sequence that fulfils for allh, whP1,h(Ω)dand |wh(xT)|≤ |wh(xT)|Tαdxfor allTTh. Ifwhwforh→0 inL2(Ω)dthen it holds that |w|≤αa.e. inΩ.

    Proof.

    The assertion follows analogously to the proof of lemma 5.2 by a slight modification of the definition of αh. Instead of the piecewise constant interpolant we consider the piecewise constant quasi-interpolant αh:=TThχTTαdx. Observe that αh converges strongly to α in L2(Ω)d, which is sufficient for the above argument. ▪

    Theorem 5.12

    Let ΩRN be a polyhedral domain. Let α∈L2(Ω) with (3.1) such that (5.17) holds true. Then the sets

    Kh:={wP1,h(Ω)d:|w(xT)|Tαdx,TTh}.
    Mosco converge for h→0 to the set K(L2(Ω)d) in L2(Ω)d.

    Proof.

    We only need to prove (M2′) since corollary 5.11 implies (M1). First note that (3.1) and (5.18) imply that the set

    K~:={wCc(Ω)d:δ=δ(w)>0 such that |w(x)|α(x)δ a.e. in Ω},
    is also dense in K(L2(Ω)d). Furthermore, we set
    rhw:=[Ihw1,,Ihwd],
    for wK~ and Ih as in (5.3). Integrating estimate (5.13) yields
    |Trhwdx|T|w|dx+ch2wW2,(Ω)d,TTh.
    Let wK~ be fixed. Since rhw is affine on each TTh, an application of the midpoint rule shows
    |rhw(xT)|T|w|dx+ch2wW2,(Ω)d,TTh,
    which implies
    |rhw(xT)|Tαdxδ(w)+ch2wW2,(Ω)d,TTh.5.18
    This implies rhwKh for all wK~ and hh0(w). By (5.11), it holds that rhww in L2(Ω)d for h→0, which proves (M2′). ▪

    6. Further applications

    (a) Regularization of elasto-plastic contact problems

    In the context of the one time-step problem of quasi-static elasto-plasticity with an associative flow law, the deformation of a material represented by a bounded Lipschitz domain Ω subject to given applied forces is modelled by the evolution of the displacement, the material stress and strain as well as certain internal variables (cf. [6]). An elasto-plastic contact problem arises if the movement of the material is additionally restricted by the presence of a rigid obstacle. From a mathematical point of view, the problem can be equivalently reformulated in terms of the normal stress z* at the (sufficiently smooth) contact boundary Γc⊂∂Ω, and a variable q that is related to the deviatoric part of the material stress; for details we refer to [24, p.154]:

    minG([z,q])z,ψover [z,q]H1/2(Γc)×L2(Ω)ds.t.zH+1/2(Γc),|q|2β a.e. in Ω.}6.1
    Here, d:=N(N+1)/2−1 and G is a strongly convex, continuous and coercive functional that models the elasto-plastic material behaviour subject to given external loads. Furthermore, a contact constraint on the normal component of the displacement is imposed by a function ψ, which lies in the trace space H1/2(Γc). The upper bound βL2(Ω) is determined by the hardening law, and it is bounded away from zero by the positive yield stress σy, i.e. β(x)≥σy a.e. in Ω. The normal stress z* is constrained to lie in the polar cone
    H+1/2(Γc):={zH1/2(Γc):z,z0zH+1/2(Γc)}
    to the cone of non-negative functions
    H+1/2(Γc)={zH1/2(Γc):z0 a.e. on Γc},
    where H1/2(Γc)* designates the topological dual space of H1/2(Γc). From an algorithmic point of view, it is favourable to replace (6.1) by a combined Moreau–Yosida/Tikhonov regularization given by
    minG([z,q])(z,ψ)L2(Γc)+γn2z+L2(Γc)2+γn2[(|q|2β)]+L2(Ω)2+12γn[z,q]H1(Γc)×H1(Ω)d2,over[z,q]H1(Γc)×H1(Ω)d,}6.1γ
    where (γn) and (γn′) are sequences with γn,γn+ as n+. In contrast to (6.1), (6.2) can be solved efficiently by the semismooth Newton method in the infinite-dimensional setting. As a consequence, the Newton iterates are superlinearly convergent, and the convergence rate is mesh-independent upon discretization. For details, see [24, section 5]. In order to prove the stability of (6.2) with regard to the limit problem (6.1) in the sense of theorem 2.1, we show that the problems (6.2) define a quasi-monotone perturbation of iK with respect to the dense subspace H1(ΓcH1(Ω)dH1/2(Γc)*×L2(Ω)d (cf. definition (2.2)). Here, we write for XH1/2(Γc)×L2(Ω)d,
    K(X):={[z,q]X:zH+1/2(Γc),|q|2β a.e. in Ω},
    and K:=K(H1/2(Γc)×L2(Ω)d). In fact, setting
    Rn([z,q]):=γn2z+L2(Γc)2+γn2[(|q|2β)]+L2(Ω)2+12γn[z,q]H1(Γc)×H1(Ω)d2,
    where it is understood that Rn([z,q])=+, unless [z*,q]∈H1(ΓcH1(Ω)d, it is easily seen that
    R¯n([z,q]):=iK([z,q])+12γn[z,q]H1(Γc)×H1(Ω)d2,
    fulfils (2.4). Moreover, we set
    R_n([z,q]):=γn2r(z)+γn2[(|q|2β)]+L2(Ω)2,
    where
    r(z):=(max{supzH+1/2(Γc),zH1/2(Γc)=1z,z,0})2.
    The validity of (2.3) is an immediate consequence of the following lemma.

    Lemma 6.1

    The functionalr:H1/2(Γc)Ris weakly l.s.c. and it fulfils

    • (i) r(z*)=0 for allz*∈H1/2+(Γc)*,

    • (ii) r(z*)>0 for allz*∈H1/2(Γc)*∖H1/2+(Γc)*,

    • (iii) r(z)z+L2(Γc)2for allzL2(Γc).

    Proof.

    As a composition of a convex, continuous and monotone function with a supremum of l.s.c. and convex functions, r:H1/2(Γc)R is weakly l.s.c. Assertions (i) and (ii) are direct consequences of the definition of H+1/2(Γc). For zL2(Γc)={zL2(Γc):z≤ 0 a.e. in Ω}, it holds r(z)=0 and (iii) is always satisfied. Now let zL2(Γc)∖L2(Γc). By the density of H1/2+(Γc) in L2+(Γc) it holds that

    r(z)1/2=supz~H+1/2(Γc)z~H1/2(Γc)=1z,z~>0.
    Moreover, one obtains
    z+L2(Γc)=supz~L2(Γc)z~01z~L2(Γc)(z+,z~)supz~L2(Γc)z~0,z~0 a.e. 1z~L2(Γc)(z,z~)supz~H+1/2(Γc)z~01z~H1/2(Γc)(z,z~)=r(z)1/2,
    which implies (iii). ▪

    From the discussion in the introduction and theorem 2.1, it is known that the consistency of the regularization scheme (6.2) with respect to (6.1) hinges on the density of K(H1(Ω)d) in K(L2(Ω)d), where

    K(X(Ω)):={qX(Ω):|q|2β a.e. in Ω},
    in accordance with the notation from the preceding sections. Owing to the results of §§3 and 4, this is always fulfilled for kinematic hardening, as β is a positive constant in this case. In the same way, it is also fulfilled for large classes of discontinuous obstacles β in the case of combined isotropic-kinematic hardening. Once the density property is ensured, one may use monotonicity properties of G to derive strong convergence properties of regularized (normal) stresses, strains and displacement; cf. [24] for details.

    (b) Fenchel duality in image restoration

    Optimization problems with total variation regularization have been successfully considered in the image restoration context. In the denoizing setting, an original image utrue that belongs to the space of functions of bounded variation BV(Ω), ΩR2, is sought to be recovered from a noise perturbed measurement f=utrue+η with ηL2(Ω), η=0 and |η|2=σ2. This motivates the optimization problem

    min12Ω|uf|2dx+αΩ|Du|1over uBV(Ω),
    for αR in the seminal work by Rudin et al. [31]. Here, Du, the distributional gradient of uBV(Ω), is a Borel measure and |Du|1 is its total variation measure with total mass Ω|Du|1, which is characterized via duality as
    Ω|Du|1=sup{Ωudivvdx:vCc1(Ω;R2),|v(x)|1,xΩ}.

    The drawback of the above reconstruction scheme is that the choice of the regularization parameter α is global: A good reconstruction locally requires high values of α in some regions of Ω (e.g. flat regions of utrue) and low values in other regions (e.g. locations of details of utrue). A recent approach in [32,33] proposes the following alternative: For a function α:ΩR such that (3.1) holds true, consider the optimization problem

    min12Ω|uf|2dx+Ωα(x)|Du|1over uBV(Ω),6.2
    where Ωα(x)|Du|1 stands for the integral of α on Ω with respect to the measure |Du|1. Hence, α needs to be a |Du|1-integrable function in order for Ωα|Du|1 to be correctly defined. A sufficient condition for this is given by αC(Ω), the space of continuous functions on Ω.

    As usual in convex optimization, it is convenient to consider the problem in (6.3) from the point of view of Fenchel duality. In fact, (6.3) can be characterized as the Fenchel dual problem of the following constrained optimization problem:

    min12divp+fL2(Ω)2over pH0(div;Ω)s.t.pK(H0(div;Ω),||),}6.3
    if the following density result holds true:
    K(Cc1(Ω)2),||)¯H0(div;Ω)=K(H0(div;Ω),||),
    where, according to the above notational convention,
    K(H0(div;Ω),||)={qH0(div;Ω):|q(x)|α(x) a.e. in Ω}.

    7. Conclusion

    We investigate the stability of a large number of perturbation and dualization approaches to variational inequality and constrained optimization problems in the context of density properties of a convex constraint set. If the intersection with certain dense subspaces is dense in the feasible set, one may prove the unconditional consistency of various perturbation schemes including Galerkin approximations. In this regard, the class of quasi-monotone perturbations provides a unified framework.

    The abstract motivation leads to the study of density properties of constraint sets in Sobolev spaces with respect to spaces of smooth functions. We focus specifically on sets that are defined by a pointwise constraint on the norm of the function value. In this case, the density property is determined by the regularity of the upper bound. Whereas the case of a uniformly continuous obstacle gives rise to positive density results in various Sobolev spaces, the result fails to be valid in general, if the obstacle is just a Sobolev function. However, a large variety of discontinuous upper bounds still remains compatible with the density property. This includes functions that fulfil a generalized lower semicontinuity condition as well as supersolutions of elliptic PDEs.

    Density results further allow to deduce the Mosco convergence of various finite-element discretized constraint sets in Sobolev spaces. Finally, the previous results are applied in the context of the regularization of quasi-static elasto-plastic contact problems and the dualization of total variation-based image restoration problems.

    Our future research is concerned with the refined characterization of the class of upper bounds that comply with the density property. Another interesting direction of future research, which we plan to pursue, is related to constraint qualifications (CQs) in the context of Fenchel–Legendre dualization in convex and possibly non-smooth optimization. Here it appears that the density of convex intersections may provide a suitable constraint qualification implying duality without a duality gap. Such a density CQ appears to neither imply nor be contained in currently known constraint qualifications like those used in the work by Hedy Attouch and co-authors (e.g. [34]). Another current research aspect concerns the ramification of remark 3.2(ii). In the presence of an inhomogeneous Dirichlet boundary condition, the construction of suitable trace-preserving mollification operators with variable support appears promising. Those operators are also of utmost interest in the context of non-smooth variational problems in image restoration.

    Data accessibility

    This paper has no additional data.

    Authors' contributions

    All authors contributed equally to this work and gave final approval for publication.

    Competing interests

    We declare we have no competing interests.

    Funding

    This research was carried out in the framework of Matheon supported by the Einstein Foundation Berlin within the ECMath projects OT1, SE5 and SE15 as well as project HI1466/7-1 ‘Free Boundary Problems and Level Set Methods’ funded by the DFG. The authors further gratefully acknowledge the support of the DFG through the DFG-SPP 1962: Priority Programme ‘Non-smooth and Complementarity-based Distributed Parameter Systems: Simulation and Hierarchical Optimization’ within projects 10, 11 and 13. The publication of this article was funded by the Open Access fund of the Weierstrass Institute.

    Acknowledgements

    We thank two anonymous reviewers for their helpful comments.

    Appendix A. Properties of quasi-monotone perturbations

    Proof of theorem 2.1

    Denote by

    Γ-lim supn+Gn(u):=supUN(u)lim supn+infuUGn(u),
    the Γ-upper limit at u of a sequence of functions Gn:XR{+} in the norm topology. Here, N(u) denotes the set of all open neighbourhoods of u in the norm of X. By analogy, define Γw-lim supn+Gn, the Γ-upper limit of Gn in the weak topology of X, as well as the lower limit counterpart Γw-lim infn+Gn. We write
    Γw-limn+Gn=Γw-lim supn+Gn=Γw-lim infn+Gn,
    for the weak Γ-limit of (Gn) provided the latter equality is satisfied. For the corresponding definitions we refer to the monograph [12]. Further denote by scG the lower semicontinuous envelope of G:XR{+}. Exploiting the relations between Γ- and pointwise convergence [12, ch. 5], one obtains with (2.4) and the continuity of F,
    Γw-lim supn+(F+Rn)Γ-lim supn+(F+Rn)Γ-lim supn+(F+R¯n)=sc(F+iKY)=F+iKY¯,
    where we use [12, Prop. 6.3, Prop. 6.7, Prop. 5.7, Prop. 3.7]. Similarly, (2.3) together with [12, Prop. 6.7, Prop. 5.4] implies
    Γw-lim infn+(F+Rn)Γw-lim infn+(F+R_n)=limn+scw(F+R_n),A 1
    where scw(F+R_n) denotes the lower semicontinuous envelope of F+R_n in the weak topology of X. Further note that the coercivity and the sequential weak lower semicontinuity of F+R_n imply that the level sets {uX:F(u)+Rn(u)≤t}, tR, are bounded and sequentially weakly closed. If X is reflexive or if the dual space X* is separable, then the sequential weak closure of bounded subsets of X coincides with the weak closure, see [12], Prop. 8.7, Prop. 8.14, such that F+R_n is weakly lower semicontinuous, which entails
    Γw-lim infn+(F+Rn)limn+(F+R_n)=F+iK,
    by (A 1). Eventually, it holds that
    F+iKΓw-lim infn+(F+Rn)Γw-lim supn+(F+Rn)Γ-lim supn+(F+Rn)F+iKY¯,
    such that Γ-limn+(F+Rn)=Γw-limn+(F+Rnu)=F+iK, if (1.1) holds true. ▪

    Proof of proposition 2.6

    Let xKKY¯ and ρ>0 such that Bρ(x)¯KY¯=, where Bρ(x):={yX:∥xy∥<ρ}.

    (a) We first prove the following result:

    nNγn>0:[yXdist(y,KBρ(x)¯)2<1γnyXn].A 2
    Assume that the opposite holds, i.e.
    n0N:[nNxnXn0,vnKBρ(x)¯:xnvn21n].
    Since vnBρ(x)¯K for all nN and Bρ(x)¯K is convex, bounded and closed, there exists a subsequence (vnk) of (vn) with vnkv and vBρ(x)¯K. As xnvn→0, one also obtains xnkv and thus vXn0. Hence, vXn0KBρ(x)¯=, which is a contradiction.

    (b) Non-existence of a strong recovery sequence:

    Choose (γn) according to (7.2) and assume that there exists a recovery sequence (yn) to x, which means that ynx and F(yn)+(γn/2)dist(yn,K)2+iXn(yn)→F(x). The continuity of F implies that ynXn for sufficiently large n and that (γn/2)dist(yn,K)2→0. Consequently, using ynx and xK, there exists n1N such that

    dist(yn,K)2=dist(yn,KBρ(x))21γn,
    for all nn1. With the help of part (a), we conclude that ynXn for all nn1, which is a contradiction. ▪

    Footnotes

    Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, provided the original author and source are credited.

    References

    • 1
      Ekeland I, Temam R. 1987Convex analysis and variational problems. Classics in applied mathematics, vol. 28. Philadelphia, PA: Society for Industrial and Applied Mathematics. Google Scholar
    • 2
      Attouch H, Buttazzo G, Michaille G. 2014Variational analysis in Sobolev and BV spaces: applications to PDEs and optimization, 2nd edn. Philadelphia, PA: SIAM. CrossrefGoogle Scholar
    • 3
      Clarke FH. 1990Optimization and nonsmooth analysis, 2nd edn. Philadelphia, PA: Society for Industrial and Applied Mathematics. CrossrefGoogle Scholar
    • 4
      Glowinski R. 1984Numerical methods for nonlinear variational problems. Berlin, Germany: Springer. CrossrefGoogle Scholar
    • 5
      Lions J-L, Glowinski R, Trémolières R. 1981Numerical analysis of variational inequalities. Amsterdam, The Netherlands: North-Holland. Google Scholar
    • 6
      Han W, Reddy BD. 2013Plasticity: mathematical theory and numerical analysis, 2nd edn. New York, NY: Springer. CrossrefGoogle Scholar
    • 7
      Rodrigues JF. 1987Obstacle problems in mathematical physics. Amsterdam, The Netherlands: North-Holland. Google Scholar
    • 8
      Rodrigues JF. 2004On hyperbolic variational inequalities of first order and some applications. Monatshefte für Mathematik 142, 157–177. (doi:10.1007/s00605-004-0238-3) Crossref, ISIGoogle Scholar
    • 9
      Dal Maso G, DeSimone A, Mora MG. 2006Quasistatic evolution problems for linearly elastic-perfectly plastic materials. Arch. Ration. Mech. Anal. 180, 237–291. (doi:10.1007/s00205-005-0407-0) Crossref, ISIGoogle Scholar
    • 10
      Bartels S, Mielke A, Roubíček T. 2012Quasi-static small-strain plasticity in the limit of vanishing hardening and its numerical approximation. SIAM J. Numer. Anal. 50, 951–976. (doi:10.1137/100819205) Crossref, ISIGoogle Scholar
    • 11
      Hintermüller M, Rautenberg CN. 2015On the density of classes of closed convex sets with pointwise constraints in Sobolev spaces. J. Math. Anal. Appl. 426, 585–593. (doi:10.1016/j.jmaa.2015.01.060) Crossref, ISIGoogle Scholar
    • 12
      Dal Maso G. 1993An introduction to Γ-convergence. Boston, MA: Birkhäuser. CrossrefGoogle Scholar
    • 13
      Mielke A, Roubíček T. 2015Rate-independent Systems. Berlin, Germany: Springer. CrossrefGoogle Scholar
    • 14
      Kinderlehrer D, Stampacchia G. 1980An introduction to variational inequalities and their applications. New York, NY: Academic Press. Google Scholar
    • 15
      Mosco U. 1969Convergence of convex sets and of solutions of variational inequalities. Adv. Math. (N.Y) 3, 510–585. (doi:10.1016/0001-8708(69)90009-7) Crossref, ISIGoogle Scholar
    • 16
      Alt HW. 2012Lineare Funktionalanalysis. Berlin, Germany: Springer. CrossrefGoogle Scholar
    • 17
      Adams RA, Fournier JJF. 2003Sobolev spaces, 2nd edn. Pure and Applied Mathematics Series, vol. 140. Amsterdam, The Netherlands: Elsevier Science. Google Scholar
    • 18
      Evans LC. 1998Partial differential equations. Providence, RI: American Mathematical Society. Google Scholar
    • 19
      Hintermüller M, Rautenberg CN, Rösel S. 2016Density of convex intersections and applications. Berlin, Germany: WIAS. WIAS Preprint No. 2333. Google Scholar
    • 20
      Toyoizumi H. 1991Continuous dependence on obstacles in variational inequalities. Funkcialaj Ekvacioj 34, 103–115. Google Scholar
    • 21
      Nečas J. 1967Les méthodes directes en théorie des équations elliptiques. Prague, Czechoslovakia: Academia. Google Scholar
    • 22
      Grisvard P. 1985Elliptic problems in nonsmooth domains. London, UK: Pitman Publishing. Google Scholar
    • 23
      Trudinger NS. 1971On the regularity of generalized solutions of linear, non-uniformly elliptic equations. Arch. Ration. Mech. Anal. 42, 50–62. (doi:10.1007/BF00282317) Crossref, ISIGoogle Scholar
    • 24
      Hintermüller M, Rösel S. 2016A duality-based path-following semismooth Newton method for elasto-plastic contact problems. J. Comput. Appl. Math. 292, 150–173. (doi:10.1016/j.cam.2015.06.010) Crossref, ISIGoogle Scholar
    • 25
      Hintermüller M, Kunisch K. 2004Total bounded variation regularization as bilaterally constrained optimization problem. SIAM J. Appl. Math. 64, 1311–1333. (doi:10.1137/S0036139903422784) Crossref, ISIGoogle Scholar
    • 26
      Siebert KG, Veeser A. 2007A unilaterally constrained quadratic minimization with adaptive finite elements. SIAM. J. Optim. 18, 260–289. (doi:10.1137/05064597X) Crossref, ISIGoogle Scholar
    • 27
      Siebert KG. 2011A convergence proof for adaptive finite elements without lower bound. IMA J. Num. Anal. 31, 947–970. (doi:10.1093/imanum/drq001) Crossref, ISIGoogle Scholar
    • 28
      Ern A, Guermond JL. 2004Theory and practice of finite elements. Berlin, Germany: Springer. CrossrefGoogle Scholar
    • 29
      Bahriawati C, Carstensen C. 2005Three MATLAB implementations of the lowest-order Raviart-Thomas MFEM with a posteriori error control. Comput. Methods Appl. Math. 5, 333–361. (doi:10.2478/cmam-2005-0016) CrossrefGoogle Scholar
    • 30
      Brezzi F, Hager WW, Raviart PA. 1977Error estimates for the finite element solution of variational inequalities. Part I. Primal theory. Numerische Mathematik 28, 431–444. (doi:10.1007/BF01404345) Crossref, ISIGoogle Scholar
    • 31
      Rudin LI, Osher S, Fatemi E. 1992Nonlinear total variation based noise removal algorithms. Phys. D 60, 259–268. (doi:10.1016/0167-2789(92)90242-F) Crossref, ISIGoogle Scholar
    • 32
      Hintermüller M, Rautenberg CN. 2017Optimal selection of the regularization function in a generalized total variation model. Part I: Modelling and theory. J. Math. Imaging Vis. (doi:10.1007/s10851-017-0744-2) PubMed, ISIGoogle Scholar
    • 33
      Hintermüller M, Rautenberg CN, Wu T, Langer A. 2017Optimal selection of the regularization function in a generalized total variation model. Part II: Algorithm its analysis and numerical tests. J. Math. Imaging Vis. (doi:10.1007/s10851-017-0736-2) PubMed, ISIGoogle Scholar
    • 34
      Attouch H, Brezis H. 1986Duality for the sum of convex functions in general Banach spaces. In Aspects of mathematics and its applications. North-Holland Mathematical Library, vol. 34, pp. 125–133. Amsterdam, The Netherlands: North-Holland. Google Scholar