Uniqueness, stability and global convergence for a discrete inverse elliptic Robin transmission problem

Harrach, Bastian

doi:10.1007/s00211-020-01162-8

Uniqueness, stability and global convergence for a discrete inverse elliptic Robin transmission problem

Open access
Published: 18 November 2020

Volume 147, pages 29–70, (2021)
Cite this article

Download PDF

You have full access to this open access article

Numerische Mathematik Aims and scope Submit manuscript

Uniqueness, stability and global convergence for a discrete inverse elliptic Robin transmission problem

Download PDF

Bastian Harrach¹

2101 Accesses
12 Citations
1 Altmetric
Explore all metrics

Abstract

We derive a simple criterion that ensures uniqueness, Lipschitz stability and global convergence of Newton’s method for the finite dimensional zero-finding problem of a continuously differentiable, pointwise convex and monotonic function. Our criterion merely requires to evaluate the directional derivative of the forward function at finitely many evaluation points and for finitely many directions. We then demonstrate that this result can be used to prove uniqueness, stability and global convergence for an inverse coefficient problem with finitely many measurements. We consider the problem of determining an unknown inverse Robin transmission coefficient in an elliptic PDE. Using a relation to monotonicity and localized potentials techniques, we show that a piecewise-constant coefficient on an a-priori known partition with a-priori known bounds is uniquely determined by finitely many boundary measurements and that it can be uniquely and stably reconstructed by a globally convergent Newton iteration. We derive a constructive method to identify these boundary measurements, calculate the stability constant and give a numerical example.

Inverse transmission eigenvalue problems for the Schrödinger operator with the Robin boundary condition

Article 23 January 2022

Convergence and adaptive discretization of the IRGNM Tikhonov and the IRGNM Ivanov method under a tangential cone condition in Banach space

Article Open access 29 May 2018

Regularity for a Local–Nonlocal Transmission Problem

Article 25 February 2015

1 Introduction

New technologies for medical imaging, non-destructive testing, or geophysical exploration are often based on mathematical inverse coefficient problems, where the coefficient of a partial differential equation is to be reconstructed from (partial) knowledge of its solutions. A prominent example is the emerging technique of electrical impedance tomography (EIT), cf. [1, 12, 14, 23, 24, 31, 62, 63, 83, 86, 88, 90, 95, 101, 103], and the references therein for a broad overview. Inverse coefficient problems are usually highly non-linear and ill-posed, and uniqueness and stability questions, as well as the design and the theoretical study of reconstruction algorithms are extremely challenging topics in theoretical and applied research.

In practical applications, only finitely many measurements can be made, the unknown coefficient has to be parametrized with finitely many variables (e.g., by assuming piecewise-constantness on a given partition), and physical considerations typically limit the unknown coefficient to fall between certain a-priori known bounds. Thus, after shifting and rescaling, a practical inverse coefficient problem can be put in the form of finding the zero $x\in [0,1]^n$ of a non-linear function $F:\ \mathbb {R}^n\rightarrow \mathbb {R}^m$,

$$\begin{aligned} F(x)=0. \end{aligned}$$

It is of utmost importance to determine what measurements make this finite-dimensional inverse (or zero-finding) problem uniquely solvable, to evaluate the stability of the solution with respect to model and measurement errors, and to design convergent numerical reconstruction algorithms.

In this paper, we will study this problem under the assumption that F is a pointwise monotonic and convex function, which often arises in elliptic inverse coefficient problems (cf. our remarks on the end of this introduction). We will derive a simple criterion that implies the existence of a zero, and the injectivity of F in a certain neighborhood of $[0,1]^n$. It also allows us to quantify the Lipschitz stability constant of the left inverse $F^{-1}$ and, for $n=m$, ensures global convergence of Newton’s method. The criterion is easy-to-check as it merely requires to evaluate the directional derivative $F'(z)d$ for a finite number of evaluation points z and finitely many directions d.

We then show how our result can be applied to the inverse problem of determining a Robin transmission coefficient in an elliptic PDE from the associated Neumann-to-Dirichlet operator that is motivated by EIT-based corrosion detection [53]. We assume that the Robin coefficient is piecewise-constant on a-priori known partition of the interface into n parts, and that we a-priori know upper and lower bounds for the Robin coefficient’s values. We then show how to construct n boundary measurements that uniquely determine the n unknown Robin values, and for which a standard Newton method globally converges. We also quantify the stability of the solution with respect to errors, and numerically demonstrate our result on a simple setting.

Let us give some references to put our result in the context of existing literature. The arguably most famous elliptic inverse coefficient problem is the Calderón problem [27, 28], that arises in EIT, cf. [10, 30, 34, 44, 72, 79,80,81, 89, 98] for an incomplete list of seminal breakthroughs for the uniqueness question in an infinite-dimensional setting.

Several recent works have addressed uniqueness and Lipschitz stability questions for the problem of determining finitely many parameters (e.g., by assuming piecewise-constantness) from finitely or infinitively many measurements in inverse coefficient problems, cf., [2,3,4,5,6,7, 11, 15, 18,19,20,21,22, 32, 48, 51, 53, 68, 69, 71, 76, 77, 87, 93, 96, 104, 105]. To the knowledge of the author, the results presented herein, is the first on explicitly calculating those measurements that uniquely determine the unknown parameters, and, together with [53], it is the first result to explicitly calculate the Lipschitz constant for a given setting. Moreover, we obtain the unique existence of a solution also for noisy measurements, so that Lipschitz stability also yields an error estimate in the case of noise.

Reconstruction algorithms for inverse coefficient problems typically rely on Newton-type approaches or on globally minimizing a non-convex regularized data-fitting functional. Both approaches require an initial guess close to the unknown solution, so that most algorithms are only known to converge locally. For the sake of rigor, it should be noted at this point, that the difficulty in this context is not to construct globally convergent methods but computationally feasible globally convergent methods. To elaborate on this point, let us consider a minimization problem with a continuous functional $f:\ I\rightarrow \mathbb {R}$ over a finite-dimensional interval $I\subseteq \mathbb {R}^n$. A trivial optimization algorithm is to choose a countable dense subset $(q_m)_{m\in \mathbb {N}}\subset I$, and setting $x_0:=q_0$,

$$\begin{aligned} x_m:=\left\{ \begin{array}{l l} q_m &{}\quad \text { if } \,f(q_m)<f(x_{m-1})\\ x_{m-1} &{}\quad \text { else.} \end{array}\right. \end{aligned}$$

Then, obviously, any accumulation point of $(x_m)_{m\in \mathbb {N}}$ is a global minimizer of f. But this type of approach requires a completely unfeasible amount of function evaluations and is thus usually disregarded in practice. Note, however, that together with estimates on the convergence range of an iterative algorithm as in the recent preprint of Alberti and Santacesaria [3], and the progress of parallel computing power, these type of approaches may become feasible at least for lower dimension numbers.

To obtain (computationally feasible) globally convergent algorithms, quasi-reversibility and convexification ideas have been developed in the the seminal work of Klibanov et al., cf., e.g. [16, 17, 74, 75]. Alternatively, one can use very specific properties of the considered problem, cf., e.g., the global convergence result of Knudsen, Lassas, Mueller and Siltanen [78] for the d-bar method for EIT, or resort to only reconstructing the shape of an anomaly, cf. [54, 55, 58, 59, 64, 66, 67] for some globally convergent approaches. The theory developed in this work shows that, somewhat surprisingly, global convergence holds for the standard zero-finding Newton method, when the right measurements are being used, and this also implies fast quadratic convergence. On the other hand, so far, our theory does not allow to utilize more measurements than unknowns or to explicitly add regularization, which would be advantageous in practical applications. Moreover, the calculated measurements tend to become more or more oscillatory for higher dimensional problems. Hence, so far, we can only expect our approach to be practically feasible for relatively few unknowns where discretization sufficiently regularizes the ill-posed problem.

On the methodological side, this work builds upon [48, 53] and stems from the theory of combining monotonicity estimates with localized potentials, cf. [9, 13, 25, 35, 43, 45, 46, 50,51,52, 56,57,58,59, 61, 94] for related works, and [29, 37,38,39,40, 49, 54, 55, 60, 85, 97, 99, 100, 102, 106] for practical monotonicity-based reconstruction methods. In this work, the monotonicity and convexity of the forward function is based on the so-called monotonicity relation which goes back to Ikehata, Kang, Seo, and Sheen [65, 70]. The existence of measurements that fulfill the extra criterion on the directional derivative evaluations follows from localized potentials arguments [41]. Hence, it might be possible to extend the theory developed herein to other elliptic inverse coefficient problems where monotonicity and localized potentials results are also available. Note however, that this extension is not obvious since the localized potentials results for the herein considered Robin transmission problem are stronger than those known for other coefficient problems such as EIT.

Finally, it should be noted, that the monotonicity and localized potentials techniques evolved from the factorization method [26, 42, 47, 73, 82], and that global convergence for Newton’s method for finite-dimensional zero-finding problems is a classical result for pointwise convex functions that are inverse monotonic (also called Collatz monotone [33]), cf., e.g., the book of Ortega and Rheinboldt [91, Thm. 13.3.2]. Such problems arise, e.g., in solving non-linear elliptic PDEs. Roughly speaking, one might be tempted to say, that elliptic forward coefficient problems lead to inverse monotonic convex function, and inverse elliptic coefficient problems lead to forward monotonic convex functions. Our extra criterion on the directional derivative evaluations allows us to write our forward monotonic function as an affine transformation of an inverse monotonic function in a certain region and (together with some technical arguments to ensure the iterates staying in this region), this is the major key in proving our global convergence result.

The paper is organized as follows. In Sect. 2, we prove uniqueness, stability and global convergence of the Newton method for continuously differentiable, pointwise convex and monotonic functions under a simple extra condition on the directional derivatives. In Sect. 3, we apply this result to an inverse Robin coefficient problem, and show how to determine those measurements that uniquely and stably determine the unknown coefficient with a desired resolution via a globally convergent Newton iteration. We also give a numerical example in Sect. 3. Throughout this paper, we take the somewhat lengthy, but hopefully reader-friendly approach of first presenting less technical intermediate results to motivate our approach.

2 Uniqueness, stability and global Newton convergence

We consider a continuously differentiable, pointwise convex and monotonic function

$$\begin{aligned} F:\ U\subseteq \mathbb {R}^n\rightarrow \mathbb {R}^m \end{aligned}$$

where $n,m\in \mathbb {N}$, $m\ge n \ge 2$, and U is a convex open set. In this section, we will derive a simple criterion that implies injectivity of F on a multidimensional interval. The criterion also allows us to estimate the Lipschitz stability constant of the left inverse $F^{-1}$ and, for $n=m$, ensures global convergence of Newton’s method.

Remark 1

Throughout this work, ”$\le $” is always understood pointwise for finite-dimensional vectors and matrices, and $x\not \le y$ denotes the converse, i.e., that $x-y$ has at least one positive entry.

Monotonicity and convexity are understood with respect to this pointwise partial order, i.e., $F:\ U\subseteq \mathbb {R}^n\rightarrow \mathbb {R}^m$ is monotonic if

$$\begin{aligned} x\le y \quad \text { implies } \quad F(x)\le F(y) \quad \text { for all } x,y\in U, \end{aligned}$$

and F is convex if

$$\begin{aligned} F((1-t)x+ty)\le (1-t)F(x)+tF(y) \quad \text { for all } x,y\in U,\ t\in [0,1]. \end{aligned}$$

We also say that a function F is anti-monotone if $-F$ is monotonic.

For continuously differentiable functions, it is easily shown that monotonicity is equivalent to

$$\begin{aligned} F'(x)y \ge 0 \quad \text { for all } x\in U,\ 0\le y\in \mathbb {R}^n, \end{aligned}$$

(1)

and thus equivalent to $F'(x)\ge 0$. Convexity is known to be equivalent to

$$\begin{aligned} F(y)-F(x)&\ge F'(x)(y-x) \quad \text { for all } x,y\in U, \end{aligned}$$

(2)

cf., e.g., [91, Thm. 13.3.2]. All the proofs in this section use the monotonicity and convexity assumption in the form (1) and (2).

Throughout this work, we denote by $e_j\in \mathbb {R}^n$ the jth unit vector in $\mathbb {R}^n$, $\mathbb {1}:=(1,1,\ldots ,1)^T\in \mathbb {R}^n$, and $e_j':=\mathbb {1}-e_j$. $I_n\in \mathbb {R}^{n\times n}$ denotes the n-dimensional identity matrix, and $\mathbb {1} \mathbb {1}^T\in \mathbb {R}^{n\times n}$ is the matrix containing 1 in all of its entries.

2.1 A simple criterion for uniqueness and Lipschitz stability

Before we state our result in its final form in Sect. 2.3, we derive two weaker (and less technical) results that motivate our arguments and may be of independent interest. We first show a simple criterion that yields injectivity of F and allows us to estimate the Lipschitz stability constant of its left inverse $F^{-1}$.

Theorem 1

Let $F:\ U\subseteq \mathbb {R}^n\rightarrow \mathbb {R}^m$, $m\ge n\ge 2$, be a continuously differentiable, pointwise convex and monotonic function on a convex open set U containing $[-1,3]^n$. If, additionally,

$$\begin{aligned} F'(-e_j+ 3e_j') \left( e_j - 3e_j'\right) \not \le 0 \quad \text { for all } j\in \{1,\ldots ,n\}, \end{aligned}$$

(3)

then the following holds:

(a)
F is injective on $[0,1]^n$.
(b)
$F'(x)\in \mathbb {R}^{m\times n}$ is injective for all $x\in [0,1]^n$.
(c)
With
$$\begin{aligned} L:=2 \left( \min _{j=1,\ldots ,n}\ \max _{k=1,\ldots ,m} e_k^T F'(-e_j+ 3e_j') \left( e_j - 3e_j'\right) \right) ^{-1}>0, \end{aligned}$$
(4)
we have that for all $x,y\in [0,1]$
$$\begin{aligned} \Vert x-y\Vert _\infty \le L \Vert F(x)-F(y)\Vert _\infty , \quad \text { and } \quad \Vert F'(x)^{-1}\Vert _\infty \le L, \end{aligned}$$
where $F'(x)^{-1}\in \mathbb {R}^{n\times m}$ denotes the left inverse of $F'(x)\in \mathbb {R}^{m\times n}$.

To prove Theorem 1, we will first formulate an auxiliary lemma, which will also be used in our more technical results. Note that assumption (5) in the following lemma simply means that a row permutation of the non-negative matrix $F'(x)\in \mathbb {R}^{m\times n}$ is strictly diagonally dominant in its first n rows.

Lemma 1

Let $F:\ U\subseteq \mathbb {R}^n\rightarrow \mathbb {R}^m$, $m\ge n\ge 2$, be continuously differentiable, pointwise convex and monotonic on a convex open set U, and let $L>0$.

(a)
If $x\in U$ fulfills
$$\begin{aligned} \max _{k=1,\ldots ,m} e_k^T F'(x) \left( e_j - e_j'\right) \ge L^{-1} \quad \text { for all } j\in \{1,\ldots ,n\}, \end{aligned}$$
(5)
then $F'(x)\in \mathbb {R}^{m\times n}$ is injective, and its left inverse fulfills $ \Vert F'(x)\Vert _\infty ^{-1}\le L$.
(b)
If, additionally, $y\in U$ also fulfills (5), then
$$\begin{aligned} \Vert x-y\Vert _\infty \le L \Vert F(x)-F(y)\Vert _\infty . \end{aligned}$$

Proof

(a)
For all $0\ne d \in \mathbb {R}^n$, at least one of the entries of $\frac{d}{ \Vert d\Vert _\infty }$ must be either 1 or $-1$, so that there exists $j\in \{1,\ldots ,n\}$ with either
$$\begin{aligned} \frac{d}{ \Vert d\Vert _\infty }\ge e_j-e_j', \quad \text { or } \quad \frac{d}{ \Vert d\Vert _\infty }\le -e_j+e_j'. \end{aligned}$$
We thus obtain from the monotonicity assumption (1) that either
$$\begin{aligned} F'(x)\frac{d}{ \Vert d\Vert _\infty }\ge F'(x)(e_j-e_j'), \quad \text { or } \quad F'(x)\frac{-d}{ \Vert d\Vert _\infty }\ge F'(x)(e_j-e_j'). \end{aligned}$$
In both cases, it follows from (5) that
$$\begin{aligned} \frac{ \Vert F'(x)d\Vert _\infty }{ \Vert d\Vert _\infty }\ge \max _{k=1,\ldots ,m} e_k^T F'(x) \left( e_j - e_j'\right) \ge L^{-1}. \end{aligned}$$
This proves injectivity of $F'(x)$ and the bound on its left inverse.
(b)
Likewise, for $x\ne y$, either
$$\begin{aligned} \frac{x-y}{ \Vert y-x\Vert _\infty }\ge e_j-e_j', \quad \text { or } \quad \frac{y-x}{ \Vert x-y\Vert _\infty }\ge e_j-e_j', \end{aligned}$$
so that by monotonicity (1) and assumption (5), either
$$\begin{aligned} \max _{k=1,\ldots ,m} e_k^T F'(x)\frac{y-x}{ \Vert x-y\Vert _\infty } \ge L^{-1}, \text { or } \max _{k=1,\ldots ,m} e_k^T F'(y)\frac{x-y}{ \Vert x-y\Vert _\infty } \ge L^{-1}. \end{aligned}$$
By convexity (2), it then follows that
$$\begin{aligned} \Vert F(y)-F(x)\Vert _\infty&= \max _{k=1,\ldots ,m} |e_k^T (F(y)-F(x))|\\&= \max _{k=1,\ldots ,m} \max \{ e_k^T (F(y)-F(x)), e_k^T (F(x)-F(y)) \}\\&\ge \max _{k=1,\ldots ,m} \max \{ e_k^T F'(x)(y-x), e_k^T F'(y)(x-y) \}\\&\ge \Vert y-x\Vert _\infty L^{-1}. \end{aligned}$$

$\square $

We can now prove Theorem 1 with Lemma 1.

Proof of Theorem 1

Let $j\in \{1,\ldots ,n\}$. Writing

$$\begin{aligned} z^{(j)}:=-e_j+ 3e_j'\in [-1,3]^n\subset U, \end{aligned}$$

we have that

$$\begin{aligned} e_j - 3 e_j' \le x - z^{(j)}\le 2e_j - 2e_j' \quad \text { for all } x\in [0,1]^n. \end{aligned}$$

Thus we deduce from (1) and (2) that for all $x\in [0,1]^n$

$$\begin{aligned} F'(x) \left( e_j - e_j'\right)&\ge \frac{1}{2} F'(x) \left( x - z^{(j)}\right) \ge \frac{1}{2} \left( F(x) - F(z^{(j)}) \right) \\&\ge \frac{1}{2} F'(z^{(j)}) \left( x - z^{(j)}\right) \ge \frac{1}{2} F'(z^{(j)}) \left( e_j - 3e_j'\right) . \end{aligned}$$

With the definition of L in (4), this shows that

$$\begin{aligned} \max _{k=1,\ldots ,m} e_k^T F'(x) \left( e_j - e_j'\right) \ge L^{-1} \quad \text { for all } x\in [0,1]^n,\ j\in \{1,\ldots ,n\}, \end{aligned}$$

(6)

so that (a), (b) and (c) follow from Lemma 1. $\square $

2.2 A simple criterion for global convergence of the Newton iteration

We will now show how to ensure that a convex monotonic function F has a unique zero, and that the Newton method globally converges against this zero.

Theorem 2

Let $F:\ U\subseteq \mathbb {R}^n\rightarrow \mathbb {R}^n$, $n\ge 2$, be continuously differentiable, pointwise convex and monotonic on a convex open set U. If $[-2,n(n+3)]^n\subset U$, and

$$\begin{aligned} F'(z^{(j)})d^{(j)}\not \le 0 \quad \text { for all } j\in \{1,\ldots ,n\}, \end{aligned}$$

(7)

with

$$\begin{aligned} z^{(j)}:=-2 e_j + n(n+3) e_j',\quad \text { and } \quad d^{(j)}:=e_j - (n^2+3n+1) e_j', \end{aligned}$$

then the following holds:

(a)
F is injective on $[-1,n]^n$, $F'(x)$ is invertible for all $x\in [-1,n]^n$, and for all $x,y\in [-1,n]^n$
$$\begin{aligned} \Vert x-y\Vert _\infty \le L \Vert F(x)-F(y)\Vert _\infty , \quad \text { and } \quad \Vert F'(x)^{-1}\Vert _\infty \le L, \end{aligned}$$
(8)
where
$$\begin{aligned} L:=(n+2) \left( \min _{j=1,\ldots ,n}\ \max _{k=1,\ldots ,n} e_k^T F'(z^{(j)})d^{(j)}\right) ^{-1}>0. \end{aligned}$$
(9)
(b)
If, additionally, $F(0)\le 0\le F(\mathbb {1})$, then there exists a unique
$$\begin{aligned} {\hat{x}}\in \left( \textstyle -\frac{1}{n-1},1+\frac{1}{n-1}\right) ^n\quad \text { with } \quad F({\hat{x}})=0, \end{aligned}$$
and this is the only zero of F in $[-1,n]^n$. The Newton iteration sequence
$$\begin{aligned} x^{(k+1)}:=x^{(k)}- F'(x^{(k)})^{-1}F(x^{(k)})\quad \text {with initial value}\, x^{(0)}:=\mathbb {1} \end{aligned}$$
(10)
is well defined (i.e., $F'(x^{(k)})$ is invertible in each step) and converges against ${\hat{x}}$. Furthermore, for all $k\in \mathbb {N}$, $x^{(k)}\in (-1,n)^n$, and
$$\begin{aligned} 0\le M{\hat{x}}\le M x^{(k+1)}\le M x^{(k)}\le Mx^{(0)}=(n+1)\mathbb {1}, \end{aligned}$$
where $M:=\mathbb {1}\mathbb {1}^T+ I_n\in \mathbb {R}^{n\times n}$. The rate of convergence of $x_k\rightarrow {\hat{x}}$ is superlinear. If $F'$ is locally Lipschitz, then the rate of convergence is quadratic.

To prove Theorem 2 we will first show the following lemma.

Lemma 2

Under the assumptions and with the notations of Theorem 2, the following holds:

(a)
For all $x\in [-1,n]^n$,
$$\begin{aligned} \max _{k=1,\ldots ,n} e_k^T F'(x)(e_j-n e_j')\ge L^{-1}. \end{aligned}$$
(b)
F is injective on $[-1,n]^n$, $F'(x)\in \mathbb {R}^{n\times n}$ is invertible for all $x\in [-1,n]^n$, and, for all $x,y\in [-1,n]^n$,
$$\begin{aligned} \Vert x-y\Vert _\infty \le L \Vert F(x)-F(y)\Vert _\infty , \quad \text { and } \quad \Vert F'(x)^{-1}\Vert _\infty \le L. \end{aligned}$$
(c)
For all $x\in [-1,n]^n$, and all $0\ne y\in \mathbb {R}^n$
$$\begin{aligned} F'(x)y\ge 0 \quad \text { implies } \quad \max _{j=1,\ldots ,n} y_j= \Vert y\Vert _\infty \ \text { and } \ \min _{j=1,\ldots ,n} y_j> -\frac{1}{n} \Vert y\Vert _\infty . \end{aligned}$$
(d)
M is invertible, $M^{-1}=I_n-\frac{1}{n+1}\mathbb {1} \mathbb {1}^T$. For all $x\in [-1,n]^n$, and $y\in \mathbb {R}^n$
$$\begin{aligned} F'(x)y\ge 0 \quad \text { implies } \quad My\ge 0, \end{aligned}$$
and thus $M F'(x)^{-1}\ge 0$.

Proof

(a)
Let $j\in \{1,\ldots ,n\}$. Using $z^{(j)}=-2 e_j + n(n+3) e_j'$, we have that for all $x\in [-1,n]^n$
$$\begin{aligned} d^{(j)}=e_j - (n^2+3n+1) e_j' \le x-z^{(j)}\le (n+2)e_j - n(n+2)e_j' \end{aligned}$$
and thus it follows from monotonicity (1) and convexity (2) that
$$\begin{aligned} F'(x) \left( e_j - n e_j'\right)&=\frac{1}{n+2} F'(x) \left( (n+2)e_j - n(n+2)e_j'\right) \\&\ge \frac{1}{n+2} F'(x) \left( x - z^{(j)}\right) \ge \frac{1}{n+2} \left( F(x) - F(z^{(j)}) \right) \\&\ge \frac{1}{n+2} F'(z^{(j)}) \left( x - z^{(j)}\right) \ge \frac{1}{n+2} F'(z^{(j)}) d^{(j)}, \end{aligned}$$
which proves (a) using the definition of L in (9).
(b)
Since (a) implies a fortiori that for all $x\in [-1,n]^n$
$$\begin{aligned} \max _{k=1,\ldots ,n} e_k^T F'(x)(e_j- e_j')\ge L^{-1}, \end{aligned}$$
the assertion (b) follows from Lemma 1.
(c)
Let $x\in [-1,n]^n$, and $0\ne y\in \mathbb {R}^n$. If there exists an index $j\in \{1,\ldots ,n\}$ with $y_j\le -\frac{1}{n} \Vert y\Vert _\infty $, then $y\le -\frac{1}{n} \Vert y\Vert _\infty e_j + \Vert y\Vert _\infty e_j'$, so that
$$\begin{aligned} - \frac{n}{ \Vert y\Vert _\infty } y \ge e_j -n e_j', \quad \text { and thus, by (a),} \quad - \frac{n}{ \Vert y\Vert _\infty } F'(x) y\not \le 0. \end{aligned}$$
By contraposition, this shows that
$$\begin{aligned} F'(x)y\ge 0 \quad \text { implies } \quad \min _{j=1,\ldots ,n} y_j> -\frac{1}{n} \Vert y\Vert _\infty , \end{aligned}$$
which also shows that $ \Vert y\Vert _\infty = \max _{j=1,\ldots ,n} y_j$.
(d)
It is easily checked that
$$\begin{aligned} \left( I_n-\frac{1}{n+1}\mathbb {1} \mathbb {1}^T \right) \left( \mathbb {1} \mathbb {1}^T+I_n\right) =I_n, \end{aligned}$$
which shows that M is invertible and $M^{-1}=I_n-\frac{1}{n+1}\mathbb {1} \mathbb {1}^T$

Moreover, using (c) it follows that $F'(x)y\ge 0$ implies that for all $k\in \mathbb {N}$,
$$\begin{aligned} \sum _{j=1}^n y_j + y_k\ge \max _{j=1,\ldots ,n} y_j + n \min _{j=1,\ldots ,n} y_j\ge 0. \end{aligned}$$
so that $F'(x)y\ge 0$ implies $My=(\mathbb {1} \mathbb {1}^T+I_n)y\ge 0$. This also shows that
$$\begin{aligned} F'(x) F'(x)^{-1} y=y\ge 0 \quad \text { implies } \quad M F'(x)^{-1} y\ge 0, \end{aligned}$$
and thus $M F'(x)^{-1}\ge 0$.

$\square $

Note that by Lemma 2(d), ${\tilde{F}}(x):=F(M^{-1}x)$ is a convex function with Collatz monotone derivative [33], i.e. ${\tilde{F}}'(x)^{-1}= M F'(M^{-1} x)^{-1}\ge 0$. If the Newton iterates do not leave the region where convexity and Collatz monotony holds, then classical results on monotone Newton methods (cf., e.g., Ortega and Rheinboldt [91, Thm. 13.3.4]) yield global Newton convergence for ${\tilde{F}}$, and thus for F since the Newton method is invariant under linear transformation. The following lemma utilizes some arguments from the classical result [91, Thm. 13.3.4] for our situation.

Lemma 3

Let $F:\ U\subseteq \mathbb {R}^n\rightarrow \mathbb {R}^n$ be continuously differentiable and pointwise convex on a convex open set U containing zero, and let $M\in \mathbb {R}^{n\times n}$. We assume that for some point $x\in U$, $F'(x)\in \mathbb {R}^{n\times n}$ is invertible,

$$\begin{aligned} MF'(x)^{-1}\ge 0,\quad F(x)\ge 0\ge F(0), \quad \text { and } \quad Mx\ge 0. \end{aligned}$$

(11)

Then for all $t\in [0,1]$

$$\begin{aligned} x^{(t)}:=x- t F'(x)^{-1} F(x) \end{aligned}$$

fulfills $0\le Mx^{(t)}\le Mx$. Moreover, if $x^{(t)}\in U$ then $F(x^{(t)})\ge 0$.

Proof

The assumptions (11) and the convexity (2) yield that for all $t\in [0,1]$

$$\begin{aligned} 0&\le - t M F'(x)^{-1} F(0)\\&= Mx^{(t)} - \left( M x- tM F'(x)^{-1} F(x)\right) - tM F'(x)^{-1}F(0)\\&= Mx^{(t)} - M x + t M F'(x)^{-1} \left( F(x) - F(0)\right) \\&\le Mx^{(t)} - M x + tM F'(x)^{-1} F'(x) ( x - 0)=Mx^{(t)}-(1-t)Mx. \end{aligned}$$

Moreover,

$$\begin{aligned} Mx-Mx^{(t)}=t M F'(x)^{-1} F(x)\ge 0, \end{aligned}$$

so that $Mx\ge Mx^{(t)}\ge (1-t)Mx\ge 0$ is proven.

If $x^{(t)}\in U$, then we also obtain from the convexity assumption (2) that

$$\begin{aligned} F(x^{(t)})-F(x)\ge F'(x)( x^{(t)} - x)=-t F(x), \end{aligned}$$

which shows that $F(x^{(t)})\ge (1-t)F(x)\ge 0$. $\square $

Proof of Theorem 2

Theorem 2(a) has been proven in Lemma 2(b).

To prove (b), let $x^{(k)}\in (-1,n)^n$ with $F(x^{(k)})\ge 0$ and $0\le M x^{(k)}\le M\mathbb {1}$. Then, by (a), $F'(x^{(k)})\in \mathbb {R}^{n\times n}$ is invertible, so that

$$\begin{aligned} x^{(k+t)}:=x^{(k)}- t F'(x^{(k)})^{-1}F(x^{(k)})\in \mathbb {R}^n, \quad t\in [0,1], \end{aligned}$$

is well defined.

We will prove that $x^{(k+1)}\in (-1,n)^n$. We argue by contradiction, and assume that this is not the case. Then, by continuity, there exists $t\in (0,1]$ with $x^{(k+t)}\in [-1,n]^n{\setminus } (-1,n)^n$ and, by lemma 3,

$$\begin{aligned} F(x^{(k+t)})\ge 0 \quad \text { and } \quad 0\le M x^{(k+t)}\le M x^{(k)}\le M\mathbb {1}. \end{aligned}$$

(12)

Convexity (2) then yields that

$$\begin{aligned} F'(x^{(k+t)})(x^{(k+t)}-0)\ge F(x^{(k+t)})-F(0)\ge 0, \end{aligned}$$

and using Lemma 2(c) this would imply that

$$\begin{aligned} \min _{j=1,\ldots ,n}\, x^{(k+t)}_j&>-\frac{1}{n}\, \max _{j=1,\ldots ,n}\, x^{(k+t)}_j \ge -1. \end{aligned}$$

(13)

For all $l\in \{1,\ldots ,n\}$, we obtain from (12) and (13)

$$\begin{aligned} n+1&= e_l^T M\mathbb {1} \ge e_l^T M x^{(k+t)}=x^{(k+t)}_l + \sum _{j=1}^n x^{(k+t)}_j > 2 x^{(k+t)}_l - (n-1). \end{aligned}$$

Hence, $\max _{j=1,\ldots ,n}\, x^{(k+t)}_j< n$, so that $x^{(k+t)}\in (-1,n)^n$. Since this contradicts $x^{(k+t)}\in [-1,n]^n{\setminus } (-1,n)^n$, we have proven that $x^{(k+1)}\in (-1,n)^n$.

Using Lemma 3 again, this shows that for all

$$\begin{aligned} x^{(k)}\in (-1,n)^n\ \text { with } F(x^{(k)})\ge 0, \ \text { and } \ 0\le M x^{(k)}\le M\mathbb {1}, \end{aligned}$$

the next Newton iterate $x^{(k+1)}$ is well-defined and also fulfills

$$\begin{aligned} x^{(k+1)}\in (-1,n)^n\ \text { with } F(x^{(k+1)})\ge 0, \ \text { and } \ 0\le M x^{(k+1)}\le M x^{(k)}\le M\mathbb {1}. \end{aligned}$$

Hence, for $x^{(0)}:=\mathbb {1}$, the Newton algorithm produces a well-defined sequence $x^{(k)}\in (-1,n)^n$ for which $Mx^{(k)}$ is monotonically non-increasing and bounded. Hence, $(Mx^{(k)})_{k\in \mathbb {N}}$ and thus also $(x^{(k)})_{k\in \mathbb {N}}$ converge. We define

$$\begin{aligned} {\hat{x}}:=\lim _{k\rightarrow \infty } x^{(k)}\in [-1,n]^n. \end{aligned}$$

Since F is continuously differentiable and $F'({\hat{x}})$ is invertible, it follows from the Newton iteration formula (10) that $F({\hat{x}})=0$. Also, the monotone convergence of $(Mx^{(k)})_{k\in \mathbb {N}}$ shows that

$$\begin{aligned} 0\le M{\hat{x}} \le Mx^{(k)} \le M\mathbb {1} \quad \text { for all } k\in \mathbb {N}. \end{aligned}$$

Moreover, since this is the standard Newton iteration, the convergence speed is superlinear and the speed is quadratic if $F'$ is Lipschitz continuous in a neighbourhood of ${\hat{x}}$.

It only remains to show that ${\hat{x}}\in (-\frac{1}{n-1},\frac{n}{n-1})^n\subset (-1,2)^n$. For this, we use the convexity to obtain

$$\begin{aligned} F'({\hat{x}})({\hat{x}}-0)&\ge F({\hat{x}})-F(0)\ge 0,\quad \\\ F'(\mathbb {1})(\mathbb {1}-{\hat{x}})&\ge F(1)-F({\hat{x}})\ge 0, \end{aligned}$$

which then implies by Lemma 2(c) that

$$\begin{aligned} \min _{j=1,\ldots ,n}\, {\hat{x}}_j&> -\frac{1}{n} \max _{j=1,\ldots ,n}\, {\hat{x}}_j,\\ \min _{j=1,\ldots ,n}\, (1-{\hat{x}}_j)&> -\frac{1}{n} \max _{j=1,\ldots ,n}\, (1-{\hat{x}}_j). \end{aligned}$$

From this we obtain that

$$\begin{aligned} \min _{j=1,\ldots ,n}\, {\hat{x}}_j&> -\frac{1}{n} \max _{j=1,\ldots ,n}\, {\hat{x}}_j =\frac{1}{n} \min _{j=1,\ldots ,n}\, (1-{\hat{x}}_j)-\frac{1}{n}\nonumber \\&>-\frac{1}{n^2} \max _{j=1,\ldots ,n}\, (1-{\hat{x}}_j)-\frac{1}{n} = \frac{1}{n^2} \min _{j=1,\ldots ,n}\, {\hat{x}}_j -\frac{1}{n^2} - \frac{1}{n}, \end{aligned}$$

(14)

which yields $\min _{j=1,\ldots ,n}\, {\hat{x}}_j> -\frac{1}{n-1}$. Using (14) again, we then obtain

$$\begin{aligned} -\frac{1}{n} \max _{j=1,\ldots ,n}\, {\hat{x}}_j&> \frac{1}{n^2} \min _{j=1,\ldots ,n}\, {\hat{x}}_j -\frac{1}{n^2} - \frac{1}{n} >-\frac{1}{n-1}, \end{aligned}$$

which shows $\max _{j=1,\ldots ,n}\, {\hat{x}}_j<\frac{n}{n-1}$. $\square $

2.3 A result with tighter domain assumptions

Our results in Sects. 2.1 and 2.2 require the considered function F to be defined (and convex and monotonic) on a much larger set than $[0,1]^n$. For some applications (such as the inverse coefficient problem in Sect. 3), the following more technical variant of Theorem 2 is useful, as it allows us treat the case where the domain of definition is an arbitrarily small neighbourhood of $[0,1]^n$.

Theorem 3

Let $\epsilon >0$ and $c\ge 2+\frac{2}{\epsilon }$. Let $F:\ U\subseteq \mathbb {R}^n\rightarrow \mathbb {R}^n$, $n\ge 2$, be continuously differentiable, pointwise convex and monotonic on a convex open set U. If $[-\frac{1+\epsilon }{cn}-\frac{\epsilon }{2cn},1+2\epsilon ]^n\subset U$, and

$$\begin{aligned} F'(z^{(j,k)})d^{(j)}\not \le 0 \quad \text { for all } j\in \{1,\ldots ,n\},\ k=\{1,\ldots ,K\}, \end{aligned}$$

(15)

where $K:={\text {ceil}}(\frac{2cn}{\epsilon }\left( 1+\epsilon + \frac{1+\epsilon }{cn}\right) )\in \mathbb {N}$,

$$\begin{aligned} z^{(j,k)}&:=\left( -\frac{1+\epsilon }{cn}+(k-2)\frac{\epsilon }{2cn} \right) e_j + \left( 1+2\epsilon \right) e_j', \end{aligned}$$

(16)

$$\begin{aligned} d^{(j)}&:=\frac{1}{2} e_j - \frac{1+\epsilon +cn+2\epsilon cn}{\epsilon } e_j'. \end{aligned}$$

(17)

then the following holds on $O:=(-\frac{1+\epsilon }{cn},1+\epsilon )^n\supset [0,1]^n$.

(a)
F is injective on ${\overline{O}}$. For all $x,y\in {\overline{O}}$, $F'(x)$ is invertible and
$$\begin{aligned} \Vert x-y\Vert _\infty \le L \Vert F(x)-F(y)\Vert _\infty , \quad \text { and } \quad \Vert F'(x)^{-1}\Vert _\infty \le L, \end{aligned}$$
where
$$\begin{aligned} L:= \left( \min _{\begin{array}{c} j=1,\ldots ,n,\\ k=1,\ldots ,K \end{array}}\ \max _{l=1,\ldots ,n} e_l^T F'(z^{(j,k)})d^{(j)}\right) ^{-1}>0. \end{aligned}$$
(18)
(b)
If, additionally, $F(0)\le 0\le F(\mathbb {1})$, then there exists a unique
$$\begin{aligned} {\hat{x}}\in {\overline{O}} \quad \text { with } \quad F({\hat{x}})=0. \end{aligned}$$
The Newton iteration sequence
$$\begin{aligned} x^{(i+1)}:=x^{(i)}- F'(x^{(i)})^{-1}F(x^{(i)})\quad \text {with initial value}\, x^{(0)}:=\mathbb {1} \end{aligned}$$
(19)
is well defined (i.e., $F'(x^{(i)})$ is invertible in each step) and converges against ${\hat{x}}$. For all $i\in \mathbb {N}$
$$\begin{aligned} x^{(i)}&\in O , \quad \text { and } \quad 0\le M{\hat{x}}\le M x^{(i+1)}\le M x^{(i)}\le Mx^{(0)}=(1+cn)\mathbb {1}, \end{aligned}$$
where $M:=\mathbb {1} \mathbb {1}^T+(1+(c-1)n)I_n\in \mathbb {R}^{n\times n}$. The rate of convergence of $x_i\rightarrow {\hat{x}}$ is superlinear. If $F'$ is locally Lipschitz then the rate of convergence is quadratic.

To prove Theorem 3 we first prove a variant of Lemma 2 with tighter domain assumptions.

Lemma 4

Under the assumptions and with the notations of Theorem 3, the following holds:

(a)
For all $x\in {\overline{O}}$, and $j\in \{1,\ldots ,n\}$,
$$\begin{aligned} \max _{l=1,\ldots ,n} e_l^T F'(x)(e_j-c n e_j')\ge L^{-1}. \end{aligned}$$
(b)
F is injective on ${\overline{O}}$. For all $x,y\in {\overline{O}}$, $F'(x)$ is invertible,
$$\begin{aligned} \Vert x-y\Vert _\infty \le L \Vert F(x)-F(y)\Vert _\infty , \quad \text { and } \quad \Vert F'(x)^{-1}\Vert _\infty \le L. \end{aligned}$$
(c)
For all $x\in {\overline{O}}$, and $0\ne y\in \mathbb {R}^n$
$$\begin{aligned} F'(x)y\ge 0 \ \text { implies } \ \max _{j=1,\ldots ,n} y_j= \Vert y\Vert _\infty \ \text { and } \ \min _{j=1,\ldots ,n} y_j> -\frac{1}{c n} \Vert y\Vert _\infty . \end{aligned}$$
(d)
M is invertible, $M^{-1}=\frac{1}{1+(c-1)n} \left( I_n- \frac{1}{1+cn} \mathbb {1} \mathbb {1}^T \right) $. For all $x\in {\overline{O}}$, and $y\in \mathbb {R}^n$
$$\begin{aligned} F'(x)y\ge 0 \quad \text { implies } \quad My\ge 0, \end{aligned}$$
and thus $M F'(x)^{-1}\ge 0$.

Proof

We use the same arguments as in Lemma 2. To prove (a) let $j\in \{1,\ldots ,n\}$ and $x\in {\overline{O}}=[-\frac{1+\epsilon }{cn},1+\epsilon ]^n$. Then, by the definition of K, there exists $k\in \{1,\ldots ,K\}$, so that

$$\begin{aligned} -\frac{1+\epsilon }{cn}+(k-1)\frac{\epsilon }{2cn} \le x_j\le -\frac{1+\epsilon }{cn}+k\frac{\epsilon }{2cn}. \end{aligned}$$

It follows from the definition of $z^{(j,k)}$ and $d^{(j)}$ in (16) and (17) that

$$\begin{aligned} x-z^{(j,k)}&\ge \frac{\epsilon }{2cn} e_j - \left( \frac{1+\epsilon }{cn}+1+2\epsilon \right) e_j' =\frac{\epsilon }{cn} d^{(j)},\\ x-z^{(j,k)}&\le \frac{\epsilon }{cn} e_j -\epsilon e_j'. \end{aligned}$$

We thus obtain

$$\begin{aligned}&F'(x) \left( e_j - cn e_j'\right) =\frac{cn}{\epsilon } F'(x) \left( \frac{\epsilon }{cn} e_j - \epsilon e_j'\right) \ge \frac{cn}{\epsilon } F'(x) \left( x-z^{(j,k)}\right) \\&\quad \ge \frac{cn}{\epsilon } \left( F(x) - F(z^{(j,k)}) \right) \ge \frac{cn}{\epsilon } F'(z^{(j,k)}) \left( x - z^{(j,k)}\right) \ge F'(z^{(j,k)}) d^{(j)} , \end{aligned}$$

which proves (a). Since this also implies a fortiori that

$$\begin{aligned} \max _{l=1,\ldots ,n} e_l^T F'(x)(e_j- e_j')\ge L^{-1} \quad \text { for all}\, x\in {\overline{O}}, \end{aligned}$$

(b) follows from Lemma 1.

To show (c) by contraposition, let $x\in {\overline{O}}$, $0\ne y\in \mathbb {R}^n$, and assume that for some index $j\in \{1,\ldots ,n\}$, we have that $y_j\le -\frac{1}{cn} \Vert y\Vert _\infty $. Then $y\le -\frac{1}{cn} \Vert y\Vert _\infty e_j + \Vert y\Vert _\infty e_j'$, so that

$$\begin{aligned} - \frac{cn}{ \Vert y\Vert _\infty } y \ge e_j -cn e_j', \quad \text { and thus, by (a),} \quad - \frac{cn}{ \Vert y\Vert _\infty } F'(x) y\not \le 0. \end{aligned}$$

By contraposition, this shows that

$$\begin{aligned} F'(x)y\ge 0 \quad \text { implies } \quad \min _{j=1,\ldots ,n} y_j> -\frac{1}{cn} \Vert y\Vert _\infty , \end{aligned}$$

and the latter also implies that $ \Vert y\Vert _\infty = \max _{j=1,\ldots ,n} y_j$.

For the proof of (d), it is easily checked that

$$\begin{aligned} \frac{1}{1+(c-1)n} \left( I_n- \frac{1}{1+cn} \mathbb {1} \mathbb {1}^T \right) \left( \mathbb {1} \mathbb {1}^T+(1+(c-1)n)I_n \right) =I_n, \end{aligned}$$

which shows the invertibility of M and the asserted formula for $M^{-1}$. Moreover, for all $y\in \mathbb {R}^n$, and $l\in \{1,\ldots ,n\}$,

$$\begin{aligned} e_l^T M y&= \sum _{j=1}^n y_j + (1+(c-1)n)y_l\\&\ge \max _{j=1,\ldots ,n} y_j + (n-1) \min _{j=1,\ldots ,n} y_j + (1+(c-1)n)y_l\\&=\max _{j=1,\ldots ,n} y_j + cn \min _{j=1,\ldots ,n} y_j. \end{aligned}$$

Hence, using (c), for all $x\in {\overline{O}}$, $F'(x)y\ge 0$ implies $My\ge 0$. As this also shows that

$$\begin{aligned} F'(x) F'(x)^{-1} y=y\ge 0 \quad \text { implies } \quad M F'(x)^{-1} y\ge 0, \end{aligned}$$

we have $M F'(x)^{-1}\ge 0$. $\square $

Proof of Theorem 3

We proceed as in the proof of Theorem 2. Assertion (a) has already been proven in Lemma 4(b).

To prove (b), let $x^{(i)}\in O$ with $F(x^{(i)})\ge 0$ and $0\le M x^{(i)}\le M\mathbb {1}$. Then, by (a), $F'(x^{(i)})\in \mathbb {R}^{n\times n}$ is invertible, so that

$$\begin{aligned} x^{(i+t)}:=x^{(i)}- t F'(x^{(i)})^{-1}F(x^{(i)})\in \mathbb {R}^n, \quad t\in [0,1] \end{aligned}$$

is well defined.

We will prove that $x^{(i+1)}\in O=(-\frac{1+\epsilon }{cn},1+\epsilon )^n$. We argue by contradiction, and assume that this is not the case. Then, by continuity, there exists $t\in (0,1]$ with $x^{(i+t)}\in {\overline{O}}{\setminus } O$, so that, by Lemma 3,

$$\begin{aligned} F(x^{(i+t)})\ge 0 \quad \text { and } \quad 0\le M x^{(i+t)}\le M x^{(i)}\le M\mathbb {1}. \end{aligned}$$

(20)

Convexity (2) then yields that

$$\begin{aligned} F'(x^{(i+t)})(x^{(i+t)}-0)\ge F(x^{(i+t)})-F(0)\ge 0, \end{aligned}$$

and using Lemma 4(c) this would imply

$$\begin{aligned} \min _{j=1,\ldots ,n}\, x^{(i+t)}_j&>-\frac{1}{cn}\, \max _{j=1,\ldots ,n}\, x^{(i+t)}_j \ge -\frac{1+\epsilon }{cn}. \end{aligned}$$

(21)

For all $l\in \{1,\ldots ,n\}$, we obtain from (20) and (21)

$$\begin{aligned} 1+cn&= e_l^T M\mathbb {1}\ge e_l^T M x^{(i+t)} = (1 + (c-1)n)x^{(i+t)}_l + \sum _{j=1}^n x^{(i+t)}_j\\&\ge (2 + (c-1)n)x^{(i+t)}_l -(n-1)\frac{1+\epsilon }{cn}, \end{aligned}$$

and thus

$$\begin{aligned} x^{(i+t)}_l&\le \frac{1+cn + (n-1)\frac{1+\epsilon }{cn}}{2 + (c-1)n} = 1+ \frac{(n-1)(1+\epsilon +cn)}{(2 + (c-1)n)cn}. \end{aligned}$$

An elementary computation shows that

$$\begin{aligned} \frac{1+\epsilon + cn}{(2 + (c-1)n)cn}< \frac{(1+\epsilon )cn + cn}{(2 + (c-1)n)cn} = \frac{2+\epsilon }{2 + (c-1)n} < \frac{2+\epsilon }{(c-1)n} \le \frac{\epsilon }{n}, \end{aligned}$$

where we used $cn> 1$ for the first inequality, and we used the assumption $c\ge 2+\frac{2}{\epsilon }=\frac{2+\epsilon }{\epsilon }+1$ for the last inequality. Hence, for all $l\in \{1,\ldots ,n\}$,

$$\begin{aligned} x^{(i+t)}_l<1+\epsilon \frac{n-1}{n}<1+\epsilon , \quad \text { so that } \quad x^{(i+t)}\in O. \end{aligned}$$

This contradicts $x^{(i+t)}\in {\overline{O}}{\setminus } O$, and thus shows that $x^{(i+1)}\in O$.

Using Lemma 3 again, this shows that for all

$$\begin{aligned} x^{(i)}\in O\ \text { with } F(x^{(i)})\ge 0, \ \text { and } \ 0\le M x^{(i)}\le M\mathbb {1}, \end{aligned}$$

the next Newton iterate $x^{(i+1)}$ is well-defined and also fulfills

$$\begin{aligned} x^{(i+1)}\in O\ \text { with } F(x^{(i+1)})\ge 0, \ \text { and } \ 0\le M x^{(i+1)}\le M x^{(i)}\le M\mathbb {1}. \end{aligned}$$

Hence, for $x^{(0)}:=\mathbb {1}$, the Newton algorithm produces a well-defined sequence $x^{(i)}\in O$ for which $Mx^{(i)}$ is monotonically non-increasing and bounded. Hence, $(Mx^{(i)})_{i\in \mathbb {N}}$ and thus also $(x^{(i)})_{k\in \mathbb {N}}$ converge. We define

$$\begin{aligned} {\hat{x}}:=\lim _{i\rightarrow \infty } x^{(i)}\in {\overline{O}}. \end{aligned}$$

Since F is continuously differentiable and $F'({\hat{x}})$ is invertible, it follows from the Newton iteration formula that $F({\hat{x}})=0$. Also, the monotone convergence of $(Mx^{(i)})_{i\in \mathbb {N}}$ shows that

$$\begin{aligned} 0\le M{\hat{x}} \le Mx^{(i)} \le M\mathbb {1} \quad \text { for all } i\in \mathbb {N}. \end{aligned}$$

Moreover, since this is the standard Newton iteration, the convergence speed is superlinear and the speed is quadratic if $F'$ is Lipschitz continuous in a neighbourhood of ${\hat{x}}$. $\square $

3 Application to an inverse Robin transmission problem

We will now show how to use our result to obtain uniqueness, stability and global convergence results for an inverse coefficient problem with finitely many measurements. More precisely, we show how to choose finitely many measurements so that they uniquely determine the unknown coefficient function with a given resolution, Lipschitz stability holds, and Newton’s method globally converges.

3.1 The setting

We consider the inverse Robin transmission problem for the Laplace equation from [53], that is motivated by corrosion detection. Note that similar problems have also been studied for the Helmholtz equation under the name conductive boundary condition or transition boundary condition [8, 84]. We first introduce the idealized infinite-dimensional forward and inverse problem following [53] and then study the case of finitely many measurements.

3.1.1 The infinite-dimensional forward and inverse problem

Let ${\varOmega } \subset \mathbb {R}^d$ ($d\ge 2$), be a bounded domain with Lipschitz boundary $\partial {\varOmega }$ and let D be an open subset of ${\varOmega }$, with ${\overline{D}}\subset {\varOmega }$, Lipschitz boundary ${\varGamma }:=\partial D$ and connected complement ${\varOmega }{\setminus }\overline{D}$, cf. Fig. 1 in the numerical section for a sketch of the setting.

We assume that ${\varOmega }$ describes an electrically conductive imaging domain, with a-priori known conductivity that we set to 1 for the ease of presentation. (Note that all of the following results remain valid if the conductivity in D and in ${\varOmega }{\setminus }\overline{D}$ are a-priori known spatially dependent functions as long as they are sufficiently regular to allow unique continuation arguments). We assume that corrosion effects on the interface ${\varGamma }$ can be modelled with an unknown Robin transmission parameter $\gamma \in L^\infty _+({\varGamma })$, where $L^\infty _+$ denotes the subset of $L^\infty $-functions with positive essential infima.

Applying an electrical current flux $g\in L^2(\partial {\varOmega })$ on the boundary $\partial {\varOmega }$ then yields an electric potential $u\in H^1({\varOmega })$ solving the following Robin transmission problem with Neumann boundary values

$$\begin{aligned} {\varDelta } u&=0 \quad \text {in } {\varOmega }{\setminus } {\varGamma }, \end{aligned}$$

(22)

$$\begin{aligned} \partial _{\nu } u|_{\partial {\varOmega }}&= g \quad \text {on }\partial {\varOmega }, \end{aligned}$$

(23)

$$\begin{aligned} \llbracket u \rrbracket _{\varGamma }&= 0 \quad \text {on } {\varGamma }, \end{aligned}$$

(24)

$$\begin{aligned} \llbracket \partial _{\nu } u\rrbracket _{\varGamma }&= \gamma u \quad \text {on } {\varGamma }, \end{aligned}$$

(25)

where $\nu $ is the unit normal vector to the interface ${\varGamma }$ or $\partial {\varOmega }$ pointing outward of D, resp., ${\varOmega }$, and

$$\begin{aligned} \llbracket u \rrbracket := u^+|_{{\varGamma }}-u^-|_{{\varGamma }}, \quad \text { and } \quad \llbracket \partial _{\nu }u \rrbracket :=\partial _{\nu }u^+|_{\varGamma } - \partial _{\nu }u^-|_{\varGamma }, \end{aligned}$$

denote the jump of the Dirichlet, resp., Neumann trace values on ${\varGamma }$, with the superscript ”$+$” denoting that the trace is taken from ${\varOmega }{\setminus } D$ and ”−” denoting the trace taken from D. In the following, we often denote the solution of (22)–(25) by $u_\gamma ^{(g)}$ to point out its dependence on the Robin transmission coefficient $\gamma $ and the Neumann boundary data g.

It is easily seen that this problem is equivalent to the variational formulation of finding $u_\gamma ^{(g)}\in H^1({\varOmega })$ such that

$$\begin{aligned} \int _{\varOmega } \nabla u_\gamma ^{(g)}\cdot \nabla w\,{\mathrm{d}} x+\int _{\varGamma } \gamma u_\gamma ^{(g)} w \,{\mathrm{d}} s=\int _{\partial {\varOmega }}gw\,{\mathrm{d}} s \quad \text { for all } w\in H^1({\varOmega }), \end{aligned}$$

(26)

and that (26) is uniquely solvable by the Lax–Milgram-Theorem. Hence, we can define the Neumann-to-Dirichlet map

$$\begin{aligned} {\varLambda }(\gamma ):\ L^2(\partial {\varOmega })\rightarrow L^2(\partial {\varOmega }), \ g\mapsto u_\gamma ^{(g)}|_{\partial {\varOmega }}, \ \text { where}\, u_\gamma ^{(g)}\in H^1({\varOmega }) \,\text {solves}\, (26). \end{aligned}$$

It is easy to show that ${\varLambda }(\gamma )$ is a compact self-adjoint linear operator.

One may regard ${\varLambda }(\gamma )$ as an idealized model (the so-called continuum model) of all electric current/voltage measurements that can be carried out on the outer boundary $\partial {\varOmega }$. Hence the infinite-dimensional inverse coefficient problem of determining a Robin transmission coefficient from boundary measurements can be formulated as the problem to

$$\begin{aligned} \text { reconstruct } \quad \gamma \in L^\infty _+({\varGamma }) \quad \text { from } \quad {\varLambda }(\gamma )\in {\mathcal {L}}(L^2(\partial {\varOmega })). \end{aligned}$$

We summarize some more properties of the infinite-dimensional forward mapping ${\varLambda }$ in the following lemma.

Lemma 5

(a)
The non-linear mapping
$$\begin{aligned} {\varLambda }:\ L^\infty _+({\varGamma })\rightarrow {\mathcal {L}}(L^2(\partial {\varOmega })), \quad \gamma \mapsto {\varLambda }(\gamma ) \end{aligned}$$
is Fréchet differentiable. Its derivative
$$\begin{aligned} {\varLambda }':\ L^\infty _+({\varGamma })\rightarrow {\mathcal {L}}(L^\infty ({\varGamma }),{\mathcal {L}}(L^2(\partial {\varOmega }))) \end{aligned}$$
is given by the bilinear form
$$\begin{aligned} \int _{\partial {\varOmega }} g \left( {\varLambda }'(\gamma )\delta \right) h\,{\mathrm{d}} s = -\int _{\varGamma } \delta u_\gamma ^{(g)}u_\gamma ^{(h)} \,{\mathrm{d}} s, \end{aligned}$$
(27)
for all $\gamma \in L^\infty _+({\varGamma })$, $\delta \in L^\infty ({\varGamma })$, and $g,h\in L^2(\partial {\varOmega })$, where $u_\gamma ^{(g)}\in H^1({\varOmega })$ solves (26). ${\varLambda }'$ is locally Lipschitz continuous and ${\varLambda }'(\gamma )\delta \in {\mathcal {L}}(L^2(\partial {\varOmega }))$ is compact and self-adjoint.
(b)
For all $g\in L^2(\partial {\varOmega })$ and all $\gamma _1,\gamma _2\in L^\infty _+({\varOmega })$,
$$\begin{aligned} \int _{\partial {\varOmega }} g \left( {\varLambda }(\gamma _2)-{\varLambda }(\gamma _1) \right) g \,{\mathrm{d}} s \ge \int _{\partial {\varOmega }} g \left( {\varLambda }'(\gamma _1) (\gamma _2-\gamma _1)\right) g \,{\mathrm{d}} s. \end{aligned}$$
(c)
For all $\gamma \in L^\infty _+({\varOmega })$, $\delta \in L^\infty ({\varOmega })$, and $g\in L^2(\partial {\varOmega })$,
$$\begin{aligned} \delta (x)\ge 0 \text { for}\, x\in {\varOmega }\,\text { a.e.} \quad \text { implies } \quad \int _{\partial {\varOmega }} g \left( {\varLambda }'(\gamma )\delta \right) g \,{\mathrm{d}} s\le 0. \end{aligned}$$

Proof

Obviously, for all $\gamma \in L^\infty _+({\varGamma })$ and $\delta \in L^\infty ({\varGamma })$, (27) defines a compact self-adjoint linear operator ${\varLambda }'(\gamma )\delta \in {\mathcal {L}}(L^2(\partial {\varOmega }))$. Moreover, it follows from the monotonicity estimate in [53, Lemma 4.1] that for all $\delta \in L^\infty ({\varGamma })$ (that are sufficiently small so that $\gamma +\delta \in L^\infty _+({\varGamma })$)

$$\begin{aligned} \int _{\varGamma } \delta |u_\gamma ^{(g)}|^2 \,{\mathrm{d}} s \ge \int _{\partial {\varOmega }} g \left( {\varLambda }(\gamma )-{\varLambda }(\gamma +\delta )\right) \,{\mathrm{d}} s \ge \int _{\varGamma } \left( \gamma -\frac{\gamma ^2}{\gamma +\delta } \right) |u_\gamma ^{(g)}|^2 \,{\mathrm{d}} s \end{aligned}$$

and thus

$$\begin{aligned}&\Vert {\varLambda }(\gamma +\delta )-{\varLambda }(\gamma )-{\varLambda }'(\gamma )\delta \Vert _{{\mathcal {L}}(L^2(\partial {\varOmega }))}\\&\quad =\sup _{g\in L^2(\partial {\varOmega })} \left| \int _{\partial {\varOmega }} g \left( {\varLambda }(\gamma +\delta )-{\varLambda }(\gamma )-{\varLambda }'(\gamma )\delta \right) \,{\mathrm{d}} s\right| \le \int _{\varGamma } \left( \frac{\delta ^2 }{\gamma +\delta } \right) |u_\gamma ^{(g)}|^2 \,{\mathrm{d}} s\\&\quad = O( \Vert \delta \Vert ^2). \end{aligned}$$

This shows that ${\varLambda }$ is Fréchet differentiable for all $\gamma \in L^\infty _+({\varGamma })$, and that ${\varLambda }'(\gamma )$ is its derivative. Since it is easily shown that $u_\gamma ^{(g)}\in H^1({\varOmega })$ depends locally Lipschitz continuously on $\gamma \in L^\infty _+({\varGamma })$, it also follows that ${\varLambda }'$ is locally Lipschitz continuous. This proves (a).

(b) is shown in [53, Lemma 4.1], and (c) follows from (27). $\square $

Note that Lemma 5 shows that ${\varLambda }$ is a convex, anti-monotone function with respect to the pointwise partial order on $L^\infty _+({\varOmega })$, and the Loewner partial order in the space of compact self-adjoint operators on $L^2(\partial {\varOmega })$. These properties are the key to formulate the finite-dimensional inverse problem as a zero finding problem for a pointwise convex and monotonic forward function.

3.1.2 The inverse problem with finitely many measurements

In practical applications, it is natural to assume that the unknown Robin transmission coefficient is a piecewise constant function, i.e., $\gamma (x)=\sum _{j=1}^n \gamma _j \chi _j(x)$ where $\gamma _j>0$ and $\chi _j:=\chi _{{\varGamma }_j}$ are the characteristic functions on pairwise disjoint subsets ${\varGamma }_j\subseteq {\varGamma }$ of a given partition ${\varGamma }=\bigcup _{j=1}^n {\varGamma }_j$. For the ease of notation, here and in the following, we identify a piecewise constant function $\gamma (x)=\sum _{j=1}^n \gamma _j \chi _j(x)\in L^\infty ({\varGamma })$ with the vector $\gamma =(\gamma _1,\ldots ,\gamma _n)\in \mathbb {R}^n$. We also simply write a for the constant function $\gamma (x)=a$, and for the vector $(a,\ldots ,a)\in \mathbb {R}^n$ (and use b analogously). Throughout this work we always assume that $n\ge 2$.

It is also natural to assume that one knows bounds $a,b\in \mathbb {R}$ on the unknown Robin transmission coefficient, so that $0<a\le \gamma _j\le b$ for all $j=1,\ldots ,n$. For the semi-discretized inverse problem of reconstructing the finite-dimensional coefficient vector $\gamma \in [a,b]^n\subset \mathbb {R}^n$ from the (infinite-dimensional) measurements ${\varLambda }(\gamma )$, the results in [53, Thm. 2.1 and 2.2] show uniquely solvability and Lipschitz stability. Moroever, [53, Thm. 5.2] shows how to explicitly calculate the Lipschitz constant for a given setting using arguments similar to (and inspiring) Sect. 2 in this work.

We now go one step further and assume that we can only measure finitely many components of ${\varLambda }(\gamma )$, i.e., that we can measure

$$\begin{aligned} F(\gamma )=\left( \int _{\partial {\varOmega }} g_j {\varLambda }(\gamma ) h_j \,{\mathrm{d}} s\right) _{j=1}^m\in \mathbb {R}^m \end{aligned}$$

(28)

for a finite number of Neumann boundary data $g_j,h_j\in L^2(\partial {\varOmega })$. Hence, the fully discretized inverse Robin transmission problem leads to the finite dimensional non-linear inverse problem to

$$\begin{aligned} \text { determine } \quad \gamma \in [a,b]^n\subset \mathbb {R}^n \quad \text { from } \quad F(\gamma )\in \mathbb {R}^m. \end{aligned}$$

The following practically important questions are then to be answered: Given bounds [a, b] and a partition of ${\varGamma }$ (i.e., a desired resolution), how many and which Neumann boundary functions $g_j$, $h_j$ should be used, so that $F(\gamma )$ uniquely determines $\gamma $? How good is the stability of the resulting inverse problem with regard to noisy measurements? How can one construct a globally convergent numerical algorithm to practically determine $\gamma $ from $F(\gamma )$? And how good will the solution be in the case that the true Robin transmission function $\gamma $ is not piecewise constant?

The following subsections show how these questions can be answered using the theory developed in Sect. 2. For this, let us first observe, that the symmetric choice $g_j=h_j$ leads to an inverse problem with a pointwise convex and monotonic forward function.

Lemma 6

Let $b>a>0$, $g_1,\ldots ,g_n\in L^2(\partial {\varOmega })$, and

$$\begin{aligned} F:\ (0,\infty )^n\rightarrow \mathbb {R}^n, \quad \gamma \mapsto F(\gamma ):=\left( \int _{\partial {\varOmega }} g_j {\varLambda }(\gamma ) g_j \,{\mathrm{d}} s\right) _{j=1}^n. \end{aligned}$$

F is a pointwise convex and anti-monotone, continuously differentiable function with locally Lipschitz continuous derivative $F'(\gamma )\in \mathbb {R}^{n\times n}$, where

$$\begin{aligned} e_j^T F'(\gamma )e_l= \frac{\partial F_j(\gamma )}{\partial \xi _l}=-\int _{{\varGamma }_l} |u_{\gamma }^{g_j}|^2 \,{\mathrm{d}} s \ \text { for all } \gamma \in (0,\infty )^n,\ j,l\in \{1,\ldots ,n\}. \end{aligned}$$

Given a vector $y=(y_j)_{j=1}^n\in \mathbb {R}^n$ with $F(b)\le y\le F(a)$, we define the rescaled function

$$\begin{aligned} {\varPhi }:\ U:=(-\infty ,\textstyle \frac{b}{b-a})^n \subseteq \mathbb {R}^n \rightarrow \mathbb {R}^n, \quad \displaystyle {\varPhi }(\xi ):=\frac{1}{b-a} \left( F(r(\xi ))-y \right) . \end{aligned}$$

with $r(\xi ):=b\mathbb {1}-(b-a)\xi $. Then ${\varPhi }$ is a pointwise convex and monotonic, continuously differentiable function with locally Lipschitz continuous derivative, and ${\varPhi }'(\xi )=-F'(r(\xi ))$ for all $\xi \in U$. Also, ${\varPhi }(0)\le 0\le {\varPhi }(1)$.

Proof

This follows immediately from Lemma 5. $\square $

3.2 Uniqueness, stability and global Newton convergence

We summarize the assumptions on the setting: Let ${\varOmega } \subset \mathbb {R}^d$ ($d\ge 2$), be a bounded domain with Lipschitz boundary $\partial {\varOmega }$ and let D be an open subset of ${\varOmega }$, with ${\overline{D}}\subset {\varOmega }$, Lipschitz boundary ${\varGamma }:=\partial D$ and connected complement ${\varOmega }{\setminus }\overline{D}$. We assume that the true unknown Robin transmission coefficient ${{\hat{\gamma }}}\in L^\infty _+({\varGamma })$ is bounded by $b\ge {{\hat{\gamma }}}(x)\ge a$ for $x\in {\varGamma }$ a.e., with a-priori known bounds $b>a>0$, and that ${{\hat{\gamma }}}=\sum _{j=1}^n {{\hat{\gamma }}}_j \chi _j$, $\chi _j:=\chi _{{\varGamma }_j}$, is piecewise constant on an a-priori known partition ${\varGamma }=\bigcup _{j=1}^n {\varGamma }_j$ into $n\in \mathbb {N}$, $n\ge 2$, pairwise disjoint measurable subsets ${\varGamma }_j\subset {\varGamma }$ with positive measure.

We will show how to construct n Neumann boundary functions $g_j\in L^2(\partial {\varOmega })$, so that $\gamma \in [a,b]^n$ is uniquely determined by the n measurements

$$\begin{aligned} F(\gamma )=\left( \int _{\partial {\varOmega }} g_j {\varLambda }(\gamma ) g_j \,{\mathrm{d}} s\right) _{j=1}^n\in \mathbb {R}^n. \end{aligned}$$

We also quantify the Lipschitz stability constant of this inverse problem, and show that the inverse problem can be numerically solved with a globally convergent Newton method. Our main tool is to reformulate the finite-dimensional inverse problem as a zero finding problem for the pointwise monotonic and convex function ${\varPhi }$ introduced in Lemma 6. Then we use a relation to the concept of localized potentials [41] to prove that we can choose the measurements $g_j$ in such a way that ${\varPhi }$ also fulfills the additional assumptions on the directional derivative of the forward function as required by our Newton convergence theory in Sect. 2.

At this point, let us stress that our theory in Sect. 2 allows us to find $m=n$ measurements that uniquely recover the n unknown components of the Robin coefficient, but it also requires to use exactly those $m=n$ measurements to ensure global convergence of Newton’s method. In practice, it would be highly desirable to utilize additional measurements for the sake of redundancy and error reduction. But our convergence theory in Sect. 2 does not cover the case $m>n$ yet, and an extension to, e.g., Gauss–Newton or Levenberg–Marquardt methods seems far from trivial. Moreover, for some applications, it would be desirable to also treat the interior boundary ${\varGamma }$ as unknown. But it is not clear whether the interplay of parametrization, differentiability and localized potentials results that is required to fulfill the assumptions of Sect. 2 can also be extended to this case. Hence, in all of the following we will only consider the case where the interior boundary is known and utilize exactly $m=n$ measurements.

3.2.1 Choosing the measurements (for specific bounds)

To demonstrate the key idea, we will first consider the specific (and rather restrictive) case where the bounds a, b fufill

$$\begin{aligned} n(n+3)<\frac{b}{b-a}, \end{aligned}$$

since this case can be treated by simply combining Theorem 2 with a known localized potentials result from [53]. The case of general bounds $b>a>0$ is more technical and requires an extended result on simultaneously localized potentials. It will be treated in Sect. 3.2.2.

Theorem 4

Given bounds $b>a>0$ that fulfill $n(n+3)<\frac{b}{b-a}$, we define the piecewise constant functions $\kappa ^{(j)}\in L^\infty _+({\varGamma })$, $j=1,\ldots ,n$, by setting

$$\begin{aligned} \kappa ^{(j)}:= \left\{ \begin{array}{l l} 3b-2a &{}\quad \text { on } {\varGamma }_j,\\ b- n(n+3)(b-a) &{}\quad \text { on } {\varGamma }{\setminus } {\varGamma }_j. \end{array} \right. \end{aligned}$$

(a)
If, for all $j=1,\ldots ,n$, $g_j\in L^2(\partial {\varOmega })$ fulfills
$$\begin{aligned} \lambda ^{(j)}:= \int _{{\varGamma }_j} |u_{\kappa ^{(j)}}^{g_j}|^2 \,{\mathrm{d}} s - (n^2+3n+1)\int _{{\varGamma }{\setminus } {\varGamma }_j} |u_{\kappa ^{(j)}}^{g_j} |^2 \,{\mathrm{d}} s>0, \end{aligned}$$
(29)
then the finite-dimensional non-linear inverse problem of determining
$$\begin{aligned} \gamma =\left( \gamma _j\right) _{j=1}^n\in [a,b]^n \quad \text { from }\quad F(\gamma ):=\left( \int _{\partial {\varOmega }} g_j {\varLambda }(\gamma ) g_j\,{\mathrm{d}} s\right) _{j=1}^n\in \mathbb {R}^n \end{aligned}$$
has a unique solution in $[a,b]^n$, and $\gamma $ depends Lipschitz continuously on $F(\gamma )$. Moreover, the iterates of Newton’s method applied to the problem of determining $\gamma $ from $F(\gamma )$, with initial value $\gamma ^{(0)}=a\in \mathbb {R}^n$, quadratically converge to the unique solution $\gamma $ (see Lemma 7 for more details on the properties of F).
(b)
In any large enough finite-dimensional subspace of $L^2(\partial {\varOmega })$, one can find $g_1,\ldots ,g_n$ fulfilling (29) by the following construction:

Let $(f_m)_{m\in \mathbb {N}}\subseteq L^2(\partial {\varOmega })$ be a sequence of vectors with dense linear span in $L^2(\partial {\varOmega })$. Let $j\in \{1,\ldots ,n\}$, $m\in \mathbb {N}$, and let $M^{(j)}_m\in \mathbb {R}^{m\times m}$ be the symmetric matrix with entries ($l,k=1,\ldots ,m$)
$$\begin{aligned} e_l^T M^{(j)}_m e_k:=\int _{{\varGamma }_j} u_{\kappa ^{(j)}}^{f_l} u_{\kappa ^{(j)}}^{f_k} \,{\mathrm{d}} s - (n^2+3n+1)\int _{{\varGamma }{\setminus } {\varGamma }_j} u_{\kappa ^{(j)}}^{f_l} u_{\kappa ^{(j)}}^{f_k} \,{\mathrm{d}} s. \end{aligned}$$
Then, for sufficiently large dimension $m\in \mathbb {N}$, the matrix $M^{(j)}_m\in \mathbb {R}^{m\times m}$ has a positive eigenvalue $\lambda ^{(j)}>0$. For a corresponding normalized eigenvector $v^{(j)}=(v^{(j)}_1,\ldots ,v^{(j)}_m)\in \mathbb {R}^m$, (29) is fulfilled by
$$\begin{aligned} g_j:=\sum _{k=1}^m v^{(j)}_k f_k\in L^2(\partial {\varOmega }). \end{aligned}$$

To prove Theorem 4, we formulate the consequences of our Newton convergence theory in Sect. 2 in the following lemma.

Lemma 7

If $n(n+3)<\frac{b}{b-a}$ then

$$\begin{aligned} \textstyle [a,b]^n\subset I\subset I' \subset (0,\infty )^n. \end{aligned}$$

where $I:=(a-\frac{b-a}{n-1},b+\frac{b-a}{n-1})^n$, and $I':=\left[ b-n(b-a),2b-a\right] ^n$. If, for all $j=1,\ldots ,n$, $g_j\in L^2(\partial {\varOmega })$ fulfills (29), then the function

$$\begin{aligned} F:\ (0,\infty )^n\rightarrow \mathbb {R}^n, \quad F(\gamma ):=\left( \int _{\partial {\varOmega }} g_j {\varLambda }(\gamma ) g_j \,{\mathrm{d}} s\right) _{j=1}^n\in \mathbb {R}^n \end{aligned}$$

has the following properties

(a)
F is pointwise convex, anti-monotone, and continuously differentiable.
(b)
F is injective on $I'$, and its Jacobian $F'(\gamma )$ is invertible for all $\gamma \in I'$.
(c)
With $L:=(n+2) \left( \min _{j=1,\ldots ,n}\ \lambda ^{(j)} \right) ^{-1}$, we have that for all $\gamma ,\gamma '\in I'$
$$\begin{aligned} \Vert \gamma -\gamma '\Vert _\infty \le L \Vert F(\gamma )-F(\gamma ')\Vert _\infty \quad \text { and } \quad \Vert F'(\gamma )^{-1}\Vert _\infty \le L. \end{aligned}$$
(d)
For all $\gamma \in [a,b]^n$, $F(b)\le F(\gamma ) \le F(a)$. Moreover, for all $y\in \mathbb {R}^n$ with $F(b)\le y \le F(a)$ there exists a unique $\gamma \in I$ with $F(\gamma )=y$. The Newton iteration
$$\begin{aligned} \gamma ^{(i+1)}:=\gamma ^{(i)}-F'(\gamma ^{(i)})^{-1} \left( F(\gamma ^{(i)}) - y \right) , \ \text { started with}\, \gamma ^{(0)}:=a \in \mathbb {R}^n, \end{aligned}$$
produces a sequence $(\gamma ^{(i)})_{i\in \mathbb {N}}\subset I'$ that converges quadratically to $\gamma $.

Proof

We define r and ${\varPhi }$ as in Lemma 6. Then,

$$\begin{aligned} \textstyle r([0,1]^n)=[a,b]^n, \quad r((-\frac{1}{n-1}, 1+\frac{1}{n-1} ) ^n)=I, \quad r([-1,n]^n)=I', \end{aligned}$$

so that the interval inclusions follow from the anti-monotonicity of r.

Lemma 6 yields assertion (a) and that ${\varPhi }:\ U\subseteq \mathbb {R}^n\rightarrow \mathbb {R}^n$ is a continuously differentiable pointwise convex and monotonic function on $U=(-\infty ,\textstyle \frac{b}{b-a})^n$. The assumption $n(n+3)<\frac{b}{b-a}$ guarantees that U contains $[-2,n(n+3)]^n$. Moreover, using that

$$\begin{aligned} \kappa ^{(j)} =r(-2 e_j + n(n+3) e_j')\in (0,\infty )^n, \end{aligned}$$

it follows from Lemma 6 and (29) that

$$\begin{aligned}&e_j^T {\varPhi }'(-2 e_j + n(n+3) e_j')\left( e_j - (n^2+3n+1) e_j' \right) \\&\quad =\int _{{\varGamma }_j} |u_{\kappa ^{(j)}}^{g_j}|^2 \,{\mathrm{d}} s - (n^2+3n+1) \int _{{\varGamma }{\setminus } {\varGamma }_j} |u_{\kappa ^{(j)}}^{g_j}|^2 \,{\mathrm{d}} s =\lambda ^{(j)}>0, \end{aligned}$$

so that ${\varPhi }$ fulfills the assumptions of Theorem 2.

It follows that ${\varPhi }$ is injective on $[-1,n]^n$, and that for all $\xi ,\xi '\in [-1,n]^n$, ${\varPhi }'(\xi )$ is invertible,

$$\begin{aligned} \Vert \xi -\xi '\Vert _\infty \le L \Vert {\varPhi }(\xi )-{\varPhi }(\xi ')\Vert _\infty , \quad \text { and } \quad \Vert {\varPhi }'(\xi )^{-1}\Vert _\infty \le L. \end{aligned}$$

This proves that F fulfills (b) and (c) on $r([-1,n]^n)=I'$.

The first assertion in (d) follows from the anti-monotonicity of F. For the remaining assertions in (d) note that, by Lemma 6, ${\varPhi }'$ is also locally Lipschitz continuous, and ${\varPhi }(0)\le 0\le {\varPhi }(1)$. Hence, Theorem 2(b) yields that there exists a unique $\xi \in (-\frac{1}{n-1}, 1+\frac{1}{n-1} ) ^n$ with ${\varPhi }(\xi )=0$, and thus a unique $\gamma \in I$ that solves $F(\gamma )=y$.

Moreover, Theorem 2(b) yields that the Newton iteration applied to ${\varPhi }$ with initial value $\xi ^{(0)}=\mathbb {1}$ does not leave the interval $(-1,n)^n$ and quadratically converges against the unique solution $\xi $ of ${\varPhi }(\xi )=0$. Since, the Newton iteration is invariant under invertible linear transformations, this yields that the Newton iteration applied to F with $\gamma ^{(0)}:=a\in \mathbb {R}^n$ does not leave $I'$ and converges quadratically against the unique solution $\gamma $ of $F(\gamma )=y$. $\square $

Proof of Theorem 4

(a)
follows from Lemma 7.
(b)
Let $j\in \{1,\ldots ,n\}$. From the localized potentials result in [53, Lemma 4.3], it follows that there exists $g\in L^2(\partial {\varOmega })$ with
$$\begin{aligned} \int _{{\varGamma }_j} |u_{\kappa ^{(j)}}^{g}|^2 \,{\mathrm{d}} s - (n^2+3n+1)\int _{{\varGamma }{\setminus } {\varGamma }_j} |u_{\kappa ^{(j)}}^{g} |^2 \,{\mathrm{d}} s>1. \end{aligned}$$
By density and continuity, for sufficiently large $m\in \mathbb {N}$, there exists a function $f\in {\mathrm {span}}\{f_1,\ldots ,f_m \}$ with
$$\begin{aligned} \int _{{\varGamma }_j} |u_{\kappa ^{(j)}}^{f}|^2 \,{\mathrm{d}} s - (n^2+3n+1)\int _{{\varGamma }{\setminus } {\varGamma }_j} |u_{\kappa ^{(j)}}^{f} |^2 \,{\mathrm{d}} s>\frac{1}{2}. \end{aligned}$$
Writing $f=\sum _{l=1}^m v_l f_l$ with $v_l\in \mathbb {R}$, and $v:=(v_l)_{l=1}^m\in \mathbb {R}^m$, we thus have
$$\begin{aligned} v^T M_m^{(j)} v=\int _{{\varGamma }_j} |u_{\kappa ^{(j)}}^{f}|^2 \,{\mathrm{d}} s - (n^2+3n+1)\int _{{\varGamma }{\setminus } {\varGamma }_j} |u_{\kappa ^{(j)}}^{f} |^2 \,{\mathrm{d}} s>\frac{1}{2}. \end{aligned}$$
This shows that the symmetric matrix $M_m^{(j)}\in \mathbb {R}^{m\times m}$ must have a positive eigenvalue when its dimension $m\in \mathbb {N}$ is sufficiently large. On the other hand, every normalized eigenvector $v^{(j)}=(v^{(j)}_1,\ldots ,v^{(j)}_m)\in \mathbb {R}^m$ corresponding to a positive eigenvector $\lambda ^{(j)}>0$ fulfills
$$\begin{aligned} 0 < \lambda ^{(j)}&= (v^{(j)})^T M_m^{(j)} v^{(j)}\\&= \int _{{\varGamma }_j} |u_{\kappa ^{(j)}}^{g_j}|^2 \,{\mathrm{d}} s - (n^2+3n+1)\int _{{\varGamma }{\setminus } {\varGamma }_j} |u_{\kappa ^{(j)}}^{g_j} |^2 \,{\mathrm{d}} s, \end{aligned}$$
with $g_j:=\sum _{k=1}^m v^{(j)}_k f_k\in L^2(\partial {\varOmega })$, so that (b) is proven.

$\square $

3.2.2 Choosing the measurements (for general bounds)

We now show how to treat the case of general bounds $b>a>0$.

Theorem 5

Given $b>a>0$, choose $0<\epsilon <\frac{a}{2(b-a)}$, and set $c:=2+\frac{2}{\epsilon }$, $ K:={\text {ceil}}\left( \frac{2cn}{\epsilon }\left( 1+\epsilon + \frac{1+\epsilon }{cn}\right) \right) , $ and $\beta := \frac{1+\epsilon +cn+2\epsilon cn}{\epsilon }$. Let $\kappa ^{(j,k)}\in L^\infty _+({\varGamma })$ denote the piecewise constant function

$$\begin{aligned} \kappa ^{(j,k)}&:= \left\{ \begin{array}{l l} b- (b-a)\left( -\frac{1+\epsilon }{cn}+(k-2)\frac{\epsilon }{2cn} \right) &{}\quad \text { on } {\varGamma }_j,\\ a-2\epsilon (b-a) &{}\quad \text { on } {\varGamma }{\setminus } {\varGamma }_j. \end{array} \right. \end{aligned}$$

(30)

(a)
If $g_j\in L^2(\partial {\varOmega })$, $j=1,\ldots ,n$, fulfills
$$\begin{aligned} \lambda _{j,k}&:= \frac{1}{2}\int _{{\varGamma }_j} |u_{\kappa ^{(j,k)}}^{g_j}|^2 \,{\mathrm{d}} s - \beta \int _{{\varGamma }{\setminus } {\varGamma }_j} |u_{\kappa ^{(j,k)}}^{g_j}|^2 \,{\mathrm{d}} s>0, \end{aligned}$$
(31)
for all $k=1,\ldots ,K$, then the finite-dimensional non-linear inverse problem of determining
$$\begin{aligned} \gamma =\left( \gamma _j\right) _{j=1}^n\in [a,b]^n \quad \text { from }\quad F(\gamma ):=\left( \int _{\partial {\varOmega }} g_j {\varLambda }(\gamma ) g_j\,{\mathrm{d}} s\right) _{j=1}^n\in \mathbb {R}^n \end{aligned}$$
has a unique solution in $[a,b]^n$, and $\gamma $ depends Lipschitz continuously ${\varLambda }(\gamma )$. Moreover, the iterates of Newton’s method applied to the problem of determining $\gamma $ from $F(\gamma )$, with initial value $\gamma ^{(0)}=a\in \mathbb {R}^n$, quadratically converge to the unique solution $\gamma $ (see Lemma 8 for more details on the properties of F).
(b)
In any large enough finite dimensional subspace of $L^2(\partial {\varOmega })$, one can find $g_1,\ldots ,g_n$ fulfilling (31) by the following construction:

Let $(f_m)_{m\in \mathbb {N}}\subseteq L^2(\partial {\varOmega })$ be a sequence of vectors with dense linear span in $L^2(\partial {\varOmega })$. Let $j\in \{1,\ldots ,n\}$, $m\in \mathbb {N}$, and $v^{(j)}=(v^{(j)}_1,\ldots ,v^{(j)}_m)\in \mathbb {R}^m$ be a normalized eigenvector corresponding to a largest eigenvalue of the symmetric matrix
$$\begin{aligned} M_m^{(j)}:=\frac{\alpha }{2} S^{(j)} - \beta \sum _{k=1}^K R^{(j,k)} \in \mathbb {R}^{m\times m}, \end{aligned}$$
where $\alpha :=( 1+ \frac{b-a}{b} (1+\epsilon + \frac{1+\epsilon }{cn}))^{-1}$, and for $i,l=1,\ldots ,m$,
$$\begin{aligned} e_i^T S^{(j)} e_l:=\int _{{\varGamma }} u_{\kappa ^{(j,1)}}^{f_i} u_{\kappa ^{(j,1)}}^{f_l} \,{\mathrm{d}} s, \ \text { and } \ e_i^T R^{(j,k)} e_l:=\int _{{\varGamma }{\setminus } {\varGamma }_j} u_{\kappa ^{(j,k)}}^{f_i} u_{\kappa ^{(j,k)}}^{f_l} \,{\mathrm{d}} s, \end{aligned}$$
Then, for sufficiently large dimension $m\in \mathbb {N}$, (31) is fulfilled by
$$\begin{aligned} g_{j}:=\sum _{l=1}^m v^{(j)}_l f_l\in {\mathrm {span}}\{f_1,\ldots ,f_m \}\subset L^2(\partial {\varOmega }). \end{aligned}$$

As in Sect. 3.2.1, we prove Theorem 5(a) by applying our Newton convergence theory from Sect. 2 in the following lemma.

Lemma 8

We have that

$$\begin{aligned} \textstyle [a,b]^n\subseteq I:=\left[ a-(b-a)\epsilon , b+(b-a)\frac{1+\epsilon }{cn}\right] ^n \subseteq (0,\infty )^n. \end{aligned}$$

If, for all $j=1,\ldots ,n$, $g_j\in L^2(\partial {\varOmega })$ fulfills (31) for all $k=1,\ldots ,K$, then the function

$$\begin{aligned} F:\ (0,\infty )^n\rightarrow \mathbb {R}^n, \quad F(\gamma ):=\left( \int _{\partial {\varOmega }} g_j {\varLambda }(\gamma ) g_j \,{\mathrm{d}} s\right) _{j=1}^n\in \mathbb {R}^n \end{aligned}$$

has the following properties:

(a)
F is pointwise convex, anti-monotone, and continuously differentiable.
(b)
F is injective on I, and its Jacobian $F'(\gamma )$ is invertible for all $\gamma \in I$.
(c)
For all $\gamma ,\gamma '\in I$,
$$\begin{aligned} \Vert \gamma -\gamma '\Vert _\infty \le L \Vert F(\gamma )-F(\gamma ')\Vert _\infty \quad \text { and } \quad \Vert F'(\gamma )^{-1}\Vert _\infty \le L, \end{aligned}$$
where $L:=\left( \min _{j=1,\ldots ,n,\ k=1,\ldots ,K}\ \lambda _{j,k} \right) ^{-1}$.
(d)
For all $\gamma \in [a,b]^n$, $F(b)\le F(\gamma ) \le F(a)$. Moreover, for all $y\in \mathbb {R}^n$ with $F(b)\le y \le F(a)$ there exists a unique $\gamma \in I$ with $F(\gamma )=y$. The Newton iteration
$$\begin{aligned} \gamma ^{(i+1)}:=\gamma ^{(i)}-F'(\gamma ^{(i)})^{-1} \left( F(\gamma ^{(i)}) - y \right) , \ \text { started with } \ \gamma ^{(0)}:=a \in \mathbb {R}^n, \end{aligned}$$
produces a sequence $(\gamma ^{(i)})_{i\in \mathbb {N}}\subset I$ that converges quadratically to $\gamma $.

Proof

We define r and ${\varPhi }$ as in Lemma 6. Then $r([0,1]^n)=[a,b]^n$ and

$$\begin{aligned} \textstyle r([-\frac{1+\epsilon }{cn},1+\epsilon ]^n)=[a-(b-a)\epsilon , b+(b-a)\frac{1+\epsilon }{cn}]^n. \end{aligned}$$

Lemma 6 yields assertion (a) and that ${\varPhi }:\ U\subseteq \mathbb {R}^n\rightarrow \mathbb {R}^n$ is a continuously differentiable pointwise convex and monotonic function on $U=(-\infty ,\textstyle \frac{b}{b-a})^n$, with locally Lipschitz continuous ${\varPhi }'$, and ${\varPhi }(0)\le 0\le {\varPhi }(1)$. Also, U contains $[-\frac{1+\epsilon }{cn}-\frac{\epsilon }{2cn},1+2\epsilon ]^n$ since $\epsilon <\frac{a}{2(b-a)}$.

Moreover, with $z^{(j,k)},d^{(j)}\in \mathbb {R}^n$ defined by (16) and (17), we have that

$$\begin{aligned} \kappa ^{(j,k)}=r(z^{(j,k)}) \quad \text { and } \quad d^{(j)}=\frac{1}{2}e_j-\beta e_j'. \end{aligned}$$

It thus follows from Lemma 6 and (31) that

$$\begin{aligned} e_j^T {\varPhi }'(z^{(j,k)})d^{(j)} =\frac{1}{2}\int _{{\varGamma }_j} |u_{\kappa ^{(j,k)}}^{g_j}|^2 \,{\mathrm{d}} s - \beta \int _{{\varGamma }{\setminus } {\varGamma }_j} |u_{\kappa ^{(j,k)}}^{g_j}|^2 \,{\mathrm{d}} s=\lambda _{j,k}>0, \end{aligned}$$

so that ${\varPhi }$ fulfills the assumptions of Theorem 3 which then yields the assertions (b)–(d). $\square $

To prove Theorem 5(b), we need to ensure that there exist Neumann data $g_j\in L^2(\partial {\varOmega })$ so that the corresponding solutions $u_{\kappa ^{(j,k)}}^{g_j}$ are much larger on ${\varGamma }_j$ than on ${\varGamma }{\setminus } {\varGamma }_j$, and this property has to hold for several Robin transmission coefficients $\kappa ^{(j,k)}$, $k=1,\ldots ,K$, simultaneously.

Note that for fixed $j\in \{1,\ldots ,n\}$ the Robin transmission coefficients $\kappa ^{(j,k)}$ only differ on ${\varGamma }_j$. Hence, the following lemma will allow us to estimate $u_{\kappa ^{(j,k)}}^{g_j}$ on ${\varGamma }_j$ by $u_{\kappa ^{(j,1)}}^{g_j}$.

Lemma 9

Let $\gamma ^{(1)},\gamma ^{(2)}\in L^\infty _+({\varGamma })$ with $\gamma ^{(1)}=\gamma ^{(2)}$ on ${\varGamma }{\setminus } {\varGamma }_0$, where ${\varGamma }_0$ is a measurable subset of ${\varGamma }$ with positive measure. Then, for all $g\in L^2(\partial {\varOmega })$, the corresponding solutions $u_1,u_2\in H^1({\varOmega })$ of (22)–(25) with $\gamma =\gamma ^{(1)}$, and $\gamma =\gamma ^{(2)}$, respectively, fulfill

$$\begin{aligned} \Vert u_2\Vert _{L^2({\varGamma }_0)} \le \left( 1+\frac{ \Vert \gamma ^{(1)}-\gamma ^{(2)}\Vert _{L^\infty ({\varGamma }_0)}}{\inf (\gamma ^{(2)}|_{{\varGamma }_0})} \right) \Vert u_1\Vert _{L^2({\varGamma }_0)}. \end{aligned}$$

Proof

We proceed analogously to [50, Lemma 3.6]. It follows from the variational formulation (26) that

$$\begin{aligned}&\inf (\gamma ^{(2)}|_{{\varGamma }_0}) \Vert u_2-u_1\Vert _{L^2({\varGamma }_0)}^2\\&\quad \le \int _{\varOmega }\sigma |\nabla (u_2-u_1)|^2 \,{\mathrm{d}} x+\int _{\varGamma } \gamma ^{(2)} |u_2-u_1|^2 \,{\mathrm{d}} s\\&\quad =\int _{\partial {\varOmega }}g(u_2-u_1)\,{\mathrm{d}} s - \int _{\varOmega }\sigma \nabla u_1 \cdot \nabla (u_2-u_1) \,{\mathrm{d}} x-\int _{\varGamma } \gamma ^{(2)} u_1 (u_2-u_1) \,{\mathrm{d}} s\\&\quad = \int _{\varGamma } (\gamma ^{(1)}-\gamma ^{(2)}) u_1 (u_2-u_1) \,{\mathrm{d}} s= \int _{{\varGamma }_0} (\gamma ^{(1)}-\gamma ^{(2)}) u_1 (u_2-u_1) \,{\mathrm{d}} s\\&\quad \le \Vert \gamma ^{(1)}-\gamma ^{(2)}\Vert _{L^\infty ({\varGamma }_0)} \Vert u_1\Vert _{L^2({\varGamma }_0)} \Vert u_2-u_1\Vert _{L^2({\varGamma }_0)}. \end{aligned}$$

Hence, we obtain

$$\begin{aligned} \Vert u_2\Vert _{L^2({\varGamma }_0)}- \Vert u_1\Vert _{L^2({\varGamma }_0)}&\le \Vert u_2-u_1\Vert _{L^2({\varGamma }_0)}\\&\le \frac{ \Vert \gamma ^{(1)}-\gamma ^{(2)}\Vert _{L^\infty ({\varGamma }_0)}}{\inf (\gamma ^{(2)}|_{{\varGamma }_0})} \Vert u_1\Vert _{L^2({\varGamma }_0)}, \end{aligned}$$

and the assertion follows. $\square $

The next lemma will allow us to construct $u_{\kappa ^{(j,k)}}^{g_j}$ for which $u_{\kappa ^{(j,1)}}^{g_j}$ is large on ${\varGamma }_j$ (and thus by Lemma 10 all $u_{\kappa ^{(j,k)}}^{g_j}$ are large on ${\varGamma }_j$), and at the same time all $u_{\kappa ^{(j,k)}}^{g_j}$ are small on ${\varGamma }{\setminus } {\varGamma }_j$.

Lemma 10

Let ${\varGamma }_0$ be a measurable subset of ${\varGamma }$ with positive measure, $K\in \mathbb {N}$, and $\gamma ^{(1)},\gamma ^{(2)},\ldots ,\gamma ^{(K)}\in L^\infty _+({\varGamma })$.

Then, for all $C>0$, there exists $g\in L^2(\partial {\varOmega })$, so that the corresponding solutions $u_1,u_2,\ldots ,u_K\in H^1({\varOmega })$ of (22)–(25) fulfill

$$\begin{aligned} \int _{{\varGamma }_0} |u_1|^2 \,{\mathrm{d}} s\ge C \quad \text { and } \quad \sum _{k=1}^K \int _{{\varGamma }{\setminus } {\varGamma }_0} |u_k|^2 \,{\mathrm{d}} s\le \frac{1}{C}. \end{aligned}$$

Proof

The existence of simultaneously localized potentials for the fractional Schrödinger equation has recently been shown in [51, Theorem 3.11], and we proceed similarly in this proof. Following the original localized potentials approach in [41], we start by reformulating the assertion as operator range (non-)inclusions, by introducing the operators

$$\begin{aligned}&A_0:\ L^2({\varGamma }_0)\rightarrow L^2(\partial {\varOmega }), \quad f\mapsto Af:=v_0|_{\partial {\varOmega }},\\&A_k:\ L^2({\varGamma }{\setminus } {\varGamma }_0)\rightarrow L^2(\partial {\varOmega }), \quad f\mapsto Af:=v_k|_{\partial {\varOmega }}, \end{aligned}$$

where $k\in \{1,\ldots ,K\}$, $v_0\in H^1({\varOmega })$ solves

$$\begin{aligned} \int _{\varOmega } \sigma \nabla v_0\cdot \nabla w \,{\mathrm{d}} x+ \int _{{\varGamma }} \gamma ^{(1)} v_0 w\,{\mathrm{d}} s = \int _{{\varGamma }_0} f w \,{\mathrm{d}} s \quad \text { for all } w\in H^1({\varOmega }), \end{aligned}$$

(32)

and $v_k\in H^1({\varOmega })$ solves

$$\begin{aligned} \int _{\varOmega } \sigma \nabla v_k\cdot \nabla w \,{\mathrm{d}} x+ \int _{{\varGamma }} \gamma ^{(k)} v_k w\,{\mathrm{d}} s = \int _{{\varGamma }{\setminus } {\varGamma }_0} f w \,{\mathrm{d}} s \quad \text { for all } w\in H^1({\varOmega }). \end{aligned}$$

(33)

It is easily shown (see, e.g., the proof of [53, Theorem 3.1]) that the adjoints of these operators are given by

$$\begin{aligned} A_0^*:&\ L^2(\partial {\varOmega })\rightarrow L^2({\varGamma }_0),&\quad g&\mapsto u_1|_{{\varGamma }_0},\\ A_k^*:&\ L^2(\partial {\varOmega })\rightarrow L^2({\varGamma }{\setminus } {\varGamma }_0),&\quad g&\mapsto u_k|_{{\varGamma }{\setminus } {\varGamma }_0}, \end{aligned}$$

where $u_1,\ldots ,u_K\in H^1({\varOmega })$ solve (22)–(25) with Neumann boundary data g and Robin transmission coefficients $\gamma ^{(1)},\ldots ,\gamma ^{(K)}$, respectively.

By a simple normalization argument, the assertion is now equivalent to showing that

$$\begin{aligned} \not \exists C>0:\ \Vert A_0^* g\Vert ^2 \le C \sum _{k=1}^K \Vert A_k^* g\Vert ^2 =C \left\| \begin{pmatrix} A_1^*\\ \vdots \\ A_K^* \end{pmatrix} g \right\| ^2 \quad \forall g\in L^2(\partial {\varOmega }). \end{aligned}$$

(34)

Using a functional analytic relation between operator ranges and the norms or their adjoints (cf., [41, Lemma 2.5], [36, Cor. 3.5]), the property (34) (and thus the assertion) is proven if we can show that

$$\begin{aligned} {\mathcal {R}}(A_0)\not \subseteq {\mathcal {R}}\begin{pmatrix} A_1&\cdots&A_K \end{pmatrix}= {\mathcal {R}}(A_1)+\cdots +{\mathcal {R}}(A_K). \end{aligned}$$

(35)

We prove (35) by contradiction, and assume that

$$\begin{aligned} {\mathcal {R}}(A_0)\subseteq {\mathcal {R}}(A_1)+\cdots +{\mathcal {R}}(A_K). \end{aligned}$$

Then, for every $f_0\in L^2({\varGamma }_0)$, there exist $f_1,\ldots ,f_K\in L^2({\varGamma }{\setminus } {\varGamma }_0)$, so that

$$\begin{aligned} A_0 f_0= A_1 f_1+\cdots + A_K f_K. \end{aligned}$$

Let $v_0,\ldots ,v_K\in H^1({\varOmega })$ be the associated solutions from the definition of $A_0,\ldots ,A_K$ (with $f=f_k$), and set $v:=v_1+\cdots +v_K-v_0$. Then $v|_{\partial {\varOmega }}=0$, and $\partial _\nu v|_{\partial {\varOmega }}=0$, so that by unique continuation $v=0$ in ${\varOmega }{\setminus } {\overline{D}}$. But this also yields that $v|_{{\varGamma }}=0$, and from this we obtain that $v=0$ in D, so that $v=0$ in all of ${\varOmega }$.

Hence, using (32) and (33), we obtain for all $w\in H^1({\varOmega })$,

$$\begin{aligned} 0&=\int _{\varOmega } \sigma \nabla v\cdot \nabla w \,{\mathrm{d}} x+ \int _{{\varGamma }} \gamma ^{(1)} v w\,{\mathrm{d}} s\\&=\sum _{k=1}^K \left( \int _{\varOmega } \sigma \nabla v_k\cdot \nabla w \,{\mathrm{d}} x+ \int _{{\varGamma }} \gamma ^{(k)} v_k w\,{\mathrm{d}} s + \int _{{\varGamma }} (\gamma ^{(1)}-\gamma ^{(k)}) v_k w\,{\mathrm{d}} s\right) \\&\quad - \int _{\varOmega } \sigma \nabla v_0\cdot \nabla w \,{\mathrm{d}} x- \int _{{\varGamma }} \gamma ^{(1)} v_0 w\,{\mathrm{d}} s\\&=\sum _{k=1}^K \left( \int _{{\varGamma }{\setminus }{\varGamma }_0} f_k w\,{\mathrm{d}} s + \int _{{\varGamma }} (\gamma ^{(1)}-\gamma ^{(k)}) v_k w\,{\mathrm{d}} s\right) - \int _{{\varGamma }_0} f_0 w\,{\mathrm{d}} s, \end{aligned}$$

and this shows that

$$\begin{aligned} f_0=\sum _{k=2}^K (\gamma ^{(1)}-\gamma ^{(k)}) v_k|_{{\varGamma }_0}. \end{aligned}$$

However, since this holds for all $f_0\in L^2({\varGamma }_0)$, this would imply that

$$\begin{aligned} L^2({\varGamma }_0)={\mathcal {R}}( {\mathcal {M}}_2 {\mathrm {tr}}_{{\varGamma }_0} )+\cdots +{\mathcal {R}}( {\mathcal {M}}_K {\mathrm {tr}}_{{\varGamma }_0} ) = {\mathcal {R}}\left( \begin{pmatrix} {\mathcal {M}}_2 {\mathrm {tr}}_{{\varGamma }_0}&\ldots&{\mathcal {M}}_K {\mathrm {tr}}_{{\varGamma }_0} \end{pmatrix}\right) \end{aligned}$$

where

$$\begin{aligned} {\mathrm {tr}}_{{\varGamma }_0}:\ H^1({\varOmega })\rightarrow L^2({\varGamma }_0), \quad \text { and } \quad {\mathcal {M}}_k:\ L^2({\varGamma }_0)\rightarrow L^2({\varGamma }_0) \end{aligned}$$

are the compact trace operator and the continuous multiplication operator by $\gamma ^{(1)}-\gamma ^{(k)}$. Hence, the closed infinite-dimensional space $L^2({\varGamma }_0)$ would be the range of a compact operator, which is not possible cf., e.g., [92, Thm. 4.18]. This contradiction shows that (35) must hold, and thus the assertion is proven. $\square $

Proof of Theorem 5

1.
follows from Lemma 8.
2.
Let $j\in \{1,\ldots ,n\}$. As in the proof of Theorem 4(b), it follows from the simultaneously localized potentials result in Lemma 10, and a density argument, that $M_m^{(j)}\in \mathbb {R}^{m\times m}$ will have a positive eigenvalue if the dimension $m\in \mathbb {N}$ is sufficiently large. Hence, for sufficiently large m, an eigenvector $v^{(j)}\in \mathbb {R}^m$ corresponding to a largest eigenvalue of $M_m^{(j)}$ will fulfill (with $g_j:=\sum _{l=1}^m v^{(j)}_l f_l\in L^2(\partial {\varOmega })$)
$$\begin{aligned} 0 \le (v^{(j)})^T M_m^{(j)} v^{(j)}&=\frac{\alpha }{2} \int _{{\varGamma }_j} |u_{\kappa ^{(j,1)}}^{g_j}|^2 \,{\mathrm{d}} s - \beta \sum _{k=1}^K \int _{{\varGamma }{\setminus } {\varGamma }_j} |u_{\kappa ^{(j,k)}}^{g_j}|^2 \,{\mathrm{d}} s. \end{aligned}$$
Using that for all $k\in \{1,\ldots ,K\}$
$$\begin{aligned} \inf (\kappa ^{(j,1)}|_{{\varGamma }_j})&= b+ (b-a)\left( \frac{2+3\epsilon }{2cn}\right) \ge b,\\ \Vert \kappa ^{(j,k)} - \kappa ^{(j,1)}\Vert _{L^\infty ({\varGamma }_j)},&\le (b-a) (K-1)\frac{\epsilon }{2cn} \le (b-a) \left( 1+\epsilon + \frac{1+\epsilon }{cn}\right) , \end{aligned}$$
we have that
$$\begin{aligned} 1+ \frac{ \Vert \kappa ^{(j,k)} - \kappa ^{(j,1)}\Vert _{L^\infty ({\varGamma }_j)}}{\inf (\kappa ^{(j,1)}|_{{\varGamma }_j})} \le 1+ \frac{b-a}{b} \left( 1+\epsilon + \frac{1+\epsilon }{cn}\right) =\frac{1}{\alpha }, \end{aligned}$$
and thus it follows from Lemma 9
$$\begin{aligned}&\frac{1}{2} \int _{{\varGamma }_j} |u_{\kappa ^{(j,k)}}^{g_j}|^2 \,{\mathrm{d}} s - \beta \int _{{\varGamma }{\setminus } {\varGamma }_j} |u_{\kappa ^{(j,k)}}^{g_j}|^2 \,{\mathrm{d}} s\\&\quad \ge \frac{\alpha }{2} \int _{{\varGamma }_j} |u_{\kappa ^{(j,1)}}^{g_j}|^2 \,{\mathrm{d}} s - \beta \sum _{k'=1}^K \int _{{\varGamma }{\setminus } {\varGamma }_j} |u_{\kappa ^{(j,k')}}^{g_j}|^2 \,{\mathrm{d}} s \ge 0. \end{aligned}$$
Hence, (31) is fulfilled.

$\square $

Remark 2

Regarding the formulation of Theorem 5(b), note that we actually prove that the matrix $M_m^{(j)}\in \mathbb {R}^{m\times m}$ has a positive eigenvalue if the dimension m is sufficiently large, and that an eigenvector corresponding to a positive eigenvalue leads to a boundary current $g_j$ that fulfills (31). But the estimates that we use in the proof of 5(b) are far from being sharp, so that it seems worth checking (31) already for eigenvectors to a largest eigenvalue that is not yet positive.

3.3 Numerical results

We test our results on the simple example setting shown in Fig. 1. ${\varOmega }\subset \mathbb {R}^2$ is the two-dimensional unit circle, and D is a square with corner coordinates $(0,-0.1)$, (0, 0.3), $(-0.4,0.3)$ and $(-0.4,-0.1)$. The boundary ${\varGamma }=\partial D$ is decomposed into $n=4$ parts ${\varGamma }_1,\ldots ,{\varGamma }_4$ denoting (in this order) the right, top, left, and bottom side of the square. We assume that the unknown true Robin transmission coefficient ${{\hat{\gamma }}}:\ {\varGamma }\rightarrow \mathbb {R}$ is a-priori known to be bounded by $a:=1$ and $b:=2$ and that it is known to be piecewise-constant with respect to the partition of ${\varGamma }$, i.e.,

$$\begin{aligned} {{\hat{\gamma }}}(x)={{\hat{\gamma }}}_1 \chi _{{\varGamma }_1} + {{\hat{\gamma }}}_2 \chi _{{\varGamma }_2} + {{\hat{\gamma }}}_3 \chi _{{\varGamma }_3} + {{\hat{\gamma }}}_4 \chi _{{\varGamma }_4}\quad \text { with } \quad {{\hat{\gamma }}}_1,\ldots ,{{\hat{\gamma }}}_4\in [a,b]=[1,2]. \end{aligned}$$

Recall that, for the ease of notation, we identify a piecewise-constant function $\gamma \in L^2({\varGamma })$ with the vector $(\gamma _1,\ldots ,\gamma _4)\in \mathbb {R}^4$, and simply write a for the constant function $\gamma (x)=a$, and for the vector $(a,a,a,a)\in \mathbb {R}^4$ (and use b analogously).

3.3.1 Choosing the measurements

We first apply Theorem 5 to construct $n=4$ Neumann boundary functions $g_1,\ldots , g_4\in L^2(\partial {\varOmega })$ so that the four measurements

$$\begin{aligned} F(\gamma ):=\left( \int _{\partial {\varOmega }} g_j {\varLambda }(\gamma ) g_j \,{\mathrm{d}} s\right) _{j=1}^4\in \mathbb {R}^4 \text { uniquely determine } \gamma \in [a,b]^4\subset \mathbb {R}^4, \end{aligned}$$

and the Newton method applied to F globally converges.

To implement Theorem 5, we choose $\epsilon :=0.9\,\frac{a}{2(b-a)}$, which yields $K=173$, and $(f_m)_{m\in \mathbb {N}}\subset L^2(\partial {\varOmega })$ as the standard trigonometric polynomial basis functions

$$\begin{aligned} (1,\cos (\varphi ),\sin (\varphi ),\cos (2\varphi ),\sin (2\varphi ),\ldots )\subseteq L^2(\partial {\varOmega }) \end{aligned}$$

with $\varphi $ denoting the angle of a point on the unit circle $\partial {\varOmega }$. For each $j\in \{1,\ldots ,n\}$, we then calculate the matrix $M_m^{(j)}\in \mathbb {R}^{m\times m}$ starting with $m=1$, and increase the dimension $m\in \mathbb {N}$, until an eigenvector $v^{(j)}\in \mathbb {R}^m$ corresponding to a largest eigenvalue of this matrix has the property that

$$\begin{aligned} g_j:=\sum _{l=1}^m v^{(j)}_l f_l = v^{(j)}_1 + v^{(j)}_2 \cos (\varphi )+ v^{(j)}_3 \sin (\varphi ) + v^{(j)}_4 \cos (2\varphi )+\ldots \in L^2(\partial {\varOmega }) \end{aligned}$$

fulfills (31) for all $k=1,\ldots ,K$. To calculate the entries of $M_m^{(j)}$, and for checking (31), the required solutions $u^{f_i}_{\kappa ^{(j,k)}}$ of the Robin transmission problem (22)–(25) with Neumann boundary functions $f_i$, $i=1,\ldots ,m$ and Robin transmission coefficients $\kappa ^{(j,k)}$, as defined in (30), were obtained using the commercial finite element software Comsol.

For our setting we had to increase the dimension up to at most $m=15$, so that all $g_j$ are trigonometric polynomials of order less or equal 7. Figure 2 shows the boundary functions $g_j$ plotted with respect to the boundary angle on the unit circle $\partial {\varOmega }$.

From checking (31), we also obtain the Lipschitz stability constant for $F(\gamma ):=\left( \int _{\partial {\varOmega }} g_j {\varLambda }(\gamma ) g_j \,{\mathrm{d}} s\right) _{j=1}^4$ as described in lemma 8(c). For our setting we obtain the stability estimate

$$\begin{aligned} {} \Vert \gamma -\gamma '\Vert _\infty \le 7.5 \frac{ \Vert F(\gamma )-F(\gamma ')\Vert _\infty }{ \Vert F(2)-F(1)\Vert _\infty } \quad \text { for all } \gamma ,\gamma '\in I\supseteq [1,2]^4, \end{aligned}$$

(36)

where I is the slightly enlarged interval from Lemma 8.

Note that here and in the following we consider the measurement error relative to $ \Vert F(2)-F(1)\Vert _\infty $ as this is the width of the measuring range $F(1)\ge F(\gamma )\ge F(2)$.

The property (31) can be interpreted in the sense that the boundary current $g_j$ generates an electric potential for which the corresponding solution $|u_{\kappa }^{g_j}|^2$ is much larger on ${\varGamma }_j$ than on the remaining boundary ${\varGamma }{\setminus } {\varGamma }_j$, and that this simultaneously holds for several (but finitely many) Robin transmission coefficients $\kappa =\kappa ^{(j,k)}$. To illustrate this localization property, Fig. 3 shows $|u_{\kappa }^{g_j}|^2$ (in logarithmic scale) for $k=1$ and $k=K$.

Let us make a comment on improving the computation time. Note that the properties of F only depend on whether the used Neumann functions $g_j$ have the desired property (31), and that our rigorous approach of constructing $g_j$ is computationally more expensive than checking whether some given $g_j$ fulfills (31). In fact, for fixed $j\in \{1,\ldots ,n\}$, the construction of the matrix $M_m^{(j)}$ requires solving the PDE (22)–(25) for all combinations of K different Robin transmission coefficients and m different Neumann boundary values. On the other hand, checking whether a given Neumann boundary function $g_j$ has the desired property (31) only requires solving the PDE (22)–(25) for K different Robin transmission coefficients and the single Neumann boundary function $g_j$. Moreover, as long as $g_j$ does not fulfill (31), the checking might require very few PDE solutions if (31) already fails to hold for a small k.

Hence, one might try computationally cheaper heuristic approaches to construct $g_j$ that satisfy (31). In our experiments, we successfully used the ad-hoc approximation

$$\begin{aligned} M_m^{(j)}\approx \frac{\alpha }{2} S^{(j)} - \beta \frac{K}{2} (R^{(j,1)}+R^{(j,K)}) \in \mathbb {R}^{m\times m}, \end{aligned}$$

(which only requires 2m PDE solutions), and always found that increasing the dimension $m\in \mathbb {N}$ lead to Neumann boundary function $g_j$ fulfilling (31) for all $k\in \{1,\ldots ,K\}$. Moreover, in our experiments, we found the functions $g_j$ constructed with this faster heuristic approach virtually identical to those constructed with the exact matrix $M_m^{(j)}$ from Theorem 5.

3.3.2 Global convergence of Newton’s method

We numerically study the theoretically predicted global convergence of the standard Newton method when applied to the measurements $g_1,\ldots ,g_4$ constructed in the last subsection. We slightly change the definition of F and define the measurements relative to the known lower bound $\gamma =a$

$$\begin{aligned} F(\gamma ):=\left( \int _{\partial {\varOmega }} g_j \left( {\varLambda }(\gamma ) - {\varLambda }(a)\right) g_j \,{\mathrm{d}} s\right) _{j=1}^4, \end{aligned}$$

and numerically evaluate F using that, for all $g\in L^2(\partial {\varOmega })$, and $\gamma \in L^\infty _+({\varGamma })$,

$$\begin{aligned} \int _{\partial {\varOmega }} g \left( {\varLambda }(\gamma ) - {\varLambda }(a)\right) g \,{\mathrm{d}} s =\int _{\varGamma } (a-\gamma ) u_a^{(g)} u_\gamma ^{(g)}\,{\mathrm{d}} s, \end{aligned}$$

which immediately follows from the variational formulation of the Robin transmission problem (26). Note that this approach is numerically more stable than calculating ${\varLambda }(\gamma )$ and ${\varLambda }(a)$ separately as it avoids loss of significance effects.

We choose the true coefficient value as ${{\hat{\gamma }}}:=(1.3,1.8,1.5,1.9)$, and first test the reconstruction for noiseless data $y^\delta :={\hat{y}}:=F({{\hat{\gamma }}})$. Starting with the lower bound $\gamma ^{(0)}:=(1,1,1,1)$, we implement the standard Newton method

$$\begin{aligned} \gamma ^{(i+1)}:=\gamma ^{(i)}-F'(\gamma ^{(i)})^{-1} \left( F(\gamma ^{(i)}) - y^\delta \right) , \end{aligned}$$

where the (j, l)th entry of the Jacobian matrix $F'(\gamma )\in \mathbb {R}^{4\times 4}$ is given by $\int _{{\varGamma }_l} |u^{(g_j)}_\gamma |^2 \,{\mathrm{d}} s$, cf. Lemma 6.

We repeat this for noisy data with relative noise level $\delta >0$, that we obtain by adding a vector with random entries to ${\hat{y}}$, so that $ \Vert y^\delta -{\hat{y}}\Vert _\infty =\delta \Vert F(b)\Vert _\infty $. Note that $F(a)=0$, so that this chooses the norm level relative to the measurement range. For the noiseless case $\delta =0$ we committed the so-called inverse crime of using the same forward solver (i.e., the same finite element mesh) for simulating the data ${\hat{y}}=F({{\hat{\gamma }}})$ and for evaluating F and $F'$ in the Newton iteration. For the noisy cases $\delta >0$, we used a different mesh for the forward and inverse solvers.

Figure 4 shows the error of the first Newton iterations for the case $\delta =0$, $\delta =10^{-5}$, $\delta =10^{-3}$, and $\delta =10^{-1}$, and demonstrates the theoretically predicted quadratic convergence properties. At this point, let us stress that also for noisy data $y^\delta $, Lemma 8 yields that there exists a unique solution $y^\delta \in I\supseteq [a,b]^n$ of

$$\begin{aligned} F(\gamma ^\delta )=y^\delta , \end{aligned}$$

and that the standard Newton method converges to this solution $\gamma ^\delta $, as long as $y^\delta $ lies within the bounds $F(a)\ge y^\delta \ge F(b)$ (which is easily guaranteed by capping or flooring the values in $y^\delta $). Moreover, the obtained solution will satisfy the error estimate

$$\begin{aligned} \Vert \gamma ^\delta -{{\hat{\gamma }}}\Vert _\infty \le 7.34\,\delta \end{aligned}$$

due to the stability estimate (36) obtained in the last subsection.

We finish this subsection with an example where the true Robin transmission coefficient ${{\hat{\gamma }}}\in L^\infty _+({\varGamma })$ is not piecewise constant but within the a-priori known bounds

$$\begin{aligned} a\le {{\hat{\gamma }}}(x)\le b\quad \text { for}\, x\in {\varGamma } \text {(a.e.)} \end{aligned}$$

Then ${\hat{y}}=\left( \int _{\partial {\varOmega }} g_j \left( {\varLambda }(\gamma ) - {\varLambda }(a)\right) g_j \,{\mathrm{d}} s\right) _{j=1}^4\in \mathbb {R}^4$ will still satisfy $F(a)\ge {\hat{y}} \ge F(b)$, so that there exists a unique solution $\gamma \in I\supseteq [a,b]^n$ of $F(\gamma )={\hat{y}}$, i.e., there exists a unique piecewise constant Robin transmission coefficient that leads to the same measured data as the true non-piecewise-constant coefficient function. The Newton iteration applied to ${\hat{y}}$ (or a noisy version $y^\delta $) will globally converge to this piecewise constant solution $\gamma $ (or an approximation $\gamma ^\delta $), see Fig. 5 for a numerical example.

3.3.3 Effect of interval width and number or unknowns

Our result in Theorem 5 holds for any a-priori known bounds $b>a>0$ and any number of unknowns $n\in \mathbb {N}$. Thus, in theory, we can treat arbitrary large intervals [a, b] and arbitrary fine resolutions of ${\varGamma }$. However, numerically, the constructed trigonometric polynomials $g_j$ will quickly become more and more oscillatory, and the calculated Lipschitz constants will quickly increase.

To demonstrate the effect of the interval width, we proceed as in Sect. 3.3.1 to calculate four boundary currents $g_1,\ldots ,g_4\in L^2(\partial {\varOmega })$ that uniquely determine $\gamma \in [a,b]^4$ and yield global Newton convergence for $a=1$, and $b\in \{2,3,4,5\}$. Table 1 shows the dimension of the trigonometric polynomial subspace of $L^2(\partial {\varOmega })$ that contains $g_1,\ldots ,g_4$ and the obtained Lipschitz constant for the inverse problem of determining $\gamma $ from the corresponding measurements.

Table 1 Effect of interval width on trigonometric degree and Lipschitz constant

Full size table

To demonstrate the effect of the number of unknowns, we then replace the square D by regular polygons with $n=3$, $n=4$, $n=5$, and $n=6$ sides keeping the polygon center and circumradius the same as in the square ($n=4$) case. We assume that $\gamma $ is piecewise constant with respect to the polygon sides. As in Sect. 3.3.1 we then calculate n boundary currents $g_1,\ldots ,g_n\in L^2(\partial {\varOmega })$ that uniquely determine $\gamma \in [1,2]^n$ and yield global Newton convergence. The required dimension of the trigonometric polynomial subspace of $L^2(\partial {\varOmega })$ and the obtained Lipschitz constant are shown in Table 2.

Table 2 Effect of number of unknowns n on trigonometric degree and Lipschitz constant

Full size table

In both situations, the boundary currents quickly become highly oscillatory, and the calculated stability constant worsens. Hence, at the current state, our approach will only be feasible for moderate contrasts and relatively few unknowns as stated in the introduction. It should be noted, however, that our criterion (31) in Theorem 5 is sufficient but possibly not necessary for uniqueness, Lipschitz stability and global Newton convergence. The constructed boundary currents and the calculated Lipschitz constants may be far from optimal. Since our result is (to the knowledge of the author) the very first on uniqueness, global convergence and explicit Lipschitz stability constants for a discretized inverse coefficient problem, there may well be room for improvement and significantly sharper estimates that could practically yield in less oscillations and better stability constants.

4 Conclusions

We have derived a method to determine which (finitely many) measurements uniquely determine the unknown coefficient in an inverse coefficient problem with a given resolution, and proved global convergence of Newton’s method for the resulting discretized non-linear inverse problem. Our method also allows to explicitly calculate the Lipschitz stability constant, and yields an error estimate for noisy data. To the knowledge of the author, these are the first such results for discretized inverse coefficient problems.

Our method stems on an extension of classical global Newton convergence theory from convex inverse-monotonic to convex (forward-)monotonic functions that arise in elliptic inverse coefficient problems. The extension required an extra assumption on the directional derivatives of the considered function that we were able to fulfill by choosing the right measurements.

Our proofs mainly utilized monotonicity ideas and localized potential techniques that are also known for several other elliptic inverse coefficient problems. So the ideas in this work might be applicable to other applications as well. A particularly interesting extension would be the case of EIT where it has recently been shown [48] that an unknown conductivity distribution with a given resolution is uniquely determined by voltage-current-measurements on sufficiently many electrodes, but the number of required electrodes is not known. The main difficulty of such an extension is that localized potentials in EIT cannot concentrate on each domain part separately as in the simpler Robin transmission problem considered in this work. Roughly speaking, a localized potential in EIT with high energy in some region D will also have a high energy on its way from the boundary electrodes to D. This behavior will make the application of our herein presented ideas more challenging.

References

Adler, A., Gaburro, R., Lionheart, W.: Electrical impedance tomography. Handbook of Mathematical Methods in Imaging, pp. 701–762 (2015)
Alberti, G.S., Santacesaria, M.: Calderón’s inverse problem with a finite number of measurements. In: Forum of Mathematics, Sigma, vol. 7. Cambridge University Press, Cambridge (2019)
Alberti, G.S., Santacesaria, M.: Infinite-dimensional inverse problems with finite measurements. arXiv preprint arXiv:1906.10028 (2019)
Alessandrini, G., Beretta, E., Vessella, S.: Determining linear cracks by boundary measurements: Lipschitz stability. SIAM J. Math. Anal. 27(2), 361–375 (1996)
MathSciNet MATH Google Scholar
Alessandrini, G., de Hoop, M.V., Gaburro, R., Sincich, E.: Lipschitz stability for a piecewise linear Schrödinger potential from local Cauchy data. Asymptot. Anal. 108(3), 115–149 (2018)
MathSciNet MATH Google Scholar
Alessandrini, G., Maarten, V., Gaburro, R., Sincich, E.: Lipschitz stability for the electrostatic inverse boundary value problem with piecewise linear conductivities. J. Math. Pures Appl. 107(5), 638–664 (2017)
MathSciNet MATH Google Scholar
Alessandrini, G., Vessella, S.: Lipschitz stability for the inverse conductivity problem. Adv. Appl. Math. 35(2), 207–241 (2005)
MathSciNet MATH Google Scholar
Angell, T., Kleinman, R., Hettlich, F.: The resistive and conductive problems for the exterior Helmholtz equation. SIAM J. Appl. Math. 50(6), 1607–1622 (1990)
MathSciNet MATH Google Scholar
Arnold, L., Harrach, B.: Unique shape detection in transient eddy current problems. Inverse Probl. 29(9), 095004 (2013)
MathSciNet MATH Google Scholar
Astala, K., Päivärinta, L.: Calderón’s inverse conductivity problem in the plane. Ann. Math. 163, 265–299 (2006)
MathSciNet MATH Google Scholar
Bacchelli, V., Vessella, S.: Lipschitz stability for a stationary 2D inverse problem with unknown polygonal boundary. Inverse Probl. 22(5), 1627 (2006)
MathSciNet MATH Google Scholar
Barber, D., Brown, B.: Applied potential tomography. J. Phys. E Sci. Instrum. 17(9), 723–733 (1984)
Google Scholar
Barth, A., Harrach, B., Hyvönen, N., Mustonen, L.: Detecting stochastic inclusions in electrical impedance tomography. Inverse Probl. 33(11), 115012 (2017)
MathSciNet MATH Google Scholar
Bayford, R.: Bioimpedance tomography (electrical impedance tomography). Annu. Rev. Biomed. Eng. 8, 63–91 (2006)
Google Scholar
Beilina, L., Cristofol, M., Li, S., Yamamoto, M.: Lipschitz stability for an inverse hyperbolic problem of determining two coefficients by a finite number of observations. Inverse Probl. 34(1), 015001 (2017)
MathSciNet MATH Google Scholar
Beilina, L., Klibanov, M.V.: A globally convergent numerical method for a coefficient inverse problem. SIAM J. Sci. Comput. 31(1), 478–509 (2008)
MathSciNet MATH Google Scholar
Beilina, L., Klibanov, M.V.: Approximate Global Convergence and Adaptivity for Coefficient Inverse Problems. Springer, New York (2012)
MATH Google Scholar
Bellassoued, M., Jellali, D., Yamamoto, M.: Lipschitz stability for a hyperbolic inverse problem by finite local boundary data. Appl. Anal. 85(10), 1219–1243 (2006)
MathSciNet MATH Google Scholar
Bellassoued, M., Yamamoto, M.: Lipschitz stability in determining density and two Lamé coefficients. J. Math. Anal. Appl. 329(2), 1240–1259 (2007)
MathSciNet MATH Google Scholar
Beretta, E., De Hoop, M.V., Qiu, L.: Lipschitz stability of an inverse boundary value problem for a Schrödinger-type equation. SIAM J. Math. Anal. 45(2), 679–699 (2013)
MathSciNet MATH Google Scholar
Beretta, E., Francini, E.: Lipschitz stability for the electrical impedance tomography problem: the complex case. Commun. Partial Differ. Equ. 36(10), 1723–1749 (2011)
MathSciNet MATH Google Scholar
Beretta, E., de Hoop, M.V., Francini, E., Vessella, S., Zhai, J.: Uniqueness and Lipschitz stability of an inverse boundary value problem for time-harmonic elastic waves. Inverse Probl. 33(3), 035013 (2017)
MathSciNet MATH Google Scholar
Borcea, L.: Electrical impedance tomography. Inverse Probl. 18(6), 99–136 (2002)
MathSciNet MATH Google Scholar
Borcea, L.: Addendum to ‘Electrical impedance tomography’. Inverse Probl. 19(4), 997–998 (2003)
MathSciNet Google Scholar
Brander, T., Harrach, B., Kar, M., Salo, M.: Monotonicity and enclosure methods for the $p$-Laplace equation. SIAM J. Appl. Math. 78(2), 742–758 (2018)
MathSciNet Google Scholar
Brühl, M., Hanke, M.: Numerical implementation of two noniterative methods for locating inclusions by impedance tomography. Inverse Probl. 16(4), 1029 (2000)
MathSciNet MATH Google Scholar
Calderón, A.P.: On an inverse boundary value problem. In: Meyer, W.H., Raupp, M.A. (eds.) Seminar on Numerical Analysis and Its Application to Continuum Physics, pp. 65–73. Brazilian Mathematical Society, Rio de Janeiro (1980)
Google Scholar
Calderón, A.P.: On an inverse boundary value problem. Comput. Appl. Math. 25(2–3), 133–138 (2006)
MathSciNet MATH Google Scholar
Candiani, V., Dardé, J., Garde, H., Hyvönen, N.: Monotonicity-based reconstruction of extreme inclusions in electrical impedance tomography. arXiv preprint arXiv:1909.12110 (2019)
Caro, P., Rogers, K.M.: Global uniqueness for the Calderón problem with Lipschitz conductivities. In: Forum of Mathematics, Pi, vol. 4. Cambridge University Press (2016)
Cheney, M., Isaacson, D., Newell, J.: Electrical impedance tomography. SIAM Rev. 41(1), 85–101 (1999)
MathSciNet MATH Google Scholar
Cheng, J., Isakov, V., Yamamoto, M., Zhou, Q., et al.: Lipschitz stability in the lateral Cauchy problem for elasticity system. J. Math. Kyoto Univ. 43(3), 475–501 (2003)
MathSciNet MATH Google Scholar
Collatz, L.: Aufgaben monotoner Art. Arch. Math. 3, 366–376 (1952)
MathSciNet MATH Google Scholar
Druskin, V.: On the uniqueness of inverse problems from incomplete boundary data. SIAM J. Appl. Math. 58(5), 1591–1603 (1998)
MathSciNet MATH Google Scholar
Eberle, S., Harrach, B., Meftahi, H., Rezgui, T.: Lipschitz stability estimate and reconstruction of Lamé parameters in linear elasticity. Inverse Probl. Sci. Eng. 25, 1–22 (2020)
Google Scholar
Frühauf, F., Gebauer, B., Scherzer, O.: Detecting interfaces in a parabolic-elliptic problem from surface measurements. SIAM J. Numer. Anal. 45(2), 810–836 (2007)
MathSciNet MATH Google Scholar
Garde, H.: Comparison of linear and non-linear monotonicity-based shape reconstruction using exact matrix characterizations. Inverse Probl. Sci. Eng. 26(1), 33–50 (2018)
MathSciNet MATH Google Scholar
Garde, H.: Reconstruction of piecewise constant layered conductivities in electrical impedance tomography. Commun. Partial Differ. Equ. 45(9), 1118–1133 (2020)
MathSciNet MATH Google Scholar
Garde, H., Staboulis, S.: Convergence and regularization for monotonicity-based shape reconstruction in electrical impedance tomography. Numer. Math. 135(4), 1221–1251 (2017)
MathSciNet MATH Google Scholar
Garde, H., Staboulis, S.: The regularized monotonicity method: detecting irregular indefinite inclusions. Inverse Probl. Imaging 13(1), 93 (2019)
MathSciNet MATH Google Scholar
Gebauer, B.: Localized potentials in electrical impedance tomography. Inverse Probl. Imaging 2(2), 251–269 (2008)
MathSciNet MATH Google Scholar
Gebauer, B., Hyvönen, N.: Factorization method and irregular inclusions in electrical impedance tomography. Inverse Probl. 23(5), 2159 (2007)
MathSciNet MATH Google Scholar
Griesmaier, R., Harrach, B.: Monotonicity in inverse medium scattering on unbounded domains. SIAM J. Appl. Math. 78(5), 2533–2557 (2018)
MathSciNet MATH Google Scholar
Haberman, B., Tataru, D.: Uniqueness in Calderón’s problem with Lipschitz conductivities. Duke Math. J. 162(3), 497–516 (2013)
MATH Google Scholar
Harrach, B.: On uniqueness in diffuse optical tomography. Inverse Probl. 25, 055010 (2009)
MathSciNet MATH Google Scholar
Harrach, B.: Simultaneous determination of the diffusion and absorption coefficient from boundary data. Inverse Probl. Imaging 6(4), 663–679 (2012)
MathSciNet MATH Google Scholar
Harrach, B.: Recent progress on the factorization method for electrical impedance tomography. Comput. Math. Methods Med. 2013(1), 425184 (2013)
MathSciNet MATH Google Scholar
Harrach, B.: Uniqueness and Lipschitz stability in electrical impedance tomography with finitely many electrodes. Inverse Probl. 35(2), 024005 (2019)
MathSciNet MATH Google Scholar
Harrach, B., Lee, E., Ullrich, M.: Combining frequency-difference and ultrasound modulated electrical impedance tomography. Inverse Probl. 31(9), 095003 (2015)
MathSciNet MATH Google Scholar
Harrach, B., Lin, Y.H.: Monotonicity-based inversion of the fractional Schrödinger equation I. Positive potentials. SIAM J. Math. Anal. 51(4), 3092–3111 (2019)
MathSciNet MATH Google Scholar
Harrach, B., Lin, Y.H.: Monotonicity-based inversion of the fractional Schrödinger equation II. General potential and stability. SIAM J. Math. Anal. 52(1), 402–436 (2020)
MathSciNet MATH Google Scholar
Harrach, B., Lin, Y.H., Liu, H.: On localizing and concentrating electromagnetic fields. SIAM J. Appl. Math. 78(5), 2558–2574 (2018)
MathSciNet MATH Google Scholar
Harrach, B., Meftahi, H.: Global uniqueness and Lipschitz-stability for the inverse Robin transmission problem. SIAM J. Appl. Math. 79(2), 525–550 (2019)
MathSciNet MATH Google Scholar
Harrach, B., Minh, M.N.: Enhancing residual-based techniques with shape reconstruction features in electrical impedance tomography. Inverse Probl. 32(12), 125002 (2016)
MathSciNet MATH Google Scholar
Harrach, B., Minh, M.N.: Monotonicity-based regularization for phantom experiment data in electrical impedance tomography. In: Hofmann, B., Leitao, A., Zubelli, J.P. (eds.) New Trends in Parameter Identification for Mathematical Models, pp. 107–120. Springer, New York (2018)
MATH Google Scholar
Harrach, B., Pohjola, V., Salo, M.: Dimension bounds in monotonicity methods for the Helmholtz equation. SIAM J. Math. Anal. 51(4), 2995–3019 (2019)
MathSciNet MATH Google Scholar
Harrach, B., Pohjola, V., Salo, M.: Monotonicity and local uniqueness for the Helmholtz equation. Anal. PDE 12(7), 1741–1771 (2019)
MathSciNet MATH Google Scholar
Harrach, B., Seo, J.K.: Exact shape-reconstruction by one-step linearization in electrical impedance tomography. SIAM J. Math. Anal. 42(4), 1505–1518 (2010)
MathSciNet MATH Google Scholar
Harrach, B., Ullrich, M.: Monotonicity-based shape reconstruction in electrical impedance tomography. SIAM J. Math. Anal. 45(6), 3382–3403 (2013)
MathSciNet MATH Google Scholar
Harrach, B., Ullrich, M.: Resolution guarantees in electrical impedance tomography. IEEE Trans. Med. Imaging 34, 1513–1521 (2015)
Google Scholar
Harrach, B., Ullrich, M.: Local uniqueness for an inverse boundary value problem with partial data. Proc. Am. Math. Soc. 145(3), 1087–1095 (2017)
MathSciNet MATH Google Scholar
Henderson, R., Webster, J.: An impedance camera for spatially specific measurements of the thorax. IEEE Trans. Biomed. Eng. BME–25(3), 250–254 (1978)
Google Scholar
Holder, D.: Electrical Impedance Tomography: Methods, History and Applications. IOP Publishing, Bristol (2005)
Google Scholar
Ide, T., Isozaki, H., Nakata, S., Siltanen, S., Uhlmann, G.: Probing for electrical inclusions with complex spherical waves. Commun. Pure Appl. Math. 60, 1415–1442 (2007)
MathSciNet MATH Google Scholar
Ikehata, M.: Size estimation of inclusion. J. Inverse Ill-Posed Probl. 6(2), 127–140 (1998)
MathSciNet MATH Google Scholar
Ikehata, M.: How to draw a picture of an unknown inclusion from boundary measurements. Two mathematical inversion algorithms. J. Inverse Ill-Posed Probl. 7(3), 255–271 (1999)
MathSciNet MATH Google Scholar
Ikehata, M.: Reconstruction of the support function for inclusion from boundary measurements. J. Inverse Ill-Posed Probl. 8(4), 367–378 (2000)
MathSciNet MATH Google Scholar
Imanuvilov, O.Y., Yamamoto, M.: Lipschitz stability in inverse parabolic problems by the Carleman estimate. Inverse Probl. 14(5), 1229 (1998)
MathSciNet MATH Google Scholar
Imanuvilov, O.Y., Yamamoto, M.: Global Lipschitz stability in an inverse hyperbolic problem by interior observations. Inverse Probl. 17(4), 717 (2001)
MathSciNet MATH Google Scholar
Kang, H., Seo, J.K., Sheen, D.: The inverse conductivity problem with one measurement: stability and estimation of size. SIAM J. Math. Anal. 28(6), 1389–1405 (1997)
MathSciNet MATH Google Scholar
Kazemi, M.A., Klibanov, M.V.: Stability estimates for ill-posed Cauchy problems involving hyperbolic equations and inequalities. Appl. Anal. 50(1–2), 93–102 (1993)
MathSciNet MATH Google Scholar
Kenig, C.E., Sjöstrand, J., Uhlmann, G.: The Calderón problem with partial data. Ann. Math. (2) 165(2), 567–591 (2007)
MathSciNet MATH Google Scholar
Kirsch, A.: Characterization of the shape of a scattering obstacle using the spectral data of the far field operator. Inverse Probl. 14(6), 1489 (1998)
MathSciNet MATH Google Scholar
Klibanov, M.V.: Convexification of restricted Dirichlet-to-Neumann map. J. Inverse Ill-Posed Probl. 25(5), 669–685 (2017)
MathSciNet MATH Google Scholar
Klibanov, M.V., Li, J., Zhang, W.: Convexification of electrical impedance tomography with restricted Dirichlet-to-Neumann map data. Inverse Probl. (2019)
Klibanov, M.V., Pamyatnykh, S.E.: Lipschitz stability of a non-standard problem for the non-stationary transport equation via a Carleman estimate. Inverse Probl. 22(3), 881 (2006)
MathSciNet MATH Google Scholar
Klibanov, M.V., Yamamoto, M.: Lipschitz stability of an inverse problem for an acoustic equation. Appl. Anal. 85(05), 515–538 (2006)
MathSciNet MATH Google Scholar
Knudsen, K., Lassas, M., Mueller, J.L., Siltanen, S.: Regularized d-bar method for the inverse conductivity problem. Inverse Probl. Imaging 35(4), 599 (2009)
MathSciNet MATH Google Scholar
Kohn, R.V., Vogelius, M.: Determining conductivity by boundary measurements. Commun. Pure Appl. Math. 37(3), 289–298 (1984)
MathSciNet MATH Google Scholar
Kohn, R.V., Vogelius, M.: Determining conductivity by boundary measurements II. Interior results. Commun. Pure Appl. Math. 38(5), 643–667 (1985)
MathSciNet MATH Google Scholar
Krupchyk, K., Uhlmann, G.: The Calderón problem with partial data for conductivities with 3/2 derivatives. Commun. Math. Phys. 348(1), 185–219 (2016)
MATH Google Scholar
Lechleiter, A., Hyvönen, N., Hakula, H.: The factorization method applied to the complete electrode model of impedance tomography. SIAM J. Appl. Math. 68(4), 1097–1121 (2008)
MathSciNet MATH Google Scholar
Lionheart, W.R.B.: EIT reconstruction algorithms: pitfalls, challenges and recent developments. Physiol. Meas. 25, 125–142 (2004)
Google Scholar
Lyalinov, M., Serbest, A.: Transition boundary conditions for simulation of thin chiral slab. Electron. Lett. 34(12), 1211–1213 (1998)
Google Scholar
Maffucci, A., Vento, A., Ventre, S., Tamburrino, A.: A novel technique for evaluating the effective permittivity of inhomogeneous interconnects based on the monotonicity property. IEEE Trans. Compon. Packag. Manuf. Technol. 6(9), 1417–1427 (2016)
Google Scholar
Martinsen, O.G., Grimnes, S.: Bioimpedance and Bioelectricity Basics. Academic Press, Cambridge (2011)
Google Scholar
Meléndez, P.G.: Lipschitz stability in an inverse problem for the main coefficient of a Kuramoto–Sivashinsky type equation. J. Math. Anal. Appl. 408(1), 275–290 (2013)
MathSciNet MATH Google Scholar
Metherall, P., Barber, D., Smallwood, R., Brown, B.: Three dimensional electrical impedance tomography. Nature 380(6574), 509–512 (1996)
Google Scholar
Nachman, A.I.: Global uniqueness for a two-dimensional inverse boundary value problem. Ann. Math. (2) 143(1), 71–96 (1996)
MathSciNet MATH Google Scholar
Newell, J., Gisser, D.G., Isaacson, D.: An electric current tomograph. IEEE Trans. Biomed. Eng. 35(10), 828–833 (1988)
Google Scholar
Ortega, J.M., Rheinboldt, W.C.: Iterative Solution of Nonlinear Equations in Several Variables. Academic Press, New York (1970)
MATH Google Scholar
Rudin, W.: Functional Analysis. International Series in Pure and Applied Mathematics, 2nd edn. McGraw-Hill, New York (1991)
Google Scholar
Rüland, A., Sincich, E.: Lipschitz stability for the finite dimensional fractional Calderón problem with finite Cauchy data. Inverse Probl. Imaging 13(5), 1023–1044 (2019)
MathSciNet MATH Google Scholar
Seo, J.K., Kim, K.C., Jargal, A., Lee, K., Harrach, B.: A learning-based method for solving ill-posed nonlinear inverse problems: a simulation study of lung EIT. SIAM J. Imaging Sci. 12(3), 1275–1295 (2019)
MathSciNet Google Scholar
Seo, J.K., Woo, E.J.: Electrical impedance tomography. Nonlinear Inverse Probl. Imaging 195–249 (2013)
Sincich, E.: Lipschitz stability for the inverse Robin problem. Inverse Probl. 23(3), 1311 (2007)
MathSciNet MATH Google Scholar
Su, Z., Udpa, L., Giovinco, G., Ventre, S., Tamburrino, A.: Monotonicity principle in pulsed eddy current testing and its application to defect sizing. In: Applied Computational Electromagnetics Society Symposium-Italy (ACES), 2017 International, pp. 1–2. IEEE (2017)
Sylvester, J., Uhlmann, G.: A global uniqueness theorem for an inverse boundary value problem. Ann. Math. 125, 153–169 (1987)
MathSciNet MATH Google Scholar
Tamburrino, A., Rubinacci, G.: A new non-iterative inversion method for electrical resistance tomography. Inverse Probl. 18(6), 1809 (2002)
MathSciNet MATH Google Scholar
Tamburrino, A., Sua, Z., Ventre, S., Udpa, L., Udpa, S.S.: Monotonicity based imang method in time domain eddy current testing. Electromagn. Nondestruct. Eval. XIX 41, 1 (2016)
Google Scholar
Uhlmann, G.: Electrical impedance tomography and Calderón’s problem. Inverse Probl. 25(12), 123011 (2009)
MATH Google Scholar
Ventre, S., Maffucci, A., Caire, F., Le Lostec, N., Perrotta, A., Rubinacci, G., Sartre, B., Vento, A., Tamburrino, A.: Design of a real-time eddy current tomography system. IEEE Trans. Magn. 53(3), 1–8 (2017)
Google Scholar
Wexler, A., Fry, B., Neuman, M.: Impedance-computed tomography algorithm and system. Appl. Opt. 24(23), 3985–3992 (1985)
Google Scholar
Yuan, G., Yamamoto, M.: Lipschitz stability in inverse problems for a Kirchhoff plate equation. Asymptot. Anal. 53(1, 2), 29–60 (2007)
MathSciNet MATH Google Scholar
Yuan, G., Yamamoto, M.: Lipschitz stability in the determination of the principal part of a parabolic equation. ESAIM Control Optim. Calc. Var. 15(3), 525–554 (2009)
MathSciNet MATH Google Scholar
Zhou, L., Harrach, B., Seo, J.K.: Monotonicity-based electrical impedance tomography for lung imaging. Inverse Probl. 34(4), 045005 (2018)
MathSciNet MATH Google Scholar

Download references

Acknowledgements

The author would like to thank Professor Michael Klibanov for his inspiring work on global convergence, and Professor Frank Natterer for pointing out possible relations between the monotonicity method and Collatz monotone functions.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Institute for Mathematics, Goethe-University Frankfurt, Frankfurt, Germany
Bastian Harrach

Authors

Bastian Harrach
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bastian Harrach.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Harrach, B. Uniqueness, stability and global convergence for a discrete inverse elliptic Robin transmission problem. Numer. Math. 147, 29–70 (2021). https://doi.org/10.1007/s00211-020-01162-8

Download citation

Received: 13 December 2019
Revised: 05 October 2020
Accepted: 26 October 2020
Published: 18 November 2020
Issue Date: January 2021
DOI: https://doi.org/10.1007/s00211-020-01162-8

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Uniqueness, stability and global convergence for a discrete inverse elliptic Robin transmission problem

Abstract

Similar content being viewed by others

Inverse transmission eigenvalue problems for the Schrödinger operator with the Robin boundary condition

Convergence and adaptive discretization of the IRGNM Tikhonov and the IRGNM Ivanov method under a tangential cone condition in Banach space

Regularity for a Local–Nonlocal Transmission Problem

1 Introduction

2 Uniqueness, stability and global Newton convergence

Remark 1

2.1 A simple criterion for uniqueness and Lipschitz stability

Theorem 1

Lemma 1

Proof

Proof of Theorem 1

2.2 A simple criterion for global convergence of the Newton iteration

Theorem 2

Lemma 2

Proof

Lemma 3

Proof

Proof of Theorem 2

2.3 A result with tighter domain assumptions

Theorem 3

Lemma 4

Proof

Proof of Theorem 3

3 Application to an inverse Robin transmission problem

3.1 The setting

3.1.1 The infinite-dimensional forward and inverse problem

Lemma 5

Proof

3.1.2 The inverse problem with finitely many measurements

Lemma 6

Proof

3.2 Uniqueness, stability and global Newton convergence

3.2.1 Choosing the measurements (for specific bounds)

Theorem 4

Lemma 7

Proof

Proof of Theorem 4

3.2.2 Choosing the measurements (for general bounds)

Theorem 5

Lemma 8

Proof

Lemma 9

Proof

Lemma 10

Proof

Proof of Theorem 5

Remark 2

3.3 Numerical results

3.3.1 Choosing the measurements

3.3.2 Global convergence of Newton’s method

3.3.3 Effect of interval width and number or unknowns

4 Conclusions

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Mathematics Subject Classification

Search

Navigation