Abstract
The paper has two relatively distinct but connected goals; the first is to define the notion of Padé approximation of Weyl–Stiltjes transforms on an arbitrary compact Riemann surface of higher genus. The data consists of a contour in the Riemann surface and a measure on it, together with the additional datum of a local coordinate near a point and a divisor of degree g. The denominators of the resulting Padé-like approximation also satisfy an orthogonality relation and are sections of appropriate line bundles. A Riemann–Hilbert problem for a square matrix of rank two is shown to characterize these orthogonal sections, in a similar fashion to the ordinary orthogonal polynomial case. The second part extends this idea to explore its connection to integrable systems. The same data can be used to define a pairing between two sequences of line bundles. The locus in the deformation space where the pairing becomes degenerate for fixed degree coincides with the zeros of a “tau” function. We show how this tau function satisfies the Kadomtsev–Petviashvili hierarchy with respect to either deformation parameters, and a certain modification of the 2-Toda hierarchy when considering the whole sequence of tau functions. We also show how this construction is related to the Krichever construction of algebro-geometric solutions.
Similar content being viewed by others
1 Introduction
The theory of Hermite–Padé approximation is intimately connected with the theory of (mutliple) orthogonal polynomials. The prototypical of these connections is as follows: one considers a measure \(\mathrm d\mu \) with finite moments on the real axis and its Stiltjes transform
Then we find polynomials \(Q_{n-1}, P_{n}\) (of the degree suggested by the subscript) such that \(W(z) = \frac{Q_{n-1}(z)}{P_n(z)} + \mathcal O(z^{-2n-1})\) as \(|z|\rightarrow \infty \) (in the sense of asymptotic expansion). A simple computation shows that the denominators \(P_n\) are orthogonal polynomials in the sense that
While this connection is classical, a more recent result [12, 13] connects the construction of the orthogonal polynomials with a Riemann–Hilbert problem. This connection was instrumental in the theory of random matrices to provide the first rigorous proof of several universality results [7].
If the measure is made to depend on (formal) parameters \(\mathrm d\mu (x;\mathbf{t}) = \mathrm{e}^{\sum _{j\ge 0} t_j x^j} \mathrm d\mu (x;0) \) then the recurrence coefficients of the polynomials \(P_n(x; \mathbf{t})\) provide a solution to the Toda lattice equations (see for example the review in [8]). Furthermore, the Hankel determinant of the corresponding moments
provide tau functions for the Kadomtsev–Petviashvili hierarchy. This type of interplay between (multiple) Padé approximation (and related multiple orthogonality) and integrable systems has been exploited in numerous papers, to name a few [1, 5, 6, 14, 16, 17].
On a seemingly disconnected track, the theory of integrable systems, notably the theory of the Kadomtsev–Petviashvili (KP) hierarchy is famously intertwined with the theory of Riemann surfaces [15, 18] in the class of algebro-geometric solutions.
There seems to be little or no literature attempting to connect the worlds of Padé approximation and the algebro-geometric setup.
The present paper is a first foray in the sparsely populated landscape between these areas.
On the side of Padé approximation theory, we mention the recent work [10] where the authors consider a sequence of functions on elliptic curves with antiholomorphic involution which are orthogonal with respect to a measure on the fixed ovals. The setup is comparable to, but not the same as, the class of examples we consider here in Sect. 2.3.1. We could not find other literature which is relevant to our present approach.
Before describing the results we add a few words of caution: by the nature of this paper there are potentially two classes of mathematicians that could be interested. On one side the community of approximation theory and on the other side the community of integrable systems. Inevitably here we are obliged to use certain notions of the theory of Riemann surfaces that are rather common in the integrable-system community but less so in the approximation theory one. The author is leaning more towards the first and therefore the language used in the paper tends to reflect this bias. I have tried to clarify certain terminology wherever possible.
Description of results. We fix a Riemann surface \(\mathcal {C}\) of genus \(g\ge 1\), and a divisor of degree g (i.e. a collection of points counted with multiplicity so that the total number is g). On \(\mathcal {C}\) we fix a contour \(\gamma \) and a weight differential \(\mathrm d\mu (q)\) (details are in Sect. 2). We choose a distinguished point on \(\mathcal {C}\) which we denote by \(\infty \) (since it plays the role of the point at infinity in the complex plane). The last piece of data is a choice of local coordinate \(z:\mathcal U{\setminus } \{\infty \}\rightarrow \mathbb {C}\) in a neighbourhood \(\mathcal U\) of \(\infty \) such that \(\lim _{p\rightarrow \infty } z(p)=\infty \). The main results are listed below:
-
We start from the description of a suitable extension of the Padé approximation problem for Weyl–Stiltjes transforms in higher genus. The Weyl–Stiltjes function, W, analog to (1.1), is defined in terms the given data in Definition 2.2; the definition requires the use of a suitable Cauchy kernel which replaces the expression \({\frac{1}{z-x}}\). In fact the object we define is not a “function” but a holomorphic differential on \(\mathcal {C}{\setminus } \gamma \cup \{\infty \}\) with a jump discontinuity along \(\gamma \) equal to the chosen measure. Further motivation for this choice is descriped in Sect. 2.
-
We define the Padé approximation problem in Definition 2.3: instead of a ratio of polynomials the relevant generalization requires the ratio of a meromorphic differential \(\mathfrak Q_{n-1}\) and a meromorphic function \(P_n\) such that it approximates the Weyl–Stiltjes function at the point \(\infty \in \mathcal {C}\) to appropriate order. The denominators \(P_n\) are shown to be orthogonal (in the sense of non-Hermitean orthogonality) with respect to the measure \(\mathrm d\mu \) on \(\gamma \).
-
One of the most versatile tools for the study of asymptotic of orthogonal polynomials has proven to be the formulation in terms of a Riemann–Hilbert problem (RHP) [8, 12, 13]. For this reason we formulate the precise analog in this context in Sect. 2.2. The situation for higher genus curves is, expectedly, more complicated: the RHP is still a problem for a \(2\times 2\) matrix but the existence of the solution is not sufficient to guarantee its uniqueness (contrary to what happens in genus zero). The existence and uniqueness of the solution is, however, equivalent to the non-vanishing of a determinant \(D_n\) (2.31) which generalizes the Hankel determinant of the moments: this is Theorem 2.11.
-
The familiar determinantal expression and Heine formulas for orthogonal polynomials have a strict counterpart in (2.32) and Proposition 2.12, respectively.
-
In Section 3 we consider a generalization of the relation between biorthogonal polynomials and KP tau functions/random matrices [1]. With the choice of a local coordinate 1/z(p) near the point \(\infty \in \mathcal {C}\) we have the same data (curve, line bundle, local coordinate) which was used by Krichever to construct algebro-geometric solutions. We define two sequences of biorthogonal sections of certain line bundles and a pairing between them in terms of integration along the curve \(\gamma \) with the given measure. The tau function is defined in Definition 3.5: it depends on an integer n (the dimension of the spaces of sections being paired) and for \(n=0\) it factorizes into the product of two algebro-geometric KP tau functions. For general \(n>0\) the tau function vanishes only if the pairing is degenerate or the Krichever line bundle is special. This tau function depends on two infinite sets of “times”. We prove (Theorem 3.7) that it is a KP tau function (Definition 3.1) in both sets of times. The proof uses the general Hirota bilinear relations in integral formulation. We compute explicitly the Baker and dual-Baker functions in terms of the bi-orthogonal sections of the two line bundles (Propositions 3.8, 3.9). We finally identify the sequence as an instance of a (suitably modified) solution of the 2-Toda hierarchy, as presented in [1, 20].
2 Weyl–Stiltjes function of a measure
Let \(\mathcal {C}\) be a Riemann surface of genus g and \(\mathscr {D}\) a non-special divisor on \(\mathcal {C}\) of degree g. Let \(\infty \in \mathcal {C}{\setminus } \mathscr {D}\). We recall that this means that there are no nontrivial (i.e. non-constant) meromorphic functions with poles only at the points of \(\mathscr {D}\) and of degree not greater than the corresponding multiplicity of the point. Equivalently (by the Riemann–Roch theorem), there are no non-zero holomorphic differentials vanishing at the points of \(\mathscr {D}\) of the corresponding order. These data define uniquely a Cauchy kernel \( \mathbf {C}(p,q)\) [11, 21]. This is the unique function w.r.t. q and meromorphic differential w.r.t. p with the following divisor properties (the subscript refers to the variable for which the divisor properties are being assessed):
and normalized by the requirement \(\mathop {\mathrm {res}}\limits _{p=q} \mathbf {C}(p,q)=1 = -\mathop {\mathrm {res}}\limits _{p=\infty } \mathbf {C}(p,q)\).
Example 2.1
In genus 0 by choosing \(\infty \) as the point at infinity, the kernel takes the familiar form \( \mathbf {C}(w,z) = \frac{\mathrm dw}{w-z}\). In this case \(\mathscr {D}\) is the empty divisor. In genus 1, by representing the elliptic curve \(\mathcal {C}\) as the quotient \(\mathbb {C}/ \mathbb {Z}+ \tau \mathbb {Z}\), we can write it in terms of the Weierstraß \(\zeta \) function; if \(\mathscr {D}=(a)\) and we choose the point \(\infty \) as the origin, for example,
One can write an expression for \( \mathbf {C}\) in terms of Theta functions; this can be found in more general setting in Sect. 3. For hyperelliptic curves we give some really explicit expression in Sect. 2.3.
Let \(\gamma \) be a closed contour avoiding \(\mathscr {D}, \infty \) and \(\mathrm d\mu \) a smooth complex valued measure on it: with this we mean that in the neighbourhood of each point \(p\in \gamma \), with z a local coordinate in the neighbourhood, we can write \(\mathrm d\mu (p) = f(z,\overline{z}) \mathrm dz\) where \(f(z, \overline{z}):\gamma \rightarrow \mathbb {C}\) is a smooth function.
Definition 2.2
The Weyl(Stiltjes) function of \(\mathrm d\mu \) is the following differential on \(\mathcal {C}{\setminus } \gamma \):
We note that \((W)\ge \mathscr {D}-\infty \) on \(\mathcal {C}{\setminus } \gamma \) and that the residue at \(\infty \) is simply the total mass of \(\mathrm d\mu \) on \(\gamma \).
We want to construct a Padé-like approximation to W on \(\mathcal {C}\); in the standard setting \(\mathcal {C}= \mathbb {P}^1\) and \(\mathscr {D}\) is empty and the Cauchy kernel is \( \mathbf {C}(z,w) = \frac{\mathrm dw}{w-z}\). Omitting the \(\mathrm dw\), the Weyl function is really a function and not a differential: \(W(w) = \int \frac{\mathrm d\mu (z)}{w-z}\). In this case the typical Padé approximation problem is that of finding polynomials \(P_n(x)\) of degree \(\le n\) and \(Q_{n-1}\) of degree \(\le n-1\) such that
If we interpret the above equation as a statement about the vanishing at infinity of the meromorphic differential \(\frac{Q_{n-1}(z)}{P_n(z)}\mathrm dz\) we see that the order of vanishing is \(2n-1\): the reader should not be confused here by the apparent discrepancy with the usual Padé requirement that the order of vanishing is \(2n+1\) because here we are considering the left side of (2.4) as a differential on \(\mathbb {P}^1\) and \(\mathrm dz\) has a double pole at infinity.
We should then interpret the numerator as a meromorphic differential \(Q_{n-1}\mathrm dz\) with a single pole at \(\infty \) of order \(n+1\).
With this interpretation the Padé problem can be similarly stated on \(\mathcal {C}\). The main difference in the higher-genus case is that for a given measure \(\mathrm d\mu \) there is a g-parametric family of Weyl functions parametrized by the choice of divisor \(\mathscr {D}\).
Definition 2.3
(Padé approximation) Given \((\mathrm d\mu , \gamma ,\mathscr {D}, \infty )\) as above, the nth Padé approximation is the datum of \(P_n\in \mathscr {L}(\mathscr {D}+n\infty )\) and \(\mathfrak Q_{n-1}\in \mathcal K((n+1)\infty )\) such that
We recall that the symbol \(\mathscr {L}(\mathscr {D}+n\infty )\) denotes the vector space of meromorphic functions f such that \((f)\ge - \mathscr {D}-n\infty \) and, similarly, the symbol \(\mathcal K((n+1)\infty )\) denotes the vector space of meromorphic differentials \(\omega \) such that \((\omega )\ge - (n+1)\infty \). Under our non-specialty assumption the Riemann–Roch theorem implies that generically the dimension of \(\mathscr {L}(\mathscr {D}+n\infty )\) is \(\dim \mathscr {L}(\mathscr {D}+n\infty )= n+1\). Similarly \(\dim (\mathcal K((n+1)\infty )) = g+n\).
Let us draw some consequences from (2.5); multiplying by \(P_n\) the equation becomes
Recalling the Definition 2.3 of W we can rewrite the above as follows:
where the remainder \(\mathfrak R_n(p)\) is defined by
The \(\mathcal O\) notation above is used as follows: to say that \(f=\mathcal O(\mathcal V)\) for a divisor \(\mathcal V = \sum _{j} k_j p_j\), \(p_j\in \mathcal {C}\), means that near each of the points \(p_j\) the function (or differential) has a pole of order at most \(k_j\) if \(k_j>0\) and a zero of order at least \(-k_j\) if \(k_j<0\). We are following here the convention of algebraic geometry. Namely, in (2.7) the notation means that the differential vanishes at \(\mathscr {D}\), and at infinity of order at least \(n-1\). Note that the piecewise analytic differential \(\mathfrak R_n\) in (2.8) has at most a simple pole at \(\infty \) and vanishes at \(\mathscr {D}\).
Since we impose vanishing at \(\infty \) to order \(n-1\) on the right side of (2.7) we deduce that
which—we observe—is a meromorphic differential on \(\mathcal {C}\) (there is no jump on \(\gamma \)) with only one pole of order \(n+1\) at \(\infty \) (n come from \(P_n\) and \(+1\) from the Cauchy kernel). In principle one may want to add a differential \(\omega _0\) to \(\mathfrak Q_{n-1}\) to have a more general solution. However this \(\omega _0\) should have at most one simple pole at \(\infty \) (and hence no pole at all given that the sum of all residues must vanish) and moreover vanish at \(\mathscr {D}\) since \(\mathfrak R_n\) already vanishes there. By the property of non-special divisors recalled at the beginning of this section we conclude that \(\omega _0\equiv 0\).
Now, the Padé approximation requires that the remainder term \(\mathfrak R_n\) also vanishes at \(\infty \) of order \(n-1\).
Since it already vanishes at \(\mathscr {D}\) by the definition of the Cauchy kernel, the extra requirements give n linear constraints on \(P_n\) and hence generically we can expect a unique solution. We investigate these conditions in the following sections.
2.1 Pseudo moments and constructions of the Padé approximants
Let z(p) be a local coordinate in the neighbourhood of \(\infty \) such that \(1/z(\infty )=0\) (i.e. mapping a punctured neighbourhood of \(\infty \) to the outside of the unit disk).
Proposition 2.4
The following functions provide a basis of \(\mathscr {L}(\mathscr {D}+n\infty )\)
with the property
Proof
Given the divisor properties of \( \mathbf {C}\) in (2.1) it is evident that \(\zeta _j\) has poles at \(\mathscr {D}\) of the appropriate orders. For the behaviour near \(\infty \) we work in the local coordinate z(p); let \(z = z(p)\) and \(w=z( q)\). Then the residue formula (2.10) becomes
where orientation of the integration is counterclockwise and the Cauchy kernel can be written \(C(w,z) = \frac{1}{w-z} + H(w,z)\) with H(w, z) jointly analytic in z, w in the neighbourhood of \(z=\infty =w\) and \(H(w,z) = \mathcal O(1/z) \mathcal O(1/w^2)\). Then a simple application of Cauchy’s residue theorem yields
This immediately shows that \(\zeta _j\) are linearly independent and span the required space of meromorphic functions. \(\square \)
Note that \(\zeta _0 \equiv 1\) since \(\mathscr {D} \) is non-special. Consider the coefficients \(\mu _{n,j}\) defined by the following expansion:
We call them pseudo-moments because in the case \(\mathcal {C}=\mathbb {P}^1\) they correspond to the usual moments of the measure. Note however that they do not form a Hankel matrix in general.
Theorem 2.5
(1) The pseudo-moments \(\mu _{j,k}\) in (2.14) are symmetric \(\mu _{j,k}=\mu _{k,j}\) and can be written as
(2) More generally, for any two holomorphic sections \(\phi , \psi \) of \(\mathscr {L}(\mathscr {D})\bigg |_{\mathcal {C}{\setminus } \{\infty \}}\) the following pairing
is symmetric and equals
Remark 2.6
In the genus zero case we have trivially \(\zeta _j =z^j\) and the matrix of coefficients is a Hankel matrix.
Remark 2.7
In the second statement of Theorem 2.5 the wording simply means that \(\phi ,\psi \) are meromorphic functions on the punctured surface (i.e. at most with an isolated singularity at \(\infty \)) and such that their divisor of poles is bounded by \(-\mathscr {D}\). We are mostly interested in the case when the singularity at \(\infty \) is a pole of finite order, but the statement itself allows for functions with essential singularities.
Proof
(1) Let
This is a differential with a discontinuity across \(\gamma \) and at most a simple pole at \(p=\infty \). The coefficient \(\mu _{n,j}\) can then be written as
where the second equality follows from the fact that \(\zeta _j(p)- z(p)^j\) vanishes at \(p=\infty \). Rewriting this latter equality in terms of the definition of \(\phi _n\) we have
where the last equality follows from Fubini’s theorem because \(\infty \not \in \gamma \). Now we observe that the differential with respect to p given by \(\zeta _j(p) \mathbf {C}(p,q)\) has only poles at \(p=\infty \) and \(p=q\) (no poles at \(p\in \mathscr {D}\) because of (2.1)) with opposite residues. Since \(\mathop {\mathrm {res}}\limits _{p=q} \mathbf {C}(p,q)=1\) the Cauchy theorem shows that
Substituting (2.21) into (2.20) yields the proof.
(2) The equality of (2.16) and (2.17) is proved exactly as above and then the symmetry is evident in (2.17). \(\square \)
The solution of the Padé approximation problem (2.5) is then predicated on the existence of \(P_n\in \mathscr {L}(\mathscr {D}+n\infty )\) such that
If we write \(P_n = \sum _{\ell =0}^n \pi _{n\ell } \zeta _\ell (p)\) the condition becomes that \(\sum _{\ell =0}^n \pi _{n,\ell } \mu _{\ell ,j}=0\) for \(j=0,1,\dots , n-1\). In view of the Theorem 2.5 this can be written as
which is the proxy of the usual orthogonality property for the ordinary Padé approximants.
The study of the compatibility of the above system in the zero genus case is part of the theory of the Padé table, [4], which is critically reliant upon the fact that the matrix of moments is a Hankel matrix.
2.2 Riemann–Hilbert problem
Like in the standard Fokas–Its–Kitaev [12, 13] formulation of orthogonal polynomials, we can setup a Riemann–Hilbert problem on the Riemann surface \(\mathcal {C}\) which characterizes these Padé denominators.
Riemann–Hilbert Problem 2.8
Let \(Y_n\) be a \(2\times 2\) matrix with functions in the first column and differentials in the second column, meromorphic in \(\mathcal {C}{\setminus } \gamma \) and admitting boundary values on \(\gamma \) that satisfy the jump relation
In addition we require that the matrix is such that it has poles at \(\mathscr {D}\) in the first column and zeros in the second column, and also the following growth condition at \(\infty \):
Exactly like in the genus zero case, the relevance of the RHP 2.8 is that if a solution exists, then the (1, 1) entry provides the orthogonal section \(P_n(p)\). See Theorem 2.11 below.
Example 2.9
(The case \(n=0\).) If \(n=0\) we see that \((Y_0)_{11}\) must be the constant 1 and \((Y_0)_{21}\) must vanish because it would be a meromorphic function with poles at \(\mathscr {D}\) and a simple zero at \(\infty \) (which is then identically zero thanks to the assumption of non-specialty of \(\mathscr {D}\)). Then the solution is given by
Note that the (2, 2) entry is the unique meromorphic differential with a single double pole at \(\infty \) (normalized according to the choice of coordinate z) and zeros at \(\mathscr {D}\).
Uniqueness of the solution: algebro-geometric approach. The determinant of \(Y_n\) does not have a jump across \(\gamma \) because the jump matrix in (2.24) is of unit determinant. It is therefore a meromorphic differential: from the growth conditions (2.25) it follows that it can only have a double pole at \(\infty \):
Since \(\Delta _n(p)\) is a differential with a double pole, it must have 2g zeros (counting multiplicity). Therefore the usual argument about the uniqueness of the solution to the problem (2.24), (2.25) fails from the start because the matrix \(Y_n^{-1}(p)\) has 2g poles. Indeed the usual reasoning would be to assume that \(\widetilde{Y}_n\) is another solution to the same problem and then consider the ratio
This matrix of functions does not have a jump across \(\gamma \) and it is therefore a priori a matrix of meromorphic functions. If we could conclude immediately that they are—in fact—holomorphic, the Liouville theorem would imply that they are constants and \(R_n\) is then the identity matrix because of the normalization condition (2.26).
However, so far, we can only conclude that \(R_n\) has poles at the 2g zeros of \(\Delta _n\). We denote by \(\mathscr {T}\) the divisor of zeros of \(\Delta _n\) and call it the Tyurin divisor.
Consider a row \(\sigma (p)\) of \(R_n(p)\); it is a meromorphic function such that \(\sigma (p) Y_n(p)\) is holomorphic at all points of \(\mathscr {T}\); this allows us to interpret \(\sigma (p)\) as a global holomorphic section of a vector bundle, \(\mathscr {E}\) of rank 2 and degree 2g described hereafter.
For each \(p_\alpha \in \mathscr {T}\) let \(\mathbb D_\alpha \) be a small disk covering the point \(p_\alpha \) in such a way that these disks are pairwise disjoint; let \(\mathbb D_0\) be \(\mathcal {C}{\setminus } \mathscr {T}\). Then we define the vector bundle by the transition functions
Then we see that the row \(\sigma \) of \(R_n(p)\) is a holomorphic section restricted to the trivializing set \(\mathbb D_0\) of the above bundle.
The Riemann–Roch theorem implies that generically such a bundle has only 2 holomorphic sections; they are the sections such that their restriction to \(\mathbb D_0\) are the constant vectors \(\mathbf{e}_1^t, \mathbf{e}_2^t\). This shows that generically the solution of the Riemann–Hilbert problem is unique.
This reasoning is probably a bit mysterious for the reader accustomed to usual Padé approximants: in the next section we clarify the uniqueness in a completely elementary way which is much closer to usual methods of Padé theory. This is accomplished in Theorem 2.11.
2.2.1 Genericity
Define the determinant
The second equality is an application of the Andréief identity. We observe, and leave the verification to the reader, that a change of coordinate around \(\infty \) from z to \(\widetilde{z}\) modifies these determinants only by a non-zero constant (the nth power of the differential of the change of coordinate from 1/z to \(1/\widetilde{z}\) evaluated at \(\infty \)).
Remark 2.10
In the genus zero case the determinants (2.31) are Hankel determinants of the moments of the measure \(\mathrm d\mu \).
The following theorem is the higher genus counterpart of the characterization theorem for orthogonal polynomials in terms of a Riemann–Hilbert problem [13]. Note, however, that there is a difference between the genus zero and higher genus cases: in genus zero the uniqueness and existence of the solution go hand-in-hand, namely if the solution exists, then it is unique. In higher genus the solution may exists but not unique, although generically it is unique.
The next theorem shows that the existence\(+\)uniqueness is completely predicated upon the non-vanishing of a principal minor of the matrix of moments, much in the same way as in the genus zero case. However, it may happen that the determinant vanishes and yet we have a solution (not unique). This occurrence is precisely the non-vanishing of \(h^1(\mathscr {E})\) discussed above.
Theorem 2.11
If the determinant \(D_n\) in (2.31) does not vanish then the solution to the RHP 2.8 exists and is unique. Viceversa if the solution exists and it is unique, then \(D_n \ne 0\). Moreover the (1, 1) entry is the “monic” orthogonal section \(P_n\in \mathscr {L}(n\infty + \mathscr {D})\) of the form \(P_n (p) = \zeta _n(p)+ \sum _{j=0}^{n-1} c_j \zeta _j(p)\).
Proof
Suppose \(D_n\ne 0\). Define, in a similar vein to the usual case of orthogonal polynomials,
This is a section of \(\mathscr {L}(\mathscr {D}+n\infty )\) of the form \(P_n(p)= \zeta _n + \mathbb {C}\{\zeta _0,\dots , \zeta _{n-1}\}\) and hence behaves as \(z(p)^n\) as \(p\rightarrow \infty \). Similarly we define
Finally we set
Consider then the matrix
A simple application of the Sokhostki-Plemelj formula shows that it satisfies (2.24). Near the divisor \(\mathscr {D}\) it has the required growth in (2.25) because of the properties (2.1) of the Cauchy kernel. It remains to verify the growth near \(\infty \) and the normalization condition (2.26).
The first column is clearly of the form \([z^n + \mathcal O(z^{n-1}), \mathcal O(z^{n-1})]^t\) and hence we need to focus only on the behaviour of the second column near \(\infty \).
Consider the expansion of \(\mathfrak R_n\) near \(\infty \):
According to Theorem 2.5 we have
This expression clearly vanishes for \(\ell \le n-1\) and hence indeed \(\mathfrak R_n(p) = \mathcal O(z^{-n}) \frac{\mathrm dz}{z}\) near \(\infty \). In particular the leading coefficient of the expansion is
The same computation for \(\widetilde{\mathfrak R_n}\) gives that
which satisfies the growth condition (2.25) and the normalization (2.26) as well.
Having shown the existence, we now need to address the uniqueness of the solution. Let \(\widetilde{Y}\) (we omit the subscript \(_n\) for brevity) be a solution of RHP 2.8: the jump condition (2.24) implies that the first column of the solution must be made of sections of \(\mathscr {L}(\mathscr {D}+n\infty )\) and \(\mathscr {L}(\mathscr {D}+(n-1)\infty )\). The same jump condition implies that the second column is obtained from the first by the integral against the Cauchy kernel: this is so because the divisor \(\mathscr {D}\) is non-special and there is no nontrivial holomorphic differential that vanishes at \(\mathscr {D}\).
Next, the order of vanishing at \(\infty \) of \(\widetilde{Y}_{12}\) must be \(n-1\) (i.e. it must be of the form \(\mathcal O(\frac{1}{z^{n+1}})\mathrm dz\)); the same computation used above implies then that
for \(P_n\) as in (2.32) and some coefficients \(\alpha _\ell \). These coefficients must satisfy the linear system
which has only the trivial solution because of the assumption \(D_n\ne 0\). Next, the component \(\widetilde{Y}_{21}\in \mathscr {L}(\mathscr {D}+(n-1)\infty )\) is subject to similar constraints: writing it as a linear combination \(\sum _{\ell =0}^{n-1} \beta _\ell \zeta _\ell \) we see that the asymptotic constraint that \(\int _\gamma \mathbf {C}(p,q) \widetilde{Y}_{21}(q)\mathrm d\mu (q) = \frac{1}{z^n} (1+\mathcal O(z^{-1})\mathrm dz\) translates in the linear system:
which, again, has a unique solution thanks to the assumption \(D_n\ne 0\).
We now show the converse statement. Suppose a solution Y exists and is unique. Denote by \(\mathbf{A}, \mathbf{B}\) the two columns of Y. The jump condition (2.24) together with the growth condition (2.25) at the divisor \(\mathscr {D}\) implies that
-
(i)
\(\mathbf{A}\) is meromorphic on \(\mathcal C\) and the first entry, \(A_1(p)\), is a section in \(\mathscr {L}(\mathscr {D} +n\infty )\);
-
(ii)
\(\mathbf{B}(p) = \int _\gamma \mathbf {C}(p,q) \mathbf{A}(q) \mathrm d\mu (q)\).
From (2.26) it follows that the entry \(A_1(q)\) has a pole of order exactly n at \(\infty \) and hence it can be written as a linear combination
The vanishing of order \(n-1\) at \(\infty \) of the first entry \(B_1(p) = \int _\gamma \mathbf {C}(p,q) A_1(q) \mathrm d\mu (q)\) is equivalent to the condition that the coefficients \(c_0,\dots , c_{n}\) satisfy the system
where \(\star \) denotes a possibly nonzero coefficient.
So far we have used only the existence of the solution; now we use the uniqueness assumption to show that \(D_n\ne 0\). Suppose, by contrapositive, that \(D_n=0\) and let \((\alpha _0,\dots , \alpha _{n-1})\) be a nontrivial solution of (2.41). Then the vector \(\vec {\widetilde{c}}= (c_0+\alpha _0, \dots , c_{n-1}+\alpha _{n-1}, 1)\) is another solution of the same equation (2.44), which then violates the uniqueness. \(\square \)
The Theorem 2.5 allows us to interpret the vanishing condition of \(\mathfrak R_n\) (2.38) precisely as an “orthogonality”
This latter equation, in turn implies
If all the sequence of determinants \(\{D_n\}_{n\ge 0}\) does not vanish, the above condition is then the usual (non-hermitean) orthogonality.
Existence without uniqueness. Suppose that \(D_n=0\); the solution to the RHP 2.8 may still exist. For this to happen we must find a linear combination \(P_n(p) = \zeta _n(p)- \sum _{\ell =0}^{n-1} \alpha _\ell \zeta _\ell (p)\) which is orthogonal to \(\zeta _j, \ j=0,\dots , n-1\). Moreover there must be also a \(\widetilde{P}_n\in \mathscr {L}(\mathscr {D}+(n-1)\infty )\) such that its Cauchy transform is appropriately normalized. This may happen if both the following systems are simultaneously compatible:
where \(\widetilde{P}_n(p) = \sum _{\ell =0}^{n-1} \beta _\ell \zeta _\ell (p)\). The expression
belongs to \(\mathscr {L}(\mathscr {D}+(n-1)\infty )\) (the coefficient in front of \(\zeta _n\) vanishes in the Laplace expansion) and has also the property that its Cauchy transform vanishes at \(\infty \) like \(\mathrm dz/z^{n+1}\) since it is orthogonal to all \(\zeta _0, \dots , \zeta _{n-1}\). Thus the row-vector
is a row-vector solution that can be added to either rows of \(Y_n\) and the uniqueness of the solution is lost.
Note that in genus zero \(\mu _{a,b} = f_{a+b}\) is a Hankel matrix and the vanishing of \(D_n\) makes the two systems (2.47) incompatible. A simple way to convince ourselves of this fact is to count the number of constraints versus the number of equations. Indeed, if \(D_n=0\) (viewed as a polynomial relation in the coefficients), the compatibility of the two systems imposes additional 2n equations for the 2n indeterminates \(f_0,\dots , f_{2n-1}\); these additional equations are given by the vanishing of all the determinants obtained by replacing the n columns of the Hankel matrix by either of the two vectors on the right sides of (2.47). Thus in the end one has \(2n+1\) polynomial equations in 2n variables. This argument, while simple and perhaps convincing, is not entirely satisfactory because the \(2n+1\) polynomial relations should be proved algebraically independent.
A complete but indirect proof is obtained by invoking the fact that, in genus zero, the existence of the solution to the Riemann-Hilbert problem implies automatically its uniqueness. Indeed the system (2.47) provides the solution by setting \(Y_{11}(z) = z^n + \sum _{j=0}^{n-1} \alpha _j z^j,\ \ Y_{21}(z) = \sum _{j=0}^{n-1} \beta _j z^j\) and defining the second column as \(Y_{\bullet 2}(z) = \int _\gamma \frac{Y_{\bullet 1}(w)\mathrm d\mu (w)\mathrm dw}{z-w}\). If we assume by contrapositive that the two systems (2.47) are compatible under the assumption \(D_n=0\), we obtain a contradiction with the uniqueness of the solution of the Riemann–Hilbert problem since neither \(\vec {\alpha }\) nor \(\vec {\beta }\) are uniquely defined.
The case \(\varvec{\infty \in \gamma }\) or \(\varvec{\mathscr {D}\cap \gamma \ne \emptyset }\). We have assumed, for simplicity, that \(\infty \) does not belong to the support \(\gamma \) of the measure \(\mathrm d\mu \). We can lift this restriction easily without modifying any of the substance. In this case we must assume that the functions \(z^\ell \) are locally integrable at \(\infty \) with respect to the measure \(\mathrm d\mu \), for all \(\ell \in \mathbb N\). Some modification in the statements about the growth then needs to be made but it is of the same nature as the case of ordinary orthogonal polynomials. Similarly, if a point \(p_0\) of the divisor \(\mathscr {D}\) (of multiplicity \(k_0\)) belongs to \(\gamma \) we need to assume that the function \(1/\kappa ^{2k_0}\) (with \(\kappa \) a local coordinate at \(p_0\)) is locally integrable in the measure \(\mathrm d\mu \) at \(p_0\). Some technical considerations will have to be modified accordingly but the essential picture remains the same.
Heine formula. In the genus zero case the orthogonal polynomials can be expressed in terms of a multiple integral that goes under the name of Heine formula [19]. The following simple proposition expresses the orthogonal sections in a similar fashion.
Proposition 2.12
(Heine formula) The following section \(\Psi _n(p)\in \mathscr {L}_n:= \mathscr {L} (\mathscr {D}+n\infty )\) is orthogonal to \( \mathscr {L}_{n-1}\):
If \(D_n\ne 0\) in (2.31) then the “monic” orthogonal section \(P_n = \zeta _n + \dots \in \mathscr {L}_n {\setminus }\mathscr {L}_{n-1}\) defined in (2.32) is given in terms of \(\Psi _n\) by \(P_n(p) = \frac{1}{n! D_n} \Psi _n(p)\).
Proof
Let \(j\le n-1\) and consider \(\int _\gamma \Psi _n(p) \zeta _j(p) \mathrm d\mu (p)\). Using the Laplace expansion we have
Using the Andreief identity we obtain
The latter expression is the Laplace expansion of the determinant
At this point it is clear that the expression is orthogonal to \(\zeta _j, \ j=0\dots , n-1\), spanning \(\mathscr {L}(\mathscr {D}+(n-1)\infty )\). If \(D_n\ne 0\) then \(\Psi _n\) has actually a pole of order n and can be “normalized” to be monic. \(\square \)
2.3 Example: the hyperelliptic case
Let \(\mathcal C\) be a hyperelliptic curve of the form
where the numbers \(t_j\) are pairwise distinct. This is a Riemann surface of genus g (compactified by adding two points above \(z=\infty \)). The reader may visualize it as a two-sheeted cover of the z-plane, branched at the points \(t_j\)’s. A simple way of doing so is to glue two copies of the z-plane dissected along pairwise disjoint segments joining the branchpoints in pairs (for example \([t_1,t_2]\), \([t_3,t_4]\) etc.)
A point \(p\in \mathcal C\) is a pair of values \(p=(z,y)\) satisfying the equation (2.54). It is well known [9] that a degree g non-special divisor is any divisor \(\mathscr {D} = \sum _{\ell =1}^g p_\ell \) (points may be repeated) as long as \(p_\ell = (z_\ell , y_\ell )\) are such that \(y_\ell \ne -y_k, \ \ \ell \ne k\). Note that the points \(p_\ell \) may be equal to one of the branch-points \(t_\ell \)’s but then it must be of multiplicity one. For added simplicity in this example we assume that \(z_j\ne z_k,\ j\ne k\).
We choose \(\infty \) to be the point \(z=\infty \) and on the sheet where \(y(z)/z^{g+1}\rightarrow 1\): we denote this point as \(\infty _+\), whereas \(\infty _-\) is the point \(z=\infty \) where \(y(z)/z^{g+1}\rightarrow -1\).
A simple exercise shows that the Cauchy kernel subordinated to the choice of divisor \(\mathscr {D}\) and with pole at \(\infty _+\) is given by (here \(p=(w,y(w))\) and \(q=(z,y(z))\))
where \(L_\ell (w)\) are the elementary Lagrange interpolation polynomials:
To verify that this is the correct Cauchy kernel, one has to verify the divisor properties (2.1): the least obvious might be the vanishing, as a function of z, when \(q=(z,y(z))\) tends to \(\infty _+\).
This can be seen as follows: the part that does not obviously vanish comes from the term
Expanding the bracket in (2.57) in geometric series w.r.t z we have
The last equality is due to the fact that, for \(k\le g-1\) the polynomial of w in the bracket has degree \(\le g-1\) and vanishes at the g points \(z_1,\dots , z_g\); for \(k=g\) it is a polynomial of degree g with leading coefficient \(-1\) and vanishing at all the g points, so that necessarily equals to \(-L(w)= -\prod _{\ell =1}^g(w-z_\ell )\). Multiplying (2.58) by y(z) we see that the expression (2.57) vanishes at \(\infty _+\).
A basis of functions \(\zeta _j\) such that \(\mathscr {L}(\mathscr {D}+n\infty ) = \mathrm{Span}\{\zeta _0,\dots , \zeta _n\}\) can be taken to be
where \(P_j(z)\) is the polynomial given by
with the subscript indicating the polynomial part and the determination of y(z) being the one such that \(y(z)/z^{g+1}\rightarrow 1\).
2.3.1 Curves with antiholomorphic involution
These are curves with an antiholomorphic diffeomorphism \(\nu :\mathcal {C}\rightarrow \mathcal {C}\). Without entering into details, in the case of hyperelliptic curves above, these are curves such that the set of branch-points \(\{t_1,\dots , t_{2g+2}\}\) is invariant under complex conjugation, and in this case the map \(\nu \) is the map \(\nu (z,y) = (\overline{z}, \overline{y})\) (or \(\nu (z,y)= (\overline{z},-\overline{y})\)). In general, for a plane algebraic curve defined as the polynomial equation \(E(z,y)=0\) this means that all the coefficients of E are real.
If we choose also the divisor \(\mathscr {D}\) invariant under \(\nu \) and \(\infty \) as a fixed point (in our case both points at \(z=\infty \) are fixed by the map \(\nu \)), then the Cauchy kernel is also a real-valued kernel in the sense that \( \mathbf {C}(\nu (p),\nu (q)) = \overline{ \mathbf {C}(p,q)}\). The basis of sections \(\zeta _j\) of \(\mathscr {L}\) is then real-valued as well so that \(\zeta _j(\nu (p))=\overline{\zeta _j(p)}\).
We can then choose \(\gamma \) to be a closed contour fixed by \(\nu \), \(\nu (p)=p\, \ \ \forall p\in \gamma \) and \(\mathrm d\mu \) a positive real-valued measure on \(\gamma \). In this case the determinants \(D_n\) (2.31) will be strictly positive and hence the Theorem 2.11 shows that the solution of the RHP 2.8 exists and is unique for all \(n\in \mathbb N\). Therefore we have an infinite basis of orthogonal functions exactly as in the usual case of orthogonal polynomials for an \(L^2(\mathbb {R}, \mathrm d\mu )\).
Genus 1. An example where we can write in great details the objects described above is the case of an elliptic curve \(E_\tau \) realized as quotient of the plane \(\mathbb {C}\) by the lattice \(2\omega _1\mathbb {Z}+ 2\omega _2 \mathbb {Z}\), with \(\tau = \omega _2/\omega _1 \in i\mathbb {R}_+\). Without loss of generality, we can choose \(\omega _1\in \mathbb {R}\). In Weierstraß form the elliptic curve is
with \(e_1+e_2+e_3=0\) and \(e_1<e_2<e_3\) (all real) or \(e_1=\overline{e}_2\), \(e_3\in \mathbb {R}\). For definiteness we consider the case where \(e_1,e_2,e_3\in \mathbb {R}\): then \(\omega _1 = \int _{e_1}^{e_2} \frac{\mathrm dX}{Y}\) (with \(Y= \sqrt{4X^3 - g_2 X-g_3}\) chosen so that it is positive in \([e_1,e_2]\)) and the Weierstraß functions provide the uniformization of (2.61). Setting
then the Weierstraß \(\wp \) function is \(\wp =-\zeta '\):
The classical result of uniformization is then obtained by setting \(X = \wp \) and \(Y = \wp '\).
The resulting elliptic curve \(E_\tau \) admits the obvious antiholomorphic involution \(z\rightarrow \frac{\omega _1}{\overline{\omega _1}} \overline{z}= \overline{z}\). We choose \(\infty =\{0 \}\) and \(\mathscr {D}=\{ a\}\), with \(a\in (0,2\omega _1)\). A basis of sections of \(\mathscr {L}(\mathscr {D}+n\infty )\) is provided in terms of the Weierstraß \(\zeta \) and \(\wp \) functions
Note that all these functions are real-analytic: \(\overline{f(z)} = f(\overline{z})\). The Cauchy kernel is given by
As for contour of integration \(\gamma \) we choose the a-cycle, which is represented as either \([\omega _1, \omega _1+2\omega _2]\) in the z-plane or the segment \([e_2,e_3]\) in the X-plane, on both sheets of the curve. Note that \(\overline{\gamma }= \gamma \,\mathrm {mod}\,\mathbb {Z}+ \tau \mathbb {Z}\) (pointwise)
The simplest case of Weyl differential is for the flat measure \(\mathrm d\mu (x) = \mathrm dx\) on \([\omega _2,\omega _2 + 2\omega _1]\), thought of as the a-cycle on the elliptic curve \(E_\tau \). The Cauchy kernel is given by The Weyl differential W(p) is then
where \(\eta _j = \zeta (\omega _j)\) are Weierstraß eta functions (not to be confused with Dedekind’s \(\eta \) function) and satisfying the Lagrange identity
The identity (2.67) implies, as it should be, that W(z) has a jump-discontinuity along the segment \((0,2\omega _1)\) and its translates thanks to the quasi-periodicity of the \(\zeta \) function
Namely: \(W(z +2\omega _2) - W(z) = 2i\pi \mathrm dz\) (as it should be from the definition).
The matrix of moments is almost a Hankel matrix because
In particular the integral is zero if j, k have different parity. This latter integral in principle can be computed in closed form; indeed since the integrand is an elliptic function with only one pole, it can be written as a linear combination of \(\wp ^{(\ell )}\), \(\ell =0,\dots , j+k\). Only the coefficient of \(\wp \) in this latter expansion survives because all other terms integrate to zero. Using the well known formula for the expansion of \(\wp \)
one finds easily
when j, k have the same parity (and zero otherwise). This is not of much use at any rate because there are no closed formulas for \(\mu _{1,j}\). We note only that the first two rows and columns of the moment matrix do not satisfy the Hankel property. For example
A numerical evaluation can be performed. The resulting first few orthonormal sections are plotted in Fig. 1 (with the independent variable \(z=\omega _2+s\), and \(s\in (0,2\omega _1)\). We observe that certain common theorems that apply to orthogonal polynomials do not apply here. In particular there are orthogonal sections of degree n which have \(n-2\) “real” (i.e. on \(\gamma \)) zeros.
Some remarks. We conclude this section with some remarks. The author could not find any literature discussing orthogonal section of line bundles in the sense of generalization of Padé approximants with the partial exception of [10] where, however, only the orthogonal “polynomials” and not the Padé problem are considered. Therefore there would be many questions regarding which of the classical results can be generalized in this context.
For example, (for the curves with antiholomorphic involution discussed in this last section) a natural question is where the zeros of the orthogonal sections are, and if something can be said for general (positive) measures. The ordinary proof of the reality and interlacing of orthogonal polynomials rely ultimately on the fact that polynomials are also an algebra, which is no longer the case in higher genus: indeed, the graded vector space \(\bigoplus _{n\ge 0} \mathscr {L}(n\infty +\mathscr {D})\) is the analog of the space of polynomials but is not an algebra.
Even the simple example indicated above (genus 1 and flat measure) would seem something of classical nature and possibly more properties of these orthogonal sections can be determined. For example it is tempting to conjecture that the number of zeros on the contour \(\gamma \) (fixed by the anti-involution) should be increasing by 2g every g steps and that an interlacing of the zeros is still a universal feature.
Much more interesting, and challenging, is the asymptotic description of the density of zeros, or even more, a strong asymptotic description of the orthogonal sections for large degree.
In this context, one could hope to adapt the techniques of the Deift–Zhou Steepest descent for Riemann–Hilbert problems (for either fixed measures or scaling weights) as in the literature for orthogonal polynomials [7]. This indeed was the main impetus for seeking the Riemann–Hilbert Problem 2.8. The immediate obstacle is the presence of the Tyurin data, which depend in a transcendental way on the measure.
3 Construction of KP tau functions: generalizing algebro-geometric tau functions
A famous construction of Krichever’s [15] gives rise to the so-called “algebro-geometric” solutions of the Kadomtsev–Petviashvili (KP) hierarchy. We are not recalling the whole construction here because it is well known in the community of integrable systems; for a modern review see [3]. Here we simply recall that the data are
-
a non-special divisor \(\mathscr {D}\) of degree g,
-
a point \(\infty \in \mathcal {C}\) and a local coordinate 1/z(p) (such that \(1/z(\infty )=0\)).
This is a subset of the data of our present setup: in addition to the above we have a measure \(\mathrm d\mu \) on a contour \(\gamma \subset \mathcal {C}\). It is then natural to investigate if we can extend that construction. This is indeed possible as we see in Theorem 3.7.
We are now going to consider the family of degree zero line-bundles \(\mathscr {L}_\mathbf{t}\) trivialized on the two sets of a disk around \(\infty \) and the punctured surface \(\mathcal {C}{\setminus } \{\infty \}\) with transition function \(\mathrm{e}^{\xi (p;\mathbf{t})}\), where we have set
for brevity. In concrete terms, a meromorphic section of \(\mathscr {L}_\mathbf{t}\) is simply a function f(p) which is meromorphic on \(\mathcal {C}{\setminus } \{\infty \}\), with an essential singularity at \(\infty \) and such that \(f(p)\mathrm{e}^{-\xi (p;\mathbf{t})}\) is meromorphic in a neighbourhood of \(\infty \).
Then the symbol \(\mathscr {L}_\mathbf{t}(\mathscr {D}+n\infty )\) stands for the vector space of all functions f(p) such that f(p) has poles at \(\mathscr {D}\) whose order does not exceed the multiplicity of the divisor and such that \(f(p)\mathrm{e}^{-\xi (p;\mathbf{t})} z(p)^{-n}\) is locally analytic near \(\infty \).
In Krichever’s approach the Baker–Akhiezer function is a spanning element of \(\mathscr {L}_\mathbf{t}(\mathscr {D})\) and in general one easily shows that
with the equality holding for a divisor \(\mathscr {D}\) and \(\mathbf{t}\) in generic position. For convenience we denote with \(\widehat{\mathscr {L}}_\mathbf{t}(\mathscr {D})= \sum _{n\ge 0} \mathscr {L}_\mathbf{t}(\mathscr {D}+n\infty )\). Namely, this is the infinite dimensional space of all meromorphic sections of \(\mathscr {L}_\mathbf{t}\) whose poles are at most at \(\mathscr {D}\) and with order which does note exceed the multiplicity of the points of \(\mathscr {D}\).
Consider now the following pairing on \(\widehat{\mathscr {L}}_\mathbf{t}(\mathscr {D})\otimes \widehat{\mathscr {L}}_\mathbf{s}(\mathscr {D})\):
given by
Our ultimate goal is to define a sequence of functions \(\tau _n(\mathbf{t},\mathbf{s}),\ n\ge 0\) (see Definition 3.5) with the following properties:
-
1.
\(\tau _n(\mathbf{t},\mathbf{s})=0\) if and only if the pairing (3.3) restricted to \({\mathscr {L}}_\mathbf{t}(\mathscr {D}+(n-1)\infty )\otimes {\mathscr {L}}_\mathbf{s}(\mathscr {D}+(n-1)\infty )\) is degenerate or \(\dim _\mathbb {C}{\mathscr {L}}_{\mathbf{t}}(\mathscr {D})>1\) or \(\dim _\mathbb {C}{\mathscr {L}}_{\mathbf{s}}(\mathscr {D})>1\);
-
2.
It satisfies the Kadomtsev–Petviashvili (KP) hierarchy in both sets of infinite variables \(\mathbf{t}, \mathbf{s}\);
-
3.
It satisfies the 2-Toda hierarchy.
Before proceeding with this plan, we provide a (formal) definition of KP tau functions which is convenient for us; historically this is not the definition but a theorem that characterizes KP tau functions [3]. However it is expedient for us to flip history on its head and use this characterization as a definition.
Definition 3.1
A (formal) KP tau function is a function \(\tau (t_1,t_2,\dots ) = \tau (\mathbf{t})\) of infinitely many variables that satisfies the Hirota bilinear identity (HBI)
for all \(\mathbf{t},\widetilde{\mathbf{t}}\). Here we use the notation
Notations for algebro-geometric objects. We choose a Torelli marking \(\{a_1,\dots , a_g, b_1,\dots , b_g\}\) for \(\mathcal {C}\) in terms of which we construct the classical Riemann Theta functions. We refer to [11], Ch. 1-2 for a review of these classical notions. Here we shall denote by \(\Theta _\Delta \) the Theta function with characteristc \(\Delta \), which is chosen as a nonsingular half-integer characteristics in the Jacobian of the curve. We remind the reader that this implies that \(\Theta _\Delta (\mathbf{z}),\ \mathbf{z}\in \mathbb {C}^g\) is an odd function on \(\mathbb J(\mathcal {C})\) and that the gradient at \(\mathbf{z}=0\) does not vanish. We also need the normalized holomorphic differentials \(\omega _1,\dots , \omega _g\) such that
The Abel map \(\mathfrak {A}_\infty (p)\) will be defined with basepoint chosen at \(\infty \):
Following a common use in the literature (see [11]), we omit the notation of the Abel map when it is composed with the Riemann Theta function; to wit, for example, if \(p\in \mathcal {C}\) is a point on the curve we shall write \(\Theta (p)\) to mean \(\Theta (\mathfrak {A}_\infty (p))\). Similarly, if \(\mathscr {D}= \sum k_j p_j\) is a divisor on \(\mathcal {C}\) with \(k_j\in \mathbb {Z}\) and \(p_j\) a collection of points, the writing \(\Theta (\mathscr {D})\) stands for \(\Theta \left(\sum k_j \mathfrak {A}_\infty (p_j)\right)\). Finally we denote by \( \mathscr {K}\) the vector of Riemann constants (e.g. [11], pag. 8). Note the \( \mathscr {K}\) depends on the choice of basepoint of the Abel map.
Let \(\Omega (p,q)\) be the “fundamental bidifferential” ([11], pag 20);
This is the unique bi-differential with the properties that it is symmetric in the exchange of arguments, its a-periods vanish and it has a unique pole for \(p=q\) with bi-residue equal to one (more details can be found in loc. cit.).
Let \(\Omega _\ell ,\ \ell \ge 1\) be the unique meromorphic differentials of the second kind with a single pole of order \(\ell +1\) at the point \(\infty \) and such that
They can be written in terms of the fundamental bidifferential as follows
With these notations we now remind that the (multi-valued) function \(\Theta _\Delta (p)\) has g zeros at the points \(\infty \) and \(\mathscr {D}_{\Delta }\) (a divisor of degree \(g-1\)). The following holomorphic differential:
vanishes at \(2\mathscr {D}_\Delta \) (i.e. has only zeros of even order and double that of the points of \(\mathscr {D}_\Delta \)). For later convenience we define the constant
From the definition of \(\omega _\Delta \) (3.12) a simple local analysis shows that \({\varkappa }\) can be also expressed as follows also the property
To shorten formulas we lift \(\xi \) to a function in the neighbourhood of \(\infty \) by using the local coordinate z:
Note that this (formal) function is defined only in the coordinate chart covered by our chosen local coordinate z. In terms of \(\xi \) we define the differential
This can be described as the unique differential on \(\mathcal {C}{\setminus } \{\infty \}\) with prescribed singular part near \(\infty \) given by \(\mathrm d\xi (p;\mathbf{t})\) and normalized so that its a-periods vanish. We denote by \(\vartheta (p;\mathbf{t}) \) its antiderivative with the constant of integration adjusted so that
I.e., \(\vartheta (p;\mathbf{t}) = \sum _{\ell \ge 1} t_\ell z(p)^\ell + \mathcal O(z(p)^{-1}).\)
Finally we denote by \(\mathbb V(\mathbf{t})\in \mathbb {C}^g\) the vector in the Jacobian \(\mathbb J(\mathcal {C})\) with components
The second equality is a consequence of Riemann bilinear identities. For \(x\in \mathcal {C}\) a point of the Riemann surface in the coordinate patch of z, we use the notation
The role of the Cauchy kernel (2.1) is now played by the following generalization.
Definition 3.2
The twisted Cauchy kernel is the following expression:
where
It can be characterized as the unique kernel on \(\mathcal {C}{\setminus } \{\infty \}\) which is a differential w.r.t. p, meromorphic function w.r.t. q and with the properties:
-
1.
w.r.t. p it has zero divisor \(\ge \mathscr {D}\) and a simple pole at q of residue 1;
-
2.
w.r.t. q it has pole divisor \(\ge -\mathscr {D}\) and a simple pole at p;
-
3.
when p, q are in a neighbourhood of \(\infty \) and hence fall within the same coordinate patch z, it can be written
$$\begin{aligned} \mathbf {C}(p,q;\mathbf{t}) = \mathrm{e}^{\xi (q;\mathbf{t})-\xi (p;\mathbf{t})} \left(\frac{1}{z-w} + \mathcal O(z^{-2})\mathcal O(w^{-1}) \right) \mathrm dz \end{aligned}$$(3.22)where \(z = z(p)\) and \(w = z(q)\).Footnote 1
We observe that for \(\mathbf{t}=0\) it coincides with the Cauchy kernel (2.1).
Remark 3.3
Observe that the Cauchy kernel ceases to exist when \(\Theta (\mathbb F(\mathbf{t}))=0\); this corresponds, by a consequence of Riemann vanishing theorem and Riemann–Roch’s theorem to the statement that \(\dim _\mathbb {C}{\mathscr {L} }_{\mathbf{t}}(\mathscr {D})> 1\), namely, that there is no Baker-Akhiezer function in Krichever’s setup. \(\triangle \)
The twisted kernel (3.20) allows us to construct spanning sections of \(\mathscr {L}_\mathbf{t}(\mathscr {D}+n\infty )\) by the following formula
A simple local analysis shows that the behaviour of \(\zeta _n\) near \(\infty \) is of the form
and hence \(\zeta _n(q;\mathbf{t})\in \mathscr {L}_\mathbf{t}(\mathscr {D}+n\infty ){\setminus } \mathscr {L}_\mathbf{t}(\mathscr {D}+(n-1)\infty )\) so that \( \mathscr {L}_\mathbf{t}(\mathscr {D}+n\infty ) = \mathrm{Span}_{\mathbb {C}}\left\rbrace \zeta _a; \ \ a=0,\dots , n\right\lbrace \).
Remark 3.4
Observe that \(\zeta _0(p;\mathbf{t})\) is the usual Baker–Akhiezer function of Krichever’s; then another convenient spanning set can be defined by \(\partial _{t_1}^\ell \zeta _0(p;\mathbf{t})\), \(\ell =0,\dots , n\).
Biorthogonal sections. A simple exercise shows that the following two sections \(P_n\in \mathscr {L}_\mathbf{t}(\mathscr {D}+n\infty )\) and \(Q_n\in \mathscr {L}_\mathbf{s}(\mathscr {D}+n\infty )\) are “biortogonal” with respect to the pairing (3.3) in the sense that \(P_n\perp \mathscr {L}_\mathbf{t}(\mathscr {D}+(n-1)\infty )\) and \(Q_n\perp \mathscr {L}_\mathbf{t}(\mathscr {D}+(n-1)\infty )\):
where we have introduced the generalized bi-moments
For example
We now come to the main object of the section;
Definition 3.5
(The Tau function) The Tau function is defined by
The expression \(Q(\mathbf{t})\) in (3.28) is the quadratic form
and \(A (\mathbf{t})\) is the linear form
where \(c_\ell \) are the coefficients of the expansion of \(\ln (\Theta _\Delta (x)z(x))\) near \(\infty \) in the coordinate z(x):
The equivalence of (3.28) and (3.29) follows from the Andreief identity [2]. The formula (3.29) shows clearly that \(\tau _n(\mathbf{t},\mathbf{s})=0\) if and only if the pairing (3.3) is degenerate or \(\Theta (\mathbb F(\mathbf{t}))\Theta (\mathbb F(\mathbf{s}))=0\).
Remark 3.6
The expression \(\Theta (\mathbb F)\mathrm{e}^{Q(\mathbf{t})}\) is the algebro-geometric KP tau function corresponding to Krichever’s construction (see [3], Ch. 8). Thus, for \(n=0\) the tau function is just the product of two independent Krichever algebro-geometric KP tau functions. For \(n\ge 1\) the determinant of the moments “entangles” them into a single object.
We now state the first main theorem
Theorem 3.7
For every \(n\in \mathbb N\), the tau function \(\tau _n (\mathbf{t},\mathbf{s})\) is a KP tau function separately in each set of variables \(\mathbf{t}, \mathbf{s}\). Namely it satisfies the two Hirota Bilinear Identities
where [x] is defined in (3.19) and \(\xi \) is defined in (3.15).
It is clear that the roles of \(\mathbf{t}, \mathbf{s}\) are completely symmetric in the definition (3.28), and hence it suffices to give the proof of (3.33). For this reason we will focus on the \(\mathbf{t}\) dependence, leaving the reader the exercise to reformulate similar statements for the \(\mathbf{s}\) dependence.
The argument in the residue formula (3.33) is usually split into the product of the so-called Baker–Akhiezer (and dual partner) functions. In fact these functions have their own definition (see for example [18]) and their relationship with the tau function is rather a theorem that generally goes under the name of Sato’s formula. Here we do not make this distinction because it is not relevant to the paper and we identify the Baker–Akhiezer functions with their expression in terms of Sato’s formula.
Proposition 3.8
The Baker–Akhiezer function is
where \(P_n\) is the biorthogonal section defined by (3.25) (the constant \({\varkappa }\) is defined in (3.13)).
The proof is in Sect. A.2. The second component of the HBI’s is the dual Baker function. For this reason we need the analog of Proposition 3.8 with the opposite shift in the times.
Proposition 3.9
The dual Baker function is
where \(\mathfrak R_n(x;\mathbf{t},\mathbf{s})\) is the following differential with a discontinuity across \(\gamma \):
and \(Q_n\) is the biorthogonal section (3.25). The jump discontinuity of \({\mathfrak {R}}_n\) across the contour \(\gamma \) is given by
The proof is in Sect. A.3.
With the aid of the two Propositions 3.8, 3.9 the proof of the main theorem is now a simple conclusion.
Proof of Theorem 3.7
Using 3.36 and 3.35 for the tau functions we see that their product in (3.33) extends to a well-defined holomorphic differential in the variable x defined on \(\mathcal {C}{\setminus } \gamma \cup \{\infty \}\). Thus we need to compute the following residue:
The differential \(\mathfrak R_n\) has a jump discontinuity across the contour \(\gamma \), an essential singularity at \(\infty \) and it is otherwise holomorphic with zeros at \(\mathscr {D}\) that cancel the poles of \(P_n\).
Thus, the computation of the residue 3.39 can be performed alternatively (Cauchy’s theorem) by integrating along the contour \(\gamma \) the jump discontinuity of the integrand using (3.38) and hence
Since \( Q_{n-1}(x;\widetilde{\mathbf{t}}, \mathbf{s})\in \mathscr {L}_{\mathbf{s}}(\mathscr {D}+(n-1)\infty )\), it follows that the integral vanishes because of the orthogonality of \(P_n\) to the whole subspace (3.27). \(\square \)
3.1 The 2-Toda hierarchy
A simple modification of the computation above allows us to prove the following corollary, which gives a modification of the 2-Toda bilinear identities [1, 20].
Corollary 3.10
The following modified 2-Toda bilinear identities hold:
where A it the linear expression (3.31) in terms of the times \(\mathbf{t}\).
Proof
For brevity we denote the pre-factor in the Definition 3.5 of the tau function (Formula (3.28)) by
We can recast (3.35) (3.36) as
Thus we have
Then after taking the residue and converting the residue to an integral over \(\gamma \) as in Theorem 3.7, we obtain
Repeating the same computation on the right side of (3.41), we have to use the formulæ (which are simply a rephrasing of Propostions 3.8 and 3.9)
Using these formulas on the right side of (3.41) we obtain
where we observe that the integral is the same as (3.46). Now, the ratio of the constants gives
and this produces the statement of the theorem. \(\square \)
We note that the bilinear identities (3.41) reduce to the standard ones in [1] in genus \(g=0\) where the linear form A vanishes.
Notes
We hope that the notation here is not too confusing; z is the value z(p) and w is the value of z(q). They are simply the local coordinates of the points p, q in the coordinate z near \(\infty \).
Since the coordinate local coordinate at \(\infty \) is \(\frac{1}{z(p)}\) the integral defining \((-\mathop {\mathrm {res}}\limits _{p=\infty })\) in the z-plane is a counterclockwise large circle.
References
Adler, M., van Moerbeke, P.: The spectrum of coupled random matrices. Ann. Math. (2) 149(3), 921–976 (1999)
Andréief, C.: Note sur une relation entre les intégrales définies des produits des fonctions. Mém. de la Soc. Sci. Bordeaux 3(2), 1–14 (1883)
Babelon, O., Bernard, D., Talon, M.: Introduction to Classical Integrable Systems. Cambridge Monographs on Mathematical Physics. Cambridge University Press, Cambridge (2003)
Baker, G.A., Jr.: Essentials of Padé Approximants. Academic Press, New York-London (1975)
Bertola, M., Gekhtman, M., Szmigielski, J.: The Cauchy two-matrix model. Commun. Math. Phys. 287(3), 983–1014 (2009)
Bertola, M.: Moment determinants as isomonodromic tau functions. Nonlinearity 22(1), 29–50 (2009)
Deift, P., Kriecherbauer, T., McLaughlin, K.T.-R., Venakides, S., Zhou, X.: Uniform asymptotics for polynomials orthogonal with respect to varying exponential weights and applications to universality questions in random matrix theory. Commun. Pure Appl. Math. 52(11), 1335–1425 (1999)
Deift, P.A.: Orthogonal Polynomials and Random Matrices: A Riemann–Hilbert Approach, Courant Lecture Notes in Mathematics, vol. 3. New York University Courant Institute of Mathematical Sciences, New York (1999)
Farkas, H.M., Kra, I.: Riemann Surfaces. Graduate Texts in Mathematics, 2nd edn. Springer, New York (1992)
Fasondini, M., Olver, S., Xu, Y.: Orthogonal polynomials on planar cubic curves. arXiv:2011.10884
Fay, J.: Theta Functions on Riemann Surfaces. Lecture Notes in Mathematics, vol. 352. Springer, New York (1970)
Fokas, A.S., Its, A.R., Kitaev, A.V.: Discrete Painlevé equations and their appearance in quantum gravity. Commun. Math. Phys. 142(2), 313–344 (1991)
Fokas, A., Its, A., Kitaev, A.: The isomonodromy approach to matric models in 2D quantum gravity. Commun. Math. Phys. (1992)
Faybusovich, L., Gekhtman, M.: Elementary Toda orbits and integrable lattices. J. Math. Phys. 41(5), 2905–2921 (2000)
Krichever, I.M.: Methods of algebraic geometry in the theory of non-linear equations. Russ. Math. Surv. 32(6), 185–213 (1977)
Kuijlaars, A.B.J., McLaughlin, K.T.-R.: A Riemann–Hilbert problem for biorthogonal polynomials. J. Comput. Appl. Math. 178(1–2), 313–320 (2005)
Lundmark, H., Szmigielski, J.: Degasperis–Procesi peakons and the discrete cubic string. IMRP Int. Math. Res. Pap. 2, 53–116 (2005)
Segal, G., Wilson, G.: Loop groups and equations of KdV type. Inst. Hautes Études Sci. Publ. Math. 61, 5–65 (1985)
Szegö, G.: Orthogonal Polynomials, vol. XXIII, 4th edn. American Mathematical Society, Providence (1975)
Ueno, K., Takasaki, K.: Toda lattice hierarchy. In: Group Representations and Systems of Differential Equations (Tokyo, 1982), Volume 4 of Adv. Stud. Pure Math., pp. 1–95. North-Holland, Amsterdam (1984)
Zverovic, E.I.: Boundary value problems in the theory of analytic functions in Hölder classes on Riemann surfaces. Uspekhi Mat. Nauk. 157, 113–179 (1971)
Acknowledgements
The work was supported in part by the Natural Sciences and Engineering Research Council of Canada (NSERC) Grant RGPIN-2016-06660.
Funding
Open access funding provided by Scuola Internazionale Superiore di Studi Avanzati - SISSA within the CRUI-CARE Agreement.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
The author has no conflicts of interest to declare that are relevant to the content of this article.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Proofs
Proofs
1.1 The Sato shift
Here we call “Sato shift” the shift of times \(\mathbf{t}\) occurring in Sato’s formulas for the Baker–Akhiezer functions (Propositions 3.8, 3.9). The following lemmas show how the various ingredients of our formula transform when \(\mathbf{t}\mapsto \mathbf{t}\pm [x]\) [with the definition (3.19)]. This is all in preparation of expressing the tau function and the HBI (3.5). These lemmas can be traced in the literature in several places but we refer comprehensively (at least for part of them) to [3]. We provide our own proofs for the convenience of the reader.
The simplest result is the following one, which is a simple exercise from the definition (3.31)):
We remind the reader of the definition of \(\varkappa \) in (3.13) and of our stipulation (discussed after (3.8)) that the writing \(\Theta _\Delta (x)\) is a shorthand for \(\Theta _\Delta (\int _\infty ^x \vec {\omega })\).
Lemma A.1
Under the Sato shift, the vector \(\mathbb V(\mathbf{t})\) defined in (3.18) transforms as follows
Proof
Observe that \(\xi (p;\mathbf{t}-[x]) =\xi (p;\mathbf{t})- \sum _{\ell \ge 1} \frac{1}{\ell }\left(\frac{z(p)}{z(x)}\right)^\ell = \xi (p;\mathbf{t}) + \ln \left(1 - \frac{z(p)}{z(x)} \right)\), where the resummation holds as long as \(|z(x)|>|z(p)|\). Using this simple observation we obtain, using (3.18):
where the contour integral is counterclockwiseFootnote 2 in the z(q)-plane and \(|z(x)|>R\). \(\square \)
The quadratic form (3.30) is well known in the Krichever approach [3]. The main property that we are going to use is reported below
Proposition A.2
The quadratic form (3.30) has the properties:
with the notation (3.19) and \({\varkappa }\) the constant defined in (3.13).
Proof
From the definition of \(\xi \) (3.15) and (3.19) it follows that \(\xi (q;\mathbf{t}\pm [x]) = \xi (q;\mathbf{t}) \mp \ln (1 - z(q)/z(x))\). This computation assumes that \(|z(x)|>|z(q)|\) and hence, to make rigorous sense, we should realize the residues in (3.30) as counterclockwise contour integrals in the z-plane along the circle \(|z(q)|=R\) (with \(R<|z(x)|\)). Keeping this in mind, denote temporarily by \(\mathcal Q(\mathbf{t}, \mathbf{t})\) the polarization of the quadratic form Q: then we need to compute
We start from the last term Q([x]) and we compute it using local coordinates \(z = z(q), \widetilde{z}= z(p)\), and \(w=z(x)\) letting \(F(z,\widetilde{z}) = \Theta _\Delta (q-p)\). We then have, using integration by parts and the Cauchy theorem:
The logarithm now has a branch-cut from \(z=w\) to \( z=\infty \) and is analytic along \(|z|=R\) (recall that \(|w|>R\)) so that can integrate by part along the circle obtaining
The computation of this last integral needs to be done with care because of the branch-cut. We regularize the integral by adding \(\oint _{|z|=R} \ln (\frac{1}{w-z})\frac{\mathrm dz}{ z-w}\), which is zero because it is analytic in \(|z|<R\) (since \(|w|>R\): the branchcut of \(\ln \) is from \(z=w\) to \(z=\infty \)). Then (A.8) becomes
The integrand is now meromorphic for \(|z|>R\) with only first order poles at \(z=w\) and \(z=\infty \) and hence we obtain
where \({\varkappa }= -\lim _{z\rightarrow \infty } z{F(z,\infty )}\) (recall that \(F(z(p),\infty )= \Theta _\Delta (\int _{\infty }^p \vec {\omega })\) vanishes of first order as \(p\rightarrow \infty \)). The derivative of F(z, w) on the diagonal in the local coordinate z is precisely \(\omega _\Delta (p)/\mathrm dz(p)\) and hence we have the final result
Here the constant guarantees that the right side tends to one as \(x\rightarrow \infty \) as the left side does. To complete the proof we need to evaluate \(2\mathcal Q(\mathbf{t}, [x])\) in (A.6); using the definition (3.30) we have
with \(R, \widetilde{R}<|z(x)|\). A simple computation following the same steps as above yields then the regular part near \(x=\infty \) of the Abelian integral \(\vartheta (x;\mathbf{t})\), namely,
which completes the proof. \(\square \)
We will also need the formula for the Sato shift on the Abelian integral \(\vartheta \):
Lemma A.3
We have the formula:
Proof
Observe that \(\xi (p;\mathbf{t}-[x]) =\xi (p;\mathbf{t})- \sum _{\ell \ge 1} \frac{1}{\ell }\left(\frac{z(p)}{z(x)}\right)^\ell = \xi (p;\mathbf{t}) + \ln \left(1 - \frac{z(p)}{z(x)} \right)\), where the resummation holds as long as \(|z(x)|>|z(p)|\). Moreover, since \(\Omega (p,q)\) is given by (3.9) we can use integration by parts in the computation below: note that \(-\mathop {\mathrm {res}}\limits _{q=\infty }\) is an integration in the counterclockwise orientation in the z(q)-plane. Using these observations we obtain:
where we have used that \(|z(x)|>R\). \(\square \)
Lemma A.4
Let us pose
where \(\mathbb F := \mathbb F(\mathbf{t})\) [in (3.21)]. Then we have
where the constant \(K_n\) is independent of \(\mathbf{t}\) and it is given by (\({\varkappa }\) defined in (3.13))
Proof
Both sides of (A.17) are skew-symmetric single-valued functions of the n-tuple \((r_1,\dots , r_n)\in \mathcal C^n\), so it suffices to consider them as a function of—say—\(r_n\). As such, both sides behave near \(\infty \) as \(z^{n-1}(r_n)\mathrm{e}^{\xi (r_n;\mathbf{t})} (1+ \mathcal O(1/z))\) and have poles at \(\mathscr {D}\) and zeros at \(r_1,\dots , r_{n-1}\). There are g other zeros in generic position. The ratio of the two sides is thus a meromorphic function with g poles (and g zeros) in generic position, but this can only be a constant (by Riemann–Roch’s theorem). To compute this constant we proceed by induction on n; for \(n=1\) the statement is a tautology with \(K_1=1\). Consider now the induction step; multiply both sides by \(\mathrm{e}^{-\xi (r_n)} z(r_n)^{1-n}\) and send \(r_n\) to \(\infty \). In the left side the limit is the coefficient in the Laplace expansion of the determinant in front of \(\zeta _{n-1}(r_n)\). This is exactly the same determinant in one variable less. We now turn to the right side of (A.17), which we denote by \(R_n(r_1,\dots , r_n)\). We have
Therefore we must have \(K_n = {K_{n-1}\Theta (\mathscr {D}+ \mathscr {K})}{(-{\varkappa })^{n-1}}\). The proof then follows by induction. \(\square \)
1.2 Proof of Proposition 3.8
With the definition (A.16) the tau-function can be written as
Combining (A.14), (A.4), Lemma A.1 and the definition of \(\mathbb F\) (3.21) we see that the following holds
where \(\mathscr {H}_n\) is defined in (A.16). The expression (A.21) can be rewritten as follows:
We now integrate (A.22) with respect to the variables \(r_j\)’s on \(\gamma \) using the formula (A.20) for the tau functions;
Consider the integral in (A.23): if we use (A.22) for \(\mathscr {H}_n ( \vec {r}; \mathbf{t}- [x])\) together with the definition (A.16) of \(\mathscr {H}_n\) as a determinant, the computation to be performed involves now (up to inessential multiplicative constants)
where, for notational simplicity, in the first determinant we have set \(r_{n+1}=x\) (and this variable is not integrated upon). Expanding this determinant along the last row and then using Andreief’s identity on each of the coefficients, we obtain \(n!P_n(x;\mathbf{t},\mathbf{s})\) as in (3.25). Collecting now the factors in (A.22) and this last computation together with (A.1), the equation (A.23) becomes
The constant \(\star \) can be computed by carefully tracking all the steps or simply by asymptotic analysis near \(\infty \); indeed we see from (3.25) that
Then also we need that (recall that \(\mathrm dz(x)\) has a second order pole at \(\infty \), while \(\omega _\Delta \) is holomorphic, so that the square root of their ratio is a locally defined function with a simple zero)
The conclusion follows from elementary algebra. \(\square \)
1.3 Proof of Proposition 3.9
Similarly to the proof of Proposition 3.8, combining (A.14), (A.5) and Lemma A.1 we have
Now consider the expression
With respect to x, \(\phi _n\) is a single-valued one-form in \( \mathcal K_{-\mathbf{t}}( -\mathscr {D} +\sum r_j - (n-2)\infty )\), while, with respect to each of the \(r_j\) it is a section of \(\mathscr {L}_{ \mathbf{t}}(\mathscr {D}+(n-2)\infty )\) and it is skew-symmetric under the action of permutations of the \(r_j\) variables. Using the same logic as in the proof of Lemma A.4 we deduce that, up to a constant, the following holds
To verify that the behaviour near \(x=\infty \) of the right side of (A.30) is indeed the correct one we use the following local analysis: from the formula for \(\zeta _a(r;\mathbf{t})\) it follows that
with the series converging for x, r in a neighbourhood of \(\infty \) as long as \(|z(x)|>|z(r)|\). Inserting this expansion in the first column of the determinant (A.30) one sees that all the coefficients of \(\frac{\mathrm dz(x)}{z(x)^a}\) vanish up to \(a=n-1\) included and
We now integrate \(\phi _n\) as follows
We want to identify \(\mathfrak R_n\) with a more transparent formula involving the Cauchy transform of the biorthogonal section \(Q_{n-1}\) defined in (3.25); to this end we observe that \(\mathfrak R_n\) is holomorphic on \(\mathcal {C}{\setminus } \gamma \cup \{\infty \}\) and has a jump discontinuity across \(\gamma \) which we can compute with the aid of Sokhotski–Plemelj formula together with the symmetry under permutation of the dummy variables \(r_j\):
Using the same reasoning that followed (A.24) we arrive at the following formula for the integral:
Using the fact that \(\mathfrak R_n\) vanishes at the non-special divisor \(\mathscr {D}\) (and the behaviour at \(\infty \)) we then conclude that
From (A.32) it follows that near \(\infty \) the differential \(\mathfrak R_n\) has the expansion
Now we can collect this information in the final computation:
Using (A.1), (A.28) and the resulting expression of \(\mathfrak R_n\) we obtain, up to a multiplicative constant to be determined
The proportionality constant can be evaluated by asymptotic expansion at infinity using (A.37) and (3.14). \(\square \)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Bertola, M. Padé approximants on Riemann surfaces and KP tau functions. Anal.Math.Phys. 11, 149 (2021). https://doi.org/10.1007/s13324-021-00585-2
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13324-021-00585-2