Partial Diffusion Markov Model of Heterogeneous TCP Link: Optimization with Incomplete Information

Borisov, Andrey; Bosov, Alexey; Miller, Gregory; Sokolov, Igor

doi:10.3390/math9141632

Open AccessFeature PaperEditor’s ChoiceArticle

Partial Diffusion Markov Model of Heterogeneous TCP Link: Optimization with Incomplete Information

¹

Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences, 44/2 Vavilova Str., 119333 Moscow, Russia

²

Moscow Aviation Institute, 4, Volokolamskoe Shosse, 125993 Moscow, Russia

³

Faculty of Computational Mathematics and Cybernetics, Lomonosov Moscow State University, GSP-1, 1-52 Leninskiye Gory, 119991 Moscow, Russia

⁴

Moscow Center for Fundamental and Applied Mathematics, Lomonosov Moscow State University, GSP-1, Leninskie Gory, 119991 Moscow, Russia

^*

Author to whom correspondence should be addressed.

Mathematics 2021, 9(14), 1632; https://doi.org/10.3390/math9141632

Submission received: 13 June 2021 / Revised: 7 July 2021 / Accepted: 8 July 2021 / Published: 10 July 2021

(This article belongs to the Special Issue Markov and Semi-markov Chains, Processes, Systems and Emerging Related Fields)

Download

Browse Figures

Versions Notes

Abstract

:

The paper presents a new mathematical model of TCP (Transmission Control Protocol) link functioning in a heterogeneous (wired/wireless) channel. It represents a controllable, partially observable stochastic dynamic system. The system state describes the status of the modeled TCP link and expresses it via an unobservable controllable MJP (Markov jump process) with finite-state space. Observations are formed by low-frequency counting processes of packet losses and timeouts and a high-frequency compound Poisson process of packet acknowledgments. The information transmission through the TCP-equipped channel is considered a stochastic control problem with incomplete information. The main idea to solve it is to impose the separation principle on the problem. The paper proposes a mathematical framework and algorithmic support to implement the solution. It includes a solution to the stochastic control problem with complete information, a diffusion approximation of the high-frequency observations, a solution to the MJP state filtering problem given the observations with multiplicative noises, and a numerical scheme of the filtering algorithm. The paper also contains the results of a comparative study of the proposed state-based congestion control algorithm with the contemporary TCP versions: Illinois, CUBIC, Compound, and BBR (Bottleneck Bandwidth and RTT).

Keywords:

controllable Markov jump processes; compound Poisson processes; diffusion limits; stochastic control problem with incomplete information; novel queuing models in applications

1. Introduction

Despite its age of almost 50 years, the Transmission Control Protocol (TCP) [1] is still an object of permanent modernization and improvement, and this evolution represents a natural perpetual process. The root of this development lies in incessant challenges caused by a wide variety of computer networks, impetuous progress in the communication devices design, and strengthening of requirements to the information transmission [2,3,4]. Meanwhile, guaranteeing data transfer independent of the hardware platform is the key task of the TCP algorithm; both the stable functioning and effective use of the available channel bandwidth are also the performance characteristics of each specific version of TCP. The congestion control algorithms are responsible for the implementation of all these functions. They use two characteristics as the control actions. The basic one is the congestion window size (cwnd), i.e., the number of packets sent without acknowledgment. A less influential one is the retransmission timeout, i.e., some waiting time for the acknowledgment of the successful packet reception, which excess is treated by the congestion control algorithm as a packet loss.

When most channels were wire channels and had a relatively small capacity and queue waiting time “Additive Increase–Multiple Decrease” (AIMD) congestion control rule demonstrated good performance. This presumed a linear growth of the cwnd between two successive packet losses when the cwnd abruptly decreased in a jump-like manner. The effectiveness of this strategy for such channels was transparent. First, the small channel capacity gave a chance to reach a bandwidth limit linearly without losses for a rather short time. Second, wired hops were so reliable that the fact of a sudden packet loss presumed congestion at some “bottleneck” almost surely. Therefore, the loss indicated the necessity to reduce the sending rate. This simple reason was a base to develop such loss-based versions of TCP as Tahoe, New Reno, etc. [5].

In the case of the “long fat” channels (ones with huge capacity and long queue waiting times), AIMD-based versions of TCP turned out to be ineffective: they underused the channel bandwidth significantly. In the case of the channels with high capacity, the linear growth does not allow for the congestion window to quickly achieve values close to the available bandwidth. Plus, a loss of at least one packet decreases the data transferring speed even more. In addition, if a channel includes a wireless hop, facts of single packet losses are not an explicit congestion indicator. The round-trip time (RTT) parameter starts to play a remarkable role in the congestion control algorithm, and this brings to the variety of the TCP versions: delay-sensitive, hybrid loss-delay, bandwidth estimation-based, etc. [2]. All the modifications make the congestion control algorithm more tolerant to packet losses: after each loss, it decreases cwnd not multiplicative but more sparingly. At the same time, the cwnd growth speed is more aggressive to reach the channel bandwidth faster. The bandwidth value is unknown but estimated given all past statistics of the channel functioning. The algorithm probes more or less gentle cwnd enlargement to give a chance to use all channel resources. Hence, the typical cwnd curve between two packet losses demonstrates a concave [6] or mixed concave-convex character [7].

The ubiquitous application of wireless technologies in computer networks is a challenge to TCP protocol performance and claims its subsequent enhancement. Jitter and periodical signal fading in the wireless channel hops are extra sources of uncertainty of the channel real throughput. These physical phenomena affect both the new mathematical models of the channel functioning and the congestion control algorithms.

Mathematical models of computer network traffic are also developed intensively. With no goal to present a comprehensive overview of these models, we only mention their major classes

Markov and hidden Markov models [8,9,10,11],
queuing systems [12,13],
models, based on the fluid or diffusion approximation of jump processes [14,15,16],
network calculus models [17,18,19],
models involving selfsimilar processes [20,21,22],
concurrent models and games [23,24,25], etc.

Generally speaking, a prospective mathematical model of a channel should satisfy the conditions below.

A model should describe the data transferring process adequately.
A model should represent a trade-off between a complicated object with many parameters, their uncertainty along with the uncertainty introduced by the external disturbances, and simplicity.
A model should operate with the same collection of statistical information as the one available in the real channel.
A model should provide a possibility to simulate the collection of recent “concurrent” versions of TCP.
The chosen model presumes the presence of the developed mathematical framework for the solution to the complex of all the analysis, estimation/identification and optimization/control problems. Availability of both the theoretical solution to the problems above and their efficient numerical realization is strongly encouraged.

The aim of the paper is two-fold. First, this is a presentation of a new mathematical model of the TCP link functioning based on the heterogeneous (wired/wireless) channel. It represents a controllable, partially observable stochastic dynamic system. The system state describes the status of the modeled TCP link and expresses it via the controllable Markov jump process (MJP) with a finite-state space. This space can be chosen arbitrarily depending on the desired detailing of the link description. Below in this paper, we consider four possible channel states:

$e_{1}$ : the channel is idle,
$e_{2}$ : the channel is loaded moderately,
$e_{3}$ : congestion in the wired segment,
$e_{4}$ : signal fading in the wireless hop.

Looking rather simple, this model admits successful description of such a problematic link phenomenon as congestion in a channel “bottleneck” and the carrier radio signal fading.

The observations included into the model correspond to those available to a TCP control algorithm on the sending side. Two observable processes describe the flow of packet losses and the flow of timeouts. They are represented by controllable Cox processes with intensity that depends both on the control and unobserved link state. The third observation is a flow of the acknowledgments concerning the successful packet reception on the receiving node. The flow is expressed in terms of a compound Poisson processes (CPP). Its first component represents a counting process of acknowledgment reception moments, and the second one registers corresponding individual values of the Round-Trip Time (RTT).

In the paper, we control the TCP varying the cwnd value only; however, the proposed model allows other control parameters, e.g., RTO (retransmission timeout). We also demonstrate how the proposed mathematical model can describe various contemporary versions of the TCP: Illinois, CUBIC, BBR, and Compound.

The second aim of the paper is presentation of a new TCP prototype version. Its mathematical background is both the solution to the optimal MJP state control under complete information, and the solution to the optimal MJP state filtering given the diffusion and counting observations. The performance of the proposed prototype is demonstrated on the complex of the numerical experiments.

The paper is organized as follows. Section 2 contains a detailed description of the TCP link mathematical model in terms of the controllable stochastic observation system, along with the optimization problem of data transmission through this link.

One can enhance the use of the channel resources in terms of the optimal stochastic control with incomplete information. However, this approach promises complications during its realization: starting from the proof of the optimal solution existence and concluding by bulky numerical algorithms of its realization. Hence, we propose a rather simple suboptimal solution to the problem along with its effective numerical implementation.

To develop the TCP prototype, we need a substantial mathematical framework, which is introduced in Section 3:

Section 3.1 contains the solution to the optimal MJP control problem with instant geometric control constraints and complete information [26],
Section 3.2 introduces a diffusion approximation for the high-frequency CPP describing the packet acknowledgment flow [27],
Section 3.3 presents a solution to the optimal MJP state filtering problem given both counting and diffusion observations with state-dependent noise [28],
Section 3.4 contains a numerical algorithm for the optimal filtering realization [28].

In general, the articles [26,27,28] represent a formal, detailed mathematical background of all applied inferences presented in this paper. We use it in Section 4 to develop a new congestion control algorithm as follows. At the first stage, we calculate a high-precision channel state estimate based on the available observations discretized by time. At the second stage, we apply a separation principle: the obtained filtering estimate replaces the actual MJP state during the process of the optimal control synthesis with the complete information.

The aim of Section 5 is two-fold. First, it demonstrates the potential of the proposed mathematical model to describe various versions of the TCP: classic AIMD congestion control scheme and TCP Illinois (Section 5.1), TCP CUBIC (Section 5.2), TCP Compound (Section 5.3), TCP BBR (Section 5.4).

Second, the section contains the comparison of the proposed state-based TCP with versions mentioned above: Section 5.5 highlights some details of the numerical realization of the proposed TCP version, and Section 5.6 represents the summary of the performed numerical experiments. Section 6 contains concluding remarks.

2. Problem of Optimal Data Transmission through TCP Channel

On the canonical Wiener-Poisson space with filtration

(Ω, F, P, {F_{t}})

[29,30] we consider the following controllable stochastic system, describing the TCP link functioning

X_{t} = X_{0} + \int_{0}^{t} A (u_{s}) X_{s} d s + α_{t},

(1)

Y_{t} = \int_{0}^{t} B (u_{s}) X_{s} d s + β_{t},

(2)

Z_{t} = \int_{0}^{t} C (u_{s}) X_{s} d s + γ_{t},

(3)

{(τ_{n}, V_{n})}_{n \in N} .

(4)

Here the TCP link state

X_{t}

is a controllable finite-state MJP with values in the set

S^{N} ≜ {e_{1}, \dots, e_{n}}

formed by unit coordinate vectors of the Euclidean space

R^{N}

. The initial value

X_{0}

has a known distribution

π

,

A (u) = ∥ A^{i j} (u) ∥_{i, j = \bar{1, N}}

is a controllable transition intensity matrix and

α_{t}

is a

F_{t}

-adapted martingale with the quadratic characteristic [31]

{〈 α, α 〉}_{t} = \int_{0}^{t} (diag (A (u_{s}) X_{s}) - A (u_{s}) diag (X_{s}) - diag (X_{s}) A^{⊤} (u_{s})) d s .

The link state is unobservable, and the complex of observations

(Y_{t}, Z_{t}, {(τ_{n}, V_{t})})

includes three components.

$Y_{t}$ is a counting process (flow) of packet losses described by its martingale representation (2): $β_{t}$ is an $F_{t}$ -adapted martingale with the quadratic characteristic

${〈 β, β 〉}_{t} = \int_{0}^{t} B (u_{s}) X_{s} d s,$

$B (u) ≜ row (B^{1} (u), \dots, B^{N} (u))$ represents the collection of the loss intensities of the flow given the conditions $X_{t} = e_{n}$ , $n = \bar{1, N}$ .
$Z_{t}$ is a counting process (flow) of packet timeouts described by its martingale representation (3): $γ_{t}$ is an $F_{t}$ -adapted martingale with the quadratic characteristic

${〈 γ, γ 〉}_{t} = \int_{0}^{t} C (u_{s}) X_{s} d s,$

$C (u) ≜ row (C^{1} (u), \dots, C^{N} (u))$ represents the collection of the timeout intensities of the flow given the conditions $X_{t} = e_{n}$ , $n = \bar{1, N}$ .
${(τ_{n}, V_{t})}$ is a flow of successful packet acknowledgments: here $τ_{n}$ stands for the time instant of the n-th acknowledgment arrival and $V_{t}$ does for the specific RTT of the n-th acknowledgment. It represents controllable compound Poisson process (CPP) with the intensity driven by the Markov state $X_{t}$ : the predictable measure generated by ${(τ_{n}, V_{t})}$ conditioned by the MJP state X takes the form

$μ_{p} (ω, d t, d v) = λ (u_{t}) diag (X_{t -}) Λ (u_{t}, v) d t d v .$

Here $λ (u_{t}) ≜ row (λ^{1} (u_{t}), \dots, λ^{N} (u_{t}))$ is a vector-valued function with continuous positive components, its nth component represent conditional intensity of acknowledgment arrivals given $X_{t} = e_{n}$ ; $Λ (u_{t}, v) ≜ col (Λ^{1} (u_{t}, v), \dots, Λ^{N} (u_{t}, v))$ is a vector-valued function with continuous components, its nth component represent conditional probability density function (pdf) with respect to v given $X_{t} = e_{n}$ for each fixed $u_{t}$ .

All martingale terms in the processes X, Y, Z and

(τ, V)

are strongly orthogonal.

The control

u_{t}

represents a current size of the congestion window, i.e., portion of packets which can be instantly transmitted. The set of admissible control contains all

O_{t}

-predictable processes (

O_{t} ≜ σ {Y_{s}, Z_{s}, (τ_{n}, V_{n}) : s, τ_{n} \in [0, t]}

stands for a natural filtration induced by all observations available up to the moment t) with the geometric constraint:

u_{s} \in U ≜ [\underset{̲}{u}, \bar{u}] \subset R_{+} P - a . s . for all s ⩾ 0 .

(5)

The intensity of acknowledgment arrivals is much more than all the state transition, packet loss and timeout ones:

min_{n, u} λ^{n} (u) ≫ max_{n, u} (| A^{n} (u) |, B^{n} (u), C^{n} (u)) .

The performance criterion

J (U) ≜ E_{} \{ψ X_{T} + \int_{0}^{T} (ϕ (u_{s}) - u_{s} ξ) X_{s} d s\} \to max_{U}

(6)

represents an average profit for the transmitted information, which should be maximized. Here

$ψ ≜ row (ψ^{1}, \dots, ψ^{N})$ is a vector of conditional gains given the terminal state $X_{T}$ ,
$ϕ (u_{s}) ≜ row (ϕ^{1} (u_{s}), \dots, ϕ^{N} (u_{s}))$ includes strictly concave components, which represent conditional instant gains for the transmitted information given the current link state $X_{s}$ ,
$ξ ≜ row (ξ^{1}, \dots, ξ^{N})$ is a vector of specific transmission expenses per information unit in each link state.

The problem under consideration is challenging. First, in general, optimal control problems of stochastic jump processes with incomplete information are rather complicated [31,32,33,34]. Their proper statement and solution depends on the answer to several auxiliary questions/problems: the martingale one [35], the one of strong solution existence and uniqueness and the one of measurable control selection (see [36] and references within). Without positive answers to the questions, we cannot use the martingale theory [35,37] to express optimal control in terms of either variation inequalities (dynamic programming equation as the preferable outcome) or stochastic maximum principle. Please note that negative answers presumes only impossibility to use the mathematical tools mentioned above. Apparently, the control problem can be modified slightly to provide its solution existence which can be found involving other still undiscovered frameworks.

Second, both the dynamic programming equation and stochastic maximum principle have forward-backward form which complicates synthesis of the optimal control in the explicit form. The authors of [36] have solved the analogous problem of the MJP state (1) control observing the flow of packet losses (2) only. The theoretical optimal solution has been characterized both via the dynamic programming equation and the maximum principle. At the same time, the authors have presented a numerical realization of the obtained result only for the case when the transition intensity matrix of the MJP is independent of the control (i.e., the state is uncontrollable), and control affects the intensity of the losses only. Despite the restrictive conditions the obtained practical results have looked rather prospective: the optimal policy has demonstrated piecewise concave nature similar to the modern versions of TCP: Illinois [6], CUBIC without probe phase [38,39], Compound [40,41] etc.

Third, the essential weak points of the optimal control implementation are its poor robustness relating to the imprecise knowledge of the control system characteristics and small perturbations of the synthesized control to its performance. This means that either control system parameters slightly misspecified towards its unknown nominal, or “instrumental errors” in control caused by imperfection of its numerical realization could nullify gain of the sophisticated optimal control in comparison with a stable suboptimal algorithm.

Fourth, the flow of packet acknowledgments has high intensity and hence leads to a high-frequency control, which is resource intensive.

Keeping in mind all arguments above we avoid the direct solution to the optimal stochastic control problem (6) of the MJP (1) state given the observations (2), (3) and (4) including the martingale problem and the ones of the solution existence and uniqueness. Instead of this we use solutions to a complex of adjacent problems and propose a suboptimal control algorithm of high performance.

3. Mathematical Background

As a basis of the proposed suboptimal control algorithm, we use the following arguments and mathematical results. We derive the algorithm basing on the following mathematical results and reasons.

The solution to the optimal stochastic control of the MJP (1) state with the complete information does exist and can be defined as a solution to the equation of dynamic programming [26].
The high frequency allows us to approximate the observable controlled CPP (4) by a drifting Brownian motion [42] with the parameters modulated by the MJP state [27]. We can describe the distribution of the diffusion approximation via some moment characteristics only, and this fact leads to robustness of the subsequent state filtering algorithm towards the imprecise knowledge of the specific distribution of compound Poisson process jumps.
The conversion of high-frequency acknowledgment flow to a diffusion process gives a possibility to use the solution to the optimal MJP (1) state filtering problem given the “diffusion” and counting observations [43]. This is extension of the Wonham filter [44] to the case of the diffusion observations with state-dependent noises. Under rather mild identifiability conditions the optimal filtering estimate coincides with the exact MJP state.
The dynamic programming equation corresponding to the control problem with complete information mentioned at item 1, represents the system of ordinary differential equations with well-developed methods of numerical solution. By contrast, the equations of the generalized Wonham filter [43] require design of special numerical procedures similar to [28].
To complete the control synthesis, we postulate a separation principle. This means we put the state filtering estimate mentioned at items 3, 4 into the control strategy defined at item 1.

3.1. Optimal Control Strategy with Complete Information

Let us consider the controllable MJP (1) which should be optimized with respect to the optimality criterion (6) where the set

U

of all admissible controls U includes all

O_{t}

-predictable processes with the geometric constraint (5).

Let us define the Bellman function

V (t, x) : [0, T] \times S^{N} \to R

:

B (t, x) ≜ sup_{U \in U} E_{} \{ψ X_{T} + \int_{t}^{T} (ϕ (u_{s}) - u_{s} ξ (s)) X_{s} d s | X_{t} = x\} .

(7)

Obviously, the function

B (t, x)

can be presented in the form

B (t, x) = η^{⊤} (t) x

, where

η (t) ≜ col (η^{1} (t), \dots, η^{N} (t)) = col (B (t, e_{1}), \dots, B (t, e_{N}))

is a vector-valued function.

Theorem 1.

The assertions below are true [26].

1.: The function $η (t)$ is the unique solution to the Cauchy problem

$\{\begin{matrix} {\dot{η}}^{n} (t) = max_{u \in U} [\sum_{j = 1}^{N} A^{j n} (u) η^{j} (t) + ξ^{n} (u)], & n = \bar{1, N}, & 0 ⩽ t < T, \\ η^{n} (T) = ψ^{n}, & n = \bar{1, N} . \end{matrix}$

(8)
2.: There exists a Borel function ${\hat{u}}_{t} (x) : [0, T] \times S^{N} \to U$ , such that

${\hat{u}}_{t} (x) \in \underset{u \in U}{Argmax} [\sum_{n = 1}^{N} (\sum_{j = 1}^{N} A^{j n} (u) η^{j} (t) + ξ^{n} (u)) x^{n}]$

(9)

for any $(t, s) \in [0, T] \times S^{N}$ .
3.: The random process ${\hat{U}}_{t} ≜ {\hat{u}}_{t} (X_{t -})$ is an optimal control strategy for the problem (1), (6).
4.: The optimal value of criterion (6) has the form ${max}_{U \in U} J (U) = J (\hat{U}) = η^{⊤} (0) π$ ; moreover, supremum in (7) is attained for any $(t, x) \in [0, T] \times S^{N}$ at the strategy ${{\hat{U}}_{s}, s \in [t, T]}$ .

The theorem establishes the base of the practical control realization. Indeed, all variants of possible optimal controls (9) can be calculated and stored in advance via solution to (8), before the control synthesis. The synthesis itself represents the selection of suitable control from the set of possible ones using the “current” MJP state

X_{t -}

.

3.2. Diffusion Approximation of High-Frequency Counting Observations

Use of the “genuine” acknowledgments flow (4) to synthesize the control leads to discontinuous one with high frequency. Its calculation may be resource intensive: each new-coming acknowledgment triggers the control recalculation algorithm. The contemporary TCP versions are exactly like this, but they are relatively simple, so not too “costly”.

Once we consider (4) discretized by time with some appropriate time increment, we can see the probability distribution of the observation increments look like mixtures of some Gaussians due to the central limit theorem for renewal-reward processes (CLTRRP). In this subsection we answer two questions. First, we determine characteristics of these mixtures. Second, we form recommendations how to choose time increment value to provide appropriate closeness of the real discretized observation distribution to the theoretical mixture above.

First, to perceive the nature of diffusion approximation, we investigate the CPPs with a fixed control

u \in U

. We consider a collection of the CPPs

{(τ_{n}^{j}, V_{n}^{j})}_{\binom{n \in N, j = \bar{1, N},}{u \in U}}

with the predictable measures

{μ_{p}^{j} (d t, d v)}_{\binom{s > 0,}{u \in U}}

:

μ_{p}^{j} (d t, d v) ≜ λ^{j} (u) Λ^{j} (u, v) d s d v .

Probabilistically they correspond to initial CPP

{(τ_{n}, V_{n})}

staying in the “single mode”:

X_{t} \equiv e_{j}

and a fixed control value

u_{t} \equiv u

. Each CPP generates a stochastic measure

μ^{j} (ω, d t, d v) ≜ \sum_{n \in N} δ_{(τ_{n}^{j} (ω), V_{n}^{j} (ω))} (d t, d v) .

Keeping in mind the specific form of the predictable measures

μ_{p}^{j}

, we can compute the moment characteristics for one jump of the CPPs:

m_{τ}^{j} ≜ E_{} \{τ_{1}^{j}\} = \frac{1}{λ^{j} (u)}, m_{V}^{j} ≜ E_{} \{V_{1}^{j}\} = \int_{R} v Λ^{j} (u, v) d v,

(10)

σ_{τ}^{j} ≜ \sqrt{var (τ_{1}^{j})} = \frac{1}{λ^{j} (u)}, σ_{V}^{j} ≜ \sqrt{var (V_{1}^{j})} = \sqrt{\int_{R} v^{2} Λ^{j} (u, v) d v - {(m_{V}^{j})}^{2}},

κ^{j} ≜ cov (τ_{1}^{j}, V_{1}^{j}) = 0 .

We investigate the asymptotic behavior of the distribution of the two-dimensional random process

Θ_{t}^{j} ≜ [\begin{matrix} \int_{[0, t] \times R} μ^{j} (d s, d v) \\ \int_{[0, t] \times R} v μ^{j} (d s, d v) \end{matrix}] = [\begin{matrix} \sum_{n \in N} I (t - τ_{n}^{j}) \\ \sum_{n \in N} V_{n} I (t - τ_{n}^{j}) \end{matrix}]

(11)

when

t \to \infty

. The first component represents the total number of acknowledgments received at the sender over the time interval

[0, t]

, the second component, in turn, stands for the corresponding cumulative RTT value. The author of [42] proved a version of CLTRRP:

\frac{1}{\sqrt{λ^{j} (u) t}} (Θ_{t}^{j} - [\begin{matrix} λ^{j} (u) t \\ m_{V}^{j} λ^{j} (u) t \end{matrix}]) \overset{L a w}{⟶} N ([\begin{matrix} 0 \\ 0 \end{matrix}], [\begin{matrix} 1 & m_{V}^{j} \\ m_{V}^{j} & {(m_{V}^{j})}^{2} + {(σ_{V}^{j})}^{2} \end{matrix}])

(12)

as

t \to \infty

. In other words, for rather huge t

\frac{1}{\sqrt{t}} Θ_{t}^{j} ≃ N ([\begin{matrix} {(λ^{j} (u))}^{\frac{3}{2}} \sqrt{t} \\ m_{V}^{j} {(λ^{j} (u))}^{\frac{3}{2}} \sqrt{t} \end{matrix}], [\begin{matrix} λ^{j} (u) & λ^{j} (u) m_{V}^{j} \\ λ^{j} (u) m_{V}^{j} & λ^{j} (u) [{(m_{V}^{j})}^{2} + {(σ_{V}^{j})}^{2}] \end{matrix}]) .

Let us complicate the model, mixing the CPPs

{(τ_{n}^{j}, V_{n}^{j})}_{\binom{n \in N, j = \bar{1, N},}{u \in U}}

above with probabilities

π = col (π^{1}, \dots, π^{N})

[\begin{matrix} {\bar{τ}}_{n} \\ {\bar{V}}_{n} \end{matrix}] = \sum_{j = 1}^{N} X_{0}^{j} [\begin{matrix} τ_{n}^{j} \\ V_{n}^{j} \end{matrix}] .

(13)

Here

X_{0} ≜ col (X_{0}^{1}, \dots, X_{0}^{N}) \in S^{N}

is an

F_{0}

-measurable random vector, independent of

{(τ_{n}^{j}, V_{n}^{j})}_{\binom{n \in N, j = \bar{1, N},}{u \in U}}

;

X_{0} \sim π_{0}

. It is easy to verify that the predictable measure generated by

{({\bar{τ}}_{n}, {\bar{V}}_{n})}

, conditioned by

X_{0}

, takes the form

{\bar{μ}}_{p} (ω, d t, d v) = λ (u_{t}) diag (X_{0}) Λ (u_{t}, v) d t d v .

Please note that the mixed CPP (13) represents a specific case of the observations (4) with “single mode” MJP X:

A (u) \equiv 0

,

X_{0} \sim π

.

Making inferences as above we can conclude that for rather huge t

\begin{matrix} \frac{1}{\sqrt{t}} [\begin{matrix} \sum_{k \in N} I (t - {\bar{τ}}_{k}) \\ \sum_{k \in N} {\bar{V}}_{k} I (t - {\bar{V}}_{k}) \end{matrix}] \\ ≃ \sum_{j = 1}^{N} π^{j} N ([\begin{matrix} {(λ^{j} (u))}^{\frac{3}{2}} \sqrt{t} \\ m_{V}^{j} {(λ^{j} (u))}^{\frac{3}{2}} \sqrt{t} \end{matrix}], [\begin{matrix} λ^{j} (u) & λ^{j} (u) m_{V}^{j} \\ λ^{j} (u) m_{V}^{j} & λ^{j} (u) [{(m_{V}^{j})}^{2} + {(σ_{V}^{j})}^{2}] \end{matrix}]) . \end{matrix}

(14)

Therefore, given some MJP state

X_{s}

distribution (conditional or unconditional) at the time instant s and a constant control

u_{q} \equiv u \in U, q \in [s, s + h)

we assume that the cumulative observation increment over the interval

[s, s + h)

is distributed approximately in the following way

\begin{matrix} \frac{1}{\sqrt{h}} [\begin{matrix} \sum_{k \in N} I (t - τ_{k}) I (τ_{k} - s) \\ \sum_{k \in N} V_{k} I (t - τ_{k}) I (τ_{k} - s) \end{matrix}] \\ ≃ \sum_{j = 1}^{N} {\hat{X}}_{s}^{j} N ([\begin{matrix} {(λ^{j} (u))}^{\frac{3}{2}} \sqrt{t} \\ m_{V}^{j} {(λ^{j} (u))}^{\frac{3}{2}} \sqrt{t} \end{matrix}], [\begin{matrix} λ^{j} (u) & λ^{j} (u) m_{V}^{j} \\ λ^{j} (u) m_{V}^{j} & λ^{j} (u) [{(m_{V}^{j})}^{2} + {(σ_{V}^{j})}^{2}] \end{matrix}]) . \end{matrix}

(15)

By analogy with (15) for the cumulative process, corresponding to the acknowledgment flow (4)

Q_{t} ≜ [\begin{matrix} \sum_{k \in N} I (t - τ_{k}) \\ \sum_{k \in N} V_{k} I (t - τ_{k}) \end{matrix}]

(16)

we propose the following approximate diffusion model

Q_{t} = \int_{0}^{t} D (u_{s}) X_{s} d s + \int_{0}^{t} \sum_{n = 1}^{N} e_{n}^{⊤} X_{s} E_{n}^{\frac{1}{2}} (u_{s}) d W_{s},

(17)

where

D (u) ≜ [\begin{matrix} {(λ^{1} (u))}^{\frac{3}{2}} & {(λ^{2} (u))}^{\frac{3}{2}} & \dots & {(λ^{N} (u))}^{\frac{3}{2}} \\ m_{V}^{1} {(λ^{1} (u))}^{\frac{3}{2}} & m_{V}^{2} {(λ^{2} (u))}^{\frac{3}{2}} & \dots & m_{V}^{N} {(λ^{N} (u))}^{\frac{3}{2}} \end{matrix}],

E_{n} (u) ≜ [\begin{matrix} λ^{n} (u) & λ^{n} (u) m_{V}^{u, n} \\ λ^{n} (u) m_{V}^{u, n} & λ^{n} (u) ({(m_{V}^{u, n})}^{2} + {(σ_{V}^{u, n})}^{2}) \end{matrix}]

Model (17) gives a chance both to solve the MJP state filtering problem given the diffusion and counting observations and develop corresponding algorithms of the numerical solution to the filtering problem.

By contrast with weak convergence in (12), any convergence in (15) is absent. First, the right-hand side (RHS) of (15) contains the mathematical expectation which is increasing function of t. Second, we determine (15) under hypothesis that the MJP state X remains unchanged over the discretization interval:

X_{q} \equiv X_{s}, q \in [s, s + t)

. In the general case, the probability of MJP state transition increases to 1 when the interval length t increases infinitely.

Use of the time-discretized observations (4) at the first stage of the control synthesis–MJP state filtering–presumes calculation of likelihood ratios for the single Gaussian modes and their mixtures. Therefore, the filtering performance depends on both the “theoretical” pdf (15) and the closeness of real distribution of the observation increments to (15).

We form recommendations for appropriate choice of the time interval for discretization of (4). On the one hand, the length should provide the appropriate performance of the diffusion approximation (15), when there is no MJP state transitions over the time interval. On the other hand, the interval length should be small enough to guarantee small probability of those state transitions.

In the CLT the closeness of the limit distribution and the pre-limit one is described by the Berry–Esseen inequality in terms of either the uniform metric or the total variation one [45,46,47]. By contrast, we are interested in closeness of the corresponding PDFs, and the appropriated results are valid for the case of the “classic” CLT, not for CLTRRP.

We propose some heuristic technique choose the discretization interval length, basing on a performance criterion of the distribution approximation.

We refer to the “single mode” processes

Θ_{t}^{j}

and construct the processes

{\bar{Θ}}_{h}^{j} ≜ {(\sqrt{Θ_{h}^{j, 1}})}^{+} \frac{1}{σ_{V}^{j}} (Θ_{h}^{j, 2} - m_{V}^{j} Θ_{h}^{j, 1}) .

(18)

From the definition one can conclude that

{\bar{Θ}}_{h}^{j}

represents the normalized sum of the random number of independent equally distributed normalized random summands. We investigate closedness of its distribution to the standard Gaussian one depending on time h.

Below in the filtering algorithm we operate with various likelihood ratios calculated via the pdfs, hence we need to characterize a distance between the pre-limit pdf and its limit one. The precise distance is difficult to calculate, and we must turn to some upper bound of this quantity.

Let

μ (d x)

be some positive measure on

(R, B (R))

, and there exist both the pdf

\frac{d P_{a}}{d μ}

of the pre-limit distribution and the limit one

\frac{d P_{ℓ}}{d μ}

. Then the relative approximation error takes the form

Δ (x) ≜ \frac{|\frac{d P_{a}}{d μ} - \frac{d P_{ℓ}}{d μ}|}{\frac{d P_{ℓ}}{d μ}} (x),

and its average

\int Δ (x) \frac{d P_{ℓ}}{d μ} (x) μ (d x) = V a r (P_{a}, P_{ℓ})

coincides with the total variation distance (TVD) between

P_{a}

and

P_{ℓ}

.

We use the notation

P^{j} (x, h) ≜ P {{\bar{Θ}}_{h}^{j} ⩽ x}

for the pre-limit distribution function,

P_{n}^{j} (x)

stands for the distribution function of the normalized sum of n independent equally distributed normalized random summands with the pdf

Λ^{j} (u)

, and

Φ (x) ≜ \int_{- \infty}^{x} \frac{1}{\sqrt{2 π}} e^{- \frac{z^{2}}{2}} d z

does for the distribution function of the standard Gaussian random value. From the total probability formula, it follows that

P^{j} (x, h) = e^{- λ^{j} (u) h} (I (x) + \sum_{n \in N} \frac{{(λ^{j} (u) h)}^{n}}{n!} P_{n}^{j} (x)),

(19)

where

I (x)

is the Heaviside function.

Proposition 1.

For

λ^{j} (u) h ⩾ \frac{3 + \sqrt{13}}{2}

an approximate upper bound of

V a r (P^{j}, Φ)

can be written as

J^{j} (h) = e^{- λ^{j} (u) h} (2 + C_{1} (2 Φ (- 3) + (\frac{1}{\sqrt{1 - \frac{3}{\sqrt{λ^{j} (u) h}}}} + \frac{1}{\sqrt{1 + \frac{3}{\sqrt{λ^{j} (u) h}}}}))),

(20)

where

C_{1} = C_{1} (Λ^{j} (u, \cdot))

is some parameter.

Proof.

From (19) and the results of [48] (Theorem 1.1) and [49] (Theorem 2.6) the following inequalities are true

V a r (P^{j}, Φ) ⩽ e^{- λ^{j} (u) h} (2 + \sum_{n \in N} \frac{{(λ^{j} (u) h)}^{2}}{n!} V a r (P_{n}^{j}, Φ)) ⩽ e^{- λ^{j} (u) h} (2 + C_{1} \sum_{n \in N} \frac{{(λ^{j} (u) h)}^{2}}{\sqrt{n} n!}),

(21)

where

C_{1} = C_{1} (Λ^{j} (u, \cdot))

is some parameter (see [48,49] for details).

Under the Proposition conditions the approximation of the Poisson distribution by the Gaussian one is valid

\begin{matrix} \sum_{n \in N} \frac{{(λ^{j} (u) h)}^{2}}{\sqrt{n} n!} & \approx \int_{1}^{\infty} \frac{1}{\sqrt{x}} \frac{1}{\sqrt{2 π λ^{j} (u) h}} e^{\frac{{(x - λ^{j} (u) h)}^{2}}{2 λ^{j} (u) h}} d x \\ ⩽ 2 Φ (- 3) + \int_{λ^{j} (u) h - 3 \sqrt{λ^{j} (u) h}}^{λ^{j} (u) h + 3 \sqrt{λ^{j} (u) h}} (a x + b) \frac{1}{\sqrt{x}} \frac{1}{\sqrt{2 π λ^{j} (u) h}} e^{\frac{{(x - λ^{j} (u) h)}^{2}}{2 λ^{j} (u) h}} d x, \end{matrix}

(22)

where

\{\begin{matrix} a ≜ \frac{\sqrt{λ^{j} (u) h - 3 \sqrt{λ^{j} (u) h}} - \sqrt{λ^{j} (u) h + 3 \sqrt{λ^{j} (u) h}}}{6 \sqrt{λ^{j} (u) h} \sqrt{{(λ^{j} (u) h)}^{2} - 9 λ^{j} (u) h}}, \\ b ≜ \frac{1}{\sqrt{λ^{j} (u) h - 3 \sqrt{λ^{j} (u) h}}} - \frac{λ^{j} (u) h - 3 \sqrt{λ^{j} (u) h} - \sqrt{λ^{j} (u) h - 9}}{6 \sqrt{λ^{j} (u) h + 3 \sqrt{λ^{j} (u) h}}} . \end{matrix}

(23)

Coefficients a and b above correspond to a piecewise linear majorant for

y (x) = \frac{1}{\sqrt{x}}

over the interval

[1, + \infty)

(see Figure 1).

We can calculate the last integral analytically

\begin{matrix} \sum_{n \in N} \frac{{(λ^{j} (u) h)}^{2}}{\sqrt{n} n!} ≲ 2 Φ (- 3) + (1 - 2 Φ (- 3)) (a λ^{j} (u) h + b) ⩽ 2 Φ (- 3) + a λ^{j} (u) h + b \\ = 2 Φ (- 3) + \frac{1}{2 \sqrt{λ^{j} (u) h}} (\frac{1}{\sqrt{1 - \frac{3}{\sqrt{λ^{j} (u) h}}}} + \frac{1}{\sqrt{1 + \frac{3}{\sqrt{λ^{j} (u) h}}}}) . \end{matrix}

(24)

Using the RHS of (24) in (21) we obtain the approximate upper bound (20). This ends the sketch of the proof of the Proposition. □

To characterize the distance between the

Q_{t}

(16) increment distribution and its diffusion approximation (17) we should take into account the chance of the MJP transition during the discretization interval. Let us suppose

X_{t}^{u} = e_{j}

, then, taking into account (20), the upper bound of

V a r (P^{u}, Φ | X_{t} = e_{j})

can be obtained by the total probability formula:

\begin{matrix} V a r (P, Φ | X_{t} = e_{j}) ⩽ J^{j} (u, h) ≜ \\ ≜ e^{(A^{j j} (u) - λ^{j} (u)) h} (2 + C_{1} (2 Φ (- 3) + (\frac{1}{\sqrt{1 - \frac{3}{\sqrt{λ^{j} (u) h}}}} + \frac{1}{\sqrt{1 + \frac{3}{\sqrt{λ^{j} (u) h}}}}))) + 2 (1 - e^{A^{j j} (u) h}) . \end{matrix}

(25)

The second summand in (25) answers the chance the MJP can leave the state

e_{j}

during the time interval with probability

1 - e^{A^{j j} (u) h}

, and the multiplier 2 is the upper bound of the TVD for any distributions.

To take into account the statistical uncertainty of the current state

X_{t}^{u}

, we must consider the following averaged criterion:

J (u, p_{1}, \dots, p_{N}, h) ≜ \sum_{j = 1}^{N} p_{j} J^{j} (u, h),

(26)

which describes the guaranteeing estimate of distribution distance for the case of the fixed control

u \in U

and

X_{t}^{u} \sim col (p_{1}, \dots, p_{N})

.

From the practical point of view, the “rational” value of the time increment h can be chosen following to the one of policies:

Numerical analysis of the values $J^{j} (u, h)$ for various $(j, u, h)$ for the choice of an appropriate value for h.
Solution to the individual minimax problems

$J^{j} (u, h) \to min_{h : λ^{j} (u) h ⩾ \frac{3 + \sqrt{13}}{2}, u \in U} max_{u \in U}, j = \bar{1, N}$

with subsequent choice of the maximal h from the set of the individual solutions.
Solution to the general minimax problem

$J (u, p_{1}, \dots, p_{N}, h)) \to min_{h : λ^{j} (u) h ⩾ \frac{3 + \sqrt{13}}{2}, u \in U, j = \bar{1, N}} max_{\binom{u \in U,}{(p_{1}, \dots, p_{N}) \in Π}} .$

In this paper, we use the first policy as the most economical one.

3.3. Optimal Filtering of MJP State Given Counting and Diffusion Observations

In this section, we investigate MJP state (1) filtering problem given counting (2), (3) and diffusion observations (17). Without loss of generality to simplify the presentation and subsequent analysis of the solution to the MJP filtering problem we must introduce below the additional assumptions.

The control $u_{t}$ represents an observable nonrandom cádlág-process.
The noises in $Q_{t}$ are uniformly nondegenerate [50], i.e., $min_{\binom{1 ⩽ n ⩽ N,}{u \in U}} E_{n} (u) > α I$ for some $α > 0$ .
The processes $K_{i j} (u_{t}) ≜ I_{{0}} (E_{i} (u_{t}) - E_{j} (u_{t}))$ , $i, j = \bar{1, N}$ has a finite local variation (here and below $0$ stands for a zero matrix of appropriate dimensionality); $K (u_{t}) ≜ {∥ K_{i j} (u_{t}) ∥}_{i, j = \bar{1, N}}$ is the corresponding $N \times N$ -dimensional matrix-valued function.
The optimal filtering problem is to find a Conditional Mathematical Expectation (CME) ${\hat{X}}_{t}^{u} ≜ E_{} \{X_{t}^{u} | O_{t +}\}$ , where $O_{t} ≜ σ {Y_{s}, Z_{s}, Q_{s}, s \in [0, t]}$ is a natural flow of $σ$ -algebras generated by the observations (2), (3) and (17).

The noise intensity in the observations (17) depends on the estimated state X, and this fact prevents to apply the known results of the optimal nonlinear filtering [37]. To overcome this obstacle, we use a special transformation of available diffusion observations [28]. Here we present a sketch of this transformation.

The Ito rule gives a possibility to obtain the observable quadratic characteristics of Q:

{〈 Q, Q 〉}_{t} = \int_{0}^{t} \sum_{n = 1}^{N} e_{n}^{⊤} X_{s} E_{n} (u_{s}) d s .

(27)

We use the normalized diffusion observations

{\bar{Q}}_{t} ≜ \int_{0}^{t} {(\frac{d {〈 Q, Q 〉}_{s}}{d s})}^{- \frac{1}{2}} d Q_{s} .

(28)

as the first block component of the transformed observations. The model of this process is the following

{\bar{Q}}_{t} = \int_{0}^{t} \bar{D} (u_{s}) X_{s} d s + {\bar{W}}_{s},

(29)

where

\bar{D} (u) ≜ \sum_{n = 1}^{N} E_{n}^{- \frac{1}{2}} (u) D (u) diag e_{n}

, and

{\bar{W}}_{t}

is a standard Wiener process of appropriate dimensionality.

The quadratic characteristics

〈 Q, Q 〉

contains essential statistical information which should be included in the estimation algorithm. This process is a linear transformation of the estimated MJP state.

It is easy to verify that

F (u_{t}, X_{t}) ≜ \frac{d {〈 Q, Q 〉}_{s}}{d s} |_{s = t +} = \sum_{n = 1}^{N} e_{n}^{⊤} X_{t} E_{n} (u_{t}),

however, result of the direct derivation is a matrix-valued function with the excess dimensionality. All its statistical information is included in the complete preimage of F:

F = F (u, x) \overset{F^{- 1}}{\to} {e_{n} \in S^{N} : E_{n} (u) = F} .

In [28] we explain in detail how to reduce the “rough” process F to the N-dimensional “compressed” process

H_{t}

, which has the model

H_{t} = L (u_{t}) X_{t},

(30)

where

L (u_{t})

is an

N \times N

-dimensional matrix-valued function with cádlág components; its rows are orthogonal and contains 0 or 1 only.

One can rewrite the process

H_{t}

as a cumulative sum of the jumps occurred at some nonrandom (or

O_{t}

-predictable) moments

τ

(the term

H_{t}^{D}

) and one, which accumulates jumps at the random (totally inaccessible) moments (the term

H_{t}^{R}

):

H_{t} = \underset{≜ H_{t}^{D}}{\underset{⏟}{L (u_{0}) X_{0} + \sum_{τ ⩽ t} Δ L (u_{τ}) X_{τ}}} + \underset{≜ H_{t}^{R}}{\underset{⏟}{\int_{0}^{t} L (u_{s}) d X_{s}}} .

The process

H_{t}^{D}

represents the second block component of the transformed diffusion observations. To obtain the third component we must express

H_{t}^{R}

through the equivalent complex of the counting processes

G_{t} = col (G_{t}^{1}, \dots, G_{t}^{N})

:

G_{t} ≜ \int_{0}^{t} (I - diag H_{s -}) d H_{s} - H_{t}^{D} .

The components of the process have the following properties.

Each component $G_{t}^{n}$ has the martingale representation

$G_{t}^{n} = \int_{0}^{t} 1 Γ_{n} (u_{s}) X_{s} d s + \int_{0}^{t} (1 - L_{n} (u_{s}) X_{s -}) L_{n} (u_{s}) d α_{s}^{u},$

(31)

where $α_{t}^{u}$ is the martingale from the state representation (1), $L_{n} (u) ≜ e_{n}^{⊤} L (u)$ and

$Γ^{n} (u) ≜ diag (L_{n} (u)) Λ^{⊤} (u) (I - diag (L_{n} (u))) .$
${[G^{n}, G^{m}]}_{t} \equiv 0$ for any $n \neq m$ , and ${〈 G^{n}, G^{n} 〉}_{t} = \int_{0}^{t} 1 Γ_{n} (u_{s}) X_{s} d s$ .

Below we present a stochastic system for the CME

{\hat{X}}_{t}

along with its properties.

Proposition 2.

The following assertions are true.

1.: The CME $\hat{X}$ is the unique strong solution to the stochastic system

$\begin{matrix} {\hat{X}}_{t} = {({(H_{0}^{D})}^{⊤} L (u_{0}) π_{0})}^{+} diag (H_{0}^{D}) L (u_{0}) π_{0} + \int_{0}^{t} Λ^{⊤} (u_{s}) {\hat{X}}_{s} d s + \int_{0}^{t} {\hat{k}}_{s} {\bar{D}}^{⊤} (u_{s}) d ω_{s} \\ + \sum_{n = 1}^{N} \int_{0}^{t} (Γ_{n} (u_{s}) - 1 Γ_{n} (u_{s}) {\hat{X}}_{s -} I) {\hat{X}}_{s -} {(1 Γ_{n} (u_{s}) {\hat{X}}_{s -})}^{+} d ν_{s}^{n} \\ + \int_{0}^{t} {\hat{k}}_{s} B^{⊤} (u_{s}) {(B (u_{s}) {\hat{X}}_{s -})}^{+} d {\hat{β}}_{s} + \int_{0}^{t} {\hat{k}}_{s} C^{⊤} (u_{s}) {(C (u_{s}) {\hat{X}}_{s -})}^{+} d {\hat{γ}}_{s} \\ + \sum_{τ ⩽ t} ({({(Δ H_{τ}^{D})}^{⊤} Δ L (u_{τ}) {\hat{X}}_{τ -})}^{+} diag (Δ H_{τ}^{D}) L (u_{τ}) - I) {\hat{X}}_{τ -}, \end{matrix}$

(32)

where

$\begin{matrix} {\hat{k}}_{t} ≜ diag {\hat{X}}_{t} - {\hat{X}}_{t} {({\hat{X}}_{t})}^{⊤} = E_{} \{({\hat{X}}_{t} - X_{t}) {({\hat{X}}_{t} - X_{t})}^{⊤} | O_{t +}\}, \\ ω_{t} ≜ \int_{0}^{t} (d {\bar{Q}}_{s} - \bar{D} (u_{s}) {\hat{X}}_{s} d s), \\ ν_{t}^{n} ≜ \int_{0}^{t} (d G_{s}^{n} - 1 Γ_{n} (u_{s}) {\hat{X}}_{s -} d s), n = \bar{1, N}, \\ {\hat{β}}_{t} ≜ \int_{0}^{t} (d Y_{s} - B (u_{s}) {\hat{X}}_{s -} d s), \\ {\hat{γ}}_{t} ≜ \int_{0}^{t} (d Z_{s} - C (u_{s}) {\hat{X}}_{s -} d s) . \end{matrix}$
2.: The estimate of the maximum a posteriori probability (MAP) ${\tilde{X}}_{t} = e_{n}$ : $n \in \underset{1 ⩽ m ⩽ N}{Argmax} e_{m}^{⊤} {\hat{X}}_{t}$ minimizes the $L_{1}$ -criterion, i.e., ${\tilde{X}}_{t} \in \underset{{\bar{X}}_{t}}{Argmin} E_{} \{∥ {\bar{X}}_{t} - X_{t} ∥_{1}\}$ .
3.: If $E_{n} (u) \neq E_{m} (u)$ for any $n \neq m$ almost everywhere on $[0, t]$ , then ${\hat{X}}_{t} = X_{t}$ $P$ a.s.

The validity of items 1 and 3 in Proposition 2 can be proved by complete analogy with [28] (Theorem 1, Corollary 1), meanwhile the one of item 2 is proved in [51].

The theoretical assertions above are also meaningful from the practical point of view for subsequent design of the suboptimal control of MJP state under incomplete information. First, the CME

{\hat{X}}_{t}

represents a solution to some closed finite-dimensional stochastic system, by contrast with the general case of the optimal filtering problem [37]. Second, the paths of the CME

{\hat{X}}_{t}

usually are piecewise continuous functions with values in

Π

, meanwhile the MJP X state trajectories are

P

-a.s. piecewise constant functions with values in

S^{N}

. Therefore, we cannot directly substitute the state X by its estimate

\hat{X}

, imposing the separation principle to this control problem. The CME

\hat{X}

can be easily transformed into the MAP estimate

\tilde{X}

with the paths with the same properties as the ones of X. Assertion 2 of Proposition indicates that the proposed MAP estimate is also

L_{1}

-optimal. Third, if the observation system satisfies the identifiability conditions (see Assertion 3 of Proposition) then the MJP state can be restored exactly given the indirect noisy observations. This crucial property gives a chance to reduce the initial control problem with incomplete information to the one with complete information. Obviously, any numerical realization of the filtering estimate leads to some approximation errors, nevertheless Assertion 3 allows one to hope that the small filtering errors cause acceptable control performance.

At the same time, results of Proposition 2 are difficult for the direct application. First, due to the approximation of the acknowledgment flow (4) by the diffusion model (17), the former one is valid and can be effectively applied only for the observation increments over the time interval of significant length (see Section 3.2). Second, the process

H_{t}

, playing the key role in the estimation, is not observable directly, and represents a result of some stochastic limit passage since it is based on the quadratic characteristic

〈 Q, Q 〉

. Due to the boundedness from below of the diffusion observation time increment, direct calculation of

H_{t}

looks impossible. In the next subsection, basing on the time-discretized diffusion observations we present a special numerical algorithm of the nonlinear filtering together with its performance characteristics.

3.4. Numerical Realization of Filtering Algorithm

To construct the numerical algorithm of the MJP state filtering given the combination of both the diffusion and counting observations we consider a time-invariant version of the observation system (1), (3), (2), (17) given the observations discretized by time with the time increment

h > 0

(

t_{r} ≜ r h, r \in N

):

X_{t} = X_{0} + \int_{0}^{t} A X_{s} d s + α_{t},

(33)

Y_{r} = \int_{t_{r - 1}}^{t_{r}} B X_{s} d s + (β_{t_{r - 1}} - β_{t_{r}}),

(34)

Z_{r} = \int_{t_{r - 1}}^{t_{r}} C X_{s} d s + (γ_{t_{r - 1}} - γ_{t_{r - 1}}),

(35)

Q_{r} = \int_{t_{r - 1}}^{t_{r}} D X_{s} d s + \int_{t_{r - 1}}^{t_{r}} \sum_{n = 1}^{N} e_{n}^{⊤} X_{s} E_{n}^{\frac{1}{2}} d W_{s},

(36)

and

O_{r} ≜ σ {Y_{n}, Z_{n}, Q_{n}, n ⩽ r}

is a natural filtration generated by the discretized observations.

An assumption that coefficients

A, B, C, D

and E are constant, is not too restrictive in practice because below we will construct the MJP control which will be constant during the time discretization intervals. Please note that the discretized observations

Y_{r}, Z_{r}

and

Q_{r}

are conditionally independent given

F_{t_{r}}^{X} \lor O_{r - 1}

due to the properties of the Wiener-Poisson canonical space and the result of [50] (Lemma 7.5). Specifically, the distribution of

Y_{r}, Z_{r}

and

Q_{r}

depends on the random vector

η_{r} = col (η_{r}^{1}, \dots, η_{r}^{N}) = \int_{t_{r - 1}}^{t_{r}} X_{s} d s

is a random vector composed of the occupation times of the state X in each state

e_{n}

during the interval

[t_{r - 1}, t_{r}]

. Then

conditional distribution of $Y_{r}$ given $F_{t_{r}}^{X} \lor O_{r - 1}$ is the Poisson one with the parameter $B η_{r}$ ,
conditional distribution of $Z_{r}$ given $F_{t_{r}}^{X} \lor O_{r - 1}$ is the Poisson one with the parameter $C η_{r}$ ,
conditional distribution of $Q_{r}$ given $F_{t_{r}}^{X} \lor O_{r - 1}$ is the Gaussian one with the mean $D η_{r}$ and covariance matrix $\sum_{n = 1}^{N} η_{r}^{n} E_{n}$ .

Below in the presentation we use the following notations:

$\bar{A} ≜ {max}_{n = \bar{1, N}} | A_{n n} |$ ;
$D ≜ {u = col (u^{1}, \dots, u^{N}) : u^{n} ⩾ 0, \sum_{n = 1}^{N} u^{n} = h}$ is an $(N - 1)$ -dimensional simplex in the space $R^{M}$ ; $D$ is a distribution support of the vector $υ_{r}$ ;
$Π ≜ {π = col (π^{1}, \dots, π^{N}) : π^{n} ⩾ 0, \sum_{n = 1}^{N} π^{n} = 1}$ is a “probabilistic simplex” formed by the possible values of $π$ ;
$N_{r}^{X}$ is a random number of the state $X_{t}$ transitions, occurred on the interval $[t_{r - 1}, t_{r}]$ ,
$ρ^{k, ℓ, q} (d u)$ is a conditional distribution of the vector $X_{t_{r}}^{ℓ} I_{{q}} (N_{r}^{X}) υ_{r}$ given $X_{t_{r - 1}} = e_{k}$ , i.e., for any $G \in B (R^{M})$ the following equality is true:

$E_{} \{I_{G} (υ_{r}) I_{{q}} (N_{r}^{X}) X_{t_{r}}^{ℓ} | X_{t_{r - 1}} = e_{k}\} = \int_{G} ρ^{k, ℓ, q} (d u);$
${∥ α ∥}_{K}^{2} ≜ α^{⊤} K α$ , ${〈 α, β 〉}_{K} ≜ α^{⊤} K β$ ;
$N (q, m, K) ≜ {(2 π)}^{- M / 2} \det^{- 1 / 2} K exp \{- \frac{1}{2} {∥ y - m) ∥}_{K^{- 1}}^{2}\}$ is an M-dimensional Gaussian probability density function (pdf) with the expectation m and nondegenerate covariance matrix K;
$P (n, a) ≜ e^{- a} \frac{a^{n}}{n!}$ is a Poisson distribution with the parameter a;
$Υ^{k, j, s} (y, z, q) ≜ \int_{D} P (y, B v) P (z, C v) N (q, D v, \sum_{i = 1}^{N} v^{i} E_{i}) ρ^{k, j, s} (d v)$ .

Below is an assertion introducing the calculation algorithm of the MJP state given the discretized observations

{\hat{X}}_{r} ≜ E_{} \{X_{t_{r}} | O_{r}\}

.

Proposition 3.

The filtering estimate

{\hat{X}}_{r}

can be calculated be the following recursive algorithm

{\hat{X}}_{r}^{j} = \frac{\sum_{n_{1} = 1}^{N} {\hat{X}}_{r}^{n_{1}} \sum_{s_{1} = 0}^{\infty} Υ^{n_{1}, j, s_{1}} (Y_{r}, Z_{r}, Q_{r})}{\sum_{n_{2}, j_{2} = 1}^{N} {\hat{X}}_{r}^{n_{2}} \sum_{s_{2} = 0}^{\infty} Υ^{n_{2}, j_{2}, s_{2}} (Y_{r}, Z_{r}, Q_{r})}, j = \bar{1, N},

(37)

and initial condition

{\hat{X}}_{0} = π_{0} .

(38)

Proof of Proposition 3 can be performed similarly to [28] (Lemma 3).

To construct a numerically realizable algorithm we must restrict the sums both in the numerator and denominator of (37)

{\bar{X}}_{r}^{j} (S) = \frac{\sum_{n_{1} = 1}^{N} {\bar{X}}_{r}^{n_{1}} \sum_{s_{1} = 0}^{S} Υ^{n_{1}, j, s_{1}} (Y_{r}, Z_{r}, Q_{r})}{\sum_{n_{2}, j_{2} = 1}^{N} {\bar{X}}_{r}^{n_{2}} \sum_{s_{2} = 0}^{S} Υ^{n_{2}, j_{2}, s_{2}} (Y_{r}, Z_{r}, Q_{r})}, j = \bar{1, N},

(39)

and obtain the analytical approximation of the Sth order.

We present some summands

Υ

of the low order s:

Υ^{k, j, 0} (y, z, q) = δ_{k j} e^{A_{k k} h} P (y, B^{k} h) P (y, C^{k} h) N (q, h D^{k}, h E_{k}),

\begin{matrix} Υ^{k, j, 1} (y, z, q) = (1 - δ_{k j}) A_{j k} e^{A_{j j} h} \\ \times \int_{0}^{h} e^{(A_{k k} - A_{j j}) v} P (y, B^{k} v + B^{j} (h - v)) P (z, C^{k} v + C^{j} (h - v)) \\ \times N (q, v D^{k} + (h - v) D^{j}, v E_{k} + (h - v) E_{j}) d v, \end{matrix}

\begin{matrix} Υ^{k, j, 2} (y, z, q) \\ = \sum_{i : i \neq k, i \neq j} A_{i k} A_{j i} e^{A_{j j} h} \int_{0}^{h} \int_{0}^{h - v^{k}} e^{(A_{k k} - A_{i i}) v^{k} + (A_{i i} - A_{j j}) v^{j}} P (y, B^{k} v^{i} + B^{i} v^{i} + B^{j} (h - v^{k} - v^{j})) \\ \times P (z, C^{k} v^{i} + C^{i} v^{i} + C^{j} (h - v^{k} - v^{j})) \\ \times N (q, v^{k} D^{k} + v^{i} D^{i} + (h - v^{k} - v^{i}) D^{j}, v^{k} E_{k} + v^{i} E_{i} + (h - v^{k} - v^{i}) E_{j}) d v^{i} d v^{k}, \end{matrix}

where

D^{k}

is the kth column of the matrix D. Other summands are also determined by the total probability formula and have complicated form. Obviously, the integrals above cannot be calculated analytically, and we approximate them by some integral sums

{\tilde{Υ}}^{k, j, s} (y, z, q) ≜ \sum_{ℓ}^{L} P (y, B v_{ℓ}) P (z, C v_{ℓ}) N (q, D v_{ℓ}, \sum_{i = 1}^{N} v_{ℓ}^{i} E_{i}) ϱ_{ℓ}^{k j},

(40)

where

{v_{ℓ}}_{ℓ = \bar{1, L}} \subset D

is a collection of points, and

{ϱ_{ℓ}^{k j}}_{ℓ = \bar{1, L}}

are corresponding weights, such that

\sum_{j = 1}^{N} \sum_{ℓ = 1}^{L} ϱ_{ℓ}^{k j} ⩽ 1

. Therefore, we calculate the filtering estimate by the recursion

{\tilde{X}}_{r}^{j} (S) = \frac{\sum_{n_{1} = 1}^{N} {\tilde{X}}_{r}^{n_{1}} \sum_{s_{1} = 0}^{S} Υ^{n_{1}, j, s_{1}} (Y_{r}, Z_{r}, Q_{r})}{\sum_{n_{2}, j_{2} = 1}^{N} {\tilde{X}}_{r}^{n_{2}} \sum_{s_{2} = 0}^{S} Υ^{n_{2}, j_{2}, s_{2}} (Y_{r}, Z_{r}, Q_{r})}, j = \bar{1, N},

(41)

and refer it as the numerical approximation of the Sth order, corresponding to a chosen numerical integration scheme.

Let us fix a time instant t, and consider the asymptotic performance of approximation (41) as

h \to 0

. The performance index is

{sup}_{π \in Π} E_{π} \{∥ {\tilde{X}}_{r} - {\hat{X}}_{r} ∥_{1}\}

, i.e., an average of the

L_{1}

-norm of the filtering error calculated at the step r for the worst initial distribution of the MJP.

Proposition 4.

If the condition

max_{k = \bar{1, N}, (y, z) \in Z_{+}^{2}} \sum_{j = 1}^{N} \int_{R^{2}} |\sum_{s = 0}^{S} {\tilde{Υ}}^{n_{1}, j, s_{1}} (y, z, q) - Υ^{n_{1}, j, s_{1}} (y, z, q)| d q < δ,

holds, then for small enough h

sup_{π \in Π} E_{π} \{∥ {\tilde{X}}_{t / h} - {\hat{X}}_{t / h} ∥_{1}\} ⩽ 2 t (2 \bar{A} \frac{{(\bar{A} h)}^{S}}{(S + 1)!} + \frac{δ}{h}) .

(42)

Proof of Proposition 4 can be performed similarly to [28] (Lemma 4, Theorem 2). The first term in (42) characterizes the error of the analytical approximation: formula (39) takes into account at most S possible state transitions occurred during the time discretization interval

[t_{r - 1}, t_{r}]

. The second term in (42) describes an impact of numerical integration error to the overall performance of the filtering approximation. We can deduce that the effective choice of the integration scheme should provide the equal contribution of both summands in (42).

For the numerical study we choose the analytical approximation of the 1st order realized by the middle-point scheme:

{\tilde{Υ}}^{k, j, 0} (y, z, q) = Υ^{k, j, 0} (y, z, q),

{\tilde{Υ}}^{k, j, 1} (y, z, q) = (1 - δ_{k j}) A_{j k} e^{\frac{h}{2} (A_{j j} + A_{j j})} h P (y, \frac{h}{2} (B^{k} + B^{j})) P (z, \frac{h}{2} (C^{k} + C^{j})) N (q, \frac{h}{2} (D^{k} + D^{j}), \frac{h}{2} (E_{k} + E_{j})) .

4. State-Based Modification of TCP

In this section, we describe a TCP channel mathematical model we later use for simulation of some modern TCP versions and their comparison with the state-based optimal control policy. The model we use here is in general following the one of [52]. The main distinctive characteristic of this model is the channel state allocation: we use three states to describe the wire channel condition and add one extra state to cover the issues of the wireless connection. This allocation presents a reasonable trade-off between a comprehensive connection state model taking into account all possible features (including the data flows from every channel user, the current packet distribution in all the channel hops and buffers’ queues, and signal quality in the wireless channel segment) and the feasibility of the mathematical modeling.

Thus, we suppose that the link state from a sender to a receiver is described by a controllable MJP

X_{t}

(1) with four possible states:

$e_{1}$ is assigned for low channel load,
$e_{2}$ is for moderate load,
$e_{3}$ is for wired segment congestion,
$e_{4}$ is for signal fading in the wireless segment.

The intensity matrix

A (u) = ∥ A^{i j} (u) ∥_{i, j = \bar{1, 4}}

is defined based on the following assumptions: the link has a single bottleneck device, which remains the same during the whole transmission, this bottleneck device uses Random Early Detection (RED) queuing discipline [53], its buffer capacity is Q, and the RED threshold of guaranteed packet rejection is

W^{″}

(

W^{″} ⩽ Q

).

We also assume that the wireless connection quality does not dependent on the data flow, hence the intensities

A^{\cdot 4}

and

A^{4 \cdot}

corresponding to the transitions from/into the state

e_{4}

are independent of the control

u_{s}

. Furthermore, the direct transitions between the

e_{1}

and

e_{3}

without passing through the

e_{2}

are assumed impossible, i.e.,

A^{13} = A^{31} \equiv 0

.

The controllable components of

A (u_{t})

have the form

\begin{matrix} A^{21} (u_{t}) = \{\begin{matrix} A_{0}^{21} + \frac{C^{21}}{U_{b d p} - u_{t}}, & if u_{t} < U_{b d p}, \\ \bar{A}, & otherwise; \end{matrix} \\ A^{12} (u_{t}) = A_{0}^{12} + C^{12} max (U_{b d p} - u_{t}, 0); \\ A^{32} (u_{t}) = \{\begin{matrix} A_{0}^{32} + \frac{C^{32}}{W^{″} - u_{t}}, & if u_{t} < W^{″}, \\ \bar{A}, & otherwise; \end{matrix} \\ A^{23} (u_{t}) = A_{0}^{23} + C^{23} max (W^{″} - u_{t}, 0), \end{matrix}

where

U_{b d p}

is the control, which corresponds to the bandwidth-delay product (BDP), in other words—the maximum window size yielding throughput equal to channel bandwidth. The constant

\bar{A}

is a level of intensity which guarantees the state transition during the forthcoming RTT.

The dependence of

A^{j i} (u_{t})

on control

u_{t}

is straightforward. In the state

e_{1}

, the number of packets in the link is less than

U_{b d p}

; and in the state

e_{2}

the “bottleneck” buffer begins to fill. The inverse proportionality of

A^{21} (u_{t})

on

u_{t}

and guaranteeing intensity

\bar{A}

provides the increasing probability of

e_{1} \to e_{2}

transition as

u_{t}

approaches to

U_{b d p}

and guarantees the transition when the threshold

U_{b d p}

is reached. The constant additive term

A_{0}^{21}

stands for a chance of the

e_{1} \to e_{2}

transition under low control values

u < U_{b d p}

, which are probable due to the external flows. When

u_{t}

decreases to levels less than

U_{b d p}

, the probability of backward transition

e_{2} \to e_{1}

increases linearly due to the constant flow processing rate. The transition intensities

e_{2} ⇆ e_{3}

act the same way, but with a different threshold, namely

W^{″}

.

The conditional intensities of the acknowledgment arrivals

λ^{j} (u)

depend on the control u and, according to (10), are inversely proportional to the average time between the acknowledgment arrivals:

λ^{j} (u) = \frac{1}{m_{τ}^{j} (u)} .

We assume that if no packets are lost, then during each RTT cycle, the sender receives back the acknowledgments for all the packets currently being sent into the network; hence we assume that the following relation is valid:

m_{τ}^{j} (u) = \frac{m_{V}^{j} (u)}{u} .

The average RTT for each state

m_{V}^{j} (u)

is assumed to be a sum of the following components:

constant propagation delay, $δ_{0}$ ,
average queuing delay caused by external data, flows $m_{V, e x t}^{j}$ ,
average queuing delay caused by the data flow under control, $u \cdot m_{V, s e l f}^{j}$ .

Summing up the assumptions, we have the following relation for the conditional intensity of the acknowledgment arrivals:

λ^{j} (u) = \frac{u}{δ_{0} + m_{V, e x t}^{j} + u m_{V, s e l f}^{j}} .

(43)

The counting processes for loss (2) and timeouts (3) can now be defined as thinned versions of the acknowledgment flow with following conditional intensities:

\begin{matrix} B^{j} (u) = B_{0}^{j} + λ^{j} (u) P_{l} (u), \\ C^{j} (u) = λ^{j} (u) P_{t o}^{j} . \end{matrix}

(44)

Here

P_{t o}^{j}

denotes the conditional probabilities of a timeout in the corresponding states. For the states

e_{1, 2, 3}

, which are related to the wired part of the link, we assume that the only cause for a timeout is a temporary communication hardware fault; and hence the probabilities for these states are constant and equal to each other:

P_{t o}^{1} = P_{t o}^{2} = P_{t o}^{3}

. In the state

e_{4}

, the timeouts follow the wireless carrier signal fading; hence the probability of a timeout

P_{t o}^{4}

is different but still independent of the control u.

The packet loss conditional probabilities, on the contrary, are the functions of the control u. If the control value is less than the RED threshold

u < W^{″}

, then

P_{l}^{1} (u) = P_{0}, P_{l}^{2} (u) = P_{0} + max (\frac{U_{t} - W^{'}}{W^{″} - W^{'}} (P_{1} - P_{0}), 0), P_{l}^{3} (u) = 1, P_{l}^{4} (U_{t}) = P_{l}^{4},

where

P_{0}

is the probability of a packet loss in the wired segment during its propagation through the media,

W^{'}

is the lower RED threshold (

W^{'} < W^{″}

). If the threshold of guaranteed packet loss is exceeded, then the loss is inevitable, thus

P^{j} (u) = 1

for any j, if

u \geq W^{″}

.

To conclude the definition of the loss and timeout intensities, it remains to mention that the additive terms

B_{0}^{j}

in the loss intensity

B (u)

stand for the losses caused by the external flows.

5. Comparative Study with Modern Versions of TCP

We have completely described the observation system (1)–(4) and its parameters’ dependence on the control u. Let

O_{t} ≜ {Y_{s}, Z_{s}, Q_{s}, 0 ⩽ s ⩽ t}

be the natural filtration generated by the observations available up to the moment t. Generally speaking, any

O_{t}

-predictable nonnegative control

U_{t}

is admissible to (1)–(4).

In this section, we present the control processes, which describe the modern versions of TCP in terms of the presented model of channel state and observations. We also present here a state-based TCP control modification, which is based on the optimal state filtering and optimal control strategy. The section will be concluded by a comparative analysis of the TCP versions’ performance.

In what follows we will assume that the constant values

U_{b d p}

,

W^{″}

,

δ_{0}

and

m_{V, s e l f}^{j}

are selected so as to comply with the link of

C = 100

Mbps capacity, propagation delay of

δ_{0} = 0.1

s, bottleneck queue limit of

Q = 100

packets, and

M S S = 1000

bytes:

U_{b d p} = \frac{10^{6} C δ_{0}}{8 M S S} = 1250, W^{″} = U_{b d p} + Q = 1350, m_{V, s e l f}^{j} = \frac{8 M S S}{10^{6} C} = 8 \cdot 10^{- 5} .

5.1. AIMD Scheme and TCP Illinois

In [52] we presented an AIMD type control

u_{t}

policy, which remains the same for the present channel model:

\{\begin{matrix} u_{t} = u_{0} + \int_{0}^{t} I_{[\underset{̲}{W}, W_{t}^{t h})} (u_{s -}) \frac{u_{s -}}{r_{s -}} d s + \int_{0}^{t} I_{[W_{t}^{t h}, + \infty)} (u_{s -}) \frac{α_{s}}{r_{s -}} d s - \int_{0}^{t} β_{s} u_{s -} d Y_{s} + \int_{0}^{t} (\underset{̲}{W} - u_{s -}) d Z_{s}, \\ W_{t}^{t h} = W_{0}^{t h} + \int_{0}^{t} (\frac{1}{2} u_{s -} - W_{s -}^{t h}) d Y_{s}, \end{matrix}

(45)

where

$I_{S} (u)$ is an indicator function equal to one, if $u \in S$ , and zero otherwise,
$\underset{̲}{W}$ is the minimal window size,
$W_{t}^{t h}$ is a threshold actuating congestion avoidance phase,
$r_{t}$ is the exponential smoothing estimate of RTT,
$α_{t}$ and $β_{t}$ are $O_{t}$ -predictable coefficients of additive increase and multiplicative decrease.

The first term in (45) describes the slow start mode, the second and the third stand for the linear increase and the multiplicative decrease in the congestion avoidance phase, and the fourth provides the window rollback to the minimal value

\underset{̲}{W}

and return to the slow start mode when a timeout event occurs.

In the case

α_{t} \equiv 1

and

β_{t} \equiv 0.5

Equation (45) represents the New Reno algorithm. The Illinois concave control policy is defined by convex

α_{t}

and increasing linear

β_{t}

functions of the average queuing delay

d_{a} = \sum_{j = 1}^{4} (m_{V, e x t}^{j} + u m_{V, s e l f}^{j}) e_{j}^{T} X_{t}

:

\begin{matrix} α_{t} (d_{a}) = \{\begin{matrix} α_{m a x} & if d_{a} ⩽ d_{1} \\ \frac{κ_{1}}{κ_{2} + d_{a}} & otherwise, \end{matrix} \\ β_{t} (d_{a}) = \{\begin{matrix} β_{m i n} & if d_{a} ⩽ d_{2} \\ κ_{3} + κ_{4} d_{a} & i f d_{2} < d_{a} < d_{3} \\ β_{m a x} & otherwise, \end{matrix} \end{matrix}

(46)

The parameters

κ_{i}

and

d_{i}

and other details of the Illinois control scheme can be found in [6]. It should be noted that the most important parameters are the maximum and minimum additive increase and multiplicative decrease coefficients, which for the standard implementation are set to

[α_{m i n}, α_{m a x}] = [0.3, 10]

,

[β_{m i n}, β_{m a x}] = [0.125, 0.5]

. In Figure 2, we present the simulation results for the Illinois TCP control policy for these standard parameters. The upper plot presents the channel parameters’ dynamics, including RTT (in red), losses (black triangles), and timeouts (red crosses). The filling color indicates the channel states: white for idle, green for moderate load, red for congestion in the wired segment, and grey for the wireless segment signal fading. The lower plot shows the control dynamics and the critical thresholds:

U_{b d p}

, which corresponds to the channel bandwidth-delay product and buffer overflow low bound

U_{b d p} + W^{″}

.

One can notice that by processing only the RTT information, the algorithm succeeds in the determination of the

U_{b d p}

and becomes much more prudent once the bottleneck buffer starts to fill. This results in long periods of relatively high transmission rates without buffer overflows and rare losses. Nevertheless, during the intervals, when the channel is idle, the control values growth speed is insufficient, which results in underuse of the channel resources and, in the end, in lower average transmission rate.

5.2. TCP CUBIC

In contrast with TCP Illinois, this version of TCP does not rely on RTT observations most of the time. Instead, it considers the control value, at which a loss occurred last time,

W_{t}^{m a x} = \int_{0}^{t} (u_{s -} - W_{s -}^{m a x}) d Y_{s},

as the highest network use control and tends to form a plateau in the close region to this point. To that end, it keeps counting the time since the last loss or timeout,

T_{t}^{l o s s} = t - \int_{0}^{t} t d Y_{s} - \int_{0}^{t} t d Z_{s},

and sets the control according to a cubic function of

T_{t}^{l o s s}

forming two regions: a concave region to reach the last maximum control value of

W_{t}^{m a x}

, and then a convex region of network probing, where the control growth speed becomes higher as the time without loss increases. Upon the loss event, the control is reduced according to a constant multiplicative decrease coefficient

β

, and when a timeout occurs, the control is reset to a minimal window size

\underset{̲}{W}

. Summing up, the TCP CUBIC control can be represented as follows:

u (t) = W_{t}^{m a x} + C {(T_{t}^{l o s s} - {(\frac{W_{t}^{m a x} (1 - β)}{C})}^{\frac{1}{3}})}^{3} - \int_{0}^{t} β u_{s -} d Y_{s} + \int_{0}^{t} (\underset{̲}{W} - u_{s -}) d Z_{s},

(47)

where C is a constant fixed to determine the aggressiveness of control growth: with higher C values (for example,

C = 4.0

), CUBIC tends to be more aggressive, which can be quite useful in high BDP networks.

In Figure 3, we present the simulation results for the TCP CUBIC control with multiplicative decrease coefficient

β = 0.9

and scale constant

C = 4.0

. It should be noted that this simulation is based on a more precise model of the protocol described in [38] and takes into account such details as TCP-friendly region and fast convergence heuristics. These details were not reflected in Equation (47) to avoid unnecessary complications. As in the previous Figure, the upper plot presents the channel dynamics (RTT, losses, timeouts, and state), and the lower plot shows the dynamics of the control.

One can see that TCP CUBIC manages to keep the control close to the desired

U_{b d p}

value, allowing fast recovery after losses. At the same time, the probing phase, which is symmetrical to the recovery phase, is too aggressive, and the average throughput would benefit from longer “plateau” periods. Another advantage, which must be mentioned, is the ability to adjust to dramatic changes in the media: in contrast with TCP Illinois, the CUBIC protocol keeps the control at low values throughout the whole period of wireless signal degradation, which results in fewer losses.

5.3. TCP Compound

The TCP Compound algorithm tries to benefit both from the loss-based and congestion-based approach. To that end, the authors enhance the standard AIMD congestion avoidance scheme with an additional component, which allows faster growth on an idle channel when standard AIMD control underuses the resources [40]. When the congestion is detected, the window is adjusted to avoid packet losses. To estimate the congestion, the TCP Compound scheme compares the estimated number of backlogged packets (bottleneck queue size)

d_{t}

with a known threshold value

γ

. The estimate of the queue size is computed as follows:

d_{t} = u_{t} (1 - \frac{V_{t}^{m i n}}{V_{t}}),

where

V_{t}

is current, and

V_{t}^{m i n}

is a minimum registered RTT value.

The entire TCP Compound control scheme can be represented by the following expression:

\begin{matrix} u_{t} = u_{0} + \int_{0}^{t} I_{[\underset{̲}{W}, W_{t}^{t h})} (u_{s -}) \frac{u_{s -}}{r_{s -}} d s \\ + \int_{0}^{t} I_{[W_{t}^{t h}, + \infty)} (u_{s -}) (I_{[0, γ)} (d_{s -}) u_{t}^{κ} \frac{α}{r_{s -}} - I_{[γ, + \infty)} (d_{s -}) ζ d_{s -}) d s \\ - \int_{0}^{t} β u_{s -} d Y_{s} + \int_{0}^{t} (\underset{̲}{W} - u_{s -}) d Z_{s}, \end{matrix}

(48)

where

$I_{[\underset{̲}{W}, W_{t}^{t h})} (u_{s -})$ is the slow start indicator,
$I_{[0, γ)} (d_{s -})$ is the congestion indicators,
$α$ , $β$ , $κ$ , $ζ$ are tunable protocol parameters.

In (48), the first term describes the slow start mode, the second term reflects the growth phase and correction upon congestion detection, the third stands for the multiplicative decrease, and the fourth provides the window rollback and return to the slow start mode when a timeout event occurs.

In Figure 4, we present the simulation results for the TCP Compound protocol with standard parameter values:

α = β = 0.125

,

κ = 0.75

,

ζ = 1.0

. The backlog estimate threshold value for congestion indication is set to

γ = 80

. The upper plot presents the channel dynamics (RTT, losses, timeouts, and state), and the lower plot shows the dynamics of the control (in black) and the estimated backlog size

d_{t}

(in blue). The figure illustrates the correction of the control when the backlog size estimate reaches the threshold and high control values when the bottleneck buffer queue is assumed empty. It should be noted that TCP Compound, such as the Illinois version, fails to quickly adapt to the wireless signal degradation, demonstrating high instability and a big number of losses during this channel state.

5.4. TCP BBR

The TCP BBR algorithm is purely delay-based [54]. It is designed with the idea of maintaining the total data in the channel equal to the BDP. At this load, a connection runs with the highest throughput and lowest delay. The BDP value is estimated as a product of

R T p r o p

—round-trip propagation time and

B t l B w

—bottleneck bandwidth or delivery rate. An estimate for the propagation time is the minimum registered RTT over a long time:

R T p r o p_{t} = m i n {R T T_{s}}, s \in [t - W_{R}, t],

where

W_{R}

typically varies from tens of seconds to minutes. To estimate the delivery rate, BBR calculates the ratio of the portion of data delivered to the time elapsed from the delivery start. Since this ratio is calculated for every acknowledgment received, it is natural to take the data “inflight” at the moment the packet was sent as a portion and the RTT of this acknowledgment as the time elapsed from the delivery start. The estimated delivery rate then is a maximum of such ratios taken over a period

W_{B}

equal to 6–10 RTTs:

B t l B w_{t} = m a x \{\frac{u (t - R T T_{t})}{R T T_{t}}\}, s \in [t - W_{B}, t] .

The main problem of this approach is that the propagation time and the delivery rate cannot be observed at the same time. Indeed, the bottleneck buffer must be empty to observe RTT values close to the propagation time and, to observe the capacity of the channel, it must be overfilled. This problem is solved by two modes of the steady-state regime: ProbeBW and ProbeRTT. In ProbeBW, the algorithm cycles through eight phases with the following pacing gain values:

p_{t} = (5 / 4, 3 / 4, 1, 1, 1, 1, 1, 1)

. The length of each phase is equal to the current estimate of the propagation time

R T p r o p_{t}

. Thus, the capacity of the channel is achieved by a periodical increase of the sending rate followed by a rollback for the queue drain. ProbRTT is turned on when the value of

R T p r o p_{t}

is not updated for a long time. In this mode, the transmission barely stops for a short time to fully drain the queue. Simulation experiments show that in the present model, the last mode is redundant since BBR manages to maintain a very precise estimate of the propagation delay spending the whole time in ProbBW mode. Plus, we excluded from consideration the Startup and Drain modes since they are usually very short.

Thus, finally, the BBR control is defined as follows:

u_{t} = R T p r o p_{t} \cdot B t l B w_{t} \cdot p_{t}^{T} e [(t / R T p r o p_{t}) % 8 + 1],

(49)

where

e [k] \in R^{8}

is a vector with unity on k-th place and zeros on all others, and % is the modulo operator.

In Figure 5, we present the simulation results for the TCP BBR protocol. The upper plot presents the channel dynamics (RTT, losses, timeouts, and state), and the lower plot shows the dynamics of the control (in black) and the estimate of the BDP control equal to

R T p r o p_{t} \cdot B t l B w_{t}

(in blue). One can notice that this estimate is quite precise, nevertheless, the channel is congested almost the whole time. This means that the BBR algorithm is too aggressive for the channel at hand parameters: the bottleneck buffer size is not enough to accommodate the periodical

25 %

sending rate increase.

5.5. State-Based TCP

To obtain the state-based TCP control strategy, the optimization problem (6) needs to be solved for some predefined gains (instantaneous and terminal) and transmission expenses.

It is natural to bind the transmission expense function

ξ = {(ξ^{1}, \dots, ξ^{4})}^{T}

with the intensity of losses, which we aim to minimize, hence set

ξ^{j} (u) = k^{j} B^{j} (u),

(50)

where

k^{1}, \dots, k^{4}

are coefficients, which reflect the gravity of losses in particular channel states.

We take the same instantaneous gain, as in [36]:

ϕ^{j} (u) = - \frac{a^{j}}{m_{V}^{j} (u) u} = - \frac{a^{j}}{δ_{0} + m_{V, e x t}^{j} + u m_{V, s e l f}^{j}},

(51)

where

a^{1}, \dots, a^{4}

are coefficients, which define the utility of the traffic, depending on the channel state.

Analyzing the behavior of the TCP versions described earlier in the present paper, we may conclude that the most beneficial in terms of the throughput and losses is the state

e_{2}

(moderate load). Hence, it is natural to design the state-based version with the goal of spending most of the time in this state. Terminal gains

ψ^{j}

, satisfying the condition

m a x {ψ^{j}} = ψ^{2}

, would reflect this idea.

In Figure 6 (left), we present a solution to the problem (6) with transmission expenses and instantaneous gains given by (50)–(51) with

k = {(10^{- 4}, 10, 10^{2}, 1)}^{T}

and

a = {(100, 100, 1, 100)}^{T}

. The terminal gains are

ψ = - 10^{6} \cdot {(2, 1, 2, 4)}^{T}

, and the right bound of the observation interval is set to a rather small value of the propagation delay

T = δ_{0} = 0.1

so that the impact of the terminal gains on the criterion would be more valuable. The controls for the states

e_{1}

(idle),

e_{2}

(moderate load),

e_{3}

(congestion),

e_{4}

(wireless signal fading) are given in grey, green, red, and black colors, respectively.

One can observe that the optimal control we obtained is almost constant. This is a very useful property in terms of the scalability of the results. Indeed, the control strategy equal to the mean of the optimal controls

{\bar{u}}_{t} = \frac{1}{T} \int_{0}^{T} u_{s} d s,

(52)

does not depend on the interval, where the original optimization problem (6) was defined.

In Figure 6 (right), we present three plots, which illustrate the behavior of state occupation probabilities of the channel

X_{t}

with constant controls (52) given three different initial states:

X_{0} = e_{1}

,

X_{0} = e_{2}

,

X_{0} = e_{3}

. The color scheme is the same: grey, green, red, and black lines show the occupation probabilities for respectively

e_{1}

,

e_{2}

,

e_{3}

,

e_{4}

states. With solid lines, we show the probabilities obtained as a result of the Kolmogorov equation solution, and with dotted lines, we show the same probabilities obtained through the Monte-Carlo sampling (with 1000 trajectories). One can see that even on a bigger time interval (

T = 5

s), the goal of the state-based control is achieved: from any given initial condition, the channel manages to revert to (or maintain) the most favorable state

e_{2}

.

In Figure 7, we present the simulation results for the state-based control policy. The upper plot presents the channel dynamics (RTT, losses, timeouts, and state), and the lower plot shows the optimal channel estimate

{\hat{X}}_{t}

in the form of a stack plot: the height of the white/green/red/grey area at a certain point of time corresponds to the conditional probability of state idle/moderate load/congestion/wireless signal fading. This plot demonstrates that the quality of the estimates is good and that the hidden channel state may be adequately revealed based on the available information. In the lower plot of Figure 7, we also show the dynamic of the control

{\hat{u}}_{t} = {\bar{u}}_{t}^{T} {\hat{X}}_{t},

where

{\bar{u}}_{t}

is given by (52). One can see that even on a larger interval, the main property of the proposed control strategy remains: the channel spends most of the time in the state

e_{2}

, which results in better throughput and fewer losses.

5.6. Comparison

To compare the performance of the TCP control schemes discussed above, we use statistical modeling. The performance metrics, namely the average throughput (a measure of bandwidth usage effectiveness) and the loss percentage (a measure of predisposition to congestions, which affect other users), are calculated on samples long enough to make the variance negligible. This way is preferable in comparison with taking the average on a bunch of short-term samples since it diminishes the effect of transient phases: initial probing for available channel characteristics, which is implemented differently but is an essential part of all TCP protocol versions.

On samples of

10^{6}

seconds, we compare the state-based control with TCP Illinois, CUBIC, Compound, and BBR versions. To make the comparison fairer, we variate, where available, the parameters of TCP control algorithms to achieve better performance. For TCP Cubic, we take three values of multiplicative decrease coefficient

β \in {0.7, 0.8, 0.9}

; for TCP Compound, we consider nine values of the backlog estimate threshold

γ \in {10, 20, 30, 40, 50, 60, 70, 80, 90}

. Other parameters of the protocol are the same as they were defined in Section 5.2 and Section 5.3 since they have little or negative effect on the performance.

For the state-based version described in Section 5.5, one can tune the protocol behavior by choosing different optimization criteria (6). Nevertheless, since, in our case, the optimal control is constant, instead of the variation of the coefficients of the transmission expenses (50) and instantaneous gain (51), we can directly manipulate these constant values assigned for the channel states. The experiments show that changing controls for states

e_{1}

,

e_{2}

,

e_{3}

, which correspond to the wired part of the transmission channel, makes the performance worse. At the same time, the variation of the control for the state

e_{4}

(wireless signal fading) can bring value; hence we consider four cases:

u_{t}^{4} \in {20, 50, 100, 200}

.

The simulation results are summarized in Figure 8, where we present the average throughput and loss percentage and are detailed in Table 1, where one can also find the control algorithm parameters and state occupation times.

One can immediately observe the same occupation time value for the state

e_{4}

, which is an indirect indicator of the sufficiency of the chosen simulation sample length: since the transition to and from the state of wireless signal fading does not depend on the control values, the limit probability for the corresponding state should be the same.

The highest occupation time for the state

e_{2}

of moderate channel load is demonstrated by the state-based control. In addition, it can be confirmed that this allows this control algorithm to demonstrate better performance: for the case of

u_{t}^{4} = 20

, the losses are minimal, and the average throughput is second best. It should be noted that the best throughput value demonstrated by the BBR protocol is only possible at the cost of huge losses. This is a characteristic feature of this control algorithm on shallow buffers [55]: it is too aggressive for a channel with chosen characteristics, and a small buffer cannot accommodate frequent 25% speed jumps.

The last thing, which is worth mentioning, is the ability of the state-based protocol to be tuned specifically for the cases of wireless channel issues. Depending on the application, it may try to maintain the maximal possible transmission rate at a cost of huge losses, or, vice versa, drop the speed and wait for the connection to restore to the full speed.

6. Conclusions

The class of controllable Markov jump processes equipped by the stochastic analysis framework represents an effective tool for the description of a TCP governed communication connection. The hidden channel state is described by a Markov jump process with a finite-state space, characterizing both the current channel load and physical “health status”. The state equation admits both to include various types of existing congestion control algorithms (Illinois, CUBIC, Compound, BBR, etc.) and to incorporate some novelties.

The available observations represent the Markov jump processes, namely the Cox processes of the packet losses and timeouts and compound Poisson processes of the packet reception acknowledgments.

The available mathematical framework admits designing the complete technological chain of the TCP congestion control optimization, namely:

to describe properly the congestion control problem as the stochastic control one,
to solve the problem above in the case of complete information under the admissible controls with geometric constraints,
to simplify the mathematical model of available observations, replacing the high-frequency packet acknowledgments flow by its diffusion limit,
to solve the connection state filtering by the available observations and obtain high-precision state estimates,
to design effective numerical algorithms for the filtering and control problems solution,
to apply the separation principle and the loop of congestion control synthesis, using the connection state estimates instead of their exact values.

The result of this optimization represents the proposed state-based version of TCP. The paper contains a comparative analysis of the proposed algorithm against the other contemporary TCP versions and demonstrates its advantages.

The potential of the controllable Markov jump processes for the description of the transport and applied layer communication protocols is far from being exhausted. In perspective, one can use it both for the enhancement of the existing protocols (see, e.g., multi-path TCP [56]) and for the development of new ones (see, e.g., “TCP-free” protocols such as QUIC [57]).

In conclusion, we should also note that the mathematical potential of Markov chains/ Markov jump processes allows designing complete technological chains “mathematical model-properly formulated mathematical problem-theoretical solution-efficient numerical algorithm” to solve many applied problems of the analysis, estimation, and control in such areas as biology [58,59,60], epidemiology [61,62,63], inventory control [64], mathematical finance [65], insurance [66,67], etc.

Author Contributions

Conceptualization, A.B. (Andrey Borisov), I.S.; methodology, A.B. (Andrey Borisov), G.M.; software, G.M.; validation, A.B. (Alexey Bosov); formal analysis and investigation, A.B. (Andrey Borisov), G.M.; writing—original draft preparation, A.B. (Andrey Borisov), G.M.; writing—review and editing, A.B. (Alexey Bosov), I.S.; visualization, G.M.; supervision, A.B. (Alexey Bosov), I.S. All authors have read and agreed to the published version of the manuscript.

Funding

The work of Andrey Borisov, Alexey Bosov, and Gregory Miller was partially supported by the Russian Foundation of Basic Research (RFBR Grant No. 19-07-00187-A).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data sharing is not applicable to this article.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

BBR	Bottleneck Bandwidth and RTT
BDP	bandwidth-delay product
CLTRRP	central limit theorem
CLTRRP	central limit theorem for renewal-reward processes
CME	conditional mathematical expectation
CPP	compound Poisson process
cwnd	congestion window size
MAP	maximum a posteriori probability
MJP	Markov jump process
pdf	probability density function
RHS	right-hand side
RTO	retransmission timeout
RTT	round-trip time
TCP	Transmission Control Protocol
TVD	the total variation distance

References

Cerf, V.; Kahn, R. A Protocol for Packet Network Intercommunication. IEEE Trans. Commun. 1974, 22, 637–648. [Google Scholar] [CrossRef] [Green Version]
Al-Saadi, R.; Armitage, G.; But, J.; Branch, P. A Survey of Delay-Based and Hybrid TCP Congestion Control Algorithms. IEEE Commun. Surv. Tutor. 2019, 21, 3609–3638. [Google Scholar] [CrossRef]
Polese, M.; Chiariotti, F.; Bonetto, E.; Rigotto, F.; Zanella, A.; Zorzi, M. A Survey on Recent Advances in Transport Layer Protocols. IEEE Commun. Surv. Tutor. 2019, 21, 3584–3608. [Google Scholar] [CrossRef] [Green Version]
Mishra, A.; Sun, X.; Jain, A.; Pande, S.; Joshi, R.; Leong, B. The Great Internet TCP Congestion Control Census. In Abstracts of the 2020 SIGMETRICS/Performance Joint International Conference on Measurement and Modeling of Computer Systems; Association for Computing Machinery: New York, NY, USA, 2020; pp. 59–60. [Google Scholar] [CrossRef]
Sikdar, B.; Kalyanaraman, S.; Vastola, K. Analytic models for the latency and steady-state throughput of TCP Tahoe, Reno, and SACK. IEEE/ACM Trans. Netw. 2003, 11, 959–971. [Google Scholar] [CrossRef]
Liu, S.; Başar, T.; Srikant, R. TCP-Illinois: A Loss- and Delay-Based Congestion Control Algorithm for High-Speed Networks. Perform. Eval. 2008, 65, 417–440. [Google Scholar] [CrossRef]
Wang, J.; Wen, J.; Han, Y.; Zhang, J.; Li, C.; Xiong, Z. CUBIC-FIT: A High Performance and TCP CUBIC Friendly Congestion Control Algorithm. IEEE Commun. Lett. 2013, 17, 1664–1667. [Google Scholar] [CrossRef]
Altman, E.; Avrachenkov, K.; Barakat, C. TCP in Presence of Bursty Losses. In Proceedings of the 2000 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, Santa Clara, CA, USA, 18–21 June 2000; Association for Computing Machinery: New York, NY, USA, 2000; pp. 124–133. [Google Scholar] [CrossRef] [Green Version]
Ephraim, Y.; Merhav, N. Hidden Markov processes. IEEE Trans. Inf. Theory 2002, 48, 1518–1569. [Google Scholar] [CrossRef] [Green Version]
Borisov, A.V.; Miller, G.B. Analysis and filtration of special discrete-time Markov processes. II: Optimal filtration. Autom. Remote Control 2005, 66, 1125–1136. [Google Scholar] [CrossRef]
Hasslinger, G.; Hohlfeld, O. The Gilbert-Elliott Model for Packet Loss in Real Time Services on the Internet. In Proceedings of the 14th GI/ITG Conference-Measurement, Modelling and Evalutation of Computer and Communication Systems, Dortmund, Germany, 31 March–2 April 2008; pp. 269–283. [Google Scholar]
Kleinrock, L. Queueing Systems: Volume 2: Computer Applications; John Wiley & Sons: New York, NY, USA, 1976. [Google Scholar]
Bertsekas, D.; Gallager, R. Data Networks; Prentice-Hall: Hoboken, NJ, USA, 1992. [Google Scholar]
Misra, V.; Gong, W.B.; Towsley, D. Fluid-Based Analysis of a Network of AQM Routers Supporting TCP Flows with an Application to RED. SIGCOMM Comput. Commun. Rev. 2000, 30, 151–160. [Google Scholar] [CrossRef]
Kushner, H. Heavy Traffic Analysis of Controlled Queueing and Communication Networks; Springer: New York, NY, USA, 2001. [Google Scholar]
Whitt, W. Stochastic-Process Limits. An Introduction to Stochastic-Process Limits and their Application to Queues; Springer: New York, NY, USA, 2002. [Google Scholar]
Le Boudec, J.Y.; Thiran, P. Network Calculus: A Theory of Deterministic Queuing Systems for the Internet; Springer: Berlin/Heidelberg, Germany, 2001. [Google Scholar]
Jiang, Y.; Liu, Y. Stochastic Network Calculus; Springer: London, UK, 2008. [Google Scholar]
Fidler, M.; Rizk, A. A Guide to the Stochastic Network Calculus. IEEE Commun. Surv. Tutor. 2015, 17, 92–105. [Google Scholar] [CrossRef]
Leland, W.; Taqqu, M.; Willinger, W.; Wilson, D. On the self-similar nature of Ethernet traffic (extended version). IEEE/ACM Trans. Netw. 1994, 2, 1–15. [Google Scholar] [CrossRef] [Green Version]
Crovella, M.; Bestavros, A. Self-similarity in World Wide Web traffic: Evidence and possible causes. IEEE/ACM Trans. Netw. 1997, 5, 835–846. [Google Scholar] [CrossRef] [Green Version]
Park, K.; Willinger, W. Self-Similar Network Traffic and Performance Evaluation; John Wiley & Sons: New York, NY, USA, 2000. [Google Scholar]
Altman, E.; Boulogne, T.; El-Azouzi, R.; Jiménez, T.; Wynter, L. A Survey on Networking Games in Telecommunications. Comput. Oper. Res. 2006, 33, 286–311. [Google Scholar] [CrossRef]
Habachi, O.; El-azouzi, R.; Hayel, Y. A Stackelberg Model for Opportunistic Sensing in Cognitive Radio Networks. IEEE Trans. Wirel. Commun. 2013, 12, 2148–2159. [Google Scholar] [CrossRef]
Liu, K.J.R.; Wang, B. Cognitive Radio Networking and Security: A Game-Theoretic View, 1st ed.; Cambridge University Press: Cambridge, UK, 2010. [Google Scholar]
Miller, B.M.; Miller, G.B.; Semenikhin, K.V. Methods to design optimal control of Markov process with finite state set in the presence of constraints. Autom. Remote Control 2011, 72, 323–341. [Google Scholar] [CrossRef]
Borisov, A.V. Robust Filtering Algorithm for Markov Jump Processes with High-Frequency Counting Observations. Autom. Remote Control 2020, 81, 575–588. [Google Scholar] [CrossRef]
Borisov, A.; Sokolov, I. Optimal Filtering of Markov Jump Processes Given Observations with State-Dependent Noises: Exact Solution and Stable Numerical Schemes. Mathematics 2020, 8, 506. [Google Scholar] [CrossRef] [Green Version]
Ishikawa, Y.; Kunita, H. Malliavin calculus on the Wiener-Poisson space and its application to canonical SDE with jumps. Stoch. Process. Their Appl. 2006, 116, 1743–1769. [Google Scholar] [CrossRef] [Green Version]
Borisov, A.; Miller, G.; Stefanovich, A. Controllable Markov Jump Processes. I. Optimum Filtering Based on Complex Observations. J. Comput. Syst. Sci. Int. 2018, 57, 890–906. [Google Scholar] [CrossRef]
Elliott, R.J.; Moore, J.B.; Aggoun, L. Hidden Markov Models: Estimation and Control; Springer: New York, NY, USA, 1995. [Google Scholar]
Fleming, W.; Rishel, R.; Rishel, R.; Collection, K.M.R. Deterministic and Stochastic Optimal Control; Applications of Mathematics; Springer: Berlin/Heidelberg, Germany, 1975. [Google Scholar]
Davis, M. Markov Models & Optimization; Chapman & Hall/CRC Monographs on Statistics & Applied Probability; Chapman & Hall/CRC: London, UK, 1993. [Google Scholar]
Cohen, S.; Elliott, R. Stochastic Calculus and Applications; Probability and Its Applications; Springer: New York, NY, USA, 2015. [Google Scholar]
Jacod, J.; Shiryaev, A. Limit Theorems for Stochastic Processes; Grundlehren der Mathematischen Wissenschaften; Springer: Berlin/Heidelberg, Germany, 2013. [Google Scholar]
Miller, B.M.; Avrachenkov, K.; Stepanyan, K.V.; Miller, G.B. Flow control as stochastic optimal control problem with incomplete information. In Proceedings of the INFOCOM 2005 24th Annual Joint Conference of the IEEE Computer and Communications Societies, Miami, FL, USA, 13–17 March 2005; pp. 1328–1337. [Google Scholar] [CrossRef]
Liptser, R.; Shiryaev, A. Theory of Martingales; Mathematics and its Applications; Springer: Dortrecht, The Netherlands, 1989. [Google Scholar]
Ha, S.; Rhee, I.; Xu, L. CUBIC: A New TCP-Friendly High-Speed TCP Variant. SIGOPS Oper. Syst. Rev. 2008, 42, 64–74. [Google Scholar] [CrossRef]
Kato, T.; Haruyama, S.; Yamamoto, R.; Ohzahata, S. mpCUBIC: A CUBIC-like Congestion Control Algorithm for Multipath TCP. In Trends and Innovations in Information Systems and Technologies; Rocha, Á., Adeli, H., Reis, L.P., Costanzo, S., Orovic, I., Moreira, F., Eds.; Springer International Publishing: Cham, Switzerlands, 2020; pp. 306–317. [Google Scholar] [CrossRef]
Tan, K.; Song, J.; Zhang, Q.; Sridharan, M. A Compound TCP Approach for High-Speed and Long Distance Networks. In Proceedings of the IEEE INFOCOM 2006 25th IEEE International Conference on Computer Communications, Barcelona, Catalunya, Spain, 23–29 April 2006; pp. 1–12. [Google Scholar] [CrossRef]
Oda, H.; Hisamatsu, H.; Noborio, H. Compound TCP+: A Solution for Compound TCP Unfairness in Wireless LAN. J. Inf. Process. 2013, 21, 122–130. [Google Scholar] [CrossRef]
Smith, W. Regenerative Stochastic Processes. Proc. R. Soc. Lond. Ser. A Math. Phys. Sci. 1955, 232, 6–31. [Google Scholar] [CrossRef]
Borisov, A.V. Wonham Filtering by Observations with Multiplicative Noises. Autom. Remote Control 2018, 79, 39–50. [Google Scholar] [CrossRef]
Wonham, W.M. Some Applications of Stochastic Differential Equations to Optimal Nonlinear Filtering. J. Soc. Ind. Appl. Math. Ser. A Control 1964, 2, 347–369. [Google Scholar] [CrossRef]
Borovkov, A. Asymptotic Methods in Queuing Theory; John Wiley & Sons: Hoboken, NJ, USA, 1984. [Google Scholar]
Gut, A. Stopped Random Walks: Limit Theorems and Applications; Springer Series in Operations Research and Financial Engineering; Springer: New York, NY, USA, 2009. [Google Scholar]
Fischer, H. A History of the Central Limit Theorem: From Classical to Modern Probability Theory; Sources and Studies in the History of Mathematics and Physical Sciences; Springer: New York, NY, USA, 2010. [Google Scholar]
Bobkov, S.G.; Chistyakov, G.P.; Götze, F. Berry-Esseen bounds in the entropic central limit theorem. Probab. Theory Relat. Fields 2014, 159, 435–478. [Google Scholar] [CrossRef] [Green Version]
Bally, V.; Caramellino, L. Asymptotic development for the CLT in total variation distance. Bernoulli 2016, 22, 2442–2485. [Google Scholar] [CrossRef]
Liptser, R.; Shiryaev, A. Statistics of Random Processes II: Applications; Springer: Berlin/Heidelberg, Germany, 2001. [Google Scholar]
Borisov, A.V. Application of optimal filtering methods for on-line monitoring of queueing network states. Autom. Remote Control 2016, 77, 277–296. [Google Scholar] [CrossRef]
Borisov, A.V.; Bosov, A.V.; Miller, G.B.; Stefanovich, A.I. Optimization of TCP Algorithm for Wired–Wireless Channels Based on Connection State Estimation. In Proceedings of the 2019 IEEE 58th Conference on Decision and Control (CDC), Nice, France, 11–13 December 2019; pp. 728–733. [Google Scholar] [CrossRef]
Floyd, S.; Jacobson, V. Random early detection gateways for congestion avoidance. IEEE/ACM Trans. Netw. 1993, 1, 397–413. [Google Scholar] [CrossRef]
Cardwell, N.; Cheng, Y.; Gunn, C.S.; Yeganeh, S.H.; Jacobson, V. BBR: Congestion-Based Congestion Control. Commun. ACM 2017, 60, 58–66. [Google Scholar] [CrossRef] [Green Version]
Claypool, S.; Claypool, M.; Chung, J.; Li, F. Sharing but not Caring-Performance of TCP BBR and TCP CUBIC at the Network Bottleneck. In Proceedings of the 4th IARIA International Conference on Advances in Computation, Communications and Services (ACCSE), Nice, France, 28 July–2 August 2019; pp. 74–81. [Google Scholar]
Pokhrel, S.R.; Panda, M.; Vu, H.L. Analytical Modeling of Multipath TCP Over Last-Mile Wireless. IEEE/ACM Trans. Netw. 2017, 25, 1876–1891. [Google Scholar] [CrossRef]
Kharat, P.; Kulkarni, M. Modified QUIC protocol with congestion control for improved network performance. IET Commun. 2021, 15, 1210–1222. [Google Scholar] [CrossRef]
Krogh, A.; Brown, M.; Mian, I.; Sjölander, K.; Haussler, D. Hidden Markov Models in Computational Biology: Applications to Protein Modeling. J. Mol. Biol. 1994, 235, 1501–1531. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Huelsenbeck, J.; Larget, B.; Swofford, D. A compound Poisson process for relaxing the molecular clock. Genetics 2000, 154, 1879–1892. [Google Scholar] [CrossRef] [PubMed]
Karchin, R.; Cline, M.; Mandel-Gutfreund, Y.; Karplus, K. Hidden Markov models that use predicted local structure for fold recognition: Alphabets of backbone geometry. Proteins Struct. Funct. Bioinform. 2003, 51, 504–514. [Google Scholar] [CrossRef]
Cauchemez, S.; Carrat, F.; Viboud, C.; Valleron, A.; Böelle, P. A Bayesian MCMC approach to study transmission of influenza: Application to household longitudinal data. Stat. Med. 2004, 22, 3469–3487. [Google Scholar] [CrossRef]
Allen, L.J.S. An Introduction to Stochastic Epidemic Models. In Mathematical Epidemiology; Brauer, F., van den Driessche, P., Wu, J., Eds.; Springer: Berlin/Heidelberg, Germany, 2008; pp. 81–130. [Google Scholar] [CrossRef]
Gómez, S.; Arenas, A.; Borge-Holthoefer, J.; Meloni, S.; Moreno, Y. Discrete-time Markov chain approach to contact-based disease spreading in complex networks. EPL (Europhys. Lett.) 2010, 89, 38009. [Google Scholar] [CrossRef] [Green Version]
Papadopoulos, C.T.; Li, J.; O’Kelly, M.E. A classification and review of timed Markov models of manufacturing systems. Comput. Ind. Eng. 2019, 128, 219–244. [Google Scholar] [CrossRef]
Ang, A.; Timmermann, A. Regime Changes and Financial Markets. Annu. Rev. Financ. Econ. 2012, 4, 313–337. [Google Scholar] [CrossRef] [Green Version]
Paulsen, J. Risk theory in a stochastic economic environment. Stoch. Process. Appl. 1993, 46, 327–361. [Google Scholar] [CrossRef] [Green Version]
Christiansen, M. Multistate models in health insurance. AStA Adv. Stat. Anal. 2012, 96, 155–186. [Google Scholar] [CrossRef]

Figure 1. The function

y = \frac{1}{\sqrt{x}}

and its piecewise linear majorant against the Gaussian.

Figure 1. The function

y = \frac{1}{\sqrt{x}}

and its piecewise linear majorant against the Gaussian.

Figure 2. TCP channel simulation example for Illinois control algorithm.

Figure 3. TCP channel simulation example for CUBIC control algorithm.

Figure 4. TCP channel simulation example for Compound control algorithm.

Figure 5. TCP channel simulation example for BBR control algorithm.

Figure 6. State-based control (left) and state occupation probabilities for three initial states:

X_{0} = e_{1}

,

X_{0} = e_{2}

,

X_{0} = e_{3}

(right).

Figure 6. State-based control (left) and state occupation probabilities for three initial states:

X_{0} = e_{1}

,

X_{0} = e_{2}

,

X_{0} = e_{3}

(right).

Figure 7. TCP channel simulation example for state-based control algorithm.

Figure 8. Performance of TCP control versions.

Table 1. Performance metrics.

Protocol	Parameter	Throughput	% loss	$e_{1}$	$e_{2}$	$e_{3}$	$e_{4}$
Illinois		63.97	0.011	15.3%	37.7%	24.8%	22.2%
CUBIC	$β = 0.7$	59.85	0.005	25.7%	31.2%	20.9%	22.2%
CUBIC	$β = 0.8$	63.99	0.006	17.7%	30.8%	29.3%	22.2%
CUBIC	$β = 0.9$	68.74	0.007	8.9%	24.2%	44.7%	22.2%
Compound	$γ = 10$	61.81	0.021	14.3%	42.6%	20.9%	22.2%
Compound	$γ = 20$	65.63	0.019	10.7%	43.4%	23.7%	22.2%
Compound	$γ = 30$	66.61	0.019	9.6%	39.9%	28.3%	22.2%
Compound	$γ = 40$	67.02	0.021	9.0%	31.1%	37.7%	22.2%
Compound	$γ = 50$	68.08	0.022	8.5%	26.8%	42.5%	22.2%
Compound	$γ = 60$	68.63	0.022	8.3%	24.1%	45.4%	22.2%
Compound	$γ = 70$	68.68	0.023	8.3%	22.8%	46.7%	22.2%
Compound	$γ = 80$	68.76	0.024	8.3%	22.9%	46.6%	22.2%
Compound	$γ = 90$	68.77	0.024	8.3%	22.8%	46.7%	22.2%
BBR		88.65	1.219	0.7%	8.2%	68.9%	22.2%
State-based	$u_{t}^{4} = 20$	76.15	0.004	1.9%	74.2%	1.7%	22.2%
State-based	$u_{t}^{4} = 50$	76.68	0.007	1.8%	74.3%	1.7%	22.2%
State-based	$u_{t}^{4} = 100$	77.64	0.012	1.7%	74.4%	1.7%	22.2%
State-based	$u_{t}^{4} = 200$	79.29	0.022	1.6%	74.5%	1.7%	22.2%

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Borisov, A.; Bosov, A.; Miller, G.; Sokolov, I. Partial Diffusion Markov Model of Heterogeneous TCP Link: Optimization with Incomplete Information. Mathematics 2021, 9, 1632. https://doi.org/10.3390/math9141632

AMA Style

Borisov A, Bosov A, Miller G, Sokolov I. Partial Diffusion Markov Model of Heterogeneous TCP Link: Optimization with Incomplete Information. Mathematics. 2021; 9(14):1632. https://doi.org/10.3390/math9141632

Chicago/Turabian Style

Borisov, Andrey, Alexey Bosov, Gregory Miller, and Igor Sokolov. 2021. "Partial Diffusion Markov Model of Heterogeneous TCP Link: Optimization with Incomplete Information" Mathematics 9, no. 14: 1632. https://doi.org/10.3390/math9141632

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Partial Diffusion Markov Model of Heterogeneous TCP Link: Optimization with Incomplete Information

Abstract

1. Introduction

2. Problem of Optimal Data Transmission through TCP Channel

3. Mathematical Background

3.1. Optimal Control Strategy with Complete Information

3.2. Diffusion Approximation of High-Frequency Counting Observations

3.3. Optimal Filtering of MJP State Given Counting and Diffusion Observations

3.4. Numerical Realization of Filtering Algorithm

4. State-Based Modification of TCP

5. Comparative Study with Modern Versions of TCP

5.1. AIMD Scheme and TCP Illinois

5.2. TCP CUBIC

5.3. TCP Compound

5.4. TCP BBR

5.5. State-Based TCP

5.6. Comparison

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI