1. Introduction
In this study, we consider the problem of approximating a locally unique solution \(x^*\) of the equation
\begin{equation}
\label{1.1}
F(x)=0,
\end{equation}
(1)
where \(F: \Omega\subseteq \mathcal{B}_1\longrightarrow\mathcal{B}_2\) be a Fréchet-differentiable operator, \(\mathcal{B}_1\) and \(\mathcal{B}_2\) are Banach spaces and \(\Omega\) is a nonempty convex subset of \(\mathcal{B}_1.\)
Numerous problems in Mathematics and computational sciences are written in the form of (1) using mathematical modeling [
1,
2,
3,
4,
5,
6,
7,
8,
9]. Most solution methods for these equations are iterative, since closed form solutions can rarely be found.
We study the local convergence of the multi-step Chebyshev-Halley-type [
9] method defined for each \(n=0,1,2,\ldots\) by
\begin{eqnarray}
\label{1.2}
\begin{cases} y_{n}=x_{n}-[I+\frac{1}{2}K(x_n)(1-\delta K(x_n))^{-1}]F(x_n)^{-1}F(x_{n}),\\
z_n=y_n-F'(x_n)^{-1}[F'(x_n)+F”(u_n)(x_n-v_n)]F'(x_n)^{-1}F(y_n),\\
x_{n+1}=z_n-[I+K(x_n)+\gamma K(x_n)^2]F'(x_n)^{-1}F(z_n),
\end{cases}
\end{eqnarray}
(2)
where \(x_0\) is an initial points, \(\gamma, \delta \in S, S=\mathbb{R}\) or \(S=\mathbb{C},\) \(u_n=x_n-\frac{2}{3}F'(x_n)^{-1}F(x_n),\) \( K(x_n)=F'(x_n)^{-1}F”(x_n)F'(x_n)^{-1}F(x_n) \) and \(v_n=x_n-F'(x_n)^{-1}F(x_n).\)
The semi-local convergence of method (2) was given in [9] under the (\(C\)) conditions:
- (\(C_1\)) \(\|F”(x)\|\leq M\) for each \(x\in \Omega\),
- (\(C_2\)) \(F'(x_0)^{-1}\in \mathcal{L}(\mathcal{B}_2, \mathcal{B}_1)\) for some \(x_0\in \Omega\) and \(\|F'(x_0)^{-1}\|\leq \beta,\)
- (\(C_3\)) \(\|F'(x_0)^{-1}F9x_0)\|\leq \eta\)
and
- (\(C_4\)) \(\|F”(x)-F”(y)\|\leq \varphi(\|x-y\|)\) for each \(x,y\in \Omega\) and \(\varphi\) satisfies \(\varphi(0)\geq 0,\, \varphi(ts)\leq t^a\varphi(s)\) for each \(t\in [0,1], s\in (0, +\infty), 0\leq a\leq 1.\)
Similar methods have been consider by other authors using conditions (\(C_1\))-(\(C_4\)) but the convergence order is smaller [
4,
5,
6,
7,
8]. The convergence order was shown to be \(4+3a\) using recurrence relations [
9]. Notice that method (2) uses the second Fréchet-derivative which is expensive in general. However, there are cases e.g., when \(F”\) is a bilinear operator or other cases [
1,
2,
3,
7,
8], when method (2) is very use full, since it is very fast. Condition (\(C_4\)) may be hard to verify or may not hold even in simple cases. Let us consider \(\varphi(t)=bt\) for some \(b > 0.\) Consider a motivational example.
Define function \(F\) on \(\Omega=[-\frac{1}{2},\frac{5}{2}]\) by
\[F(x)=\left\lbrace
\begin{array}{ll}
x^2\ln x^2+2x^4-4x^3+2x^2,\,\,\,x\neq 0,\\
0 ,\,\,\, x=0.
\end{array}\right.\]
Choose \(x^\ast =1.\)
Then, condition (\(C_4\)) is not satisfied since the third Fréchet derivative does not exists at \(x^*=1.\)
In this study, we present the local convergence not given in [9] and drop condition (\(C_4\)). This way we expand the applicability of method (2). Moreover, we refine Theorem 1 in [9] leading to a new semi-local convergence for method (2), a wider convergence region, tighter error bounds on the distances \(\|x_n-x^*\|\) and an at least as precise information on the location of the solution. These advantages are obtained, since we find a more precise region where the iterate lie resulting to tighter Lipschitz constants as well as the aforementioned advantages. The new constants are special cases of the old ones, so the advantages are obtained under the same computational cost.
The study is structured as follows; Section 2 contain the local convergence followed by the semi-local convergence in Section 3. The numerical examples in Section 4 conclude this study.
2. Local convergence
The local convergence that follows is centered on some parameters and scalar functions. Let \(\gamma,\, \delta\in \mathbb{R}, w_0:[0, +\infty)\longrightarrow [0, +\infty)\) be a continuous and non-decreasing function with \(w_0(0)=0.\) Define parameter \(\rho_0\) by
\begin{equation}
\label{2.1}
\rho_0=\sup\{t\geq 0:w_0(t) < 1\}.
\end{equation}
(3)
Let \(v_1, v:[0, \rho_0)\longrightarrow[0, +\infty)\) be continuous, non-decreasing functions and \(\delta\in \mathbb{R}.\) Define functions \(g_0\) and \(h_0\) on the interval \([0, \rho_0)\) by
\[g_0(t)=\frac{v_1(t)\int_0^1v(\theta t)d\theta}{1-w_0(t)},\]
and
\[h_0(t)=|\delta|g_0(t)t-1.\]
Suppose
\begin{equation}
\label{2.2*}
h_0(t)\longrightarrow +\infty\, \, or \, a \, positive \, constant \, as \, \,t\longrightarrow \rho_0^-,
\end{equation}
(4)
we have \(h_0(0)=-1.\) It follows by the intermediate value theorem that equation \(h_0(t)=0\) has solutions in \((0, \rho_0).\) Denote by \(\rho\) the smallest such solution in \((0, \rho_0).\) Define functions \(g_1\) and \(h_1\) on \([0, \rho)\) by
\[g_1(t)=\frac{\int_0^1w((1-\theta)t)d\theta}{1-w_0(t)}+\frac{g_0(t)t\int_0^1v(\theta t)d\theta}{2(1-|\delta|g_0(t)t)(1-w_0(t))},\]
and
\[h_1(t)=g_1(t)-1,\]
where \(w:[0, \rho)\longrightarrow [0, +\infty)\) is a continuous and non-decreasing function with \(w(0)=0.\)
Suppose
\begin{equation}
\label{2.3*}
h_1(t)\longrightarrow +\infty\,\, or \, a \, positive \, constant \, as \,\,t\longrightarrow \rho^-,
\end{equation}
(5)
we get \(h_1(0)=-1.\) It follows by the intermediate value theorem that equation \(h_1(t)=0\) has solutions in \((0, \rho).\) Denote by \(r_1\) the smallest such solution in \((0, \rho).\) Suppose that
\begin{equation}
\label{2.2}
v(0) < 3.
\end{equation}
(6)
Define functions \(p\) and \(q\) on the interval \([0, \rho_0)\) by
\[p(t)=\frac{\int_0^1w((1-\theta)t)d\theta+\frac{1}{3}\int_0^1v(\theta t)d\theta}{1-w_0(t)},\]
and
\[q(t)=p(t)t-1.\]
Suppose
\begin{equation}
\label{2.5*}
q(t)\longrightarrow +\infty\,\, or \, a \, positive \, constant \, as \,\,t\longrightarrow \rho_0^-,
\end{equation}
(7)
we obtain that \(q(0)=\frac{v(0)}{3}-1 < 0.\) Denote by \(r_q\) the smallest solution of equation \(q(t)=0\) in \((0, \rho_0).\) Define functions \(g_2\) and \(h_2\) on the interval \([0, \rho)\) by
\begin{eqnarray*}
g_2(t)&=&\frac{\int_0^1w((1-\theta)g_1(t)t)d\theta g_1(t)}{1-w_0(g_1(t)t)}
+\frac{(w_0(t)+w_0(g_1(t)t))\int_0^1v(\theta g_1(t)t)d\theta g_1(t)}{(1-w_0(t))(1-w_0(g_1(t)t))}\\
&&+\frac{\bar{v}(p(t)t)\int_)^1v(\theta t)d\theta\int_0^1v(\theta g_1(t)t)g_1(t)}{(1-w_0(t))^3},
\end{eqnarray*}
and
\[h_2(t)=g_2(t)-1,\]
where function \(\bar{v}:[0, \rho_0)\longrightarrow [0, +\infty)\) is continuous and nondecreasing.
Define parameter \(\bar{\rho}_0\) by
\[\bar{\rho}_0=\max\{t\in[0,\rho_0]:w_0(g_1(t)t) < 1\}.\]
Suppose
\begin{equation}
\label{2.6*}
h_2(t)\longrightarrow +\infty\,\, or \, a \, positive \, constant \, as \,\,t\longrightarrow \bar{\rho}_0^-,
\end{equation}
(8)
we have \(h_2(0)=-1 < 0.\)
Denote by \(r_2\) the smallest solution of equation \(h_2(t)=0\) in \((0, \bar{\rho}_0).\) Define parameter \(\bar{\bar{\rho}}_0\) by
\[\bar{\bar{\rho}}_0=\max\{t\in [0, \rho_0]:w_0(g_2(t)t) < 1\}.\]
Define functions \(g_3\) and \(h_3\) on the interval \([0, \lambda), \lambda=\min\{\bar{\rho}_0, \bar{\bar{\rho}}_0\}\) by
\begin{eqnarray*}
g_3(t)&=&\frac{\int_0^1w((1-\theta)g_2(t)t)d\theta g_2(t)}{1-w_0(g_2(t)t)}+\frac{(w_0(t)+w_0(g_2(t)t))\int_0^1v(\theta g_2(t)t)d\theta g_2(t)}{(1-w_0(t))(1-w_0(g_2(t)t))}\\
&&+\frac{g_0(t)\int_0^1v(\theta g_2(t)t)d\theta g_2(t)t}{1-w_0(t)}+\frac{|\gamma|g_0^2(t)\int_0^1v(\theta g_2(t)t)d\theta g_2(t)t^2}{1-w_0(t)},
\end{eqnarray*}
and
\[h_3(t)=g_3(t)-1.\]
Suppose
\begin{equation}
\label{2.7*}
h_3(t)\longrightarrow +\infty\,\, or \, a \, positive \, constant \, as \,\,t\longrightarrow \lambda^-,
\end{equation}
(9)
we get that \(h_3(0)=-1.\) Denote by \(r_3\) the smallest solution of equation \(h_3(t)=0\) in \((0, \lambda).\) Define the radius of convergence \(r\) by
\begin{equation}
\label{2.3}
r=\min\{r_i\},\,i=1,2,3.
\end{equation}
(10)
Then for each \(t\in [0, r)\)
\begin{align}
\label{2.4}
0&\leq g_i(t)< 1,\\\label{2.5}
\end{align}
(11)
\begin{align}
0&\leq w_0(t)< 1,\\\label{2.6}
\end{align}
(12)
\begin{align}
0&\leq w_0(g_1(t)t)< 1,\\\label{2.7}
\end{align}
(13)
\begin{align}
0&\leq w_0(g_2(t)t)< 1,
\end{align}
(14)
and
\begin{equation}
0\leq p(t)t < 1.
\end{equation}
(15)
Some alternatives to the aforementioned conditions are; Equation
\[w_0(t)=1\]
has positive solutions. Denoted by \(\rho_0\) the smallest such solution. Similarly, define \(\bar{\rho}_0, \bar{\bar{\rho}}_0.\) Functions, \(v_1, v, w, \bar{v}\) defined on the same intervals as before are continuous and increasing. Then, clearly conditions (4), (5), (7), (8) and (9) hold.
Let \(U(x,\mu), \bar{U}(x,\mu)\) denote the open and closed balls in \(\mathcal{B}_1,\) respectively of center \(x\in \mathcal{B}_1\) and of radius \(\mu > 0.\) The local
convergence of method (2) is based on the preceding notation and conditions (\(A\))
- (\(a_1\))     \(F:\Omega\subset \mathcal{B}_1\longrightarrow \mathcal{B}_2\) is a twice continuously Fréchet differentiable operator.
- (\(a_2\))     There exists \(x^*\in \Omega\) such that \(F(x^*)=0\) and \(F'(x^*)^{-1}\in \mathcal{L}( \mathcal{B}_1, \mathcal{B}_1).\)
- (\(a_3\))     There exists function \(w_0:[0, +\infty)\longrightarrow[0, +\infty)\) continuous and non-decreasing with \(w_0(0)=0\) such that for each \(x\in \Omega\)
\[\|F'(x^*)^{-1}(F'(x)-F'(x^*))\|\leq w_0(\|x-x^*\|).\]
Set \(\Omega_0=\Omega\cap U(x^*,\rho_0),\) where \(\rho_0\) is defined by (3).
- (\(a_4\))     There exist functions \(w:[0, \rho_0)\longrightarrow [0, +\infty),v:[0, \rho_0)\longrightarrow [0, +\infty), \bar{v}:[0, \rho_0)\longrightarrow [0, +\infty)\) continuous and nondecreasing with \(w(0)=0\) such that for each \(x,y\in \Omega_0\)
\[\|F'(x^*)^{-1}(F'(x)-F'(y))\|\leq w(\|x-y\|),\]
\[\|F'(x^*)^{-1}F'(x)\|\leq v(\|x-x^*\|)\]
and
\[\|F'(x^*)^{-1}F”(x)\|\leq \bar{v}(\|x-x^*\|).\]
- (\(a_5\))     \(\bar{U}(x^*,r)\subseteq D\) and (6) hold, where \(r\) is given by (10).
- (\(a_6\))     There exists \(R\geq r\) such that
\[\int_0^1w_0(\theta R)d\theta < 1.\]
Theorem 1.
Suppose that the conditions (\(A\)) and (4)-(9) hold. Then, sequence \(\{x_n\}\) generated for \(x_0\in U(x^*,r)-\{x^*\}\) by method (2) is well defined in \(U(x^*,r)\) remains in \(U(x^*,r)\) for each \(n=0,1,2,\ldots\) and converges to \(x^*.\) Moreover, the following estimates hold
\begin{equation}
\label{2.9}
\|y_n-x^*\|\leq g_1(\|x_n-x^*\|)\|x_n-x^*\|\leq \|x_n-x^*\| < r,
\end{equation}
(16)
\begin{equation}
\label{2.10}
\|z_n-x^*\|\leq g_2(\|x_N-x^*\|)\|x_n-x^*\|\leq \|x_n-x^*\|,
\end{equation}
(17)
and
\begin{equation}
\label{2.11}
\|x_{n+1}-x^*\|\leq g_3(\|x_n-x^*\|)\|x_n-x^*\|\leq \|x_n-x^*\|,
\end{equation}
(18)
where the functions \(g_i, i=1,2,3\) are defined previously. Furthermore, the point \(x^*\) is the only solution of equation \(F(x)=0\)
in \(\Omega_1=\Omega\cap \bar{U}(x^*, R).\)
Proof. Mathematical induction is employed to show estimates (16)-(18). Using (3), (\(a_1\))-(\(a_3\)) and \(x_0\in U(x^*,r),\) we have that
\begin{equation}
\label{2.12}
\|F'(x^*)^{-1}(F'(x)-F'(x^*))\|\leq w_0(\|x_0-x^*\|)\leq w_0(r) < 1.
\end{equation}
(19)
In view of (19) and the Banach lemma on invertible operators [
1,
2,
3,
7,
8], we get that \(F'(x_0)^{-1}\in \mathcal{L}(\mathcal{B}_2,\mathcal{B}_1),\)
\begin{equation}
\label{2.13}
\|F'(x)^{-1}F'(x^*)\|\leq \frac{1}{1-w_0(\|x_)-x^*\|)},
\end{equation}
(20)
and \(u_0\) is well defined. By the definition of operator \(K,\) (\(a_4\)) and (20), we obtain that
\begin{eqnarray}
\nonumber
\|K(x_0)\|&\leq&\|F'(x_0)^{-1}F'(x^*)\|\|F'(x^*)^{-1}F”(x_0)\| \|F'(x_0)^{-1}F'(x^*)\|\|F'(x^*)^{-1}F(x_0)\|\\\nonumber
&\leq&\frac{\bar{v}(\|x_0-x^*\|)\int_0^1v(\theta \|x_0-x^*\|)d\theta\|x_0-x^*\|}{(1-w_0(\|x_0-x^*\|))^2}\\\label{2.14}
&\leq&g_0(\|x_0-x^*\|)\|x_0-x^*\|,\end{eqnarray}
(21)
\begin{eqnarray}
\|\delta K(x_0)\|&\leq&|\delta|g_0(\|x_0-x^*\|)\|x_0-x^*\|\notag\\\label{2.15}
&\leq&|\delta|g_0(r) < 1,
\end{eqnarray}
(22)
so \(\delta K(x_0)^{-1}\in\mathcal{L}(\mathcal{B}_2,\mathcal{B}_1),\)
\begin{equation}
\label{2.16}
\|(I-\delta K(x_0))^{-1}\|\leq \frac{1}{1-|\delta|g_0(\|x_0-x^*\|)\|x_0-x^*\|},
\end{equation}
(23)
and
\begin{equation}
\label{2.17}
\left\|K(x_0)\left(I-\delta K(x_0)^{-1}\right)\right\|\leq \frac{g_0(\|x_0-x^*\|)\|x_0-x^*\|}{1-|\delta|g_0(\|x_0-x^*\|)\|x_0-x^*\|}.
\end{equation}
(24)
By the first substep of method (2), we can write
\begin{equation}
\label{2.18}
y_0-x^*=x_0-x^*-F'(x_0)^{-1}F(x_0)-\frac{1}{2}K(x_0)(1-\delta K(x_0))^{-1}F'(x_0)^{-1}F(x_0).
\end{equation}
(25)
Then, using (10), (11) (for \(i=1\)), (\(a_4\)), (21)-(25), we have in turn that
\begin{eqnarray}
\nonumber
\|y_0-x^*\|&\leq&\|F'(x_0)^{-1}F'(x^*)\| \int_0^1\|F'(x^*)^{-1}(F'(x^*+\theta(x_0-x^*))-F'(x_0))(x_0-x^*)d\theta\|\\\nonumber
&&+\frac{1}{2}\|K(x_0)(1-\delta K(x_0))^{-1}\|\\\label{2.19}
&\leq&g_1(\|x_0-x^*\|)\|x_0-x^*\|\leq \|x_0-x^*\| < r,
\end{eqnarray}
(26)
which shows (16) for \(n=0\) and \(y_0\in U(x^*,r).\) The point \(v_0\in U(x^*,r)\) by (26) and the choice of \(r.\) We shall show that \(u_0\in U(x^*, r).\) By the definition of \(u_0,\) we get in turn that
\begin{eqnarray}
\nonumber
\|u_0-x^*\|&=&\|(x_0-x^*-F'(x_0)^{-1}F(x_0))+\frac{1}{3}F'(x_0)^{-1}F(x_0)\|\\\nonumber
&\leq&\frac{\int_0^1w((1-\theta)\|x_0-x^*\|)d\theta\|x_0-x^*\|}{1-w_0(\|x_0-x^*\|)}+\frac{1}{3}\frac{\int_0^1v(\theta\|x_0-x^*\|)d\theta\|x_0-x^*\|}{1-w_0(\|x_0-x^*\|)}\\\label{2.20}
&\leq&p(\|x_0-x^*\|)\|x_0-x^*\|\leq \|x_0-x^*\| < r,
\end{eqnarray}
(27)
so \(u_0\in U(x^*,r).\) We can write by the second substep of method (2)
\begin{eqnarray}
\label{2.21}\nonumber
z_0-x^*&=&y_0-x^*-F'(x_0)^{-1}F(y_0)-F'(x_0)^{-1}F”(u_0)(x_0-y_0)F'(x_0)^{-1}F(y_0)\\\nonumber
&=&(y_0-x^*-F'(y_0)^{-1}F(y_0))+F'(y_0)^{-1}(F'(x_0)-F'(y_0))F'(x_0)^{-1}F(y_0)\\
&&-F'(x_0)^{-1}F”(u_0)(x_0-v_0)F'(x_0)^{-1}F(y_0),
\end{eqnarray}
(28)
so
\begin{eqnarray}
\nonumber
\|z_0-x^*\|
&=&\frac{\int_0^1w((1-\theta)\|y_0-x^*\|)d\theta\|y_0-x^*\|}{1-w_0(\|y_0-x^*\|)}\\\nonumber
&&+\frac{(w_0(\|x_0-x^*\|)+w_0(\|y_0-x^*\|))\int_0^1v(\theta\|y_0-x^*\|)d\theta\|y_0-x^*\|}{(1-w_0(\|y_0-x^*\|))(1-w_0(\|x_0-x^*\|))}\\\nonumber
&&+\bar{v}(p(\|x_0-x^*\|)\|x_0-x^*\|) \frac{\int_0^1v(\theta\|y_0-x^*\|)d\theta\|x_0-x^*\|\int_0^1v(\theta\|y_0-x^*\|)d\theta\|y_0-x^*\|}{(1-w_0(\|x_0-x^*\|))^2}\\\label{2.22}
&\leq&g_2(\|x_0-x^*\|)\|x_0-x^*\|\leq \|x_0-x^*\| < r,
\end{eqnarray}
(29)
so (17) holds for \(n=0\) and \(z_0\in U(x^*, r).\) Then, by the third substep of method (2) we can also write.
\begin{eqnarray}
\nonumber
x_1-x_0&=&z_0-x^*-F'(z_0)^{-1}F(z_0)+F'(z_0)^{-1}(F'(x_0)-F'(z_0))F'(x_0)^{-1}F(z_0)\\\label{2.23}
&&-K(x_0)F'(x_0)^{-1}F(z_0)-\gamma K(x_0)^2F'(x_0)^{-1}F(z_0),
\end{eqnarray}
(30)
so
\begin{eqnarray}
\nonumber
\|x_1-x^*\|&\leq&\frac{\int_0^1w((1-\theta)\|z_0-x^*\|)d\theta\|z_0-x^*\|}{1-w_0(\|z_0-x^*\|)}\\\nonumber
&&+\frac{(w_0(\|x_0-x^*\|)+w_0(\|z_0-x^*\|))\int_0^1v(\theta\|z_0-x^*\|)d\theta\|z_0-x^*\|}{(1-w_0(\|z_0-x^*\|))(1-w_0(\|x_0-x^*\|))}
\end{eqnarray}
\begin{eqnarray}
\nonumber
&&+\frac{g_0(\|x_0-x^*\|)\|x_0-x^*\|\int_0^1v(\theta\|z_0-x^*\|)d\theta\|z_0-x^*\|}{(1-w_0(\|x_0-x^*\|))}\\\nonumber
&&+|\gamma|\frac{g_0^2(\|x_0-x^*\|)\|x_0-x^*\|^2\int_0^1v(\theta\|z_0-x^*\|)d\theta\|z_0-x^*\|}{(1-w_0(\|x_0-x^*\|))}
\\\label{2.24}
&\leq&g_3(\|x_0-x^*\|)\|x_0-x^*\|\leq \|x_0-x^*\| < r,
\end{eqnarray}
(31)
so (18) holds for \(n=0\) and \(x_1\in U(x^*,r).\) The induction for (16)-(18) is completed, if we simply replace \(x_0, y_0, z_0, u_0, v_0, x_1\) by \(x_m,y_m, z_m., u_m, v_m, x_{m+1}\) in the preceding estimates. Then, from the estimate
\begin{equation}
\label{2.25}
\|x_{m+1}-x^*\|\leq c\|x_m-x^*\| < r, c= g_3(\|x_0-x^*\|)\in [0,1)
\end{equation}
(32)
we conclude that \(\lim_{m\longrightarrow\infty}x_m=x^*\) and \(x_{m+1}\in U(x^*,r).\) Let \(y^*\in \Omega_1\) with \(F(y^*)=0.\) Define linear operator \(T=\int_0^1 F'(x^*+\theta(y^*-x^*))d\theta.\) Then, using (\(a_3\)) and (\(a_6\)), we get that
\[\|F'(x^*)^{-1}(T-F'(x^*))\|\leq\int_0^1w_0(\theta\|y^*-x^*\|)d\theta\leq \int_0^1w_0(\theta R)d\theta < 1,\]
so \(T^{-1}\in \mathcal{L}(\mathcal{B}_2, \mathcal{B}_1).\) Using the identity \(0=F(y^*)-F(x^*)=T(y^*-x^*),\) we deduce that \(x^*=y^*,\) which completes the uniqueness part of the proof.
Remark 1.
3. Semi-local convergence analysis
Let us modify the (\(C\)) conditions ( given in a non-affine invariant form in [
9]) so as to be given in affine invariant form as well as introduce the notion of the restricted convergence region. The conditions (\(\bar{C}\)) are;
Definition 1. The set \(T=T(F,x_0,y_0)\) belong to class \(K=K(L_0, L, L_1, L_2 ,\eta _0, \eta),\) if
- (\(\bar{C}_0\))     (\(\bar{C}_0 \))=(\(a_1\)).
- (\(\bar{C}_1\))     There exists \(x_0\in \Omega\) such that \(F'(x_0)^{-1}\in \mathcal{L}(\mathcal{B}_2, \mathcal{B}_1).\)
- (\(\bar{C}_2\))   
\(\|F'(x_0)^{-1}(F'(x)-F'(x_0))\|\leq M_0\|x-x_0\| \, for \, each \, x\in \Omega.\)
Set \(\Omega_0=\Omega \cap B(x_0, \frac{1}{M_0}),\) we get
\(\|F'(x_0)^{-1}F”(x)\|\leq\bar{M} \, for \, each \, x\in \Omega_0.\)
- (\(\bar{C}_3\))    (\(\bar{C}_3\))=(\(C_3\)).
- (\(\bar{C}_4\))   
\(\|F'(x_0)^{-1}(F”(x)-F”(y))\|\leq\bar{\varphi}(\|x-y\|) \, for \, each \, x,y\in \Omega_0,\)
where \(\bar{\varphi}\) is as \(\varphi.\)
We shall compare, the old condition (\(C\)) assuming they are given in affine invariant form with the new conditions (\(\bar{C}\)). Clearly,
\begin{equation}
\label{3.1}
\Omega_0\subseteq \Omega,
\end{equation}
(37)
\begin{equation}
\label{3.2}
M_0\leq \bar{M}\leq M,
\end{equation}
(38)
\begin{equation}
\label{3.3}
\bar{\varphi}(t)\leq \varphi(t),
\end{equation}
(39)
hold in general and \(\frac{\bar{M}}{M_0}\) can be arbitrarily large [
1,
2,
3]. With these modifications as in [
9], we define the functions
\begin{eqnarray}
\label{3.4}
\bar{\varphi}(t)&=&\bar{g}_1(t)+(1+t)\bar{g}_2(t)+(1+t+|\gamma|)\bar{g}_3(t),\\
\end{eqnarray}
(40)
\begin{eqnarray}
\label{3.5}
\bar{h}(t)&=&\frac{1}{1-\bar{\varphi}(t)t},\\\nonumber
\bar{\psi}(t,u)&=&\left(\frac{2}{3}\right)^p\bar{\varphi}_2(t, u)u+t^2(1+|\gamma|+|\gamma|t)\bar{\varphi}_2(t,u)
+\frac{1}{p+1}(1+t+|\gamma|t^2)\bar{\varphi}_2(t,u)u\\\nonumber
&&+\frac{t^2(1+t+|\gamma|t^2)}{1(1-\delta t)}\bar{\varphi}_2(t,u)+ t(1+t)\bar{\varphi}_1(t,u)(1+t+|\gamma|t^2)\bar{\varphi}_2(t,u)\\
\end{eqnarray}
(41)
\begin{eqnarray}
\label{3.6}
&&+\frac{1}{2}(1+t+|\gamma|t^2)^2\bar{\varphi}_2(t,u)^2,
\end{eqnarray}
(42)
where
\begin{align*}
\bar{g}_1(t)&=1+\frac{1}{2(1-\delta t)},\\
\bar{g}_2(t)&=\frac{t}{2}+\frac{t}{2(1-\delta t)}+\frac{t^2}{2(1-\delta t)}+\frac{t^3}{8(1-\delta t)^2},\\
\bar{g}_3(t)&=t \bar{g}_2(t)+t(1+t)\bar{g}_2(t)+\frac{t^2(1+t)}{2(1-\delta t)}\bar{g}_2(t)+\frac{t}{2}(1+ t)^2\bar{g}_2^2(t),\\
\bar{\varphi}_1(t,u)&=\frac{2^{p-1}}{3^p}u+\frac{u}{(p+1)(p+2)}+\frac{(1+\delta)t^2}{2(1-\delta t)}+\frac{t^3}{8(1-\delta t)^2},\\
\bar{\varphi}_2(t,u)&=\left[(\frac{2}{3})^pu+t^2+\frac{(1+t)u}{p+1}+\frac{t^2(1+t)}{2(1-\delta t)}\right]\bar{\varphi}_1(t,u)+\frac{t}{2}(1+t)^2\bar{\varphi}_1(t,u)^2.\end{align*}
Moreover, define \(\bar{\beta}= 1, \eta_0=\eta, a_0=\bar{M}\eta, b_0=\eta \bar{\varphi}(\eta)\) and \(C_0=h(a_0)\bar{\psi}(a_0,b_0).\) Define the sequences
\begin{eqnarray}
\label{3.7}
\begin{cases}
\bar{\beta}_{n+1}=\bar{h}(a_n)\bar{\beta}_n,\, \bar{\eta}_{n+1}=\bar{C}_n\bar{\eta}_n,\\
\bar{a}_{n+1}=\bar{M}\bar{\beta}_{n+1}\bar{\eta}_{n+1}, \bar{b}_{n+1}=\bar{\beta}_{n+1}\bar{\eta}_{n+1}\bar{\varphi}(\bar{\eta}_{n+1}),\\
\bar{e}_{n+1}= \bar{h}(\bar{a}_{n+1})\bar{\psi}(\bar{a}_{n+1}, \bar{b}_{n+1}).
\end{cases}
\end{eqnarray}
(43)
Next, we present the semi-local convergence analysis of method (2) using the conditions (\(\bar{C}\)) instead of conditions (\(C\)) used in [
9].
Theorem 2.
Suppose that the conditions (\(\bar{C}\)) hold and \(\bar{U}(x_0, \bar{R} \eta)\subseteq \Omega,\) where \(\bar{R}=\frac{\bar{\varphi}(\bar{a}_0)}{1-\bar{c}_0}.\) For \(\bar{a}_0=M\eta, b_0=\eta\bar{\varphi}(\eta), \bar{c}_0=\bar{h}(\bar{a}_0)\bar{\psi}(\bar{a}_0, \bar{b}_0).\) Suppose \(\bar{a}_0 < \bar{s}^*\) and \(\bar{h}(\bar{a}_0)\bar{c}_0 < 1,\) where \(\bar{s}^*\) is the smallest positive solution of equation \(\bar{\varphi}(t)t-1 =0.\) Then, sequence \(\{x_n\}\) starting from \(x_0\in \Omega_0\) and generated by method (2) is well defined in \(U(x_0, \bar{R}\eta),\) remains in \(U(x_0, \bar{R}\eta)\) for each \(n=0,1,2,\ldots\) and converges to a unique solution \(x^*\in \Omega_1=\Omega\cap U(x_0, \tilde{R}),\) where \(\tilde{R}=\frac{2}{\bar{a}_0}-\bar{R}.\) Moreover, the following estimates hold;
\[\|x_n-x^*\|\leq e_n,\]
where \(e_n=\bar{\varphi}(\bar{a}_0)\eta \alpha^n\varepsilon^{\frac{(4+3a)^n-1}{3+3a}}\frac{1}{1-\alpha \varepsilon^{(4+3a)^n}},\ \ \ \)
\(\alpha=\frac{1}{\bar{h}(\bar{a}_0)},\ \ \ \varepsilon=\bar{h}(\bar{a}_0)\bar{c}_0.\)
Proof. Simply notice that iterate belong in \(\Omega_0\) which is a more precise location than \(\Omega\) used in [9]. Hence, the proof of Theorem 1 can be repeated but using the bar constants and functions instead of the non bar constants and functions, which is an important modification (see also the Remark that follows).
Remark 2. In view of (37)-(39), we have the advantages mentioned in the introduction of this study. For example,
\[a_0=M\eta < s^* \Longrightarrow \bar{a}_0=\bar{M}\eta < \bar{s}^*,\]
\[h(a_0)c_0 < 1 \Longrightarrow \bar{h}(\bar{a}_0)\bar{c}_0 < 1,\]
but not necessarily vice versa, where \(s^*\) is the smallest positive solution of \(\varphi(t)t-1=0.\)
4. Numerical examples
We present the following example to test the convergence criteria.
Example 1. Let \(\mathbb{B}_1=\mathbb{B}_2=\mathbb{R}^3\), \(D=B(0,1), x^*=(0,0,0)^T.\) Define \(F\) on \(D\)
by
\begin{equation}
\label{eq4.8}
F(u)=F(u_1,u_2,u_3)=(e^{u_1}-1,{u_2}^2+u_2,{u_3})^T.
\end{equation}
(44)
For the points \(u=(u_1,u_2,u_3)^T\), the Fréchet derivative is given by
\[
F'(u)=\left(
\begin{array}{ccc}
e^{u_1} & 0 & 0 \\
0 & 2u_2+1 & 0 \\
0 & 0 & 1 \\
\end{array}
\right).
\]
Using the norm of the maximum of the rows and (\(a_3\))-(\(a_4\)), we see that since \(F'(x^*)=diag(1,1,1),\) we can define functions for method (2) by
\(w_0(t)=(e-1)t,\ \ w(t)=e^{\frac{1}{e-1}}t ,\ \ v(t)=\bar{v}(t)=e^{\frac{1}{e-1}}.\) Then, the radius of convergence using (10) is given by
\begin{align*}
r_1&= 0.13043162089314655482930049856805,\\
r_2&= 0.015754168279106986472193341342063=r,\\
r_3&= 0.064012457484415613562234170785814 .\end{align*} Local results were not given in [
9] but if they were, then \(w_0(t)=w(t)=et, \ \ \bar{v}(t)=v(t)=e\), so old
\begin{align*}
r_1&= 0.052167745643160665092175065638003,\\
r_2 &= 0.0017930634000776965713414012881799 = r,\\
r_3&= 0.020054377144156372569927526683387.\end{align*}
5. Conclusion
Major concerns in the study of the convergence for iterative methods (local or semilocal) are; the size of the convergence domain, the selection of the initial point and the uniqueness of the solution. We address these problems using method (2) under sufficient convergence conditions which are weaker than the ones in [
9] for the semilocal convergence case. This way we extend the convergence domain require fewer iterates to achieve a desired error tolerance and provide a better location on the location of the solution. We also examine the local convergence case not studied in [
9]. In the future we will employ our technique to extend the applicability of other iterative methods too.
Author Contributions
All authors contributed equally to the writing of this paper. All authors read and approved the final manuscript.
Conflicts of Interest
The authors declare no conflict of interest.